BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781110|ref|YP_003065523.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] (420 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done Results from round 1 >gi|254781110|ref|YP_003065523.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] gi|254040787|gb|ACT57583.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] Length = 420 Score = 866 bits (2238), Expect = 0.0, Method: Compositional matrix adjust. Identities = 420/420 (100%), Positives = 420/420 (100%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA Sbjct: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN Sbjct: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL Sbjct: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD Sbjct: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ Sbjct: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN Sbjct: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILKG 420 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILKG Sbjct: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILKG 420 >gi|315122479|ref|YP_004062968.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495881|gb|ADR52480.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 427 Score = 481 bits (1237), Expect = e-133, Method: Compositional matrix adjust. Identities = 247/429 (57%), Positives = 322/429 (75%), Gaps = 13/429 (3%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 M+LL F F FKK I S KANFSI+F++ ++S LL IG LIYVLD+++KKN+ME+AN +A Sbjct: 1 MNLLRTFLFNFKKIILSPKANFSILFSVILISILLFIGILIYVLDYYHKKNAMENANTSA 60 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 IL+GASK++S +S GD ++S+H RA++DD RFIK++IKESL S+VF +E Sbjct: 61 ILSGASKIISRISYFGD---NMSSHTHRAIVDDVTRFIKSYIKESLLMDSSVFDISEKNI 117 Query: 121 IVNSSRISMT--------HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQK 172 I +S++S+T H NN+ N FY++ V T YDY ++F ++LLN++ N K Sbjct: 118 ISQNSKVSITREPHPNVFHEFNNQSILQNKKTFYHISVETFYDYHIKFFDNLLNKKINSK 177 Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC-QDKKRTKMAAL 231 I+SF+PAL++I+ GE P F ++LVVDLS SM C MNSDPE +C + KK +KM AL Sbjct: 178 IISFVPALVKIDTGEHPFFFVQLVVDLSASMSCLMNSDPEHATEFSVCGKSKKNSKMDAL 237 Query: 232 KNALLLFLDSIDLLSHVKEDV-YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 K A+LLFLDS+D S ++D Y+GL GYTTRVEKNIEPSWGT KVR+Y+ ++D +L Sbjct: 238 KKAVLLFLDSVDRGSKTQKDTHYIGLTGYTTRVEKNIEPSWGTGKVRKYIVEEIDVNMLG 297 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 TDSTPAMK+AYQILTSDKKR+F N + +KIP LPFQKF+IFLTDGENN+ KS+V T Sbjct: 298 QTDSTPAMKKAYQILTSDKKRNFIRNILHKRIKIPPLPFQKFLIFLTDGENNDPKSDVKT 357 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 IKIC+KAK+N IKI+TISINAS NG+RLLK CVS+PEY+YNVV+ SL+ VFQ+IS L+ Sbjct: 358 IKICEKAKKNSIKILTISINASANGKRLLKKCVSAPEYYYNVVDTGSLLRVFQDISTLIT 417 Query: 411 HRKYSVILK 419 H KY VILK Sbjct: 418 HYKYQVILK 426 >gi|222087111|ref|YP_002545646.1| hypothetical protein Arad_3867 [Agrobacterium radiobacter K84] gi|221724559|gb|ACM27715.1| conserved hypothetical protein [Agrobacterium radiobacter K84] Length = 401 Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 84/328 (25%), Positives = 138/328 (42%), Gaps = 37/328 (11%) Query: 94 AKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTS 153 A +F+ + LSG + N ++ + T + +SS T Y + V S Sbjct: 82 ATQFVTGQMSNYLSGDT---------NTADALKAGTTANVTSATNSSGGT-SYTVAVNAS 131 Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 YD + + LL + + ++ +E+ +D SGSM +N+D D Sbjct: 132 YDMSVNGMSQLLGIKTMHVSAASTSTSGSAAAAKQAALSMEIALDKSGSML--LNTDVID 189 Query: 214 VNSAPICQ-----------DKKRT-----KMAALKNALLLFLDSIDLLSHVKEDVYMGLI 257 + Q K ++ K+AALK A+ LD +D + V I Sbjct: 190 TSQKSCTQYYTEGNYLYQYPKAKSPCYIKKIAALKTAVGTLLDQLDSADPKSQYVRTAAI 249 Query: 258 GYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNF 317 +++ V+ + +WGT R V +++ T+S+ M AY K S + Sbjct: 250 AWSSEVDSSSALAWGTTTTRSNVISGLNAN--GGTESSAPMALAY------KNVSASSEA 301 Query: 318 FRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQR 377 Q K + FQK I+ +TDGENN S+ T+ C AK+ + I +++ A GQ Sbjct: 302 TAQAAK-GNTTFQKIIVLMTDGENNATSSDTKTLATCKAAKDAGVLIYSVAFMAPDRGQT 360 Query: 378 LLKTCVSSPEYHYNVVNADSLIHVFQNI 405 LLK C SSP +++ LI F+ I Sbjct: 361 LLKNCASSPSNYFDAQQMSDLIAAFKTI 388 >gi|163760496|ref|ZP_02167578.1| hypothetical protein HPDFL43_04296 [Hoeflea phototrophica DFL-43] gi|162282447|gb|EDQ32736.1| hypothetical protein HPDFL43_04296 [Hoeflea phototrophica DFL-43] Length = 363 Score = 90.5 bits (223), Expect = 4e-16, Method: Compositional matrix adjust. Identities = 94/404 (23%), Positives = 184/404 (45%), Gaps = 58/404 (14%) Query: 5 SRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG 64 +R +K + +E NF++I A +V + + + K +++A ++A LA Sbjct: 4 TRITSKIRKLLRNENGNFALIAAAAVPVLFMAGSLAVDTTNAMSMKVRLQNAVDSAALAT 63 Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 A+++ E+++ +A A +F+ +KE ++ F T NI Sbjct: 64 AARLSEE--------ENLTAAQAQAF---ALKFVNGQVKEDFGAFNG-FSVTPTVNI--- 108 Query: 125 SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 + +++ T+ + + V L + ++ + ++ VS + Sbjct: 109 ----------DPVETGGRTV-WKVAVSMEGSQSLTPMARIMGK--DKLTVSVVGK--SES 153 Query: 185 MGE-RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 GE + F + LV+D SGSM +N + K+ LK A+ ++ + Sbjct: 154 AGEAQGAFSMALVLDRSGSMDWNLNG---------------QKKINVLKTAVGGLIEQFE 198 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAY 302 ++ V +G Y +++ + + W K +++V D+L TDST A AY Sbjct: 199 EADPERKYVRLGASSYNSKLTGSTKLRWNPGKTKEFV----DALPASGGTDSTDAFDWAY 254 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 +T KR T+ + G ++P +KFI+F+TDG+NN ++ +T +CD AK++ I Sbjct: 255 TAVT--HKRENNTHDAKSG-QVP----KKFIVFMTDGDNNYSSADSSTKHLCDDAKDDGI 307 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 ++ T++ A G++LL C S+ E+ ++ N+ LI F+NI Sbjct: 308 EVYTVAFAAPNRGKQLLSYCASTEEHFFDAQNSAQLIEAFKNIG 351 >gi|241206334|ref|YP_002977430.1| hypothetical protein Rleg_3648 [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240860224|gb|ACS57891.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 400 Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 68/233 (29%), Positives = 105/233 (45%), Gaps = 31/233 (13%) Query: 195 LVVDLSGSMH---CAMN-SDPED--------------VNSAPICQDKKR---TKMAALKN 233 LV+D SGSM +N SDP + V C D + TK+ ALK Sbjct: 166 LVLDRSGSMGEDTATVNASDPTEEYNYDCSEKDRYGNVTKKKTCTDTRPHYYTKIEALKL 225 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 A+ +D + KE V G + Y ++K WGT V +YV + TD Sbjct: 226 AVGTLTGELDAVDPEKEYVRTGAVSYNIEMQKAKALDWGTAHVTKYVNK---LTATDGTD 282 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 S A K AY L + + Q +P+ K+I+F+TDG+NN ++ T Sbjct: 283 SGEAFKTAYNKLADAAEDKAHVDKTGQ---VPT----KYIVFMTDGDNNYTSADTETKTW 335 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 CDKA++ +++ TI+ A GQ LL C ++P ++ + +L+ F+ I Sbjct: 336 CDKARDAKMQVYTIAFMAPARGQALLSYCATAPGNYFPAGDMTALLKAFKEIG 388 >gi|150397936|ref|YP_001328403.1| von Willebrand factor type A [Sinorhizobium medicae WSM419] gi|150029451|gb|ABR61568.1| von Willebrand factor type A [Sinorhizobium medicae WSM419] Length = 419 Score = 87.8 bits (216), Expect = 3e-15, Method: Compositional matrix adjust. Identities = 67/228 (29%), Positives = 103/228 (45%), Gaps = 26/228 (11%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED--- 251 LV+D SGSM N+ P + +K LK ++ ID L D Sbjct: 192 LVLDRSGSMAWKTNTINTGKAKCPNYTEANWSKYPDLKATGPCYVTKIDALKTAVGDLLA 251 Query: 252 -----------VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDSTPAMK 299 V G I Y + + SWGT YV D+L+ + T S A K Sbjct: 252 QLVTADPESAYVRTGAISYNSAQDAASSLSWGTRGAAGYV----DALVAIGGTASGNAFK 307 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 A+Q +T+ + S G K +P K+I+F+TDGENN+ + T + CD AK Sbjct: 308 TAFQKVTNAAEDS------EHGAKNGQVP-TKYIVFMTDGENNHANDDTVTRQWCDTAKA 360 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + ++I +++ A GQ+LLK+C SS +++ A L+ F+ I + Sbjct: 361 SKVQIYSVAFMAPDRGQKLLKSCASSSSHYFEAEEASDLVAAFKAIGE 408 >gi|315122473|ref|YP_004062962.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495875|gb|ADR52474.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 403 Score = 77.0 bits (188), Expect = 5e-12, Method: Compositional matrix adjust. Identities = 99/427 (23%), Positives = 193/427 (45%), Gaps = 67/427 (15%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 +L R F+ K ++ F I+ A + L+ + F+I + + KN ++S+ + AI+ Sbjct: 9 ILRRIYFFSK----NKSGVFHIMSASIIFVCLIFVSFVIDITHLLHMKNHIQSSLDNAII 64 Query: 63 AGASKMVSN--LSRLGDRFESISNHAKR-ALIDDAKRFIKNHIKESLSGYSAVFYNTEIQ 119 +G S +VS+ ++ L + E I + K+ A ++ + F H + + F + Sbjct: 65 SGCSIVVSDPKINDLNPQEERIRDVIKKNAYVNMVQNFPAEHAAYIIENANISFS----K 120 Query: 120 NIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPA 179 ++ N +T A ++L N FI L I S Sbjct: 121 DLTNKYEYKITMEAKHQLSGKN------------------FILGFLMPNVITHISSISTG 162 Query: 180 LLRIEMGERPIFLIELVVDLSGSM------HCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 +++ + ++ F +E+V+D SGSM C ++S +K ++K+ ALK Sbjct: 163 IIQ-KPSDKKAFSVEMVLDCSGSMLDSMQESCDLSSGRGGYYFYSKNNNKPKSKIYALKT 221 Query: 234 ALLLFLDSIDLLSHVKEDVY--------MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 A S D ++ ++E V +GLI + + ++ + S +++ ++R Sbjct: 222 A------SSDFVNLIQETVQTFPQISARIGLITFNHYIMQDSKLSNNFNVIKKTISR--- 272 Query: 286 SLILKP---TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 +KP TD+ M AY+ L + + N ++P +++II +TDGENN Sbjct: 273 ---MKPKGGTDTFLPMNAAYEYLNNIPNETKAHNISD------NVPLKRYIILMTDGENN 323 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISIN--ASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + ++ TI +CD A++N I I +I +N +G L + C SS ++ + N +L+ Sbjct: 324 HPSYDLKTINVCDNARKNGIIIYSIFLNYYEYTDGYELARKCASSEKHFFYANNTKALLD 383 Query: 401 VFQNISQ 407 F++I+ Sbjct: 384 SFKSIAH 390 >gi|327189644|gb|EGE56794.1| hypothetical protein RHECNPAF_570041 [Rhizobium etli CNPAF512] Length = 415 Score = 73.2 bits (178), Expect = 8e-11, Method: Compositional matrix adjust. Identities = 55/186 (29%), Positives = 79/186 (42%), Gaps = 9/186 (4%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 TK+ ALK A + + V G + Y +WGT V YV Sbjct: 222 TKIEALKMAAGNLFGQLTSADPDAQYVRTGAVSYDIDQYTPSTLAWGTSGVSSYVNALQA 281 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T+S+ AM AY LT+ +K +P +K+I+F+TDG+NNN Sbjct: 282 G---GGTNSSGAMGTAYSSLTAKNAAGNDAEDAAHKLKTGQIP-KKYIVFMTDGDNNNDS 337 Query: 346 SNVN-----TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 S T CD AK I+I TI+ A P GQ LL+ C S +++ + L+ Sbjct: 338 SGGRSYDTLTKATCDTAKSKGIEIYTIAFMAPPGGQALLQYCASDAAHYFQAEQMEDLLA 397 Query: 401 VFQNIS 406 F+ I Sbjct: 398 AFKAIG 403 >gi|227823417|ref|YP_002827390.1| hypothetical protein NGR_c28930 [Sinorhizobium fredii NGR234] gi|227342419|gb|ACP26637.1| hypothetical protein NGR_c28930 [Sinorhizobium fredii NGR234] Length = 413 Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 65/230 (28%), Positives = 102/230 (44%), Gaps = 30/230 (13%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKR--------------TKMAALKNALLLFLD 240 LV+D SGSM N+ S P + TK+ ALK A+ L Sbjct: 186 LVLDRSGSMAWKTNTINAAKKSCPNYTESNWSRYPNLWASSPCYVTKIDALKTAVTDLL- 244 Query: 241 SIDLLSHVKEDVYM--GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDSTPA 297 LL + +Y+ I Y + + +WGT YV ++L+ T S A Sbjct: 245 -AQLLVADPDQIYVRTAAISYNSVQDTAGTLAWGTSGAAAYV----NALVATGGTASAGA 299 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 K AYQ + + + T + ++PS K+++F+TDGENN + T + CD A Sbjct: 300 FKTAYQKVIAATEN---TAHAAKNGQVPS----KYMVFMTDGENNYANDDTVTKQWCDTA 352 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 K N ++I +++ A GQ LLK C SS +++ L+ F+ I + Sbjct: 353 KANKVEIYSVAFMAPERGQALLKYCASSSSHYFEAEEVTDLVAAFKAIGE 402 >gi|15966595|ref|NP_386948.1| hypothetical protein SMc04059 [Sinorhizobium meliloti 1021] gi|307300370|ref|ZP_07580150.1| TadE family protein [Sinorhizobium meliloti BL225C] gi|307319653|ref|ZP_07599079.1| TadE family protein [Sinorhizobium meliloti AK83] gi|15075867|emb|CAC47421.1| Hypothetical protein SMc04059 [Sinorhizobium meliloti 1021] gi|306894775|gb|EFN25535.1| TadE family protein [Sinorhizobium meliloti AK83] gi|306904536|gb|EFN35120.1| TadE family protein [Sinorhizobium meliloti BL225C] Length = 410 Score = 72.4 bits (176), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 65/239 (27%), Positives = 109/239 (45%), Gaps = 39/239 (16%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT----------------KMAALKNALLLF 238 LV+D SGSM A +D D S P C + + K+ LK+A+ Sbjct: 174 LVLDRSGSM--AWKTDTVDT-SRPRCINWTASNWGESNVRATSPCYVDKITTLKSAVDKL 230 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 + + E + G Y R ++ + +WGT+ +V + +D+ TDS+ A Sbjct: 231 FTPLAKMDPGNEYLRAGAASYNDRQDRASKLTWGTKNASAHV-QGLDAT--GGTDSSSAF 287 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF----------KSNV 348 A + L D + + + G + P +K+I+F+TDGEN ++ K++ Sbjct: 288 AAAVEELLLDGENE--AHLAKNG-QTP----EKYIVFMTDGENTSYNGKTSPRDLEKADS 340 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 T C AK N I I T++ A G+ LLK C +SP+++ +A +L+ F+ I Q Sbjct: 341 VTKAACTTAKNNGIAIFTVAFMAPQRGKDLLKACATSPDHYKEADDAAALVSEFEKIGQ 399 >gi|254781108|ref|YP_003065521.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] gi|254040785|gb|ACT57581.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] Length = 398 Score = 69.7 bits (169), Expect = 9e-10, Method: Compositional matrix adjust. Identities = 39/118 (33%), Positives = 69/118 (58%), Gaps = 11/118 (9%) Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN---FKSNV 348 T++ PAM AY+ L ++K+ S T I S +KF+IF+TDGEN+ +++ + Sbjct: 275 TNTYPAMHHAYRELYNEKESSHNT--------IGSTRLKKFVIFITDGENSGASAYQNTL 326 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 NT++IC+ + +KI +++++A P GQ LL+ C S + V ++ L+ F I+ Sbjct: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKIT 384 >gi|86359182|ref|YP_471074.1| hypothetical protein RHE_CH03592 [Rhizobium etli CFN 42] gi|86283284|gb|ABC92347.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 411 Score = 69.3 bits (168), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 55/186 (29%), Positives = 79/186 (42%), Gaps = 9/186 (4%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 TK+ ALK A ++ + V G + Y +WG V YV Sbjct: 218 TKIEALKMAAGNLFSQLNSADPNAQYVRTGAVSYDINQYAPSSLAWGITGVSSYVNALQA 277 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + T+S+ AM AY LT+ K +P +K+I+F+TDG+NNN Sbjct: 278 N---GGTNSSGAMNTAYTSLTAKNAAGNDVENSAHQQKTGQVP-KKYIVFMTDGDNNNDP 333 Query: 346 S-----NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 S + T K CD AK I+I TI+ A GQ LL C S +++ + L+ Sbjct: 334 SGGRSYDTATKKTCDDAKSKGIEIYTIAFMAPAGGQALLHYCASDDSHYFQAEKMEDLLA 393 Query: 401 VFQNIS 406 FQ I Sbjct: 394 AFQAIG 399 >gi|218662625|ref|ZP_03518555.1| hypothetical protein RetlI_26027 [Rhizobium etli IE4771] Length = 389 Score = 68.6 bits (166), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 46/142 (32%), Positives = 64/142 (45%), Gaps = 9/142 (6%) Query: 270 SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 +WG V YV T+S+ AM AY LT+ +K +P Sbjct: 240 AWGAIGVSSYVNALQAG---GGTNSSGAMSTAYLSLTAKNAAGNDAEDSAHKLKSGQIP- 295 Query: 330 QKFIIFLTDGENNNFKSNVN-----TIKICDKAKENFIKIVTISINASPNGQRLLKTCVS 384 QK+I+F+TDG+NNN S T CD AK I+I TI+ A P GQ LL+ C S Sbjct: 296 QKYIVFMTDGDNNNDSSGGRSYDTLTKATCDTAKSKGIEIYTIAFMAPPGGQALLQYCAS 355 Query: 385 SPEYHYNVVNADSLIHVFQNIS 406 +++ + L F+ I Sbjct: 356 DASHYFQAEKMEDLFAAFKAIG 377 >gi|315122199|ref|YP_004062688.1| hypothetical protein CKC_02245 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495601|gb|ADR52200.1| hypothetical protein CKC_02245 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 463 Score = 67.8 bits (164), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 50/182 (27%), Positives = 93/182 (51%), Gaps = 20/182 (10%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT----RDMDS 286 +++AL F+ + + ++K+ + M + + R++ +WG ++ +Q V+ R ++ Sbjct: 277 VRDALATFIKRVRKIDNLKDKLRMSFMYFNERIDHYFPMTWGIKEFKQEVSSHYKRKHEN 336 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 TD P +++AY L S + + K S+ +KFI+ LTDG N Sbjct: 337 T---ATDIHPILQEAYNKLHSKNEDD-------EHKKKNSVEVKKFIVLLTDGAQNEGVH 386 Query: 347 NVNTI-KICDKAKENFIKIVTISINASPNGQR----LLKTCVSSPEYHYNVVNADSLIHV 401 +V+++ KICD AKE IKI TIS + + ++ L C +SP+ + +AD L + Sbjct: 387 SVDSVLKICDAAKEEGIKIFTISYSVDSSERKKANDFLSRC-ASPDKFFEAYDADKLNMI 445 Query: 402 FQ 403 F+ Sbjct: 446 FK 447 >gi|209550922|ref|YP_002282839.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209536678|gb|ACI56613.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 411 Score = 67.4 bits (163), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 55/187 (29%), Positives = 80/187 (42%), Gaps = 11/187 (5%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 TK+ ALK A ++ E V G + Y +WGT V YV + Sbjct: 218 TKIEALKIAAGNLFGQLNSADPNAEYVRTGAVSYDINQYTPSNLAWGTAGVTSYV----N 273 Query: 286 SLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 +L T+S+ AM AY LT+ +K P +K+I+F+TDG+NN+ Sbjct: 274 ALQANGGTNSSGAMSTAYSSLTAKNAAGNDAEDSAHKLKTGQTP-KKYIVFMTDGDNNDD 332 Query: 345 KSNVN-----TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 S T CD AK I+I TI+ A GQ LL C S +++ + L+ Sbjct: 333 SSGGRSYDTLTKATCDTAKSKGIEIYTIAFMAPAGGQTLLHYCASDDSHYFQAEKMEDLL 392 Query: 400 HVFQNIS 406 F+ I Sbjct: 393 AAFKAIG 399 >gi|190893432|ref|YP_001979974.1| hypothetical protein RHECIAT_CH0003859 [Rhizobium etli CIAT 652] gi|190698711|gb|ACE92796.1| hypothetical conserved protein [Rhizobium etli CIAT 652] Length = 410 Score = 65.9 bits (159), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 44/142 (30%), Positives = 63/142 (44%), Gaps = 9/142 (6%) Query: 270 SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 +WGT V YV T+S+ AM AY LT+ +K P Sbjct: 261 AWGTSGVSTYVNALQAG---GGTNSSGAMSTAYSSLTAKNAAGNDAEDAAHKLKTGQTP- 316 Query: 330 QKFIIFLTDGENNNFKSNVN-----TIKICDKAKENFIKIVTISINASPNGQRLLKTCVS 384 +K+I+F+TDG+NN+ S T CD AK I+I TI+ A GQ LL C S Sbjct: 317 KKYIVFMTDGDNNDDSSGGRSYDTLTKATCDTAKSKGIEIYTIAFMAPEGGQALLHYCAS 376 Query: 385 SPEYHYNVVNADSLIHVFQNIS 406 +++ + L+ F+ I Sbjct: 377 DDSHYFQAEKMEDLLAAFKAIG 398 >gi|218515283|ref|ZP_03512123.1| hypothetical protein Retl8_17130 [Rhizobium etli 8C-3] Length = 329 Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 45/143 (31%), Positives = 66/143 (46%), Gaps = 11/143 (7%) Query: 270 SWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 +WGT V YV ++L T+S+ AM AY LT+ +K P Sbjct: 180 AWGTSGVSTYV----NALQAGGGTNSSGAMSTAYSSLTAKNAAGNDAEDAAHKLKTGQTP 235 Query: 329 FQKFIIFLTDGENNNFKSNVN-----TIKICDKAKENFIKIVTISINASPNGQRLLKTCV 383 +K+I+F+TDG+NN+ S T CD AK I+I TI+ A GQ LL C Sbjct: 236 -KKYIVFMTDGDNNDDSSGGRSYDTLTKATCDTAKSKGIEIYTIAFMAPEGGQALLHYCA 294 Query: 384 SSPEYHYNVVNADSLIHVFQNIS 406 S +++ + L+ F+ I Sbjct: 295 SDDSHYFQAEKMEDLLAAFKAIG 317 >gi|254780833|ref|YP_003065246.1| hypothetical protein CLIBASIA_03630 [Candidatus Liberibacter asiaticus str. psy62] gi|254040510|gb|ACT57306.1| hypothetical protein CLIBASIA_03630 [Candidatus Liberibacter asiaticus str. psy62] Length = 371 Score = 64.7 bits (156), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 50/189 (26%), Positives = 92/189 (48%), Gaps = 15/189 (7%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ ++ LD I + V V GL+ +++++ + +WG + +++ + R Sbjct: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR---- 246 Query: 287 LIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 LI T STP ++ AY + K++ + + ++K+IIFLTDGEN++ Sbjct: 247 LIFGSTTKSTPGLEYAYNKIFDAKEK-------LEHIAKGHDDYKKYIIFLTDGENSSPN 299 Query: 346 -SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 N ++ C++AK + I + A Q LK C +SP+ Y+V N+ L F Sbjct: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ-FLKNC-ASPDRFYSVQNSRKLHDAFLR 357 Query: 405 ISQLMVHRK 413 I + MV ++ Sbjct: 358 IGKEMVKQR 366 >gi|254780934|ref|YP_003065347.1| hypothetical protein CLIBASIA_04165 [Candidatus Liberibacter asiaticus str. psy62] gi|254040611|gb|ACT57407.1| hypothetical protein CLIBASIA_04165 [Candidatus Liberibacter asiaticus str. psy62] Length = 374 Score = 62.4 bits (150), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 86/400 (21%), Positives = 169/400 (42%), Gaps = 51/400 (12%) Query: 19 KANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDR 78 K +I+ A+ + L++G +I V + K + S + +++ A+++++ Sbjct: 16 KGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN-------- 67 Query: 79 FESISNHAKRALIDDAKRFIKNHIKESLSG--YSAVFYNTEIQNIVNSSRISMTHMANNR 136 E N+ K+ D IKN S F N +I +IV S+ + + + N Sbjct: 68 -EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN-DIDDIVRSTSLDIVVVPQNE 125 Query: 137 LDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELV 196 Y++ ++ Y L+F + ++ IV I + +++ + +V Sbjct: 126 --------GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIV 177 Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL 256 +D+S SM D TK+ ++ L+ + L+ V V GL Sbjct: 178 LDVSRSMESFF--------------DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 Query: 257 IGYTTRVEKNIEPSWGTEKVR---QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 + ++ ++E+ WG ++ +Y+++ S T+STP +K AY + F Sbjct: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS-----TNSTPGLKYAYN-------QIF 271 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 RQ ++K I+F+TDGEN + K + ++ C++AK+ + I I Sbjct: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-I 330 Query: 374 NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 L+ C +SP Y V N S+ F +I + +V ++ Sbjct: 331 RSHEFLRAC-ASPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 >gi|15891094|ref|NP_356766.1| hypothetical protein Atu3868 [Agrobacterium tumefaciens str. C58] gi|15159433|gb|AAK89551.1| hypothetical protein Atu3868 [Agrobacterium tumefaciens str. C58] Length = 412 Score = 61.6 bits (148), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 52/197 (26%), Positives = 93/197 (47%), Gaps = 25/197 (12%) Query: 227 KMAALKNALLLFLDSID------LLSHVKEDVYMGLIGYT--TRVEKNIEPSWGTEKVRQ 278 K A+LK A+ + +++ ++ E V G YT T V ++I WGT V Sbjct: 214 KAASLKTAVGFLVATLNKADPTYTVNGGSELVRTGASVYTHETYVAQSI--GWGTSGVTS 271 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 YV + + TD+ ++ AY L + + R + S F+++I+ +TD Sbjct: 272 YVDKQIPEFPSGGTDARSSLNAAYNAL-----KKANPDEARYHKEKGSESFERYIVLMTD 326 Query: 339 GE--------NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY 390 GE N++ +V T C+ AK++ IKI +++ A G+ LL+ C SS + +Y Sbjct: 327 GEMTGNSAAWNSSIDQSVRTT--CETAKKDGIKIFSVAFMAPDKGKSLLQYCASSADNYY 384 Query: 391 NVVNADSLIHVFQNISQ 407 N + ++ F I++ Sbjct: 385 APENMEQIVTAFGEIAR 401 >gi|222149754|ref|YP_002550711.1| hypothetical protein Avi_3756 [Agrobacterium vitis S4] gi|221736736|gb|ACM37699.1| conserved hypothetical protein [Agrobacterium vitis S4] Length = 437 Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 56/198 (28%), Positives = 86/198 (43%), Gaps = 28/198 (14%) Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVK------EDVYMGLIGYTTRVEKNIEPSWGTEKV 276 K TK+ +LK A+ DL + +K E V G Y + SWGT V Sbjct: 244 KNYTKIESLKLAV------ADLAAQLKKADPNSEYVRTGADSYNASADTAQAMSWGTANV 297 Query: 277 RQYVTRDMDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 YV ++L TD+ A+ AY L + K + KI ++I+F Sbjct: 298 VTYV----NALSATGGTDARGALSAAYSALQTSNKTEITAHNVSSVSKIG-----RYIVF 348 Query: 336 LTDGE---NNNFKSNVNTIKI---CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH 389 +TDGE N++ S+ + C K + I+I T++ A NG+ LL C S ++ Sbjct: 349 MTDGEMTGNSSSWSSSIDSAVRSQCTSIKADGIQIYTVAFMAPANGKSLLSACASDASHY 408 Query: 390 YNVVNADSLIHVFQNISQ 407 Y +A SL+ F I + Sbjct: 409 YEATDAASLVAAFGEIGK 426 >gi|218506715|ref|ZP_03504593.1| hypothetical protein RetlB5_03444 [Rhizobium etli Brasil 5] Length = 269 Score = 59.7 bits (143), Expect = 8e-07, Method: Compositional matrix adjust. Identities = 52/186 (27%), Positives = 79/186 (42%), Gaps = 9/186 (4%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 TK+ ALK A ++ E V G + Y + +WG V YV ++ Sbjct: 76 TKIEALKIAAGNLFSQLNSADPNAEYVRTGAVSYDLVEYTPSKLAWGITAVTSYVNA-LE 134 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 S T+S+ A+ AY LT+ +K LP +K+I+F+TDG+NN+ Sbjct: 135 SG--GGTNSSGAVNTAYTSLTAKNAAGNDAEDAAHKLKTGQLP-KKYIVFMTDGDNNDDS 191 Query: 346 SNVN-----TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 T CD AK I+ TI+ A GQ LL C S +++ + L+ Sbjct: 192 RGGRSYDTLTKATCDTAKAKGIETYTIAFMAPEGGQALLHYCASDDAHYFQAEKMEDLLA 251 Query: 401 VFQNIS 406 F+ I Sbjct: 252 AFKAIG 257 >gi|116253849|ref|YP_769687.1| hypothetical protein RL4112 [Rhizobium leguminosarum bv. viciae 3841] gi|115258497|emb|CAK09601.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae 3841] Length = 398 Score = 59.3 bits (142), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 69/238 (28%), Positives = 103/238 (43%), Gaps = 39/238 (16%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR----TKMAALKNALLLFLDSIDLL--- 245 +ELV+D SGSM E + K TK+ ALK A D++D Sbjct: 166 MELVLDQSGSMG-------EKTTTCATYNGKNCKTYVTKIDALKKAADALFDALDTADPD 218 Query: 246 -SHVKEDVYM---GLI--GYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 S V+ Y GLI T+++ +WGT YV+ S TD+T M+ Sbjct: 219 HSLVRTGAYSYNNGLIYNSQKTQIKSMSGMAWGTATTATYVSGITAS---GGTDATEPMR 275 Query: 300 QAYQILTSDKKRSFFTNFFRQG--VKIPSLPFQKFIIFLTDGE--------NNNFKSNVN 349 QA S K S ++ Q VK ++ ++II +TDGE ++F NV Sbjct: 276 QA---TLSIAKASDGSDVETQAHAVKGNTI-VSRYIILMTDGEMTGNTGVWQSSFDQNVR 331 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 CD K IKI T++ A G++LL+ C S +Y + L+ F +I++ Sbjct: 332 --NQCDATKTAGIKIFTVAFMAPDKGKQLLQYCASPGGNYYEAETMEKLVASFTSIAK 387 >gi|332716587|ref|YP_004444053.1| hypothetical protein AGROH133_12352 [Agrobacterium sp. H13-3] gi|325063272|gb|ADY66962.1| hypothetical protein AGROH133_12352 [Agrobacterium sp. H13-3] Length = 412 Score = 59.3 bits (142), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 65/240 (27%), Positives = 101/240 (42%), Gaps = 33/240 (13%) Query: 195 LVVDLSGSMH------------CA------MNSDPEDVNSAPICQDKKRTKMAALKNALL 236 LV+D SGSM C S P N++P C K T + L+ Sbjct: 168 LVLDRSGSMSFKTDTLNTKKTSCQNYTVDNWGSYPNLKNTSP-CYVNKATSLKTAVGYLV 226 Query: 237 LFLDSID---LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 L+ D + E V G YT +WGT V YV + + TD Sbjct: 227 ATLNKADPTYTANGGSELVRTGASVYTHETYAAQPITWGTSSVATYVDKQIPEFPSGGTD 286 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE-NNNFKSNVNTI- 351 + ++ AY L KK + + K S F+++I+ +TDGE N S ++I Sbjct: 287 ARSSLNAAYNAL---KKANTVEAKEHKDKKSES--FERYIVLMTDGEMTGNSSSWSSSID 341 Query: 352 ----KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 CD AK++ IKI +++ A G+ LL+ C SS + +Y N + ++ F I++ Sbjct: 342 QTVRNTCDTAKKDGIKIFSVAFMAPDKGKSLLQHCASSLDNYYAPENMEQIVTAFGEIAR 401 >gi|307945905|ref|ZP_07661241.1| putative von Willebrand factor type A [Roseibium sp. TrichSKD4] gi|307771778|gb|EFO31003.1| putative von Willebrand factor type A [Roseibium sp. TrichSKD4] Length = 432 Score = 57.4 bits (137), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 55/216 (25%), Positives = 95/216 (43%), Gaps = 32/216 (14%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + LV+D SGSM ++ ++M+ LK A+ + + + + Sbjct: 237 MALVLDRSGSMGW-----------------ERPSRMSELKKAVRSLIKELQTVDPDDQFT 279 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +G Y E +W VR +V S T + PA+++A L ++ + + Sbjct: 280 RLGAYAYHWYYAGKKELTWNKNSVRSWVNSLPAS---GGTRAAPAIQKAKNDLLTNSELN 336 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIVTISINA 371 N Q P L FI+++TDG + N K C AK I I T++ A Sbjct: 337 AHINKNEQE---PDL----FILYMTDG----IDGDPNWAKRECTSAKNAGITIYTVAFKA 385 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 +G+ LLK C +S ++Y+ NA+ L VF++I++ Sbjct: 386 PASGRNLLKACATSDAHYYDAKNANELNKVFKDIAR 421 >gi|218678237|ref|ZP_03526134.1| hypothetical protein RetlC8_04927 [Rhizobium etli CIAT 894] Length = 120 Score = 57.0 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 32/89 (35%), Positives = 48/89 (53%), Gaps = 6/89 (6%) Query: 322 VKIPSLPFQKFIIFLTDGENNNFKS-----NVNTIKICDKAKENFIKIVTISINASPNGQ 376 +K +P +K+I+F+TDG+NNN S + T K CD AK I+I TI+ A GQ Sbjct: 20 LKTGQIP-KKYIVFMTDGDNNNDSSGGRSYDTATKKTCDDAKSKGIEIYTIAFMAPAGGQ 78 Query: 377 RLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 LL C S +++ + L+ F+ I Sbjct: 79 ALLHYCASDDSHYFQAEKMEDLLAAFEAI 107 >gi|254780388|ref|YP_003064801.1| hypothetical protein CLIBASIA_01365 [Candidatus Liberibacter asiaticus str. psy62] gi|254040065|gb|ACT56861.1| hypothetical protein CLIBASIA_01365 [Candidatus Liberibacter asiaticus str. psy62] Length = 458 Score = 55.8 bits (133), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 50/183 (27%), Positives = 84/183 (45%), Gaps = 22/183 (12%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD--MDSLI 288 +++AL + SI + +V + V MG + RV + SWG K+ + + + +D Sbjct: 274 VRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENE 333 Query: 289 LKPTDSTPAMKQAYQILTS---DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + T AM+ AY + S D+ N L +K+I+ LTDGEN + Sbjct: 334 MGSTAINDAMQTAYDTIISSNEDEVHRMKNN----------LEAKKYIVLLTDGENT--Q 381 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQR----LLKTCVSSPEYHYNVVNADSLIHV 401 N I IC+KAK I+I+TI+ + + Q L C +SP + + L + Sbjct: 382 DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNC-ASPNSFFEANSTHELNKI 440 Query: 402 FQN 404 F++ Sbjct: 441 FRD 443 Score = 47.0 bits (110), Expect = 0.006, Method: Compositional matrix adjust. Identities = 32/125 (25%), Positives = 66/125 (52%), Gaps = 7/125 (5%) Query: 5 SRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG 64 ++F FY KK I S +F II AL + L + G L+ V+ W Y +++++ A AI+ Sbjct: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 + ++ +L + R ++ K+ + + R +N++K++ F + E+++IV Sbjct: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKN-------FTDREVRDIVRD 117 Query: 125 SRISM 129 + + M Sbjct: 118 TAVEM 122 >gi|154250683|ref|YP_001411507.1| von Willebrand factor type A [Parvibaculum lavamentivorans DS-1] gi|154154633|gb|ABS61850.1| von Willebrand factor type A [Parvibaculum lavamentivorans DS-1] Length = 436 Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 36/119 (30%), Positives = 59/119 (49%), Gaps = 20/119 (16%) Query: 312 SFFTNFFRQGVKIPSLP-----FQKFIIFLTDGEN--NNFKSNVNTIK-----ICDKAKE 359 ++ N QG + S K I+FLTDG+N N + +N NTI IC+ K Sbjct: 322 AWGWNMLTQGGPLSSAAAPAANLDKVIVFLTDGDNTRNRWSNNSNTINARTTLICNNIKA 381 Query: 360 NFIKIVTISI---NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 IK+ ++ + NA+ L++ C + P +Y+V A L VF +I+Q + + + S Sbjct: 382 AGIKVYSVRVIEGNAT-----LIRNCATEPGMYYSVTTASELTSVFASIAQSLSNLRIS 435 >gi|126730251|ref|ZP_01746062.1| hypothetical protein SSE37_10864 [Sagittula stellata E-37] gi|126708984|gb|EBA08039.1| hypothetical protein SSE37_10864 [Sagittula stellata E-37] Length = 614 Score = 49.3 bits (116), Expect = 0.001, Method: Compositional matrix adjust. Identities = 30/119 (25%), Positives = 59/119 (49%), Gaps = 4/119 (3%) Query: 296 PAMKQAYQILTSDKKRSFFTN-FFRQGVKIPSLPFQKFIIFLTDGEN--NNFKSNVNTIK 352 P ++ +Q L + K +++ N ++ Q + + E + ++N N Sbjct: 492 PPVRLTWQELFGNVKTTYYANAWYWQAYMDGRASYNDYYNAYYSWETTVDASQANTNLAT 551 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 IC KAK+ + I TI + A G ++ C SS ++YN V+++ L+ F++IS ++V Sbjct: 552 ICAKAKQQDVTIFTIGVEAPQAGLNAMRNCASSASHYYN-VSSNQLVDTFRSISDVVVE 609 >gi|315498201|ref|YP_004087005.1| von willebrand factor type a [Asticcacaulis excentricus CB 48] gi|315416213|gb|ADU12854.1| von Willebrand factor type A [Asticcacaulis excentricus CB 48] Length = 570 Score = 47.0 bits (110), Expect = 0.006, Method: Compositional matrix adjust. Identities = 30/99 (30%), Positives = 45/99 (45%), Gaps = 20/99 (20%) Query: 325 PSLPFQ-----------KFIIFLTDGEN--NNFKSNVNTIKI-----CDKAKENFIKIVT 366 P LPF K++I +TDGEN N + ++ +TI C AK+ I + T Sbjct: 463 PELPFNTAKPYSDKTNYKYMIVITDGENTQNRWSTSASTINARTLLACQAAKDLGITVYT 522 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 I + +LK+C S PEY Y+V + L + Sbjct: 523 IRV--MEGNSDMLKSCASRPEYFYDVTASSQLTSTLAKV 559 >gi|259416688|ref|ZP_05740608.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B] gi|259348127|gb|EEW59904.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B] Length = 583 Score = 45.1 bits (105), Expect = 0.020, Method: Compositional matrix adjust. Identities = 22/62 (35%), Positives = 34/62 (54%), Gaps = 3/62 (4%) Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN---ADSLIHVFQNIS 406 T+ +CD AKE I + TI A GQ++L+ C SS ++Y+V +D+ + I Sbjct: 518 TLAVCDAAKEKGIVVFTIGFEAPWRGQQVLQQCASSASHYYDVDGLEISDAFASIASAIR 577 Query: 407 QL 408 QL Sbjct: 578 QL 579 >gi|329850249|ref|ZP_08265094.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] gi|328840564|gb|EGF90135.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] Length = 412 Score = 45.1 bits (105), Expect = 0.021, Method: Compositional matrix adjust. Identities = 100/459 (21%), Positives = 170/459 (37%), Gaps = 121/459 (26%) Query: 2 HLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWH---YKKNSMESANN 58 H + FR + + N +IFALSV ++ GF+ +D+ Y + ++ A + Sbjct: 9 HRFAGFRRRLGESCRDQSGNVIMIFALSV---FVIFGFVGAAIDFSRVDYARRRLQDAAD 65 Query: 59 AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVF-YNTE 117 +A+L + +S ++ ++ D K F +N G+ V+ N Sbjct: 66 SAVLRAMA------------LKSATDESRGVAAD--KAFAENF------GHPGVYDLNGA 105 Query: 118 IQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFI 177 ++ VN + IS T+ + + SS F+ D Y +VS Sbjct: 106 LKREVNENIISQTYTVHATV-SSYFGAFFGKD------------------SYPVTVVSQA 146 Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL-KNALL 236 L +F I V+D +GSM + N P + + MA L +N Sbjct: 147 KTSL-------DVFEIAFVLDTTGSM--------AEANKMPNLKSSVDSAMAGLLQNGKN 191 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 L I ++ V + TT + + WG D D ++ Sbjct: 192 LSGSKIAVVPF-NTQVRLSDATVTTMSSQGLSSGWGN------CVHDRDLATSHDVSASA 244 Query: 297 AMKQAYQIL----TSDKK---------------RSF--------FTNF---FRQGVKI-- 324 A K Q L T D+ R+F +TN + G+++ Sbjct: 245 AQKGKAQTLYPLETCDEASLKPVQGLSDNISSARNFIKTLQPGGYTNVTMGVQWGMEVLS 304 Query: 325 PSLPF-----------QKFIIFLTDGENN-NFKS------NVNTIKICDKAKENFIKIVT 366 P+ PF +KF+I +TDG+N +F S + T C+ AK I + T Sbjct: 305 PNQPFSDATEFGSTKARKFMIVVTDGDNTKSFTSWSASVIDKRTALACENAKAKGITVYT 364 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + I +L+ C S+PEY Y++ +A+ L I Sbjct: 365 VKI--IQGNSNMLRKCASAPEYFYDLTSANQLNAAMSGI 401 >gi|218528586|ref|YP_002419402.1| hypothetical protein Mchl_0543 [Methylobacterium chloromethanicum CM4] gi|218520889|gb|ACK81474.1| conserved hypothetical protein [Methylobacterium chloromethanicum CM4] Length = 518 Score = 43.1 bits (100), Expect = 0.087, Method: Compositional matrix adjust. Identities = 27/122 (22%), Positives = 51/122 (41%), Gaps = 33/122 (27%) Query: 318 FRQGVKIPSLPFQKFIIFLTDGENNNF-----------------------KSNVNTIK-- 352 F+ GV + +KFI+ +TDG+N + SN N ++ Sbjct: 387 FKDGVAYGEIKTKKFIVLMTDGQNQSAVSSSDNRSYYSGLGFIWQNRIGTTSNDNAVRTK 446 Query: 353 --------ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 +CD ++ I++ + + + +LK C +SP ++V N+ L VF+ Sbjct: 447 AIDTRLTLLCDNIRKARIQVFAVRVEVNDGDSAVLKACATSPNMFFDVKNSSGLPAVFRA 506 Query: 405 IS 406 I+ Sbjct: 507 IA 508 >gi|167644155|ref|YP_001681818.1| Flp pilus assembly protein TadG [Caulobacter sp. K31] gi|167346585|gb|ABZ69320.1| Flp pilus assembly protein TadG [Caulobacter sp. K31] Length = 562 Score = 42.7 bits (99), Expect = 0.10, Method: Compositional matrix adjust. Identities = 18/55 (32%), Positives = 30/55 (54%) Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + C AK ++I I + S + + +L+ C S PE +Y+V +A L VF I+ Sbjct: 498 RACTNAKAAGVQIYAIGVGVSSHSRGILQDCASKPEMYYDVTDAAQLASVFNTIA 552 >gi|294498950|ref|YP_003562650.1| 3-ketoacyl-acyl carrier protein reductase [Bacillus megaterium QM B1551] gi|294348887|gb|ADE69216.1| 3-ketoacyl-acyl carrier protein reductase [Bacillus megaterium QM B1551] Length = 239 Score = 42.4 bits (98), Expect = 0.15, Method: Compositional matrix adjust. Identities = 32/133 (24%), Positives = 61/133 (45%), Gaps = 3/133 (2%) Query: 82 ISNHAKRALIDDAKRFIKNHIKESLSGYSA--VFYNTEIQNIVNSSRISMTHMANNRLDS 139 I++ A R L +D K I+ H+ + +G +FY + +N+ + ++ N LD Sbjct: 35 INDEALRQLAEDTKGHIETHVVDVKNGKDVKDLFYQIKDKNLYGLVNNAGVYLGKNLLDY 94 Query: 140 SNNTIFYNMDV-MTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVD 198 N I + MD+ + Y Y ++ LL ++ + + I ++ +E I+ + Sbjct: 95 EENEIDFVMDINIKGYVYFSKYFGELLMKKETEGAIINISSVSGMEGSSDAIYGMSKAAV 154 Query: 199 LSGSMHCAMNSDP 211 L + CAMN P Sbjct: 155 LGLTKSCAMNFSP 167 >gi|99081991|ref|YP_614145.1| hypothetical protein TM1040_2151 [Ruegeria sp. TM1040] gi|99038271|gb|ABF64883.1| hypothetical protein TM1040_2151 [Ruegeria sp. TM1040] Length = 582 Score = 41.6 bits (96), Expect = 0.22, Method: Compositional matrix adjust. Identities = 20/65 (30%), Positives = 33/65 (50%), Gaps = 3/65 (4%) Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN---ADSLIHVFQ 403 + T+ IC+ AK + + TI A GQ +L+ C SS ++Y+V +D+ + Sbjct: 514 DARTLDICEAAKAKGVVVFTIGFEAPSRGQEVLQACASSASHYYDVDGLEISDAFASIAS 573 Query: 404 NISQL 408 I QL Sbjct: 574 AIRQL 578 >gi|260576512|ref|ZP_05844501.1| conserved hypothetical protein [Rhodobacter sp. SW2] gi|259021235|gb|EEW24542.1| conserved hypothetical protein [Rhodobacter sp. SW2] Length = 529 Score = 41.6 bits (96), Expect = 0.27, Method: Compositional matrix adjust. Identities = 20/67 (29%), Positives = 34/67 (50%), Gaps = 1/67 (1%) Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + +V +ICD AK++ I I +I A NG+ L+ C S P ++N + F+ Sbjct: 459 QKDVRLQQICDAAKDSGIVIFSIGFEAPENGRNQLRDCASQPSNYFNATGV-QITTAFRA 517 Query: 405 ISQLMVH 411 I+ + H Sbjct: 518 IATQLSH 524 >gi|315122347|ref|YP_004062836.1| hypothetical protein CKC_02995 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495749|gb|ADR52348.1| hypothetical protein CKC_02995 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 362 Score = 41.2 bits (95), Expect = 0.28, Method: Compositional matrix adjust. Identities = 27/85 (31%), Positives = 46/85 (54%), Gaps = 3/85 (3%) Query: 330 QKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH 389 +K +IFLTDGEN + TI++C+ K+ K V SI + + + +L+ C SS Sbjct: 276 KKIVIFLTDGENMIINATRKTIELCNDMKKK--KAVIYSIALAVDNKEVLQGCSSSGNV- 332 Query: 390 YNVVNADSLIHVFQNISQLMVHRKY 414 Y +A SL+ + I + ++ +Y Sbjct: 333 YAADDAQSLVQAYSLIGKDVMKDEY 357 >gi|294678572|ref|YP_003579187.1| hypothetical protein RCAP_rcc03056 [Rhodobacter capsulatus SB 1003] gi|294477392|gb|ADE86780.1| conserved hypothetical protein [Rhodobacter capsulatus SB 1003] Length = 647 Score = 40.8 bits (94), Expect = 0.45, Method: Compositional matrix adjust. Identities = 21/60 (35%), Positives = 33/60 (55%), Gaps = 1/60 (1%) Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + T K+CD AK I I +++ +A G+ LLK C S Y+Y V ++ L F +I+ Sbjct: 579 DARTKKLCDLAKSKGIYIFSVAADAPSGGKTLLKYCSSGTSYYYEVQGSN-LSTAFASIA 637 >gi|254501086|ref|ZP_05113237.1| hypothetical protein SADFL11_1122 [Labrenzia alexandrii DFL-11] gi|222437157|gb|EEE43836.1| hypothetical protein SADFL11_1122 [Labrenzia alexandrii DFL-11] Length = 465 Score = 40.4 bits (93), Expect = 0.51, Method: Compositional matrix adjust. Identities = 20/57 (35%), Positives = 28/57 (49%), Gaps = 2/57 (3%) Query: 353 ICDKAKENFIKIVTI--SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 +CD K I+I T+ S G L+ C S +Y+Y N D LI F NI++ Sbjct: 400 LCDAMKAENIEIFTVFFDTGGSAFGDDLMSYCASGSDYYYEADNKDELITAFSNIAK 456 >gi|254477542|ref|ZP_05090928.1| conserved hypothetical protein [Ruegeria sp. R11] gi|214031785|gb|EEB72620.1| conserved hypothetical protein [Ruegeria sp. R11] Length = 523 Score = 40.0 bits (92), Expect = 0.74, Method: Compositional matrix adjust. Identities = 22/65 (33%), Positives = 32/65 (49%), Gaps = 3/65 (4%) Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN---ADSLIHVFQ 403 N T ICD AK I + TI A G +LK C SS ++++V +D+ + Sbjct: 455 NARTKAICDAAKARGIVVYTIGFEAPSGGVSVLKDCASSDAHYFDVQGLEISDAFASIAT 514 Query: 404 NISQL 408 +I QL Sbjct: 515 SIRQL 519 >gi|56696619|ref|YP_166980.1| hypothetical protein SPO1742 [Ruegeria pomeroyi DSS-3] gi|56678356|gb|AAV95022.1| conserved hypothetical protein [Ruegeria pomeroyi DSS-3] Length = 558 Score = 39.7 bits (91), Expect = 0.96, Method: Compositional matrix adjust. Identities = 19/49 (38%), Positives = 29/49 (59%), Gaps = 3/49 (6%) Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 T +CD AK+ I + T+ A +G+R+LK C SS ++Y +AD L Sbjct: 493 TDHVCDAAKDEGIIVYTVGFEAPYSGRRVLKRCASSDSHYY---DADGL 538 >gi|329850248|ref|ZP_08265093.1| von Willebrand factor type A [Asticcacaulis biprosthecum C19] gi|328840563|gb|EGF90134.1| von Willebrand factor type A [Asticcacaulis biprosthecum C19] Length = 575 Score = 39.3 bits (90), Expect = 1.2, Method: Compositional matrix adjust. Identities = 28/95 (29%), Positives = 39/95 (41%), Gaps = 9/95 (9%) Query: 318 FRQGVKIPSLPFQKFIIFLTDGEN-------NNFKSNVNTIKICDKAKENFIKIVTISIN 370 F +G K++I LTDG N NN + N C AK I + T+ + Sbjct: 472 FSEGGAFTDKAVLKYMIVLTDGINTQNRWTTNNSQINARLALACTNAKNLGITVFTVRVE 531 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + L+ C S Y+YN+ NAD L I Sbjct: 532 QGDS--TTLQNCASQTAYYYNLSNADQLPATMSKI 564 >gi|163742980|ref|ZP_02150363.1| hypothetical protein RG210_01902 [Phaeobacter gallaeciensis 2.10] gi|161383663|gb|EDQ08049.1| hypothetical protein RG210_01902 [Phaeobacter gallaeciensis 2.10] Length = 560 Score = 39.3 bits (90), Expect = 1.3, Method: Compositional matrix adjust. Identities = 21/70 (30%), Positives = 33/70 (47%), Gaps = 3/70 (4%) Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN---ADSL 398 N + T +C+ AK I + TI A +G +LK C SS +H++V D+ Sbjct: 487 NTSTKDARTRAVCNAAKNQGIVVYTIGFEAPSSGTAVLKDCASSDAHHFDVRGLEIRDAF 546 Query: 399 IHVFQNISQL 408 + +I QL Sbjct: 547 ASIATSIRQL 556 >gi|86137906|ref|ZP_01056482.1| hypothetical protein MED193_08588 [Roseobacter sp. MED193] gi|85825498|gb|EAQ45697.1| hypothetical protein MED193_08588 [Roseobacter sp. MED193] Length = 543 Score = 39.3 bits (90), Expect = 1.3, Method: Compositional matrix adjust. Identities = 21/71 (29%), Positives = 34/71 (47%), Gaps = 3/71 (4%) Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN---ADS 397 + N N T +C+ AK I + TI A NG +L+ C SS ++++V D+ Sbjct: 469 HGNSTKNARTRSVCEAAKAKGIVVYTIGFEAPSNGVAVLRDCASSDAHYFDVDGLEIKDA 528 Query: 398 LIHVFQNISQL 408 + +I QL Sbjct: 529 FASIATSIRQL 539 >gi|254450361|ref|ZP_05063798.1| conserved hypothetical protein [Octadecabacter antarcticus 238] gi|254450938|ref|ZP_05064375.1| conserved hypothetical protein [Octadecabacter antarcticus 238] gi|198264767|gb|EDY89037.1| conserved hypothetical protein [Octadecabacter antarcticus 238] gi|198265344|gb|EDY89614.1| conserved hypothetical protein [Octadecabacter antarcticus 238] Length = 75 Score = 38.9 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 17/55 (30%), Positives = 28/55 (50%) Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 N +++ IC A+ + I T++ A GQ L+ C SSP +H++V D Sbjct: 2 NGTEADARLSDICAAARAQGVVIYTVAFEAPSGGQSALQDCASSPSHHFDVNGTD 56 >gi|315499132|ref|YP_004087936.1| von willebrand factor type a [Asticcacaulis excentricus CB 48] gi|315417144|gb|ADU13785.1| von Willebrand factor type A [Asticcacaulis excentricus CB 48] Length = 519 Score = 38.9 bits (89), Expect = 1.4, Method: Compositional matrix adjust. Identities = 23/83 (27%), Positives = 36/83 (43%), Gaps = 9/83 (10%) Query: 330 QKFIIFLTDGEN-------NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 +K++I +TDG N +N N T C AK I + + + LL+ C Sbjct: 428 KKYMIIVTDGANTQNRWSTSNSAINARTALACTAAKAQGITLFVVRVE--DGDSSLLEAC 485 Query: 383 VSSPEYHYNVVNADSLIHVFQNI 405 S Y+Y++ A L Q+I Sbjct: 486 ASQSSYYYDLSQASDLTKTMQDI 508 >gi|163738634|ref|ZP_02146048.1| hypothetical protein RGBS107_11437 [Phaeobacter gallaeciensis BS107] gi|161387962|gb|EDQ12317.1| hypothetical protein RGBS107_11437 [Phaeobacter gallaeciensis BS107] Length = 558 Score = 38.9 bits (89), Expect = 1.5, Method: Compositional matrix adjust. Identities = 21/70 (30%), Positives = 33/70 (47%), Gaps = 3/70 (4%) Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN---ADSL 398 N + T +C+ AK I + TI A +G +LK C SS +H++V D+ Sbjct: 485 NTSTKDARTRAVCNAAKNQGIVVYTIGFEAPSSGTAVLKDCASSDAHHFDVRGLEIRDAF 544 Query: 399 IHVFQNISQL 408 + +I QL Sbjct: 545 ASIATSIRQL 554 >gi|110679843|ref|YP_682850.1| hypothetical protein RD1_2614 [Roseobacter denitrificans OCh 114] gi|109455959|gb|ABG32164.1| conserved hypothetical protein [Roseobacter denitrificans OCh 114] Length = 488 Score = 38.1 bits (87), Expect = 2.7, Method: Compositional matrix adjust. Identities = 26/93 (27%), Positives = 37/93 (39%), Gaps = 30/93 (32%) Query: 330 QKFIIFLTDG--------ENNNFKSN----------------------VNTIKICDKAKE 359 +KFI+ +TDG E+ N N N IC+KAK Sbjct: 373 RKFIVLMTDGQITDQFRPEDKNDPKNDEIALNQRIGDRDTYATQSTNVANFYSICNKAKA 432 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNV 392 I + TI+ A N ++TC +SP + Y V Sbjct: 433 AGITVYTIAFEAPANAITQMRTCATSPAFFYKV 465 >gi|320584188|gb|EFW98399.1| carboxypeptidase Y receptor, putative [Pichia angusta DL-1] Length = 1512 Score = 37.7 bits (86), Expect = 3.7, Method: Compositional matrix adjust. Identities = 21/58 (36%), Positives = 36/58 (62%), Gaps = 3/58 (5%) Query: 110 SAVFYNTEIQNIVNSSRISMTHMANNRLDSSN--NTIFYNMDVMTSYDYRLQFIEHLL 165 SAVF ++ NI+ + +I M+H+ + RLD S NT Y++D S++ +F +H+L Sbjct: 461 SAVFTFADLGNIIVAIQIDMSHLLDVRLDPSTLPNTFLYSLDQGDSWE-EAEFGDHIL 517 >gi|218462234|ref|ZP_03502325.1| hypothetical protein RetlK5_23393 [Rhizobium etli Kim 5] Length = 66 Score = 37.4 bits (85), Expect = 4.9, Method: Composition-based stats. Identities = 18/52 (34%), Positives = 26/52 (50%) Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 CD AK I+I TI+ A GQ LL C S +++ + L+ F+ I Sbjct: 2 CDTAKSKGIEIYTIAFMAPAGGQALLHYCASDDSHYFQAEKMEDLLAAFKAI 53 >gi|84688081|ref|ZP_01015939.1| hypothetical protein 1099457000215_RB2654_05415 [Maritimibacter alkaliphilus HTCC2654] gi|84663909|gb|EAQ10415.1| hypothetical protein RB2654_05415 [Rhodobacterales bacterium HTCC2654] Length = 595 Score = 37.4 bits (85), Expect = 5.1, Method: Compositional matrix adjust. Identities = 22/79 (27%), Positives = 40/79 (50%), Gaps = 5/79 (6%) Query: 335 FLTDGEN--NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV 392 FL D + N + N N +IC AK + + TI S + ++++C S+P Y+++V Sbjct: 513 FLADAHDYFNYSEKNDNLDEICTAAKNAGMVVFTIGFEVSGSQHDIMRSCASAPAYYFDV 572 Query: 393 VNAD---SLIHVFQNISQL 408 D + + + IS+L Sbjct: 573 DGLDISAAFAAIAREISKL 591 >gi|281351544|gb|EFB27128.1| hypothetical protein PANDA_007647 [Ailuropoda melanoleuca] Length = 478 Score = 37.0 bits (84), Expect = 5.5, Method: Compositional matrix adjust. Identities = 22/67 (32%), Positives = 33/67 (49%), Gaps = 1/67 (1%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLI 288 +N L FL DL+S + E +G G + N+ S G KV+QYV+ D+ S I Sbjct: 307 GYRNEPLFFLSKQDLISTIGESAALGAAGIVIWGDMNLTSSEGNCTKVKQYVSSDLGSYI 366 Query: 289 LKPTDST 295 + T + Sbjct: 367 VNVTKAA 373 >gi|126738776|ref|ZP_01754472.1| hypothetical protein RSK20926_02629 [Roseobacter sp. SK209-2-6] gi|126719957|gb|EBA16664.1| hypothetical protein RSK20926_02629 [Roseobacter sp. SK209-2-6] Length = 530 Score = 37.0 bits (84), Expect = 5.6, Method: Compositional matrix adjust. Identities = 17/52 (32%), Positives = 26/52 (50%) Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV 392 + N N T +C AK I + TI A NG +L+ C SS ++++V Sbjct: 456 HGNSTKNTRTSNVCSAAKAQGIIVYTIGFEAPSNGVAVLQDCASSDSHYFDV 507 >gi|86141261|ref|ZP_01059807.1| hypothetical protein MED217_04567 [Leeuwenhoekiella blandensis MED217] gi|85831820|gb|EAQ50275.1| hypothetical protein MED217_04567 [Leeuwenhoekiella blandensis MED217] Length = 484 Score = 37.0 bits (84), Expect = 5.8, Method: Compositional matrix adjust. Identities = 39/147 (26%), Positives = 68/147 (46%), Gaps = 22/147 (14%) Query: 53 MESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKR--FIKNHIKESLSGYS 110 +E AILAG SK + + E++SN A AL DA + F++NHI Sbjct: 59 LEETGYDAILAG-SKTYTVWAPTN---EALSNVASEALASDAAKRLFVENHIAL------ 108 Query: 111 AVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRY- 169 T + ++ + +++ ++N L+ N T+ + V+T+ Y + H++++ Sbjct: 109 -----TAVSSVTEADTVTVQMLSNKYLEFKNGTVMSDAQVVTADQYASNGLFHIVDKALA 163 Query: 170 -NQKIVSFIPALLRIEMGERPIFLIEL 195 Q I +I A +M FLIEL Sbjct: 164 PKQNIWEYINATADNKMSS---FLIEL 187 >gi|301767312|ref|XP_002919077.1| PREDICTED: hyaluronidase-4-like [Ailuropoda melanoleuca] Length = 481 Score = 37.0 bits (84), Expect = 6.3, Method: Compositional matrix adjust. Identities = 22/65 (33%), Positives = 33/65 (50%), Gaps = 1/65 (1%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILK 290 +N L FL DL+S + E +G G + N+ S G KV+QYV+ D+ S I+ Sbjct: 309 RNEPLFFLSKQDLISTIGESAALGAAGIVIWGDMNLTSSEGNCTKVKQYVSSDLGSYIVN 368 Query: 291 PTDST 295 T + Sbjct: 369 VTKAA 373 >gi|149202124|ref|ZP_01879097.1| hypothetical protein RTM1035_12393 [Roseovarius sp. TM1035] gi|149144222|gb|EDM32253.1| hypothetical protein RTM1035_12393 [Roseovarius sp. TM1035] Length = 584 Score = 37.0 bits (84), Expect = 6.7, Method: Compositional matrix adjust. Identities = 39/138 (28%), Positives = 57/138 (41%), Gaps = 13/138 (9%) Query: 268 EPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 EP+ G E+V ++ R DS + TD + S+ F ++ L Sbjct: 426 EPT-GEEQVAYFLRRPSDSRMFNVTDGGSVTWSTFNNYNSNTLPGFISSERLSWETAWGL 484 Query: 328 PFQKFIIFLTD--GENNNFKSN-VNTIK-------ICDKAKENFIKIVTIS--INASPNG 375 +F T G NNF +N +N K IC +AK I I TI+ + + P G Sbjct: 485 MSPRFYGNTTGNWGPWNNFLNNPINRSKKDERLDDICREAKSEGIVIYTIAFEMGSQPTG 544 Query: 376 QRLLKTCVSSPEYHYNVV 393 +K C SS +HYN Sbjct: 545 ADKIKKCASSVNHHYNAT 562 >gi|145479603|ref|XP_001425824.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124392896|emb|CAK58426.1| unnamed protein product [Paramecium tetraurelia] Length = 344 Score = 36.6 bits (83), Expect = 8.3, Method: Compositional matrix adjust. Identities = 38/156 (24%), Positives = 71/156 (45%), Gaps = 11/156 (7%) Query: 82 ISNHAKRALIDDAKRFIKNHIKE-SLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSS 140 I NH+ I + R +K +KE L EIQN N ++ + + ++ + Sbjct: 106 IGNHSGEEEIHENLRIVKEKLKELDLKVNCQEKEIQEIQNTYNQTKEELQQLKSD----N 161 Query: 141 NNTIFYNMDV---MTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVV 197 N NM++ + SYD +++ ++HLL+ +K ++ ++I+ +R IEL Sbjct: 162 ENYKKDNMEIRQELQSYDSQIKKVQHLLSLLIQEKQLNGYDPPIQIQ-NKRSKTTIELGK 220 Query: 198 DLSGSMHCAMNSDPEDVNSAPI--CQDKKRTKMAAL 231 ++ + H D N +PI C+ + AAL Sbjct: 221 EIMNNSHTQQKKKQTDSNGSPIKQCETPLSQQQAAL 256 >gi|170751925|ref|YP_001758185.1| hypothetical protein Mrad2831_5557 [Methylobacterium radiotolerans JCM 2831] gi|170658447|gb|ACB27502.1| hypothetical protein Mrad2831_5557 [Methylobacterium radiotolerans JCM 2831] Length = 568 Score = 36.6 bits (83), Expect = 8.7, Method: Compositional matrix adjust. Identities = 23/61 (37%), Positives = 31/61 (50%), Gaps = 5/61 (8%) Query: 350 TIKICDKAKENFIKIVTI--SINASP---NGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 T + C K I I TI S+ P GQ LL+ C SSP+ Y ++D LI F++ Sbjct: 497 TAEACANTKAVNISIYTIGFSVPTDPIDSAGQTLLRNCASSPDQFYLANSSDDLIKAFKS 556 Query: 405 I 405 I Sbjct: 557 I 557 >gi|295704271|ref|YP_003597346.1| 3-ketoacyl-acyl carrier protein reductase [Bacillus megaterium DSM 319] gi|294801930|gb|ADF38996.1| 3-ketoacyl-acyl carrier protein reductase [Bacillus megaterium DSM 319] Length = 239 Score = 36.2 bits (82), Expect = 8.9, Method: Compositional matrix adjust. Identities = 30/133 (22%), Positives = 59/133 (44%), Gaps = 3/133 (2%) Query: 82 ISNHAKRALIDDAKRFIKNHIKESLSGYSA--VFYNTEIQNIVNSSRISMTHMANNRLDS 139 I + A + L +D K I+ H+ + G +FY + +++ + ++ N LD Sbjct: 35 IKDGALKQLAEDTKGHIETHVVDVKKGNDVKDLFYQIKDKSLYGLVNNAGVYLGENLLDY 94 Query: 140 SNNTIFYNMDV-MTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVD 198 N I + MD+ + Y Y ++ LL ++ + + I ++ +E I+ + Sbjct: 95 EENEIDFVMDINIKGYVYFSKYFGELLMKKEIEGAIINISSVSGMEGSSDAIYGMSKAAI 154 Query: 199 LSGSMHCAMNSDP 211 L + CAMN P Sbjct: 155 LGLTKSCAMNFSP 167 Searching..................................................done Results from round 2 >gi|254781110|ref|YP_003065523.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] gi|254040787|gb|ACT57583.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] Length = 420 Score = 488 bits (1256), Expect = e-136, Method: Composition-based stats. Identities = 420/420 (100%), Positives = 420/420 (100%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA Sbjct: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN Sbjct: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL Sbjct: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD Sbjct: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ Sbjct: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN Sbjct: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILKG 420 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILKG Sbjct: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILKG 420 >gi|315122479|ref|YP_004062968.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495881|gb|ADR52480.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 427 Score = 430 bits (1104), Expect = e-118, Method: Composition-based stats. Identities = 247/429 (57%), Positives = 322/429 (75%), Gaps = 13/429 (3%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 M+LL F F FKK I S KANFSI+F++ ++S LL IG LIYVLD+++KKN+ME+AN +A Sbjct: 1 MNLLRTFLFNFKKIILSPKANFSILFSVILISILLFIGILIYVLDYYHKKNAMENANTSA 60 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 IL+GASK++S +S GD ++S+H RA++DD RFIK++IKESL S+VF +E Sbjct: 61 ILSGASKIISRISYFGD---NMSSHTHRAIVDDVTRFIKSYIKESLLMDSSVFDISEKNI 117 Query: 121 IVNSSRISMT--------HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQK 172 I +S++S+T H NN+ N FY++ V T YDY ++F ++LLN++ N K Sbjct: 118 ISQNSKVSITREPHPNVFHEFNNQSILQNKKTFYHISVETFYDYHIKFFDNLLNKKINSK 177 Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC-QDKKRTKMAAL 231 I+SF+PAL++I+ GE P F ++LVVDLS SM C MNSDPE +C + KK +KM AL Sbjct: 178 IISFVPALVKIDTGEHPFFFVQLVVDLSASMSCLMNSDPEHATEFSVCGKSKKNSKMDAL 237 Query: 232 KNALLLFLDSIDLLSHVKEDV-YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 K A+LLFLDS+D S ++D Y+GL GYTTRVEKNIEPSWGT KVR+Y+ ++D +L Sbjct: 238 KKAVLLFLDSVDRGSKTQKDTHYIGLTGYTTRVEKNIEPSWGTGKVRKYIVEEIDVNMLG 297 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 TDSTPAMK+AYQILTSDKKR+F N + +KIP LPFQKF+IFLTDGENN+ KS+V T Sbjct: 298 QTDSTPAMKKAYQILTSDKKRNFIRNILHKRIKIPPLPFQKFLIFLTDGENNDPKSDVKT 357 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 IKIC+KAK+N IKI+TISINAS NG+RLLK CVS+PEY+YNVV+ SL+ VFQ+IS L+ Sbjct: 358 IKICEKAKKNSIKILTISINASANGKRLLKKCVSAPEYYYNVVDTGSLLRVFQDISTLIT 417 Query: 411 HRKYSVILK 419 H KY VILK Sbjct: 418 HYKYQVILK 426 >gi|254780934|ref|YP_003065347.1| hypothetical protein CLIBASIA_04165 [Candidatus Liberibacter asiaticus str. psy62] gi|254040611|gb|ACT57407.1| hypothetical protein CLIBASIA_04165 [Candidatus Liberibacter asiaticus str. psy62] Length = 374 Score = 323 bits (827), Expect = 4e-86, Method: Composition-based stats. Identities = 86/410 (20%), Positives = 170/410 (41%), Gaps = 51/410 (12%) Query: 10 YFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMV 69 + + K +I+ A+ + L++G +I V + K + S + +++ A++++ Sbjct: 7 NIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIM 66 Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSG--YSAVFYNTEIQNIVNSSRI 127 + E N+ K+ D IKN S F N +I +IV S+ + Sbjct: 67 N---------EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN-DIDDIVRSTSL 116 Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE 187 + + N Y++ ++ Y L+F + ++ IV I + +++ Sbjct: 117 DIVVVPQNE--------GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQT 168 Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + +V+D+S SM D TK+ ++ L+ + L+ Sbjct: 169 DARLDMMIVLDVSRSMESFF--------------DSSITKIDMAIKSINAMLEEVKLIPD 214 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVR---QYVTRDMDSLILKPTDSTPAMKQAYQI 304 V V GL+ ++ ++E+ WG ++ +Y+++ S T+STP +K AY Sbjct: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS-----TNSTPGLKYAYNQ 269 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 + F RQ ++K I+F+TDGEN + K + ++ C++AK+ + Sbjct: 270 I-------FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIV 322 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 I I L+ C SP Y V N S+ F +I + +V ++ Sbjct: 323 YAIGIRV-IRSHEFLRACA-SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 >gi|163760496|ref|ZP_02167578.1| hypothetical protein HPDFL43_04296 [Hoeflea phototrophica DFL-43] gi|162282447|gb|EDQ32736.1| hypothetical protein HPDFL43_04296 [Hoeflea phototrophica DFL-43] Length = 363 Score = 316 bits (808), Expect = 5e-84, Method: Composition-based stats. Identities = 88/402 (21%), Positives = 177/402 (44%), Gaps = 54/402 (13%) Query: 5 SRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG 64 +R +K + +E NF++I A +V + + + K +++A ++A LA Sbjct: 4 TRITSKIRKLLRNENGNFALIAAAAVPVLFMAGSLAVDTTNAMSMKVRLQNAVDSAALAT 63 Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 A+++ E+++ +A A +F+ +KE ++ F T NI Sbjct: 64 AARLSEE--------ENLTAAQAQAF---ALKFVNGQVKEDFGAFN-GFSVTPTVNI--- 108 Query: 125 SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 + +++ T+ + + V L + ++ + ++ VS + Sbjct: 109 ----------DPVETGGRTV-WKVAVSMEGSQSLTPMARIMGK--DKLTVSVVGKSESAG 155 Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + F + LV+D SGSM +N + K+ LK A+ ++ + Sbjct: 156 EA-QGAFSMALVLDRSGSMDWNLNG---------------QKKINVLKTAVGGLIEQFEE 199 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 ++ V +G Y +++ + + W K +++V S TDST A AY Sbjct: 200 ADPERKYVRLGASSYNSKLTGSTKLRWNPGKTKEFVDALPAS---GGTDSTDAFDWAYTA 256 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 +T ++ + K +P +KFI+F+TDG+NN ++ +T +CD AK++ I++ Sbjct: 257 VTHKRENN------THDAKSGQVP-KKFIVFMTDGDNNYSSADSSTKHLCDDAKDDGIEV 309 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 T++ A G++LL C S+ E+ ++ N+ LI F+NI Sbjct: 310 YTVAFAAPNRGKQLLSYCASTEEHFFDAQNSAQLIEAFKNIG 351 >gi|315122473|ref|YP_004062962.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495875|gb|ADR52474.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 403 Score = 309 bits (792), Expect = 5e-82, Method: Composition-based stats. Identities = 95/427 (22%), Positives = 190/427 (44%), Gaps = 49/427 (11%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 +L R F+ K ++ F I+ A + L+ + F+I + + KN ++S+ + AI+ Sbjct: 9 ILRRIYFFSK----NKSGVFHIMSASIIFVCLIFVSFVIDITHLLHMKNHIQSSLDNAII 64 Query: 63 AGASKMVSN--LSRLGDRFESISNHAKR-ALIDDAKRFIKNHIKESLSGYSAVFYNTEIQ 119 +G S +VS+ ++ L + E I + K+ A ++ + F H + + F + Sbjct: 65 SGCSIVVSDPKINDLNPQEERIRDVIKKNAYVNMVQNFPAEHAAYIIENANISFS----K 120 Query: 120 NIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPA 179 ++ N +T A ++L N FI L I S Sbjct: 121 DLTNKYEYKITMEAKHQLSGKN------------------FILGFLMPNVITHISSISTG 162 Query: 180 LLRIEMGERPIFLIELVVDLSGSM------HCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 +++ + ++ F +E+V+D SGSM C ++S +K ++K+ ALK Sbjct: 163 IIQ-KPSDKKAFSVEMVLDCSGSMLDSMQESCDLSSGRGGYYFYSKNNNKPKSKIYALKT 221 Query: 234 ALLLFLDSIDLLSHV--KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 A F++ I + +GLI + + ++ + S +++ ++R Sbjct: 222 ASSDFVNLIQETVQTFPQISARIGLITFNHYIMQDSKLSNNFNVIKKTISRMK---PKGG 278 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 TD+ M AY+ L + + N ++P +++II +TDGENN+ ++ TI Sbjct: 279 TDTFLPMNAAYEYLNNIPNETKAHNISD------NVPLKRYIILMTDGENNHPSYDLKTI 332 Query: 352 KICDKAKENFIKIVTISIN--ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 +CD A++N I I +I +N +G L + C SS ++ + N +L+ F++I+ + Sbjct: 333 NVCDNARKNGIIIYSIFLNYYEYTDGYELARKCASSEKHFFYANNTKALLDSFKSIAHAI 392 Query: 410 VHRKYSV 416 + + Sbjct: 393 QDKAVRI 399 >gi|150397936|ref|YP_001328403.1| von Willebrand factor type A [Sinorhizobium medicae WSM419] gi|150029451|gb|ABR61568.1| von Willebrand factor type A [Sinorhizobium medicae WSM419] Length = 419 Score = 269 bits (688), Expect = 5e-70, Method: Composition-based stats. Identities = 91/411 (22%), Positives = 158/411 (38%), Gaps = 27/411 (6%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 F + + NF ++ AL L + G + V + KN ++ A +AA LA AS +VS Sbjct: 11 FMRMLRDRGGNFGMMTALVAPLLLAVGGVSVDVANMLMTKNQLQDATDAAALAAASALVS 70 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 + + + ++ + A S+ + E+ ++ T Sbjct: 71 DARPDIEEAKDLARKFLKTQAAAATASDLPDEGPSIGARGGGNADDEVPATPRWEDVNAT 130 Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPI 190 + + + + V + + + LL + + Sbjct: 131 EIDITATPNGAKGKSFQVTVANKHLLQFNAMTRLLGPESIEIETRSTAESATES---KNA 187 Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR--------------TKMAALKNALL 236 + LV+D SGSM N+ P + TK+ ALK A+ Sbjct: 188 LSMYLVLDRSGSMAWKTNTINTGKAKCPNYTEANWSKYPDLKATGPCYVTKIDALKTAVG 247 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 L + V G I Y + + SWGT YV + + T S Sbjct: 248 DLLAQLVTADPESAYVRTGAISYNSAQDAASSLSWGTRGAAGYVDALVA---IGGTASGN 304 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A K A+Q +T+ + S G K +P K+I+F+TDGENN+ + T + CD Sbjct: 305 AFKTAFQKVTNAAEDS------EHGAKNGQVP-TKYIVFMTDGENNHANDDTVTRQWCDT 357 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 AK + ++I +++ A GQ+LLK+C SS +++ A L+ F+ I + Sbjct: 358 AKASKVQIYSVAFMAPDRGQKLLKSCASSSSHYFEAEEASDLVAAFKAIGE 408 >gi|327189644|gb|EGE56794.1| hypothetical protein RHECNPAF_570041 [Rhizobium etli CNPAF512] Length = 415 Score = 268 bits (684), Expect = 1e-69, Method: Composition-based stats. Identities = 87/425 (20%), Positives = 145/425 (34%), Gaps = 60/425 (14%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 ++ + NF I+ A+ L G I V D K ++ A ++A Sbjct: 10 LRRMFSDRGGNFGIMTAILAPVLLGAAGLAIQVGDMLLSKQQLQEAADSA---------- 59 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 + A+ F+ + L +I N++ +++ Sbjct: 60 --ALATATALGNGTIQTSQAEAFARNFVAGQMANYLQN---------GVDIKNATAVNV- 107 Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE---MGE 187 + +S + Y + V SYD + + + S + Sbjct: 108 -----QTSNSGKSASYQVTVTPSYDLTVNPLMQAVGFSTQHLSTSSTTVSGPSQTPGSNS 162 Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR---------------------T 226 + + L +D SGSM + +D + D T Sbjct: 163 QGSVSMFLALDKSGSMGDPTETVNKDQPTETFTYDCNPHLNKKGKWVYDTCTGSRTNYYT 222 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ ALK A + + V G + Y +WGT V YV Sbjct: 223 KIEALKMAAGNLFGQLTSADPDAQYVRTGAVSYDIDQYTPSTLAWGTSGVSSYVNALQAG 282 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T+S+ AM AY LT+ +K +P +K+I+F+TDG+NNN S Sbjct: 283 ---GGTNSSGAMGTAYSSLTAKNAAGNDAEDAAHKLKTGQIP-KKYIVFMTDGDNNNDSS 338 Query: 347 -----NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + T CD AK I+I TI+ A P GQ LL+ C S +++ + L+ Sbjct: 339 GGRSYDTLTKATCDTAKSKGIEIYTIAFMAPPGGQALLQYCASDAAHYFQAEQMEDLLAA 398 Query: 402 FQNIS 406 F+ I Sbjct: 399 FKAIG 403 >gi|190893432|ref|YP_001979974.1| hypothetical protein RHECIAT_CH0003859 [Rhizobium etli CIAT 652] gi|190698711|gb|ACE92796.1| hypothetical conserved protein [Rhizobium etli CIAT 652] Length = 410 Score = 267 bits (682), Expect = 2e-69, Method: Composition-based stats. Identities = 90/422 (21%), Positives = 152/422 (36%), Gaps = 59/422 (13%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 ++ + NF I+ A+ L G I V D K ++ A ++A LA A+ + + Sbjct: 10 LRRMFSDRGGNFGIMTAILAPVLLGAAGMAIQVGDMLISKQQLQEAADSAALATATALAN 69 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 +I A A+ F+ + L +I +++ +++ Sbjct: 70 G---------TIQTSQAEAF---ARNFVAGQMANYLQS---------GVDIKSATGVTV- 107 Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPI 190 + ++S N+ Y + V SYD + + + S + + Sbjct: 108 -----QTNTSGNSTSYQVTVSPSYDLTVNPLMQAVGFTTQHLSTS--GTTIGGHSQTQGS 160 Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK---------------------RTKMA 229 + L +D SGSM + E+ + D TK+ Sbjct: 161 ISMYLALDKSGSMGEDTATVNEEDPTESYTYDCNGHYNKKGKWIYDTCTGSRANYYTKIE 220 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 ALK A + + V G + Y +WGT V YV Sbjct: 221 ALKMAAGNLFGQLSSADPNAQYVRTGAVSYDIVQYTPSALAWGTSGVSTYVNALQAG--- 277 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS--- 346 T+S+ AM AY LT+ +K P +K+I+F+TDG+NN+ S Sbjct: 278 GGTNSSGAMSTAYSSLTAKNAAGNDAEDAAHKLKTGQTP-KKYIVFMTDGDNNDDSSGGR 336 Query: 347 --NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + T CD AK I+I TI+ A GQ LL C S +++ + L+ F+ Sbjct: 337 SYDTLTKATCDTAKSKGIEIYTIAFMAPEGGQALLHYCASDDSHYFQAEKMEDLLAAFKA 396 Query: 405 IS 406 I Sbjct: 397 IG 398 >gi|209550922|ref|YP_002282839.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209536678|gb|ACI56613.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 411 Score = 262 bits (668), Expect = 1e-67, Method: Composition-based stats. Identities = 90/427 (21%), Positives = 148/427 (34%), Gaps = 60/427 (14%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 ++ + NF I+ A+ L + G I V D K ++ A ++A LA A+ Sbjct: 6 LYPRLRRMFSDRGGNFGIMTAIMAPVLLGVAGVAIQVGDMMLSKQQLQEAADSAALATAT 65 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + + +I A A+ F+ + + + Sbjct: 66 ALANG---------TIQTSQAEAF---AQNFVAGQMANY---------------VQSGVD 98 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG 186 N + +S + Y + V SYD + + + + S + Sbjct: 99 FKSGTSVNVQTSTSGKSTSYQVTVSPSYDLTVNPLMQAVGFKTQHLSTS--GTTVGGHSQ 156 Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK---------------------- 224 + + L +D SGSM A + D + D Sbjct: 157 TQGSISMFLALDKSGSMGEATATVNADDPTESYTYDCNLHYNSKNNKWVYDKCTGSRTNY 216 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 TK+ ALK A ++ E V G + Y +WGT V YV Sbjct: 217 YTKIEALKIAAGNLFGQLNSADPNAEYVRTGAVSYDINQYTPSNLAWGTAGVTSYVNALQ 276 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + T+S+ AM AY LT+ +K P +K+I+F+TDG+NN+ Sbjct: 277 AN---GGTNSSGAMSTAYSSLTAKNAAGNDAEDSAHKLKTGQTP-KKYIVFMTDGDNNDD 332 Query: 345 KS-----NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 S + T CD AK I+I TI+ A GQ LL C S +++ + L+ Sbjct: 333 SSGGRSYDTLTKATCDTAKSKGIEIYTIAFMAPAGGQTLLHYCASDDSHYFQAEKMEDLL 392 Query: 400 HVFQNIS 406 F+ I Sbjct: 393 AAFKAIG 399 >gi|86359182|ref|YP_471074.1| hypothetical protein RHE_CH03592 [Rhizobium etli CFN 42] gi|86283284|gb|ABC92347.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 411 Score = 260 bits (664), Expect = 3e-67, Method: Composition-based stats. Identities = 90/423 (21%), Positives = 146/423 (34%), Gaps = 60/423 (14%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 ++ + NF I+ A+ L G I V D K ++ A ++A LA A+ + + Sbjct: 10 LRRMFSDRGGNFGIMTAILAPVLLGAAGMAIQVGDMLLSKQQLQEAADSAALATATALAN 69 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 +I A A+ F+ + L + I T Sbjct: 70 G---------TIQTTEAEAF---ARNFVAGQMANYLQS---------------GTDIKST 102 Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPI 190 N + +S + Y + V +Y + + + S + + Sbjct: 103 TSVNVQTTTSGKSTSYQVTVSPAYVLTVNPLMQAVGFTTQHLSTS--GTTIGGHSQTQGS 160 Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK----------------------RTKM 228 + L +D SGSM + E+ + D TK+ Sbjct: 161 ISMFLALDKSGSMGEDTATVNEESPTESYTYDCNLHYNTKNNKWVYDKCTGSRTNYYTKI 220 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 ALK A ++ + V G + Y +WG V YV + Sbjct: 221 EALKMAAGNLFSQLNSADPNAQYVRTGAVSYDINQYAPSSLAWGITGVSSYVNALQAN-- 278 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF---- 344 T+S+ AM AY LT+ K +P +K+I+F+TDG+NNN Sbjct: 279 -GGTNSSGAMNTAYTSLTAKNAAGNDVENSAHQQKTGQVP-KKYIVFMTDGDNNNDPSGG 336 Query: 345 -KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 + T K CD AK I+I TI+ A GQ LL C S +++ + L+ FQ Sbjct: 337 RSYDTATKKTCDDAKSKGIEIYTIAFMAPAGGQALLHYCASDDSHYFQAEKMEDLLAAFQ 396 Query: 404 NIS 406 I Sbjct: 397 AIG 399 >gi|241206334|ref|YP_002977430.1| hypothetical protein Rleg_3648 [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240860224|gb|ACS57891.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 400 Score = 256 bits (653), Expect = 5e-66, Method: Composition-based stats. Identities = 91/423 (21%), Positives = 156/423 (36%), Gaps = 60/423 (14%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 ++ + NF I+ A+ + G I V D K ++ A ++A LA A+ + + Sbjct: 11 LRRMLGDRGGNFGIMTAIVLPVLFGAAGMAIQVGDLLLSKQQLQEAADSAALATATALAN 70 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 +I A D F+ + L I T Sbjct: 71 G---------TIQTSQAEAFARD---FVAGQMANYLQS---------------GIDIKST 103 Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPI 190 + R +S + Y + V Y+ + + + +S + Sbjct: 104 TGVDVRTTTSGKSTSYQVTVSPDYNIAVNPLMQTIGFTTQN--ISTSSTTTSGNSQTQGS 161 Query: 191 FLIELVVDLSGSMHCAMNSDP------------------EDVNSAPICQDKKR---TKMA 229 + LV+D SGSM + +V C D + TK+ Sbjct: 162 VSMFLVLDRSGSMGEDTATVNASDPTEEYNYDCSEKDRYGNVTKKKTCTDTRPHYYTKIE 221 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 ALK A+ +D + KE V G + Y ++K WGT V +YV + Sbjct: 222 ALKLAVGTLTGELDAVDPEKEYVRTGAVSYNIEMQKAKALDWGTAHVTKYVNKL---TAT 278 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 TDS A K AY L + + K +P K+I+F+TDG+NN ++ Sbjct: 279 DGTDSGEAFKTAYNKLADAAEDKAHVD------KTGQVP-TKYIVFMTDGDNNYTSADTE 331 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 T CDKA++ +++ TI+ A GQ LL C ++P ++ + +L+ F+ I Sbjct: 332 TKTWCDKARDAKMQVYTIAFMAPARGQALLSYCATAPGNYFPAGDMTALLKAFKEIGMKA 391 Query: 410 VHR 412 ++ Sbjct: 392 SNQ 394 >gi|254780833|ref|YP_003065246.1| hypothetical protein CLIBASIA_03630 [Candidatus Liberibacter asiaticus str. psy62] gi|254040510|gb|ACT57306.1| hypothetical protein CLIBASIA_03630 [Candidatus Liberibacter asiaticus str. psy62] Length = 371 Score = 254 bits (648), Expect = 2e-65, Method: Composition-based stats. Identities = 82/410 (20%), Positives = 168/410 (40%), Gaps = 48/410 (11%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 + + K + SI+ A+ + +++G +I + K + + ++L A+K Sbjct: 3 FLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATK 62 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSG--YSAVFYNTEIQNIVNSS 125 +++ + N+ K+ D + R IKN + F +I NI S+ Sbjct: 63 ILN---------QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFA-QDINNIERST 112 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEM 185 +S+ ++ YN+ ++ Y+ F + I + ++I Sbjct: 113 SLSIIIDDQHK--------DYNLSAVSRYEMPFIFCT-FPWCANSSHAPLLITSSVKISS 163 Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 + +V+D+S SM+ + K+ ++ LD I + Sbjct: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMD--------------KLGVATRSIREMLDIIKSI 209 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 V V GL+ +++++ + +WG + +++ + R + T STP ++ AY + Sbjct: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI---FGSTTKSTPGLEYAYNKI 266 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK-SNVNTIKICDKAKENFIKI 364 K++ + + ++K+IIFLTDGEN++ N ++ C++AK + Sbjct: 267 FDAKEK-------LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 I + A Q LK C SP+ Y+V N+ L F I + MV ++ Sbjct: 320 YAIGVQAEAADQ-FLKNCA-SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 >gi|218662625|ref|ZP_03518555.1| hypothetical protein RetlI_26027 [Rhizobium etli IE4771] Length = 389 Score = 251 bits (641), Expect = 1e-64, Method: Composition-based stats. Identities = 91/414 (21%), Positives = 150/414 (36%), Gaps = 58/414 (14%) Query: 26 FALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNH 85 A+ L G ++V D K ++ A ++A LA A+ + + I Sbjct: 1 TAILAPVLLGAAGMAVHVGDMLLSKQQLQEAADSAALATATALANG---------KIQTS 51 Query: 86 AKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIF 145 A A+ F+ + L +I +++ +S+ + ++S N+ Sbjct: 52 EAEAY---ARNFVAGQMANYLQS---------GVDIKSATGVSV------QTNTSGNSTS 93 Query: 146 YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP-ALLRIEMGERPIFLIELVVDLSGSMH 204 Y + V SYD + + + S + + + L +D SGSM Sbjct: 94 YQVTVSPSYDLTVNPLMQAVGFTTQHLSTSGTTIGGGHSQTQGQGSISMYLALDKSGSMG 153 Query: 205 CAMNS----DPEDVNSAPI--------------CQDKK---RTKMAALKNALLLFLDSID 243 + DP + + P C + TK+ ALK A + Sbjct: 154 EDTATVNEEDPTESYTYPCNPHYNRKGKEVWDTCTGSRANYYTKIEALKMAAGNLFAQLS 213 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + V G + Y +WG V YV T+S+ AM AY Sbjct: 214 GADPNAQYVRTGAVSYDIVQYAPSSLAWGAIGVSSYVNALQAG---GGTNSSGAMSTAYL 270 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS-----NVNTIKICDKAK 358 LT+ +K +P QK+I+F+TDG+NNN S + T CD AK Sbjct: 271 SLTAKNAAGNDAEDSAHKLKSGQIP-QKYIVFMTDGDNNNDSSGGRSYDTLTKATCDTAK 329 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 I+I TI+ A P GQ LL+ C S +++ + L F+ I + Sbjct: 330 SKGIEIYTIAFMAPPGGQALLQYCASDASHYFQAEKMEDLFAAFKAIGAKASTQ 383 >gi|227823417|ref|YP_002827390.1| hypothetical protein NGR_c28930 [Sinorhizobium fredii NGR234] gi|227342419|gb|ACP26637.1| hypothetical protein NGR_c28930 [Sinorhizobium fredii NGR234] Length = 413 Score = 245 bits (626), Expect = 7e-63, Method: Composition-based stats. Identities = 90/411 (21%), Positives = 152/411 (36%), Gaps = 33/411 (8%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 F + NF ++ A++ L G I + + KN ++ A +AA LA AS +VS Sbjct: 11 FITMLKDRGGNFGMMTAVAAPLLLAAGGVSIDMANMLMTKNQLQDATDAAALAAASALVS 70 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 + + I+ + S +T + VN+ +++T Sbjct: 71 DEQPDIAAAKEIARKFLKTQAGGTTT-PDAPADSGEGASSGAASSTPDWDDVNTLEVNIT 129 Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPI 190 N + + V+ + LL + S + Sbjct: 130 ETPN-----GTKGKIFQVTVINKRVTEFNAMTRLLGTDSIELEASSTAESATES---KNA 181 Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR--------------TKMAALKNALL 236 + LV+D SGSM N+ S P + TK+ ALK A+ Sbjct: 182 LSMYLVLDRSGSMAWKTNTINAAKKSCPNYTESNWSRYPNLWASSPCYVTKIDALKTAVT 241 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 L + + + V I Y + + +WGT YV + T S Sbjct: 242 DLLAQLLVADPDQIYVRTAAISYNSVQDTAGTLAWGTSGAAAYVNALVA---TGGTASAG 298 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A K AYQ + + + + K +P K+++F+TDGENN + T + CD Sbjct: 299 AFKTAYQKVIAATENT------AHAAKNGQVP-SKYMVFMTDGENNYANDDTVTKQWCDT 351 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 AK N ++I +++ A GQ LLK C SS +++ L+ F+ I + Sbjct: 352 AKANKVEIYSVAFMAPERGQALLKYCASSSSHYFEAEEVTDLVAAFKAIGE 402 >gi|218515283|ref|ZP_03512123.1| hypothetical protein Retl8_17130 [Rhizobium etli 8C-3] Length = 329 Score = 244 bits (623), Expect = 2e-62, Method: Composition-based stats. Identities = 72/337 (21%), Positives = 120/337 (35%), Gaps = 47/337 (13%) Query: 96 RFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYD 155 F+ + L +I +++ +++ + ++S N+ Y + V SYD Sbjct: 2 NFVAGQMANYLQS---------GVDIKSATGVTV------QTNTSGNSTSYQVTVSPSYD 46 Query: 156 YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 + + + S + + + L +D SGSM + E+ Sbjct: 47 LTVNPLMQAVGFTTQHLSTS--GTTIGGHSQTQGSISMYLALDKSGSMGEDTATVNEEDP 104 Query: 216 SAPICQDKK---------------------RTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 + D TK+ ALK A + + V Sbjct: 105 TESYTYDCNGHYNKKGKWIYDTCTGSRANYYTKIEALKMAAGNLFGQLSSADPNAQYVRT 164 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 G + Y +WGT V YV T+S+ AM AY LT+ Sbjct: 165 GAVSYDIVQYTPSALAWGTSGVSTYVNALQAG---GGTNSSGAMSTAYSSLTAKNAAGND 221 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS-----NVNTIKICDKAKENFIKIVTISI 369 +K P +K+I+F+TDG+NN+ S + T CD AK I+I TI+ Sbjct: 222 AEDAAHKLKTGQTP-KKYIVFMTDGDNNDDSSGGRSYDTLTKATCDTAKSKGIEIYTIAF 280 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 A GQ LL C S +++ + L+ F+ I Sbjct: 281 MAPEGGQALLHYCASDDSHYFQAEKMEDLLAAFKAIG 317 >gi|222087111|ref|YP_002545646.1| hypothetical protein Arad_3867 [Agrobacterium radiobacter K84] gi|221724559|gb|ACM27715.1| conserved hypothetical protein [Agrobacterium radiobacter K84] Length = 401 Score = 237 bits (605), Expect = 2e-60, Method: Composition-based stats. Identities = 84/414 (20%), Positives = 154/414 (37%), Gaps = 45/414 (10%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 + I NF I+ A+++ G + V + + ++ + Sbjct: 13 QLIHDRTGNFGILTAIAIPVVAATAGVAVDVTNMTVSNSQLQ------------QATDAA 60 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 + + N A +F+ + LSG + N ++ + T Sbjct: 61 ALATATALANGNATTSNAQQLATQFVTGQMSNYLSGDT---------NTADALKAGTTAN 111 Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFL 192 + +SS T Y + V SYD + + LL + + ++ Sbjct: 112 VTSATNSSGGT-SYTVAVNASYDMSVNGMSQLLGIKTMHVSAASTSTSGSAAAAKQAALS 170 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR--------------TKMAALKNALLLF 238 +E+ +D SGSM + S + K+AALK A+ Sbjct: 171 MEIALDKSGSMLLNTDVIDTSQKSCTQYYTEGNYLYQYPKAKSPCYIKKIAALKTAVGTL 230 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 LD +D + V I +++ V+ + +WGT R V +++ T+S+ M Sbjct: 231 LDQLDSADPKSQYVRTAAIAWSSEVDSSSALAWGTTTTRSNVISGLNAN--GGTESSAPM 288 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 AY+ +++ + + + FQK I+ +TDGENN S+ T+ C AK Sbjct: 289 ALAYKNVSASSEATAQA-------AKGNTTFQKIIVLMTDGENNATSSDTKTLATCKAAK 341 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + + I +++ A GQ LLK C SSP +++ LI F+ I + Sbjct: 342 DAGVLIYSVAFMAPDRGQTLLKNCASSPSNYFDAQQMSDLIAAFKTIGNQASKQ 395 >gi|15966595|ref|NP_386948.1| hypothetical protein SMc04059 [Sinorhizobium meliloti 1021] gi|307300370|ref|ZP_07580150.1| TadE family protein [Sinorhizobium meliloti BL225C] gi|307319653|ref|ZP_07599079.1| TadE family protein [Sinorhizobium meliloti AK83] gi|15075867|emb|CAC47421.1| Hypothetical protein SMc04059 [Sinorhizobium meliloti 1021] gi|306894775|gb|EFN25535.1| TadE family protein [Sinorhizobium meliloti AK83] gi|306904536|gb|EFN35120.1| TadE family protein [Sinorhizobium meliloti BL225C] Length = 410 Score = 233 bits (594), Expect = 3e-59, Method: Composition-based stats. Identities = 89/417 (21%), Positives = 156/417 (37%), Gaps = 43/417 (10%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 + NF ++ AL L + G + V + KN ++ A +AA LA AS +VS+ Sbjct: 3 MLRDRGGNFGMMTALIAPLLLAVGGVSVDVANMLMTKNQLQDATDAAALAAASALVSDAR 62 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 D E+ + K A + E++ +A + VN+S + + Sbjct: 63 P--DIEEAKAIARKFLKTQMAATSSADVPGEAVGTMAAAGSTAPSWDDVNTSEVVIVETP 120 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLI 193 N + + V + + + LL + + + + Sbjct: 121 N-----GTKGKSFQVSVANKHLLQFNAMTRLLGKESIELETRSTADSATES---KNAISM 172 Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR-------------TKMAALKNALLLFLD 240 LV+D SGSM ++ K+ LK+A+ Sbjct: 173 YLVLDRSGSMAWKTDTVDTSRPRCINWTASNWGESNVRATSPCYVDKITTLKSAVDKLFT 232 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 + + E + G Y R ++ + +WGT+ +V TDS+ A Sbjct: 233 PLAKMDPGNEYLRAGAASYNDRQDRASKLTWGTKNASAHVQGLDA---TGGTDSSSAFAA 289 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF----------KSNVNT 350 A + L D + K P +K+I+F+TDGEN ++ K++ T Sbjct: 290 AVEELLLDGENE------AHLAKNGQTP-EKYIVFMTDGENTSYNGKTSPRDLEKADSVT 342 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 C AK N I I T++ A G+ LLK C +SP+++ +A +L+ F+ I Q Sbjct: 343 KAACTTAKNNGIAIFTVAFMAPQRGKDLLKACATSPDHYKEADDAAALVSEFEKIGQ 399 >gi|15891094|ref|NP_356766.1| hypothetical protein Atu3868 [Agrobacterium tumefaciens str. C58] gi|15159433|gb|AAK89551.1| hypothetical protein Atu3868 [Agrobacterium tumefaciens str. C58] Length = 412 Score = 224 bits (571), Expect = 2e-56, Method: Composition-based stats. Identities = 83/424 (19%), Positives = 164/424 (38%), Gaps = 61/424 (14%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 ++ +A NF ++ A+ + L + G + + + K M++ ++A LA A++ Sbjct: 11 RRFLADTSGNFGMMTAILLPVLLGVAGAGMELANVMQVKADMQNTADSAALAAATEARLR 70 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 +L D + + AK FI ++++L+ E + +S +T Sbjct: 71 EGKLSDEQ----------IKEIAKNFIAAQMEKNLTAE-------EKIELEKNSPTRVTT 113 Query: 132 MANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF 191 N R + Y ++ + +L + + + V + + + Sbjct: 114 TENARGKT------YAVETTIKHQIQLNPMLGFIGAKTLDLSV---TGTAKSTINKGAPI 164 Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM--------------AALKNALLL 237 + L +D SGSM ++ S +K A+LK A+ Sbjct: 165 SMYLALDRSGSMSFKTDTVDTTKTSCQNYTSDNWSKYPNLAKTSPCYVNKAASLKTAVGF 224 Query: 238 FLDSIDLLSH------VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + +++ E V G YT WGT V YV + + Sbjct: 225 LVATLNKADPTYTVNGGSELVRTGASVYTHETYVAQSIGWGTSGVTSYVDKQIPEFPSGG 284 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE--------NNN 343 TD+ ++ AY L + + R + S F+++I+ +TDGE N++ Sbjct: 285 TDARSSLNAAYNAL-----KKANPDEARYHKEKGSESFERYIVLMTDGEMTGNSAAWNSS 339 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 +V T C+ AK++ IKI +++ A G+ LL+ C SS + +Y N + ++ F Sbjct: 340 IDQSVRT--TCETAKKDGIKIFSVAFMAPDKGKSLLQYCASSADNYYAPENMEQIVTAFG 397 Query: 404 NISQ 407 I++ Sbjct: 398 EIAR 401 >gi|218506715|ref|ZP_03504593.1| hypothetical protein RetlB5_03444 [Rhizobium etli Brasil 5] Length = 269 Score = 223 bits (568), Expect = 4e-56, Method: Composition-based stats. Identities = 61/259 (23%), Positives = 93/259 (35%), Gaps = 31/259 (11%) Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR--------- 225 S + + + L +D SGSM + D + P D Sbjct: 3 STSGRTVSGHSQSQGSISMFLALDKSGSMGDPTATVNADDPTEPFTYDCNPHLNKKGTKI 62 Query: 226 -------------TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 TK+ ALK A ++ E V G + Y + +WG Sbjct: 63 IYDTCTGSRAHYYTKIEALKIAAGNLFSQLNSADPNAEYVRTGAVSYDLVEYTPSKLAWG 122 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 V YV T+S+ A+ AY LT+ +K LP +K+ Sbjct: 123 ITAVTSYVNALESG---GGTNSSGAVNTAYTSLTAKNAAGNDAEDAAHKLKTGQLP-KKY 178 Query: 333 IIFLTDGENNNFK-----SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE 387 I+F+TDG+NN+ + T CD AK I+ TI+ A GQ LL C S Sbjct: 179 IVFMTDGDNNDDSRGGRSYDTLTKATCDTAKAKGIETYTIAFMAPEGGQALLHYCASDDA 238 Query: 388 YHYNVVNADSLIHVFQNIS 406 +++ + L+ F+ I Sbjct: 239 HYFQAEKMEDLLAAFKAIG 257 >gi|307945905|ref|ZP_07661241.1| putative von Willebrand factor type A [Roseibium sp. TrichSKD4] gi|307771778|gb|EFO31003.1| putative von Willebrand factor type A [Roseibium sp. TrichSKD4] Length = 432 Score = 217 bits (553), Expect = 2e-54, Method: Composition-based stats. Identities = 74/390 (18%), Positives = 146/390 (37%), Gaps = 52/390 (13%) Query: 25 IFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISN 84 +F + +M L ++ I + ++ +++A + A + +++ E I+ Sbjct: 85 LFGILIMLLLAVVTIGIDMSQTFGERTRLQTAADMAAVQTGRALLA---------EEITI 135 Query: 85 HAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTI 144 A DA I + + S S+ + + +I+ T NT Sbjct: 136 AQANAYAKDAFNRIASGLSAS-GDGSSGTSIFGTMTVKPAVQITETVD--------GNTT 186 Query: 145 FYNMDVMTSYDYRLQFIEHLL--NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGS 202 Y + V + + + + I + + + LV+D SGS Sbjct: 187 NYVVKVNGTAKIPASPLSFMFFDGETGKNTISLGFESETTAKAEAGASLSMALVLDRSGS 246 Query: 203 MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR 262 M ++ ++M+ LK A+ + + + + +G Y Sbjct: 247 MGW-----------------ERPSRMSELKKAVRSLIKELQTVDPDDQFTRLGAYAYHWY 289 Query: 263 VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 E +W VR +V S T + PA+++A L ++ + + N Q Sbjct: 290 YAGKKELTWNKNSVRSWVNSLPAS---GGTRAAPAIQKAKNDLLTNSELNAHINKNEQEP 346 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIVTISINASPNGQRLLKT 381 + FI+++TDG + N K C AK I I T++ A +G+ LLK Sbjct: 347 DL-------FILYMTDG----IDGDPNWAKRECTSAKNAGITIYTVAFKAPASGRNLLKA 395 Query: 382 CVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 C +S ++Y+ NA+ L VF++I++ Sbjct: 396 CATSDAHYYDAKNANELNKVFKDIARETTK 425 >gi|332716587|ref|YP_004444053.1| hypothetical protein AGROH133_12352 [Agrobacterium sp. H13-3] gi|325063272|gb|ADY66962.1| hypothetical protein AGROH133_12352 [Agrobacterium sp. H13-3] Length = 412 Score = 217 bits (552), Expect = 3e-54, Method: Composition-based stats. Identities = 73/422 (17%), Positives = 149/422 (35%), Gaps = 57/422 (13%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 ++ +A NF ++ A+ + L G + + + K +++ + + Sbjct: 11 RRFLADTGGNFGMMTAILLPVLLGFAGAGMELANVMQVKADLQN----------TADSAA 60 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 L+ + + + AK FI + ++++L E + + +S +++ Sbjct: 61 LAAATEARLKEGALTDEQIKEIAKAFIASQMEKTL-------TEEEKKALEKNSPVNIG- 112 Query: 132 MANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF 191 Y + +Y +L + + + + Sbjct: 113 -----TTDDARGKTYTIQTTINYQMQLNPLLGFFGAKTLDLAA---TGTAVSTVNKGAPI 164 Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR--------------TKMAALKNALLL 237 + LV+D SGSM ++ S K +LK A+ Sbjct: 165 SMYLVLDRSGSMSFKTDTLNTKKTSCQNYTVDNWGSYPNLKNTSPCYVNKATSLKTAVGY 224 Query: 238 FLDSIDLLSH------VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + +++ E V G YT +WGT V YV + + Sbjct: 225 LVATLNKADPTYTANGGSELVRTGASVYTHETYAAQPITWGTSSVATYVDKQIPEFPSGG 284 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS----- 346 TD+ ++ AY L + T ++ S F+++I+ +TDGE S Sbjct: 285 TDARSSLNAAYNAL-----KKANTVEAKEHKDKKSESFERYIVLMTDGEMTGNSSSWSSS 339 Query: 347 -NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + CD AK++ IKI +++ A G+ LL+ C SS + +Y N + ++ F I Sbjct: 340 IDQTVRNTCDTAKKDGIKIFSVAFMAPDKGKSLLQHCASSLDNYYAPENMEQIVTAFGEI 399 Query: 406 SQ 407 ++ Sbjct: 400 AR 401 >gi|116253849|ref|YP_769687.1| hypothetical protein RL4112 [Rhizobium leguminosarum bv. viciae 3841] gi|115258497|emb|CAK09601.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae 3841] Length = 398 Score = 214 bits (544), Expect = 3e-53, Method: Composition-based stats. Identities = 91/421 (21%), Positives = 151/421 (35%), Gaps = 59/421 (14%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 ++ + NF I+ A+ + L G I + K ++ A ++A LA A+ + S Sbjct: 10 LRRMLGDRGGNFGIMTAIMMPVLLGAAGLAIDYSNMALSKRELQEATDSAALAAATALAS 69 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 + E+I AK F+ + + + I +I + + + Sbjct: 70 GAASTTADAEAI-----------AKDFVSGQMANYVDTDA-------ISSIKAGTSVDID 111 Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPI 190 A S Y + V TSY +L + + + + Sbjct: 112 VSATATSKS------YKVTVATSYGIAATPFMSVLGYKTLN--IGASTSTSSGTSDTKTA 163 Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR----TKMAALKNALLLFLDSIDLLS 246 +ELV+D SGSM E + K TK+ ALK A D++D Sbjct: 164 LSMELVLDQSGSMG-------EKTTTCATYNGKNCKTYVTKIDALKKAADALFDALDTAD 216 Query: 247 HVKEDVYMGLIGYT---------TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 V G Y T+++ +WGT YV+ S TD+T Sbjct: 217 PDHSLVRTGAYSYNNGLIYNSQKTQIKSMSGMAWGTATTATYVSGITAS---GGTDATEP 273 Query: 298 MKQAYQILTSDKKRSFFTNFFRQ-GVKIPSLPFQKFIIFLTDGENNN------FKSNVNT 350 M+QA S K S ++ Q + ++II +TDGE + N Sbjct: 274 MRQA---TLSIAKASDGSDVETQAHAVKGNTIVSRYIILMTDGEMTGNTGVWQSSFDQNV 330 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 CD K IKI T++ A G++LL+ C S +Y + L+ F +I++ Sbjct: 331 RNQCDATKTAGIKIFTVAFMAPDKGKQLLQYCASPGGNYYEAETMEKLVASFTSIAKEAT 390 Query: 411 H 411 Sbjct: 391 K 391 >gi|222149754|ref|YP_002550711.1| hypothetical protein Avi_3756 [Agrobacterium vitis S4] gi|221736736|gb|ACM37699.1| conserved hypothetical protein [Agrobacterium vitis S4] Length = 437 Score = 205 bits (521), Expect = 1e-50, Method: Composition-based stats. Identities = 81/416 (19%), Positives = 152/416 (36%), Gaps = 51/416 (12%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 + + NF ++ A+ + + + G + + +++++S+ +AA LA AS M + + Sbjct: 41 RLLRHSGGNFGMMTAVLLPVSIGVAGLAMDATEMVQSRSALQSSVDAAALAAASAMSNGM 100 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 S I AK F+ + + +++ +I + T + Sbjct: 101 SEADA-------------IALAKSFLSSQLANTMARDENTSSVDQITQAEPDISVKTTQV 147 Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFL 192 ++++ Y++++ SY + + +L Sbjct: 148 -------NSSSTSYDVELTGSYTITMNPLSRVLGWETVTLKA--YGKAQAATTASESPLS 198 Query: 193 IELVVDLSGSMHCAMNSD---------------PEDVNSAPICQDKKRTKMAALKNALLL 237 + LV+D SGSM+ + + + K TK+ +LK A+ Sbjct: 199 MYLVLDRSGSMNDETATTYTGTCTKTTTSGYGWNKKTTTTSYSCTKNYTKIESLKLAVAD 258 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 + E V G Y + SWGT V YV TD+ A Sbjct: 259 LAAQLKKADPNSEYVRTGADSYNASADTAQAMSWGTANVVTYVNALSA---TGGTDARGA 315 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT------I 351 + AY L + K + KI ++I+F+TDGE S+ ++ Sbjct: 316 LSAAYSALQTSNKTEITAHNVSSVSKIG-----RYIVFMTDGEMTGNSSSWSSSIDSAVR 370 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 C K + I+I T++ A NG+ LL C S ++Y +A SL+ F I + Sbjct: 371 SQCTSIKADGIQIYTVAFMAPANGKSLLSACASDASHYYEATDAASLVAAFGEIGK 426 >gi|254781108|ref|YP_003065521.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] gi|254040785|gb|ACT57581.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] Length = 398 Score = 192 bits (488), Expect = 7e-47, Method: Composition-based stats. Identities = 90/426 (21%), Positives = 167/426 (39%), Gaps = 66/426 (15%) Query: 25 IFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISN 84 + A+ + L I + I + Y +N M+SA +AA+L+G + +VS+ + + Sbjct: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKK--- 57 Query: 85 HAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTI 144 D K IK+ L + +I ++I++T NN L Sbjct: 58 -------DQTSTIFKKQIKKHLK--QGSYIRENAGDIAQKAQINITKDKNNPLQ------ 102 Query: 145 FYNMDVMTSYDYRL-QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSM 203 Y + Y+ L + ++ E I +V+D+S SM Sbjct: 103 -YIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGII-ERSSENLAISICMVLDVSRSM 160 Query: 204 H--------------------------CAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + + AP R K+ L + Sbjct: 161 EDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR-KIDVLIESAGN 219 Query: 238 FLDSIDLLSHVKE--DVYMGLIGYTTRVEKNI--EPSWGTEKVRQYVTRDMDSLILKPTD 293 ++SI K+ V +G I Y + N S +V+ + + + T+ Sbjct: 220 LVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL---NPYENTN 276 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN---NFKSNVNT 350 + PAM AY+ L ++K+ S T I S +KF+IF+TDGEN+ +++ +NT Sbjct: 277 TYPAMHHAYRELYNEKESSHNT--------IGSTRLKKFVIFITDGENSGASAYQNTLNT 328 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 ++IC+ + +KI +++++A P GQ LL+ C S + V ++ L+ F I+ + Sbjct: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 Query: 411 HRKYSV 416 + + Sbjct: 389 EQSVRI 394 >gi|254780388|ref|YP_003064801.1| hypothetical protein CLIBASIA_01365 [Candidatus Liberibacter asiaticus str. psy62] gi|254040065|gb|ACT56861.1| hypothetical protein CLIBASIA_01365 [Candidatus Liberibacter asiaticus str. psy62] Length = 458 Score = 181 bits (459), Expect = 2e-43, Method: Composition-based stats. Identities = 90/464 (19%), Positives = 181/464 (39%), Gaps = 64/464 (13%) Query: 5 SRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG 64 ++F FY KK I S +F II AL + L + G L+ V+ W Y +++++ A AI+ Sbjct: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQ----- 119 + ++ +L + R ++ K+ + + R +N++K++ + + Sbjct: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 Query: 120 -----NIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTS-----------YDYRLQFIEH 163 +V SSR + + S + + + +Q++ Sbjct: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 Query: 164 L------------------LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHC 205 Q ++ + S+ ++ + + +V + S++ Sbjct: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEK-LSPYMV-SCNKSLYY 242 Query: 206 AMNSDPEDVNSAPICQDKKRT------KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 + P D + + + K +++AL + SI + +V + V MG + Sbjct: 243 MLYPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFF 302 Query: 260 TTRVEKNIEPSWGTEKVRQYVTRD--MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNF 317 RV + SWG K+ + + + +D + T AM+ AY + S + Sbjct: 303 NDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV---- 358 Query: 318 FRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQR 377 +L +K+I+ LTDGEN + N I IC+KAK I+I+TI+ + + Q Sbjct: 359 ---HRMKNNLEAKKYIVLLTDGEN--TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQE 413 Query: 378 ----LLKTCVSSPEYHYNVVNADSLIHVFQN-ISQLMVHRKYSV 416 L C SP + + L +F++ I + R + Sbjct: 414 KARYFLSNCA-SPNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 >gi|315122199|ref|YP_004062688.1| hypothetical protein CKC_02245 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495601|gb|ADR52200.1| hypothetical protein CKC_02245 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 463 Score = 172 bits (435), Expect = 1e-40, Method: Composition-based stats. Identities = 97/465 (20%), Positives = 185/465 (39%), Gaps = 61/465 (13%) Query: 5 SRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG 64 F FK+ +F +I AL + ++IG LI ++ W Y NS+ A N A L+ Sbjct: 5 KSLFFNFKRLKKCYNGSFFVISALLLPVIFMVIGLLIDLVRWGYYHNSLVQAVNTAALSA 64 Query: 65 ASKMVSNLSRLGDRFESIS----NHAKRALIDDAK------------RFIKNHIKESLSG 108 + ++++++ S N+ K+ L+++ K + I H K ++ Sbjct: 65 SVQLLNSVEDKSKEKALSSVLGENNIKQYLLNNLKISLYNNFGEMDSQRIIQHTKVNIYN 124 Query: 109 YSA-------VFYNTEIQ-------NIVNSSRISMTHMANNRLDSS-------NNTIFYN 147 YN + N++N +T + + S ++ + Sbjct: 125 RKGTHIINVYSHYNLPLNPFSLFFMNLINIKSWPITTVGEAEVTSKKNYHKEEGVSVQWL 184 Query: 148 MDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELV---VDLSGSMH 204 +D S + Q +Q V ++R E + +V V S++ Sbjct: 185 IDDSGSMGSIIDRACFGSKQLKSQYNVGSKIGIVRNENADTSDSFYPIVGELVSCDRSLY 244 Query: 205 CAMNS------DPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG 258 +N D + + K +++AL F+ + + ++K+ + M + Sbjct: 245 YVLNDKKILEDDDLEEKNLDNHSQYYIRKRYLVRDALATFIKRVRKIDNLKDKLRMSFMY 304 Query: 259 YTTRVEKNIEPSWGTEKVRQYVTRDMD-SLILKPTDSTPAMKQAYQILTSDKKRSFFTNF 317 + R++ +WG ++ +Q V+ TD P +++AY L S + Sbjct: 305 FNERIDHYFPMTWGIKEFKQEVSSHYKRKHENTATDIHPILQEAYNKLHSKNEDD----- 359 Query: 318 FRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT-IKICDKAKENFIKIVTISINASPNGQ 376 + K S+ +KFI+ LTDG N +V++ +KICD AKE IKI TIS + + + Sbjct: 360 --EHKKKNSVEVKKFIVLLTDGAQNEGVHSVDSVLKICDAAKEEGIKIFTISYSVDSSER 417 Query: 377 ----RLLKTCVSSPEYHYNVVNADSLIHVFQ-NISQLMVHRKYSV 416 L C SP+ + +AD L +F+ +I + R + Sbjct: 418 KKANDFLSRCA-SPDKFFEAYDADKLNMIFKEHIGDAIFERLVKI 461 >gi|315122347|ref|YP_004062836.1| hypothetical protein CKC_02995 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495749|gb|ADR52348.1| hypothetical protein CKC_02995 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 362 Score = 159 bits (402), Expect = 7e-37, Method: Composition-based stats. Identities = 62/405 (15%), Positives = 160/405 (39%), Gaps = 52/405 (12%) Query: 10 YFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMV 69 + +++ +I A+ ++L+ + + + + +K +++ + A+L + + Sbjct: 5 KIRNFFQNKRGIITITSAIIFPLIIILMAIVFEMSNIYLEKERLQAVIDRALLDTVTMI- 63 Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + ++I + K + ++K L +++QN+++ + + Sbjct: 64 --------KLKNIEDVVKN--VGPVNTIWTKNLKYELEHSDFS---SDVQNVIDDTSMK- 109 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 L+S +N ++ ++ Y + L N+ + + + ++I E Sbjct: 110 -------LESDSNFKTLSITAISQYKMPFKICNIHLLCPKNKYVTVPVLSSMKIGRNEGS 162 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + +V+D+S SM ++ +++ K ++ L+ + + Sbjct: 163 DIDLMIVLDVSSSMDDNF----------MKPEEAPCSRLEVAKKSIRKMLEDFRKVPNYA 212 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 G +G+ V+ + G +++ + + T+S MK A++ L + Sbjct: 213 NVFRTGSVGFNDMVQFPMPLKRGLKRIYNDIKKYRA---FGSTNSYVGMKYAWEQLYGNP 269 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + + +K +IFLTDGEN + TI++C+ K+ I +I++ Sbjct: 270 QDTKDR--------------KKIVIFLTDGENMIINATRKTIELCNDMKKKKAVIYSIAL 315 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 + +L+ C SS Y +A SL+ + I + ++ +Y Sbjct: 316 AVDN--KEVLQGC-SSSGNVYAADDAQSLVQAYSLIGKDVMKDEY 357 >gi|218678237|ref|ZP_03526134.1| hypothetical protein RetlC8_04927 [Rhizobium etli CIAT 894] Length = 120 Score = 128 bits (321), Expect = 2e-27, Method: Composition-based stats. Identities = 34/109 (31%), Positives = 51/109 (46%), Gaps = 6/109 (5%) Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS-----NVNTIKICDKA 357 LT+ +K +P +K+I+F+TDG+NNN S + T K CD A Sbjct: 1 TSLTAKNAAGNDAEDAAHKLKTGQIP-KKYIVFMTDGDNNNDSSGGRSYDTATKKTCDDA 59 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 K I+I TI+ A GQ LL C S +++ + L+ F+ I Sbjct: 60 KSKGIEIYTIAFMAPAGGQALLHYCASDDSHYFQAEKMEDLLAAFEAIG 108 >gi|126730251|ref|ZP_01746062.1| hypothetical protein SSE37_10864 [Sagittula stellata E-37] gi|126708984|gb|EBA08039.1| hypothetical protein SSE37_10864 [Sagittula stellata E-37] Length = 614 Score = 121 bits (304), Expect = 2e-25, Method: Composition-based stats. Identities = 37/193 (19%), Positives = 78/193 (40%), Gaps = 9/193 (4%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGY-----TTRVEKNIEPSWGTEKVRQYVTRDMD 285 + ++ + ++ E Y E IE G + + D Sbjct: 422 VTRTAAQMVNGVAVVDGDSETTKAKCSSYKGAGHNAGQETLIENVLGMDYGTLDLDGDGI 481 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTN-FFRQGVKIPSLPFQKFIIFLTDGEN--N 342 + + P ++ +Q L + K +++ N ++ Q + + E + Sbjct: 482 AGANDDCSNYPPVRLTWQELFGNVKTTYYANAWYWQAYMDGRASYNDYYNAYYSWETTVD 541 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 ++N N IC KAK+ + I TI + A G ++ C SS ++YNV +++ L+ F Sbjct: 542 ASQANTNLATICAKAKQQDVTIFTIGVEAPQAGLNAMRNCASSASHYYNV-SSNQLVDTF 600 Query: 403 QNISQLMVHRKYS 415 ++IS ++V + + Sbjct: 601 RSISDVVVELRLT 613 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 47/356 (13%), Positives = 106/356 (29%), Gaps = 56/356 (15%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS--------K 67 A + S + + ++ G I ++ K++ +++ + A+LA A+ Sbjct: 28 ADTSGSMSYVALAGSLVMMVFGGIGIDMMHAELKRSQVQNTLDRAVLAAANLSNTRDPQT 87 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFI--KNHIKESLSGYSAVFYNTEIQNIVNSS 125 +V + R +++ + + + I G ++ + Sbjct: 88 VVEDYFRAMKLEDTLGDVQTGDSLGAKRVRAEGNGSINSHFLGL-IGVDQLDVYGAATAE 146 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQ--RYNQKIVSFIPALLRI 183 + + LD S + + + F++ +L + ++ VS IP + Sbjct: 147 NATAPLEISLVLDVSGSMQGQKIRDLKEAAKA--FVDAVLGEGGDNSRVTVSLIPYNATV 204 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 +G+ +D + C + + +L L+ + Sbjct: 205 NLGDD--LSERFNLDR--------------WQNYSSCAIFESSDYNSLSIDPNAGLEQLA 248 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV-------RQYVTRDMDSLIL-KPTDST 295 Y G + + G Y++ +DS T Sbjct: 249 HFDP---YDYSG----NSPDLTAPWCAEGNNLAIVPHSSDADYLSDVIDSFEAQGNTAID 301 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ---------KFIIFLTDGENN 342 MK L R + G+ S ++ KF++ +TDGEN Sbjct: 302 LGMKWGL-ALLDPAARPVIGDMQADGLVPSSARYRPSDYGTQTMKFVVVMTDGENT 356 >gi|110679843|ref|YP_682850.1| hypothetical protein RD1_2614 [Roseobacter denitrificans OCh 114] gi|109455959|gb|ABG32164.1| conserved hypothetical protein [Roseobacter denitrificans OCh 114] Length = 488 Score = 103 bits (256), Expect = 6e-20, Method: Composition-based stats. Identities = 76/489 (15%), Positives = 156/489 (31%), Gaps = 91/489 (18%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 L F E +I + V+ LL+ G + ++ + +++ + AILA Sbjct: 13 LRGFCRRLMGFRREEDGAMTIFATMMVLMMLLVCGIAVDLMQNEMMRTRVQNTLDRAILA 72 Query: 64 GASKMVSNLSRLGDRFESISNHAKRA----LIDDAKRFIKNHIKES------LSGYSAVF 113 S+L + E + ++ +A +DD + H+ + + Sbjct: 73 A-----SDLDQPLPADEVVDDYFAKAGMTEFLDDVQITPGAHLPTTNFRVVQAEARTRTP 127 Query: 114 YNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRL--------QFIEHLL 165 V S + + A ++++ ++ ++ + ++ FI +L Sbjct: 128 SIYMAMTGVRSLPVYVAGTAEETIENTEISLVLDISGSMRNNGKIGNLRTAAKDFIGAVL 187 Query: 166 -NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAM---NSDPEDVNSAPICQ 221 N ++ +P + G +V +G A +SD ++ Sbjct: 188 EGNAANTTSLNIVPYAGQTNPGP-------IVFQRAGGRPFATFIEDSDGNEILYGQTFV 240 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK----NIEPSWG----- 272 D + + N + LD + D+ G T WG Sbjct: 241 DDEGNSIDVPYNTMSSCLD-LTNGDFDNIDLPSGGYDQTPYFMNWPIDAPTMDWGWCPQN 299 Query: 273 -------TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP 325 Q D + T + MK +L + +F V Sbjct: 300 KSSIRYAQNNAGQLQDFIDDMRLHDGTGTQYGMKYGVALLNPSSRDTFVALNAAGLVPDG 359 Query: 326 ---------SLPFQKFIIFLTDG---------ENNNFKSNVNTIK--------------- 352 + +KFI+ +TDG + N+ K++ + Sbjct: 360 FKDRPADFGTTDTRKFIVLMTDGQITDQFRPEDKNDPKNDEIALNQRIGDRDTYATQSTN 419 Query: 353 ------ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 IC+KAK I + TI+ A N ++TC +SP + Y V + F++I+ Sbjct: 420 VANFYSICNKAKAAGITVYTIAFEAPANAITQMRTCATSPAFFYKVEGV-EIKTAFKSIA 478 Query: 407 QLMVHRKYS 415 + + + + Sbjct: 479 RQINELRLT 487 >gi|163731887|ref|ZP_02139334.1| hypothetical protein RLO149_21324 [Roseobacter litoralis Och 149] gi|161395341|gb|EDQ19663.1| hypothetical protein RLO149_21324 [Roseobacter litoralis Och 149] Length = 468 Score = 102 bits (254), Expect = 1e-19, Method: Composition-based stats. Identities = 73/473 (15%), Positives = 152/473 (32%), Gaps = 83/473 (17%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 E +I + V+ LL+ G + ++ + +++ + AILA S+L + Sbjct: 5 REEDGAMTIFATIMVLMMLLVCGIAVDLMQNEMMRTRVQNTLDRAILAA-----SDLDQP 59 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 E + ++ +A + + ++ L + E + S ++MT + Sbjct: 60 LPADEVVDDYFAKAGMTEFLNDVRITPGSDLPTTNFRIVQAEARTRTPSIYMAMTGVRTL 119 Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVS--FIPALLRIEMGERPIFLI 193 + S + S + + N + + FI A+L + I Sbjct: 120 PVYVSGTAEETIEKIEISLVLDISGSMRNNGKIGNLRTAAKDFIGAVLEGNAAKTTSLNI 179 Query: 194 ----------ELVVDLSG----------SMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 +V + +G S + V+ D M++ + Sbjct: 180 VPYAGQTNPGRIVFERAGGLPFATFIEDSNGDEILYGQTIVDDEGNSIDVPYNTMSSCLD 239 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG----TEKVRQYVTRDM----- 284 D+IDL S + ++ WG +Y D Sbjct: 240 LTNSDFDNIDLPSGGYDQT---PYFMNWPIDAP-TMDWGWCPQNNSSIRYAQNDAGRLQD 295 Query: 285 ---DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP---------SLPFQKF 332 D + T + MK +L + +F V + +KF Sbjct: 296 FIDDMRLHDGTGTQYGMKYGVALLNPSSRNTFLALNAAGLVPDGFKNRPADFGTTDTRKF 355 Query: 333 IIFLTDG---------ENNNFKSNVNTIK---------------------ICDKAKENFI 362 I+ +TDG + N+ K++ + +C+KAK I Sbjct: 356 IVLMTDGQITDQFRPEDKNDPKNDEIALNQRTGDRDTYSTQSTNVTNFYSVCNKAKAEGI 415 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + TI+ A + ++TC +SP + Y V + F++I++ + + + Sbjct: 416 TVYTIAFEAPADAVTQMRTCATSPAFFYKVEGV-QIKTAFKSIARQINELRLT 467 >gi|154250683|ref|YP_001411507.1| von Willebrand factor type A [Parvibaculum lavamentivorans DS-1] gi|154154633|gb|ABS61850.1| von Willebrand factor type A [Parvibaculum lavamentivorans DS-1] Length = 436 Score = 91.5 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 36/163 (22%), Positives = 67/163 (41%), Gaps = 20/163 (12%) Query: 260 TTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFR 319 ++ + S + ++ + T++T + + +LT S Sbjct: 286 NCNLQTIMPLSTNWSALNSHIDAMASA---GNTNTTIGLAWGWNMLTQGGPLSSAAAPA- 341 Query: 320 QGVKIPSLPFQKFIIFLTDGEN--NNFKSNVNTIK-----ICDKAKENFIKIVTISINAS 372 K I+FLTDG+N N + +N NTI IC+ K IK+ ++ + Sbjct: 342 -------ANLDKVIVFLTDGDNTRNRWSNNSNTINARTTLICNNIKAAGIKVYSVRVI-- 392 Query: 373 PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 L++ C + P +Y+V A L VF +I+Q + + + S Sbjct: 393 EGNATLIRNCATEPGMYYSVTTASELTSVFASIAQSLSNLRIS 435 >gi|218462234|ref|ZP_03502325.1| hypothetical protein RetlK5_23393 [Rhizobium etli Kim 5] Length = 66 Score = 90.7 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 18/54 (33%), Positives = 26/54 (48%) Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 CD AK I+I TI+ A GQ LL C S +++ + L+ F+ I Sbjct: 1 TCDTAKSKGIEIYTIAFMAPAGGQALLHYCASDDSHYFQAEKMEDLLAAFKAIG 54 >gi|329850249|ref|ZP_08265094.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] gi|328840564|gb|EGF90135.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] Length = 412 Score = 90.3 bits (222), Expect = 5e-16, Method: Composition-based stats. Identities = 72/466 (15%), Positives = 149/466 (31%), Gaps = 113/466 (24%) Query: 2 HLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 H + FR + + N +IFALSV +G I Y + ++ A ++A+ Sbjct: 9 HRFAGFRRRLGESCRDQSGNVIMIFALSVFVIFGFVGAAIDFSRVDYARRRLQDAADSAV 68 Query: 62 LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI 121 L + + ES A +A ++ F + + ++ Sbjct: 69 LRAMALKSA-------TDESRGVAADKAFAEN-------------------FGHPGVYDL 102 Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL 181 + + + ++ + + T+ + + + + Sbjct: 103 NGALKREVNENIISQTYTVHATVS-------------SYFGAFFGKD-----SYPVTVVS 144 Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + + +F I V+D +GSM + N P + + MA L S Sbjct: 145 QAKTSLD-VFEIAFVLDTTGSMA--------EANKMPNLKSSVDSAMAGLLQNGKNLSGS 195 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG----------------------------- 272 + V + TT + + WG Sbjct: 196 KIAVVPFNTQVRLSDATVTTMSSQGLSSGWGNCVHDRDLATSHDVSASAAQKGKAQTLYP 255 Query: 273 -----------TEKVRQYVTRDMD----SLILKPTDSTPAMKQAYQILTSDKKRSFFTNF 317 + + ++ + T+ T ++ ++L+ ++ S T F Sbjct: 256 LETCDEASLKPVQGLSDNISSARNFIKTLQPGGYTNVTMGVQWGMEVLSPNQPFSDATEF 315 Query: 318 FRQGVKIPSLPFQKFIIFLTDGEN-------NNFKSNVNTIKICDKAKENFIKIVTISIN 370 S +KF+I +TDG+N + + T C+ AK I + T+ I Sbjct: 316 -------GSTKARKFMIVVTDGDNTKSFTSWSASVIDKRTALACENAKAKGITVYTVKII 368 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 +L+ C S+PEY Y++ +A+ L I + + + S+ Sbjct: 369 --QGNSNMLRKCASAPEYFYDLTSANQLNAAMSGIFKSINKTRLSM 412 >gi|327538644|gb|EGF25299.1| protein containing von Willebrand factor, type A domains [Rhodopirellula baltica WH47] Length = 388 Score = 86.8 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 57/412 (13%), Positives = 133/412 (32%), Gaps = 55/412 (13%) Query: 6 RFRFYF--KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 RF K + S ++ + + L + + I V+ + ++ + + A A Sbjct: 18 RFTLNVVDKARLRSRSGTTVVMLVILLPVMLAVAAYCINVVYMEMARTELQISTDLATRA 77 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 + + GD+ E+I + + S+ +F + Sbjct: 78 AGRVL----AVTGDKAEAIEAAERLLEANPYLDR-----TLSIGDADIIFGKSNRTEENR 128 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 + + +++S F DV + IE ++ Sbjct: 129 RYEFT----PDKKVNSVGLRAFGADDVPMLFPTMGVPIEFRP---------------IKQ 169 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT------KMAALKNALLL 237 + + I +V+D SGSM + D N +P K A + + Sbjct: 170 AVATQVELDIAIVLDRSGSMA--FSHDEVAKNGSPSSAPPGWKMGHAVPKNARWLDTVAA 227 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 +D++ D + L Y+ + + +++ + ++R + + T+ Sbjct: 228 VNGFLDIMEDSSHDERVSLSTYSDKSKADVKLTGDYTEIRAAMNAHSTNFKGGATNIGSG 287 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + + L + + +I ++DG +N + I + Sbjct: 288 ILEGGATLGDKNLARSWA--------------SRVLIVMSDGIHNT---GIEPIPAAQQV 330 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 I I T++ + N Q + K VS H++ ++ L F+ I++ + Sbjct: 331 ANEKIMIFTVTFSNEANVQEMEKVAVSGGGQHFHAKDSQQLAEAFRKIAKSL 382 >gi|87311197|ref|ZP_01093320.1| hypothetical protein DSM3645_16250 [Blastopirellula marina DSM 3645] gi|87286105|gb|EAQ78016.1| hypothetical protein DSM3645_16250 [Blastopirellula marina DSM 3645] Length = 373 Score = 86.8 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 63/394 (15%), Positives = 130/394 (32%), Gaps = 44/394 (11%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 I+ + I+ A+ + L + F + V + + A ++A AGA + Sbjct: 15 ISRRRGAVLILIAVLLPVILWMAAFCVDVAYMQLTRTELRIATDSAARAGARTLSLEQDA 74 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 +I AK + + + ++ LS + ++ + S + N Sbjct: 75 SLAHKSAIEYAAKNNVAGNTLTLADSDVQIGLSVRT--------DDVGRFTFSSGGKLLN 126 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIE 194 + + T + Y + E + + I Sbjct: 127 SVNVTGRRTQQAPDGAVRLYLTPIFGHEFF--------------QPVADATASQIDRDIA 172 Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LVVD SGSM +N + + R + AL +++ FL + + E V Sbjct: 173 LVVDRSGSMTFRINRNSYESGWRNNDPVPSRARWWALVDSVDGFLTELGSTPQL-ELV-- 229 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 L Y + + + + + ++ + T+ T M + L + K + Sbjct: 230 SLSTYNSSAKIDEQLTDKYSRIEDALDDYSRRYPDGSTNITAGMDRGISTLQNKKYARPY 289 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 K ++ +TDG +N S N A + I + TI+ + Sbjct: 290 A--------------SKTMVVMTDGNHNYGSSPTNAAY---DAASDDIVVHTITYS-DGA 331 Query: 375 GQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQ 407 Q L++ H++ + D L +F+ I++ Sbjct: 332 NQSLMREVARIGGGQHWHAPDGDELEEIFREIAR 365 >gi|32477945|ref|NP_870939.1| hypothetical protein RB13237 [Rhodopirellula baltica SH 1] gi|32448502|emb|CAD78017.1| conserved hypothetical protein [Rhodopirellula baltica SH 1] Length = 388 Score = 85.3 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 58/413 (14%), Positives = 134/413 (32%), Gaps = 57/413 (13%) Query: 6 RFRFYF--KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 RF K + S ++ + + L + + I V+ + ++ + + A A Sbjct: 18 RFTLNVVDKARLRSRSGTTVVMLVILLPVMLAVAAYCINVVYMEMARTELQISTDLATRA 77 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 + + GD+ E+I + + S+ +F + Sbjct: 78 AGRVL----AVTGDKAEAIEAAERLLEANPYLDR-----TLSIGDADIIFGKSNRTEENR 128 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 + + +++S + F DV + IE ++ Sbjct: 129 RYEFT----PDKKVNSVSLRAFGADDVPMLFPTMGVPIEFRP---------------IKQ 169 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK-------NALL 236 + + I +V+D SGSM + D N +P KM + + Sbjct: 170 AVATQVELDIAIVLDRSGSMA--FSHDEVAKNGSPSSAPPGW-KMGHAVPENARWLDTVA 226 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 +D++ D + L Y+ + + +++ + ++R + T+ Sbjct: 227 AVNGFLDIMEDSSHDERVSLSTYSDKSKADVKLTGDYTEIRAAMNAHSTKFKGGATNIGS 286 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 + + L K+ + +I ++DG +N + I + Sbjct: 287 GILEGGATLGDK--------------KLARSWASRVLIVMSDGIHNT---GIEPIPAAQQ 329 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 I I T++ + N Q + K VS H++ ++ L F+ I++ + Sbjct: 330 VANEKIMIFTVTFSDEANVQEMEKVAVSGGGQHFHAKDSQQLTEAFRKIAKSL 382 >gi|114705525|ref|ZP_01438428.1| Flp pilus assembly protein TadG [Fulvimarina pelagi HTCC2506] gi|114538371|gb|EAU41492.1| Flp pilus assembly protein TadG [Fulvimarina pelagi HTCC2506] Length = 461 Score = 84.9 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 28/154 (18%), Positives = 57/154 (37%), Gaps = 19/154 (12%) Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 V + +L P+ ++ S F Q + +K ++ +TDG Sbjct: 306 VKTSISNLSYGGGGYRPSTFIPAGLIWGLNVLSPPAPFEEQAYDPNNKLPRKALVLMTDG 365 Query: 340 ENNN------------------FKSNVNTIKICDKAKENFIKIVTISINASPNG-QRLLK 380 N +S+ +TI IC+ K + I+I T+ + + LLK Sbjct: 366 ANTMVFNSSDGRHRNARSGTEVAQSDRDTISICNNIKRSGIEIFTVGFMVNSSSALDLLK 425 Query: 381 TCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 C + E++++ + + L F I+ + + Sbjct: 426 ECATDGEHYFDATSPEELHSAFGRIADGLTQIRL 459 >gi|159044810|ref|YP_001533604.1| hypothetical protein Dshi_2267 [Dinoroseobacter shibae DFL 12] gi|157912570|gb|ABV94003.1| hypothetical protein Dshi_2267 [Dinoroseobacter shibae DFL 12] Length = 553 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 63/383 (16%), Positives = 129/383 (33%), Gaps = 35/383 (9%) Query: 41 IYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKN 100 + ++ +H N + + ++ + + + F+SIS + L D F +N Sbjct: 199 VSIIPYHAVVNVPDELLDEYAVSTQQTVSNCVRFTATDFQSISIDRTKTL-DRLAHFDRN 257 Query: 101 HIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQF 160 + + Q + ++ + + D +N T+ D +++ Sbjct: 258 NSNLHTFNGDRLIGRPWCQV---GTYGAILPWSTSVTDLTNKVAELGASGNTATDIGMKW 314 Query: 161 IEHLLNQRYNQKIV-SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 LL+ + L ++ RP+ SDPE + + Sbjct: 315 AAALLDPGTQNIVDDMIDGGHLEADLAGRPVLY----------------SDPETIKVVVL 358 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 D + T LKN + + + + R + +P W + + Sbjct: 359 MTDGENTSQYDLKNEFKGTMSPVWWDEASDSY----FVYFQNRANNDKKPWWNVGRTPES 414 Query: 280 VTRDMDSLILK-----PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF-QKFI 333 T DS S + A L F + FF + +K+ Sbjct: 415 DTNGNDSDKGSWEWELGEPSARQLSHA--ELFGTFSVRFISEFFFRVKNDNQKTIREKYR 472 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV 393 +T N+ ++ T +ICD+ K + I TI A GQ L++ C SS ++++V Sbjct: 473 NAVTRYTGNST-ADGYTEQICDQLKAQDVVIFTIGFEAPQRGQDLMRYCASSSGHYFDVE 531 Query: 394 NADSLIHVFQNISQLMVHRKYSV 416 + F +I+ + + S+ Sbjct: 532 GV-EISEAFSSIANTIQQLRLSL 553 >gi|114764812|ref|ZP_01443994.1| hypothetical protein 1100011001322_R2601_10469 [Pelagibaca bermudensis HTCC2601] gi|114542698|gb|EAU45721.1| hypothetical protein R2601_10469 [Roseovarius sp. HTCC2601] Length = 477 Score = 84.2 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 59/455 (12%), Positives = 130/455 (28%), Gaps = 64/455 (14%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 +E + +I+ + ++ G I ++ ++ +++ + A+LA A + + L Sbjct: 31 RAESGSMTIMAVALSLLMMIFGGIGIDMMYAELQRTKIQNTLDRAVLAAAD-LDNELDAQ 89 Query: 76 GDRFE-----SISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI--------- 121 G + S+++ +D+ + + S I+N+ Sbjct: 90 GVVEDYMSKMSLADALVSVNVDEGLNYRTVTADGYRTMPSNFMQLIGIENMQAGGHSQAM 149 Query: 122 --VNSSRISMTHMANNRLDSSNNTIFYNMDVMTS-------------------YDYRLQF 160 +N +SM + +D + Y + Sbjct: 150 ERINKVEVSMVLDISGSMDDGDKMAELQTAASDFVDTLLDDGSEDLVSISLVPYSEHVNA 209 Query: 161 IEHLLNQRYNQKIVSFIPALLRIEMG-ERPIFLIELVVDLSGSMHCAMNSDPEDVNS-AP 218 +L+ + L + L D + ++ P Sbjct: 210 GPEILSYLNVNYMHDDSYCLEMPNSAFNSAALDLSLTYDQMQHFQWNYSGSNSLTDTVCP 269 Query: 219 IC---QDKKRTK-MAALKNALLLFLDSIDL------------LSHVKEDVYMGLIGYTT- 261 Q + ++ ALK + L + G+I T Sbjct: 270 RYAYEQIRPWSQDAGALKTQISQLQPRAGTSIFMGMKWASALLDPSTRPIASGMIADGTV 329 Query: 262 -RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 V + ++ V + + D S AY + S + + Sbjct: 330 DAVFEGRPVAYSDTDVLKTIVLMTDGQHD---RSFRIQNWAYNDENEVEHWSQYNLWHYL 386 Query: 321 GVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLK 380 + S F D + +C AK I I +I S +G +++ Sbjct: 387 NYYVNSWNRSSFYYQKYD----AATGDTLLSSVCTAAKRQGILIWSIGFEVSDHGANVME 442 Query: 381 TCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 +C SSP + + V + F I+Q + + + Sbjct: 443 SCASSPAHFFRVEGV-EISEAFSTIAQTLNQLRLT 476 >gi|260576512|ref|ZP_05844501.1| conserved hypothetical protein [Rhodobacter sp. SW2] gi|259021235|gb|EEW24542.1| conserved hypothetical protein [Rhodobacter sp. SW2] Length = 529 Score = 84.2 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 31/160 (19%), Positives = 60/160 (37%), Gaps = 13/160 (8%) Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 L Y+ R + W + + I T T +++ + + + + Sbjct: 382 LTLYSDRASTTSDYYW-------FSDSKWHTTIDGGT--TGSVQMTWPEVWAKWSVRYVA 432 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 S + TD E + + +V +ICD AK++ I I +I A NG Sbjct: 433 KDIYTKALGGSE--NSWFETFTD-EISYGQKDVRLQQICDAAKDSGIVIFSIGFEAPENG 489 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + L+ C S P ++N + F+ I+ + H + + Sbjct: 490 RNQLRDCASQPSNYFNATGV-QITTAFRAIATQLSHLRLT 528 >gi|170751925|ref|YP_001758185.1| hypothetical protein Mrad2831_5557 [Methylobacterium radiotolerans JCM 2831] gi|170658447|gb|ACB27502.1| hypothetical protein Mrad2831_5557 [Methylobacterium radiotolerans JCM 2831] Length = 568 Score = 83.4 bits (204), Expect = 6e-14, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 59/200 (29%), Gaps = 54/200 (27%) Query: 267 IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 + T ++ + + T+ ++ L+ + + + + Sbjct: 371 QRLTNNTNALKTLIN---NMAPSGSTNIHEGFMWGWRTLSPNSVFADGQPYASSANSSNA 427 Query: 327 LPFQKFIIFLTDGENNNFKSNVN------------------------------------- 349 K II +TDG N+ ++ Sbjct: 428 TNINKIIILMTDGTNSWGTNSSAPTGSLYFAAGYFRNANGTTPNPRLTTAYQNTNIADGN 487 Query: 350 ---------TIKICDKAKENFIKIVTISINAS-----PNGQRLLKTCVSSPEYHYNVVNA 395 T + C K I I TI + GQ LL+ C SSP+ Y ++ Sbjct: 488 TARKALDALTAEACANTKAVNISIYTIGFSVPTDPIDSAGQTLLRNCASSPDQFYLANSS 547 Query: 396 DSLIHVFQNISQLMVHRKYS 415 D LI F++I + + + Sbjct: 548 DDLIKAFKSIQASIGALRLT 567 >gi|56696619|ref|YP_166980.1| hypothetical protein SPO1742 [Ruegeria pomeroyi DSS-3] gi|56678356|gb|AAV95022.1| conserved hypothetical protein [Ruegeria pomeroyi DSS-3] Length = 558 Score = 82.2 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 24/121 (19%), Positives = 52/121 (42%), Gaps = 5/121 (4%) Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ Y L + ++ N + + + + +T E + T +C Sbjct: 442 GEAVRLTYAELFAKVSLAY--NAYYNFEFNSNAWAEWYTAAMTHKE--ASAKDQRTDHVC 497 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 D AK+ I + T+ A +G+R+LK C SS ++Y+ + + F +I+ + + Sbjct: 498 DAAKDEGIIVYTVGFEAPYSGRRVLKRCASSDSHYYDA-DGLEISDAFTSIASSIRKLRL 556 Query: 415 S 415 + Sbjct: 557 T 557 >gi|325106974|ref|YP_004268042.1| von Willebrand factor A [Planctomyces brasiliensis DSM 5305] gi|324967242|gb|ADY58020.1| von Willebrand factor type A [Planctomyces brasiliensis DSM 5305] Length = 396 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 60/419 (14%), Positives = 135/419 (32%), Gaps = 67/419 (15%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 + + ++ A + L+L+ F V + + + +AA AG + SR Sbjct: 14 LTRRGAMLVLIAALLSVMLILVVFTTDVAYMQLVRTQLHVSTDAAAKAGMEALARTESRG 73 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKE--SLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 R + +K + + I+ + + + + + RIS+ Sbjct: 74 QARVVAKDIFSKNLIGGRELKLHNKDIEFGRTDANPDGTWEFLPNERPFQAIRISVNLDD 133 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLI 193 N + + L F + L + S L+ I Sbjct: 134 NRQKGRNG-------------SVPLLFGKVLGQSSFATNHSSVAANLVHE---------I 171 Query: 194 ELVVDLSGSMHCA-----------MNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 L +D S SM S P + P + K+ + LD + Sbjct: 172 VLCLDRSHSMCFDETGVDYAYPPGTPSYPAGYITPPNPVGSRWAKLQGAIQVFVDTLDDL 231 Query: 243 DLLSHVKEDVYMGLIGYTTR-----------VEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 ++ V + I + V ++ V + + +++ Sbjct: 232 QIVPDVGVVTWGSDITLSWSWYPFQGRSFPAVMVDVPLGQNLNLVSPAIAAKLGDIMMGG 291 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T+ + + ++ +LT++ S QK II ++DG+ + + N + Sbjct: 292 TNMSSGIDRSVSLLTANGTHSLA---------------QKTIILMSDGQ---WNAGRNPL 333 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLM 409 + A + I I TI+ Q +++ + +N + +SL F+ +++++ Sbjct: 334 DAANDAADKNITIHTIAFL--NGDQSVMRQIAERTGGKFFNAPDGESLEDTFKELAKML 390 >gi|114799275|ref|YP_759187.1| hypothetical protein HNE_0457 [Hyphomonas neptunium ATCC 15444] gi|114739449|gb|ABI77574.1| conserved domain protein [Hyphomonas neptunium ATCC 15444] Length = 512 Score = 81.5 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 64/165 (38%), Gaps = 19/165 (11%) Query: 265 KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 + + ++ +V + T + A+ + + + + Sbjct: 352 GPVPLTEDKTVLKDHVNALVAE---GGTAGHLGI--AWGWYLVSPEWAAIWPEASEPLPY 406 Query: 325 PSLPFQKFIIFLTDGENN------NFKSNVNTIKICD--KAKENFIKIVTISINASPN-- 374 K +I +TDG+ N + S ++ +CD KA I+I T+ + Sbjct: 407 RQPQTSKAVILMTDGDFNIEHPTASRDSFRQSMDLCDGMKASSRRIQIYTVGFQVPSSVQ 466 Query: 375 ----GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 G+ +L+ C +SP + ++ + + LI V+++I++ + + Sbjct: 467 RTGDGRTILEYCATSPSHAFSADSGEELIEVYRSIARSISDLRLK 511 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 37/264 (14%), Positives = 81/264 (30%), Gaps = 63/264 (23%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 M L R ++E N ++I A + L L G I + + +K+ +++A ++A Sbjct: 1 MMRLCRPVTRLPALRSAEGGNVAMITAFVIPCILALTGIAIDLQNTVRQKSKVQAALDSA 60 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 +LAGA N A +D ++ + + + Sbjct: 61 VLAGA------------LGRQAGNTAAETTLD-----VQTYALALFTDQGGGLDCDPVAV 103 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 + + + + R + ++ L+ + V+ Sbjct: 104 TFDETNLDILGTVRCRQPT--------------------YLSSLIGHDELEFNVASTSTY 143 Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + + D+SGSM ++A LK A + +D Sbjct: 144 GV------GKLDVAFIFDVSGSM-------------------NSYNRLAQLKTAAVAAVD 178 Query: 241 SIDLLSHVKE-DVYMGLIGYTTRV 263 + S ++ V + + Y + Sbjct: 179 ELLPDSRERDGTVRLAIASYNHSL 202 >gi|259416688|ref|ZP_05740608.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B] gi|259348127|gb|EEW59904.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B] Length = 583 Score = 81.1 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 31/148 (20%), Positives = 62/148 (41%), Gaps = 23/148 (15%) Query: 270 SWGTEKVRQYVTRDMDSLILKPTD--STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 SW T + V ++L L D + +++ Y+ L D + + Sbjct: 456 SWKT----RTVDEPGEALQLSYADLFAETSLRYLYRDLFGDWMSNASWYW---------- 501 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE 387 + + ++ D + T+ +CD AKE I + TI A GQ++L+ C SS Sbjct: 502 -YNRLYSYVGD-----STKDSRTLAVCDAAKEKGIVVFTIGFEAPWRGQQVLQQCASSAS 555 Query: 388 YHYNVVNADSLIHVFQNISQLMVHRKYS 415 ++Y+V + + F +I+ + + + Sbjct: 556 HYYDV-DGLEISDAFASIASAIRQLRLT 582 >gi|254460794|ref|ZP_05074210.1| conserved hypothetical protein [Rhodobacterales bacterium HTCC2083] gi|206677383|gb|EDZ41870.1| conserved hypothetical protein [Rhodobacteraceae bacterium HTCC2083] Length = 480 Score = 81.1 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 53/454 (11%), Positives = 132/454 (29%), Gaps = 64/454 (14%) Query: 17 SEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA----SKMVSNL 72 E + I V+ L + G + ++ + ++ + AIL+ A ++ + Sbjct: 35 DESGSLVIFAVFMVLMILTIGGIGVDLMRSERDRTVLQHTLDRAILSAADLDQTQTPQAV 94 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 + + +D + K E+ S + F + +N++ + Sbjct: 95 VDDYFETAGLESFLSNVTVDQGINY-KTVGAEAQSITTTAFMKMAGVDTLNATAAGVAEE 153 Query: 133 ANNRLDSS-----NNTIFYNMDVMTSYDYRLQFIEHLLNQRY-NQKIVSFIPALLRIEMG 186 ++ S + ++ + F+ +L+ + VS +P + G Sbjct: 154 RIANVEISMVLDISGSMGIGSKMTQLRSAATSFVNTVLSPENEDLVSVSLVPYSQHVNAG 213 Query: 187 --------------------------ERPIFLIELVVDLSGSMHCAMNSDPEDVNS-API 219 + + D + + ++ P Sbjct: 214 PKIYNELNTNHRHNYSHCVEMADSAYSETELDLSVTYDQMQHFQWNYSGANQLTDTICPR 273 Query: 220 CQDKKRTKMAALKNALLLFLDSID----------------LLSHVKEDVYMGLIGYTT-- 261 ++ T + +AL + + +L V L+ Sbjct: 274 YSYERITAFSQDASALNAQIAQLQPRAGTQIFMGMKWAAAMLDPAFNPVVNALVTSNDID 333 Query: 262 RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQG 321 V N ++ + + V D AY + S + ++ Sbjct: 334 SVFDNRPAAFDDTETLKTVVLMTDGKNSSSMRIK---SWAYDSSSDYYHWSRYNLWYYLR 390 Query: 322 VKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT 381 + ++ F D + + IC+ +K+ I I +I +G ++ Sbjct: 391 RNVNRHYHSRYYWFTHD----AAQGDALLDDICNASKDAGIVIWSIGFEVDDHGADVMAN 446 Query: 382 CVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 C SSP + + V + F I++ + + + Sbjct: 447 CASSPSHFFRVEGI-EISEAFDAIARQINQLRLT 479 >gi|114798549|ref|YP_759188.1| hypothetical protein HNE_0458 [Hyphomonas neptunium ATCC 15444] gi|114738723|gb|ABI76848.1| conserved domain protein [Hyphomonas neptunium ATCC 15444] Length = 460 Score = 81.1 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 30/158 (18%), Positives = 60/158 (37%), Gaps = 13/158 (8%) Query: 265 KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 I ++ K+ ++ ++ + QA+ + + + + Sbjct: 308 TPIPLTYNRNKLHDFIDDMTPRR-----NTAGHIGQAWGWYLVSPEWNSVWPAGSKALPY 362 Query: 325 PSLPFQKFIIFLTDGE-------NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQR 377 K +I ++DG+ N S ICDK KE + I T+ +A GQ Sbjct: 363 DEPDATKVVIMMSDGQYNETRHNNAYPSSVTQAEAICDKMKEKEVVIYTVGFDA-GYGQD 421 Query: 378 LLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 +L C S+P + Y N L +++I++ + + S Sbjct: 422 VLNYCASNPAFAYKPTNGQELTEAYKSIARSISDLRIS 459 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 43/252 (17%), Positives = 84/252 (33%), Gaps = 58/252 (23%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 + +E N ++I AL+++ + + GF I +K ++ A ++A+LA Sbjct: 6 RNFFRNESGNVAMIAALTIIPIVGIAGFAIDFQVTTTQKARVQQAVDSAVLAA------- 58 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 +S+ + RA + + ++ K L+ + N ++V I T Sbjct: 59 -------TKSMQDGKDRAY---SLKEANDYFKGILNQSNNSGLNCTNIDLV---YIDETE 105 Query: 132 MANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF 191 ++ S NT + + D+ + Sbjct: 106 ELEGHVECSQNTTLSKVAGIRHLDFNVS----------------------SAATYGIGKL 143 Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V D+SGSM V A + A+ L ++ + ED Sbjct: 144 EIAFVFDVSGSMANDNRMGNLKV---------------AAREAVNTLL-PVEGYAGDPED 187 Query: 252 VYMGLIGYTTRV 263 V + ++ Y T V Sbjct: 188 VRLAMVSYDTMV 199 >gi|171742038|ref|ZP_02917845.1| hypothetical protein BIFDEN_01142 [Bifidobacterium dentium ATCC 27678] gi|283456833|ref|YP_003361397.1| hypothetical protein BDP_2000 [Bifidobacterium dentium Bd1] gi|171277652|gb|EDT45313.1| hypothetical protein BIFDEN_01142 [Bifidobacterium dentium ATCC 27678] gi|283103467|gb|ADB10573.1| Conserved hypothetical protein containing a von Willebrand factor type A (vWA) domain [Bifidobacterium dentium Bd1] Length = 967 Score = 80.7 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 51/280 (18%), Positives = 96/280 (34%), Gaps = 57/280 (20%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 I + I LV+D+SGSM+ +K++ALK A+ Sbjct: 229 VIGKDASTSTTDTTPIDIALVLDVSGSMNDDFGGRG------------SPSKISALKTAV 276 Query: 236 LLFLDSI----DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 FLD D + + V + L+ Y ++ G + R +R D+ Sbjct: 277 NSFLDETAKTNDTIEDDNDKVKVALVKYANQIGTAT----GADGCRISNSRQSDTGNCTQ 332 Query: 292 -----TDSTPAMKQAYQILTSDKKRSFF--TNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 T +K + L + +Q + +K++IF TDGE N++ Sbjct: 333 IVQELTTDAGLLKTSVNGLQAAGATYADAAMEVAQQALAGGRAGAKKYVIFFTDGEPNHW 392 Query: 345 K---SNVNTIKI--CDKAKENFIKIVTISI----------NASPNGQRLLKTCVS----- 384 +V I + K + +I I +++ N + + S Sbjct: 393 SGFDGDVANAAIKKSQELKNAGTTVYSIGIFDGANPSASVSSASNANKFMHGISSNYPNA 452 Query: 385 ----------SPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 S +Y+Y+ +A L +F +I + + + Sbjct: 453 TGYWNLGDRASGDYYYSASSATQLAQIFNDIQKTITEKHV 492 >gi|312878233|ref|ZP_07738157.1| von Willebrand factor type A [Caldicellulosiruptor lactoaceticus 6A] gi|311794982|gb|EFR11387.1| von Willebrand factor type A [Caldicellulosiruptor lactoaceticus 6A] Length = 1221 Score = 80.7 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 45/237 (18%), Positives = 79/237 (33%), Gaps = 52/237 (21%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 I R + + V+D SGSM DP K A Sbjct: 517 PINKGEREVNQQIKYIDLVFVLDSSGSMSWN---DPNGYR----------------KIAA 557 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 F+D++ ++ + + + V+ + R T+ Sbjct: 558 KSFVDALIQGD------RAAVVDFDDYGYLLQPLTTDFQTVKNAIDRI---DSWGGTNIA 608 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 ++ A L S K II LTDGE + +N+ T Sbjct: 609 EGIRIANHQLISQSSDDRI----------------KVIILLTDGE-GYYDNNLTT----- 646 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVH 411 +AK N I I TI + + + LL+ + + ++ V +A L VF+ I++++ Sbjct: 647 EAKNNGITIYTIGL-GTSVDENLLRNIATQTGGMYFPVSSASQLPQVFKRITEIVTE 702 >gi|300023811|ref|YP_003756422.1| von Willebrand factor A [Hyphomicrobium denitrificans ATCC 51888] gi|299525632|gb|ADJ24101.1| von Willebrand factor type A [Hyphomicrobium denitrificans ATCC 51888] Length = 466 Score = 80.3 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 56/441 (12%), Positives = 132/441 (29%), Gaps = 40/441 (9%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 +K + + +I+F L + +IG + + ++ +A +AA+LAGA + + Sbjct: 29 IEKFSRDTRGDVAILFGLMALVLFAMIGLAVDYGRFVNARSQTIAATDAAVLAGARALQT 88 Query: 71 NLSRLGDRFESISNHAKRA------LIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 N ++ +A L +D F ++ T + + Sbjct: 89 NGGDQAAALRVAQSYYAQATKNRLSLSNDTINFAIADNATAMVTTGNAVITTPFMGLAGT 148 Query: 125 SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 + + + + + N ++ L + Q+ + L + Sbjct: 149 GSLPILRKDGSDYSKAVLAVGGNAELNLEIAMMLDITGSMRGQKLTDMKAAASDLLNIVV 208 Query: 185 MGERPIFLIELVV-----DLSGSMHCAMNSDPEDVNSAPICQDKKRT-KMAALKNALLLF 238 ++ F ++ + D+ + + P ++ T K A + Sbjct: 209 WTDQSKFTSKVAIVPFAYDVRLPAAAFKKATGTTSTNYPCVVERTGTEKYTDAAPATGKY 268 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 + + S K + + + + V + T Sbjct: 269 VMVHNTSSTKKNKTTYSPTCDVASSAEVLPLTSDKSTLLAKVNGLSTA---GSTAGHIGT 325 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE------NNNFKSNVNTIK 352 A+ + + S +T+ + +K + +TDGE N + +++ Sbjct: 326 AWAW-YMLAPNWSSLWTSASSTPAAYNADNLRKIAVLMTDGEYNTQYTTNGVPDDSSSLT 384 Query: 353 ICDKA-----------------KENFIKIVTISINAS-PNGQRLLKTCVSSPEYHYNVVN 394 C A K I++ T+ L C + + YN Sbjct: 385 RCPNAANGVCSSAQAVSQCTAMKAKGIEVYTVGFQLDNQTAIDTLSQCATDSSHFYNSTT 444 Query: 395 ADSLIHVFQNISQLMVHRKYS 415 D+L F++I+ + S Sbjct: 445 GDALKAAFRDIALKISTLYLS 465 >gi|255261929|ref|ZP_05341271.1| conserved hypothetical protein [Thalassiobium sp. R2A62] gi|255104264|gb|EET46938.1| conserved hypothetical protein [Thalassiobium sp. R2A62] Length = 478 Score = 80.3 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 79/466 (16%), Positives = 158/466 (33%), Gaps = 73/466 (15%) Query: 10 YFKKGIASEKANFSIIFALSVMSFLLL-IGFLIYVLDWHYKKNSMESANNAAILAGAS-- 66 + K+ E F IIF+L + +LL G + ++ + + ++ + A+LA A Sbjct: 23 FVKEFARDEDGAF-IIFSLFMFVLMLLTAGMALDLMRYETHRARLQGTLDRAVLAAADLD 81 Query: 67 KMVSNLSRLGDRFE--SISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 + +S + + D F +S+ +D + +S + T + Sbjct: 82 QTLSPAAVVTDYFAKAGLSSFLTSTTVDQGLNY------RIISAQGNMTMPTTFMRLSGQ 135 Query: 125 SRISMTHMANNRLDSSNNTIFYNMDVMTSY----------DYRLQFIEHLLNQRYNQKIV 174 + +++ A SN I +D+ S FI+ ++ I Sbjct: 136 TELAIRGDATAEERVSNVEISLVVDISGSMGRNNKLSTLRTASHTFIDTVIRPETEDLIS 195 Query: 175 -SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA---- 229 + IP ++ G P +L VD + ++ +P D N+A + T+ Sbjct: 196 LNIIPYTAQVNAG--PDIFDQLTVDQKHNFSHCIDFEPADFNTAALDVPPVSTRTYKQMQ 253 Query: 230 --------ALKNALLLFLDSIDLLSHVKED---VYMGLIGYTTRVEKNI--EPSWGTE-- 274 + N + S + + +D + + R I WG Sbjct: 254 HFQYGWSSSYVNNPGCPMQSYERIVPFSQDATSLKSTVTSLRARANTAIHLGMKWGVSML 313 Query: 275 --KVRQYVTRDMDSLILKPTDSTPAMKQAYQI--------LTSDKKRSFFTNFFRQGVKI 324 R VT + + + P + + AY L +D + + Sbjct: 314 DPTFRPIVTAMIANNKVDPEFAGRPV--AYNDPETLKTIVLMTDGQNVDTYRISDEFYST 371 Query: 325 PS--LPFQKFIIFLT-------DGENNNF-------KSNVNTIKICDKAKENFIKIVTIS 368 PS + ++ +F D + N + +++ ICD AK I + TI Sbjct: 372 PSQIAHWDRYQLFFFTNNYIDRDIDQNYYYKKFTATQADTMLQSICDAAKAEGILVWTIG 431 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 S + + C SSP + + V L F +I++ + + Sbjct: 432 FEVSNHAAGEMLDCASSPSHFFRVEGV-ELSEAFASIARQINQLRL 476 >gi|306824220|ref|ZP_07457590.1| conserved hypothetical protein [Bifidobacterium dentium ATCC 27679] gi|309801684|ref|ZP_07695804.1| von Willebrand factor type A domain protein [Bifidobacterium dentium JCVIHMP022] gi|304552423|gb|EFM40340.1| conserved hypothetical protein [Bifidobacterium dentium ATCC 27679] gi|308221626|gb|EFO77918.1| von Willebrand factor type A domain protein [Bifidobacterium dentium JCVIHMP022] Length = 967 Score = 80.3 bits (196), Expect = 6e-13, Method: Composition-based stats. Identities = 49/276 (17%), Positives = 94/276 (34%), Gaps = 49/276 (17%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 + I LV+D+SGSM+ +K++ALK A+ Sbjct: 229 VTGKDASTSTTDTTPIDIALVLDVSGSMNDDFGGRG------------SPSKISALKTAV 276 Query: 236 LLFLDSI----DLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLILK 290 FLD D + V + L+ Y ++ RQ T + ++ + Sbjct: 277 NSFLDETAKTNDTIEDDNNKVKVALVKYANQIGTATGADGCRISNSRQSDTGNCTQIVQE 336 Query: 291 PTDSTPAMKQAYQILTSDKKRSFF--TNFFRQGVKIPSLPFQKFIIFLTDGENNNFK--- 345 T +K + L + +Q + +K++IF TDGE N++ Sbjct: 337 LTTDAGLLKTSVNGLQAAGATYADAAMEVAQQALAGGRAGAKKYVIFFTDGEPNHWSGFD 396 Query: 346 SNVNTIKI--CDKAKENFIKIVTISI----------NASPNGQRLLKTCVS--------- 384 +V I + K + +I I +++ N + + S Sbjct: 397 DDVANAAIKKSQELKNAGTTVYSIGIFDGANPSASVSSASNANKFMHGISSNYPNATGYR 456 Query: 385 ------SPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 S +Y+Y+ +A L +F +I + + + Sbjct: 457 SLGDRASGDYYYSASSATQLAQIFNDIQKTITEKHV 492 >gi|218458490|ref|ZP_03498581.1| von Willebrand factor type A [Rhizobium etli Kim 5] Length = 220 Score = 79.9 bits (195), Expect = 8e-13, Method: Composition-based stats. Identities = 39/247 (15%), Positives = 76/247 (30%), Gaps = 51/247 (20%) Query: 31 MSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRAL 90 L G ++V D K ++ A ++A LA A+ + + I A Sbjct: 3 PVLLGAAGMAVHVGDMLLSKQQLQEAADSAALATATALANG---------KIQTSEAEAY 53 Query: 91 IDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDV 150 A+ F+ + L +I + +++ + +S + Y + V Sbjct: 54 ---ARNFVAGQMANYLQS---------GVDIKGGTSVNV------QTSTSGKSTSYQVTV 95 Query: 151 MTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 SYD + + + + S + + + L +D SGSM + + Sbjct: 96 SPSYDLSVNPLMQAVGFKTQHLSTS--GTTVGGHSQTQGSISMFLALDKSGSMGESTATV 153 Query: 211 PED----------------------VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 ED + + TK+ ALK A ++ Sbjct: 154 NEDDPTETFTYDCNLHYNSKNNKWVYDKCTGSRTNYYTKIEALKIAAGNLFSQLNSADPN 213 Query: 249 KEDVYMG 255 + V G Sbjct: 214 AQYVRTG 220 >gi|254440702|ref|ZP_05054195.1| hypothetical protein OA307_117 [Octadecabacter antarcticus 307] gi|198250780|gb|EDY75095.1| hypothetical protein OA307_117 [Octadecabacter antarcticus 307] Length = 590 Score = 79.5 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 28/162 (17%), Positives = 53/162 (32%), Gaps = 25/162 (15%) Query: 271 WGTEKVRQYVTRDM--DSLILKPT----DSTPAMKQAYQILTSDK-----------KRSF 313 G YV S + T + ++Q L +D+ + Sbjct: 436 NGFNSRLAYVNASPVDASGPGEGTRYVDNGDELYHASWQQLFADRSYFLINNHYFLDAYY 495 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 + I+ N ++N IC A+ I I T++ A Sbjct: 496 AGAWSWNEYWGTDNSIDHLIV-------NGSEANTRLSNICAAARAQGIVIYTVAFEAPS 548 Query: 374 NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 GQ L+ C SS ++++V + + F I+ + + K + Sbjct: 549 GGQTALQDCASSSSHYFDV-DGTDISGAFSAIASDIRNLKLT 589 >gi|218528586|ref|YP_002419402.1| hypothetical protein Mchl_0543 [Methylobacterium chloromethanicum CM4] gi|218520889|gb|ACK81474.1| conserved hypothetical protein [Methylobacterium chloromethanicum CM4] Length = 518 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 30/182 (16%), Positives = 64/182 (35%), Gaps = 40/182 (21%) Query: 267 IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 + T Q ++ T+ + + +L+ + F+ GV Sbjct: 343 QPLTRLTTSQTQLTNAIAAMTVIGDTNIPIGLAWGWHLLSPNG-------PFKDGVAYGE 395 Query: 327 LPFQKFIIFLTDGENNNFKS---------------------------------NVNTIKI 353 + +KFI+ +TDG+N + S + + Sbjct: 396 IKTKKFIVLMTDGQNQSAVSSSDNRSYYSGLGFIWQNRIGTTSNDNAVRTKAIDTRLTLL 455 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 CD ++ I++ + + + +LK C +SP ++V N+ L VF+ I+ + + Sbjct: 456 CDNIRKARIQVFAVRVEVNDGDSAVLKACATSPNMFFDVKNSSGLPAVFRAIADQISELR 515 Query: 414 YS 415 S Sbjct: 516 IS 517 >gi|89055932|ref|YP_511383.1| hypothetical protein Jann_3441 [Jannaschia sp. CCS1] gi|88865481|gb|ABD56358.1| hypothetical protein Jann_3441 [Jannaschia sp. CCS1] Length = 612 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 18/77 (23%), Positives = 35/77 (45%), Gaps = 1/77 (1%) Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 DG +++ N + ICD A I + I A GQR+++ C S +++V Sbjct: 535 DGVVGQSQADTNLMAICDVANAAGIIVYAIGFEAPDRGQRVMEHCASVDANYFDVEG-RE 593 Query: 398 LIHVFQNISQLMVHRKY 414 + F +I++ + + Sbjct: 594 ISEAFASIARSINQLRL 610 >gi|83955719|ref|ZP_00964299.1| hypothetical protein NAS141_07930 [Sulfitobacter sp. NAS-14.1] gi|83840013|gb|EAP79189.1| hypothetical protein NAS141_07930 [Sulfitobacter sp. NAS-14.1] Length = 480 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 66/467 (14%), Positives = 146/467 (31%), Gaps = 69/467 (14%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 R + + +I+ +M + + G + + +++ +++ ++ A+LA A Sbjct: 24 LRKRLTRFAREDDGLVTILALFMIMMMIAVGGIQLDFMRHEMERSRLQAVSDRAVLAAAD 83 Query: 67 --KMVSNLSRLGDRF--ESISNHAKRALIDDAKRFIKNHIKESLSGYSA-----VFYNTE 117 +M + + D F ++ ++DD F + S + + F E Sbjct: 84 LDQMRDPKTVVEDYFAKSGMTEFLSNVVVDDGLNFRTVTVDASKNMDTQFIGRFGFPTLE 143 Query: 118 IQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFI 177 + + + LD S + N + L ++ + VS + Sbjct: 144 VPAHSQAEERVAKVEISLVLDISGSMATNNRLGEVQNAADIFLDTVLKDENQDLISVSLV 203 Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK-NALL 236 P ++ G P+ + + V+ ++ D D +S + + +M + N Sbjct: 204 PYSEQVNAG--PLIMDRMNVNRKHDYSHCIDFDNGDFDSIAMNSSTRYNQMQHFQWNYDG 261 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 D + + + R KN + V R S+ L + Sbjct: 262 RNNYRDDTVCPRYDYERITPFSQNKRTLKNQ--------IDDLVPRAGTSIFLGMKWAAA 313 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ--KFIIFLTDGEN--------NNFKS 346 + A++ + + + + + + K +I +TDG N N + S Sbjct: 314 MLDPAFRDINNSLVNAGYVDREFYNRPASYTDSETLKTVILMTDGANDNSYRIRSNYYDS 373 Query: 347 NVN--------------------------------------TIKICDKAKENFIKIVTIS 368 + ICD AK I I +I Sbjct: 374 DSEYVHWNKYNLWWYLRREVDSRYWGYFYYHKYNKTLGNTLLSNICDAAKAKRIVIWSIG 433 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 ++ C SSP + + V L F+ I++ + + + Sbjct: 434 FEVDDEDVPAMQDCASSPSHFFRVEGV-ELSEAFRAIARQINQLRLT 479 >gi|83941160|ref|ZP_00953622.1| hypothetical protein EE36_02988 [Sulfitobacter sp. EE-36] gi|83846980|gb|EAP84855.1| hypothetical protein EE36_02988 [Sulfitobacter sp. EE-36] Length = 480 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 68/474 (14%), Positives = 139/474 (29%), Gaps = 83/474 (17%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 R + + +I+ +M + + G + + +++ +++ ++ A+LA A Sbjct: 24 LRKRLTRFAREDDGLVTILALFMIMMMIAVGGIQLDFMRHEMERSRLQAVSDRAVLAAAD 83 Query: 67 --KMVSNLSRLGDRF--ESISNHAKRALIDDAKRFIKNHIKESLSGYSA-----VFYNTE 117 +M + + D F ++ ++DD F + S + F E Sbjct: 84 LDQMRDPKTVVEDYFAKSGMTEFLSNVVVDDGLNFRTVTVDASKDMDTQFIGRFGFPTLE 143 Query: 118 IQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFI 177 + + + LD S + N + L ++ + VS + Sbjct: 144 VPAHSQAEERVAKVEISLVLDISGSMATNNRLGEVQDAADIFLDTVLKDENEDLISVSLV 203 Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK-NALL 236 P ++ G P+ + + V+ ++ D D +S + + +M + N Sbjct: 204 PYSEQVNAG--PLIMDRMNVNRKHDYSHCIDFDNGDFDSIAMNSSTRYNQMQHFQWNYDG 261 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 D + + + R K + D + T Sbjct: 262 RNNYRDDTVCPRYDYERITPFSQNKRTLK---------------NQIDDLVPRAGTSIFL 306 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIP---------SLPFQKFIIFLTDGENNNFKSN 347 MK A +L + + V K +I +TDG N+N Sbjct: 307 GMKWAAAMLDPAFRDINNSLVNAGHVDREFYNRPASYTDSETLKTVILMTDGANDNSFRI 366 Query: 348 VNT----------------------------------------------IKICDKAKENF 361 NT ICD AK Sbjct: 367 SNTYYNEDSEYVHWNRYNLWWYLRREVNSRYWGYFYYQKYNKSLGNTLLSNICDAAKAKR 426 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 I I +I ++ C SSP + + V L F+ I++ + + + Sbjct: 427 IVIWSIGFEVDDEDVPAMQDCASSPSHFFRVEGV-ELSEAFRAIARQINQLRLT 479 >gi|260425757|ref|ZP_05779737.1| conserved hypothetical protein [Citreicella sp. SE45] gi|260423697|gb|EEX16947.1| conserved hypothetical protein [Citreicella sp. SE45] Length = 479 Score = 78.8 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 59/466 (12%), Positives = 148/466 (31%), Gaps = 77/466 (16%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 ++ E + + + + M ++ G I ++ ++ +++ + A+LA A + + Sbjct: 28 RRFAGDESGSMTYMAVVLSMMMMIFGGLGIDMIYAELQRTKVQNTLDRAVLAAAD-LDNE 86 Query: 72 LSRLGDRFE-----SISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 L G + ++++ +D+ + + + E + F + + + Sbjct: 87 LEAQGVVEDYMDKMALADALISVDVDEGLNY-RTVVAEGYKTMPSNFMQILGVDNLQAYG 145 Query: 127 ISMTHMANNRLDSS-----NNTIFYNMDVMTSYDYRLQFIEHLLNQ-RYNQKIVSFIPAL 180 ++ N+++ S + ++ N + D FI+ LL + + +S +P Sbjct: 146 LAEATERINKVEVSLVLDISGSMDDNDKLANMQDAAGTFIDTLLAEGNEDLVSISLVPYS 205 Query: 181 LRIEMGER--------------------PIFLIELVVDLSGS----MHCAMNSDP-EDVN 215 ++ G +D S + H N D + Sbjct: 206 EQVNAGPEILSYLSANWKHGYSHCIEMPNSVFGSAALDFSRTYEQMQHYQWNYDGYNNTL 265 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 S +C ++ A + ++ L G + + WGT Sbjct: 266 SDTVCPRYGYERIQAWSHDASALKAQVNQLQP-----RAGTSIF-------MGMKWGTAL 313 Query: 276 VRQYVTRDMDSLILKPT------------DSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 + +I + + D T +K + RS+ + + Sbjct: 314 LDPSTRPIASGMIARGSVDQVFEGRPVAYDDTDVLKTVVLMTDGQHDRSYRIQDWAYNSE 373 Query: 324 IPSLPFQKFIIFLT------DGEN--------NNFKSNVNTIKICDKAKENFIKIVTISI 369 + ++ ++ E N + IC AK I I ++ Sbjct: 374 SEYAHWNRYNLWYYLSRYVSSYERSSFYYQKYNADLGDALLGSICAAAKAQGIIIWSVGF 433 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 +G ++++C SSP + + V + F I+ + + + Sbjct: 434 EVGDHGADVMESCASSPAHFFRVEGV-EITEAFSTIAHTLNQLRLT 478 >gi|254477542|ref|ZP_05090928.1| conserved hypothetical protein [Ruegeria sp. R11] gi|214031785|gb|EEB72620.1| conserved hypothetical protein [Ruegeria sp. R11] Length = 523 Score = 78.8 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 24/117 (20%), Positives = 43/117 (36%), Gaps = 2/117 (1%) Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 + + L + + S + + D + K N T ICD AK Sbjct: 408 QLTHAELFARTSLRYVYQRLFADWMGNSAAKNSWYYGVYDSWGTSTK-NARTKAICDAAK 466 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 I + TI A G +LK C SS ++++V + F +I+ + + + Sbjct: 467 ARGIVVYTIGFEAPSGGVSVLKDCASSDAHYFDVQGL-EISDAFASIATSIRQLRLT 522 >gi|312793553|ref|YP_004026476.1| von willebrand factor type a [Caldicellulosiruptor kristjanssonii 177R1B] gi|312180693|gb|ADQ40863.1| von Willebrand factor type A [Caldicellulosiruptor kristjanssonii 177R1B] Length = 726 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 43/217 (19%), Positives = 75/217 (34%), Gaps = 52/217 (23%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V+D SGSM DP K A F+D++ Sbjct: 42 VLDSSGSMSWN---DPNGYR----------------KIAAKSFVDALIQGD------RAA 76 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 ++ + + + V+ + R T+ ++ A L S Sbjct: 77 VVDFDDYGYLLQPLTTDFQTVKNAIDRI---DSWGGTNIAEGIRIANHQLISQSSDDRI- 132 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 K II LTDGE + +N+ T +AK N I I TI + + Sbjct: 133 ---------------KVIILLTDGE-GYYDNNLTT-----EAKNNGITIYTIGL-GTSVD 170 Query: 376 QRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVH 411 + LL+ + + ++ V +A L VF+ I++++ Sbjct: 171 ENLLRNIATQTGGMYFPVSSASQLPQVFKRITEIVTE 207 >gi|163759224|ref|ZP_02166310.1| hypothetical protein HPDFL43_05650 [Hoeflea phototrophica DFL-43] gi|162283628|gb|EDQ33913.1| hypothetical protein HPDFL43_05650 [Hoeflea phototrophica DFL-43] Length = 541 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 35/193 (18%), Positives = 71/193 (36%), Gaps = 45/193 (23%) Query: 258 GYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNF 317 G+ +E + + K+R V + T+ + +++L+ Sbjct: 358 GFGCEMEPLVPLTTDFSKIRTTVKALEAN---GSTNMLEGVMWGWRVLS-------DREP 407 Query: 318 FRQGVKIPSLPFQKFIIFLTDGENN----------------------------------N 343 F QG +K +IFLTDG+N+ N Sbjct: 408 FAQGAPKSDASVEKIMIFLTDGQNSFGNLNNDLGSAYTSMGYLVDGRLDGMTAANIGQTN 467 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQ-RLLKTCVSSPEYHYNVVNADSLIHVF 402 + T C+ AKE+ + I TI + + G ++L+ C +S ++++ + L +F Sbjct: 468 NALDKKTKAACENAKEDGVTIYTIRLEEADVGTGKMLEECATSSAHYFDAPSRQQLTPIF 527 Query: 403 QNISQLMVHRKYS 415 I + +V + + Sbjct: 528 DAIKKGVVKLRLT 540 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 38/245 (15%), Positives = 80/245 (32%), Gaps = 72/245 (29%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 + NF+++F + + ++ G + + +K+ +++A ++A L ++R Sbjct: 8 LKDTSGNFALVFGILAVPVMVAGGLAVDYVGLSVEKSKLQNAVDSAAL--------LIAR 59 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 GD E + K A + N K ++S Sbjct: 60 AGDMSE--TQAMKLAKTTITTNYGINVAKVAVSMVDG----------------------- 94 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIE 194 + V S D L F + ++ + + I Sbjct: 95 ------------DATVKASMDQALVF-GGFMGRKNAAVSAEATATYAYTK------YEIA 135 Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D +GSM K+ +L+NA++ +D ++ L KE + Sbjct: 136 LVLDTTGSMLGG--------------------KLTSLQNAVIGLVDGMEALGLNKEQLKF 175 Query: 255 GLIGY 259 ++ Y Sbjct: 176 AVVPY 180 >gi|90418244|ref|ZP_01226156.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] gi|90337916|gb|EAS51567.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] Length = 489 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 68/485 (14%), Positives = 147/485 (30%), Gaps = 98/485 (20%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 + ++ + ++ AL ++ +++ G I ++ ++ ++ + +LA AS + R Sbjct: 16 LRAKAGSIPVMTALMLVPMIVISGGAIDLIAHERLRSVLQDGLDRGVLAAASLTQTRPPR 75 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + K + D K + + + ++ T+ + +T A+ Sbjct: 76 ETIESFLKAAVTKGSYALDVK---ADELSNAKRVEASATAVTDTAFLRLIGIDKLTVEAH 132 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQR------------------------YN 170 + I ++ + S R + Sbjct: 133 AEAEEKRKNIEISLLLDMSGSMRFDKSGSYPGPSGAMRINYLRPAAKSFMDMVLADGAED 192 Query: 171 QKIVSFIPALLRIEMGERPIFLIELVV-----DLSGSMHCAMNSDPEDVNSAPICQDKKR 225 VS +P ++ +G P+ L D S C + P + + Sbjct: 193 YTTVSIVPYAGQVSIG--PVLFDALARNRRQHDRS---SCFQFGRNDFTLGVPDFANLPQ 247 Query: 226 TKM-------AALKNALLLFLDSIDLLSHVKEDVYMGLI-------GYTTRVEKNIEPSW 271 T+ ALK A + D G G T S Sbjct: 248 TQHFTQANHHDALKKAGEAQITEPWWCPDDPHDPRPGTTPDFVAGEGKDTDRTSVSFLSN 307 Query: 272 GTEKVRQYVTRDMDSLIL-KPT---------------DSTPAMKQAYQILTSDKKRSFFT 315 R+Y+ R +D+ L T P +++A + ++ Sbjct: 308 D----REYLKRQIDNYKLYDGTGTPIALKWGLLLLDPAIQPMLREAARYRALSEELDIDA 363 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGE------------------NNNFKSNVNTI------ 351 F + KF++ +TDG N + +++ ++ Sbjct: 364 RFSNRPASFTDPDTMKFLVLMTDGAISSQRIPKDASKPVQYYNNGSLNTDLYSVGDAERF 423 Query: 352 --KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 +C AK+ + + TI + + + + C S E Y V NA + F++I+ + Sbjct: 424 AAALCTAAKQKNVIVFTIGFDVNDTAAKQMSNCASGAERFYRV-NALDIQDAFKSIATAI 482 Query: 410 VHRKY 414 K Sbjct: 483 QKIKL 487 >gi|99081991|ref|YP_614145.1| hypothetical protein TM1040_2151 [Ruegeria sp. TM1040] gi|99038271|gb|ABF64883.1| hypothetical protein TM1040_2151 [Ruegeria sp. TM1040] Length = 582 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 24/119 (20%), Positives = 52/119 (43%), Gaps = 4/119 (3%) Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ +Y L +D + + + + +F + G + + T+ IC+ Sbjct: 467 AVQLSYPDLFADTSLRYLRDRLFGDWMSNANYYWFSGLFSSVG---STTKDARTLDICEA 523 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 AK + + TI A GQ +L+ C SS ++Y+V + + F +I+ + + + Sbjct: 524 AKAKGVVVFTIGFEAPSRGQEVLQACASSASHYYDV-DGLEISDAFASIASAIRQLRLT 581 Score = 37.5 bits (85), Expect = 4.3, Method: Composition-based stats. Identities = 32/202 (15%), Positives = 67/202 (33%), Gaps = 37/202 (18%) Query: 5 SRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG 64 ++ E + + V+ ++ G + ++ + ++ + A+LA Sbjct: 18 RAIVDRMRQFRRDESGVLAKPMIMIVVLMFMIGGLGMDMVRLERDRTKLQYTLDRAVLAA 77 Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 A +L + D + +++ +S G E+ Sbjct: 78 A-----DLDQPLDPEA----------------VVLDYMSKSGLGDYTTVVVPEVSPTAKR 116 Query: 125 SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 + S+ S N +FY+ + Y+L+ I L ++ A+ I Sbjct: 117 VKASVDTNF---TASWMNNVFYDDYIRNPDTYQLEPITLPL--------LASSTAVESIG 165 Query: 185 MGERPIFLIELVVDLSGSMHCA 206 E I LV+D+SGSM Sbjct: 166 NVE-----ISLVLDVSGSMRSN 182 >gi|296108502|ref|YP_003620203.1| hypothetical protein lpa_04155 [Legionella pneumophila 2300/99 Alcoy] gi|295650404|gb|ADG26251.1| Hypothetical protein lpa_04155 [Legionella pneumophila 2300/99 Alcoy] Length = 352 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 50/258 (19%), Positives = 94/258 (36%), Gaps = 55/258 (21%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + + R + I +V+DLSGSM + + + +++ +K A F+ Sbjct: 87 VGEPKPVAREGYNIMMVLDLSGSME----------ITDMLLHGRPVSRLLVVKRAAEQFV 136 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 + D + +GLI + TR ++ V + L K T A+ Sbjct: 137 E--DRVGD-----RIGLILFGTRAYLQTPLTYDRHSVLMRIDDATAGLAGKTTSIGDAVG 189 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 A + L + + II LTDG NN+ + +K + AK+ Sbjct: 190 LAVKRLQDVPSKG------------------RVIILLTDGANNSGV--LAPLKAAELAKQ 229 Query: 360 NFIKIVTIS-------------INASP-----NGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + IKI TI A + + L K + ++ + +SL + Sbjct: 230 DGIKIYTIGLGSEADPRALTGDFFAPTLSAELDEKTLEKMAKMTGGRYFRATDPESLQSI 289 Query: 402 FQNISQLMVHRKYSVILK 419 +Q I+QL ++ ++ Sbjct: 290 YQTINQLETVKQEQATVR 307 >gi|148361167|ref|YP_001252374.1| Von Willebrand factor type A (vWA) domain-containing protein [Legionella pneumophila str. Corby] gi|148282940|gb|ABQ57028.1| conserved hypothetical protein [Legionella pneumophila str. Corby] Length = 344 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 50/258 (19%), Positives = 94/258 (36%), Gaps = 55/258 (21%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + + R + I +V+DLSGSM + + + +++ +K A F+ Sbjct: 79 VGEPKPVAREGYNIMMVLDLSGSME----------ITDMLLHGRPVSRLLVVKRAAEQFV 128 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 + D + +GLI + TR ++ V + L K T A+ Sbjct: 129 E--DRVGD-----RIGLILFGTRAYLQTPLTYDRHSVLMRIDDATAGLAGKTTSIGDAVG 181 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 A + L + + II LTDG NN+ + +K + AK+ Sbjct: 182 LAVKRLQDVPSKG------------------RVIILLTDGANNSGV--LAPLKAAELAKQ 221 Query: 360 NFIKIVTIS-------------INASP-----NGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + IKI TI A + + L K + ++ + +SL + Sbjct: 222 DGIKIYTIGLGSEADPRALTGDFFAPTLSAELDEKTLEKMAKMTGGRYFRATDPESLQSI 281 Query: 402 FQNISQLMVHRKYSVILK 419 +Q I+QL ++ ++ Sbjct: 282 YQTINQLETVKQEQATVR 299 >gi|52843052|ref|YP_096851.1| hypothetical protein lpg2856 [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] gi|52630163|gb|AAU28904.1| hypothetical protein lpg2856 [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] Length = 352 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 50/258 (19%), Positives = 94/258 (36%), Gaps = 55/258 (21%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + + R + I +V+DLSGSM + + + +++ +K A F+ Sbjct: 87 VGEPKPVAREGYNIMMVLDLSGSME----------ITDMLLHGRPVSRLLVVKRAAEQFV 136 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 + D + +GLI + TR ++ V + L K T A+ Sbjct: 137 E--DRVGD-----RIGLILFGTRAYLQTPLTYDRHSVLMRIDDATAGLAGKTTSIGDAVG 189 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 A + L + + II LTDG NN+ + +K + AK+ Sbjct: 190 LAVKRLQDVPSKG------------------RVIILLTDGANNSGV--LAPLKAAELAKQ 229 Query: 360 NFIKIVTIS-------------INASP-----NGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + IKI TI A + + L K + ++ + +SL + Sbjct: 230 DGIKIYTIGLGSEADPRALTGDFFAPTLSAELDEKTLEKMAKMTGGRYFRATDPESLQSI 289 Query: 402 FQNISQLMVHRKYSVILK 419 +Q I+QL ++ ++ Sbjct: 290 YQTINQLETVKQEQATVR 307 >gi|54295680|ref|YP_128095.1| hypothetical protein lpl2768 [Legionella pneumophila str. Lens] gi|53755512|emb|CAH17011.1| hypothetical protein lpl2768 [Legionella pneumophila str. Lens] gi|307611729|emb|CBX01432.1| hypothetical protein LPW_31221 [Legionella pneumophila 130b] Length = 344 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 50/258 (19%), Positives = 94/258 (36%), Gaps = 55/258 (21%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + + R + I +V+DLSGSM + + + +++ +K A F+ Sbjct: 79 VGEPKPVAREGYNIMMVLDLSGSME----------ITDMLLHGRPVSRLLVVKRAAEQFV 128 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 + D + +GLI + TR ++ V + L K T A+ Sbjct: 129 E--DRVGD-----RIGLILFGTRAYLQTPLTYDRHSVLMRIDDATAGLAGKTTSIGDAVG 181 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 A + L + + II LTDG NN+ + +K + AK+ Sbjct: 182 LAVKRLQDVPSKG------------------RVIILLTDGANNSGV--LAPLKAAELAKQ 221 Query: 360 NFIKIVTIS-------------INASP-----NGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + IKI TI A + + L K + ++ + +SL + Sbjct: 222 DGIKIYTIGLGSEADPRALTGDFFAPTLSAELDEKTLEKMAKMTGGRYFRATDPESLQSI 281 Query: 402 FQNISQLMVHRKYSVILK 419 +Q I+QL ++ ++ Sbjct: 282 YQTINQLETVKQEQATVR 299 >gi|257062895|ref|YP_003142567.1| hypothetical protein Shel_01450 [Slackia heliotrinireducens DSM 20476] gi|256790548|gb|ACV21218.1| uncharacterized protein [Slackia heliotrinireducens DSM 20476] Length = 744 Score = 77.2 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 50/369 (13%), Positives = 113/369 (30%), Gaps = 62/369 (16%) Query: 61 ILAGASKMVSNLSR---LGDRFESISNHAKRALIDDAKRFI----KNHIKESLSGYSAVF 113 I+AG +++S + + + +A D+ + + + + F Sbjct: 239 IVAGYPELISREGSGFLFSAEDAKLIDDSVQAFNDEIQNLVEECQQEDMNIYFVSVQDEF 298 Query: 114 YNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRY---- 169 E + + A + Y++ + Sbjct: 299 AGHEAY-TSDEYINRVKINAQPQDLKPEGPSAYSVHPNDKGAAAYARCVQRMIDDIQECE 357 Query: 170 -NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 N + +++ + + L +D SGSM + Sbjct: 358 DNGVDPRTLMGDSKVDPNDASSRHVVLALDTSGSMDGE--------------------PL 397 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K A F +I + L+ Y + I+ + ++ V D Sbjct: 398 NETKTATREFASTIFKSDAD-----VCLVSYDSSARNVIDSTDNEYALKAAV---RDLSA 449 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T+ A++ +Y+ L S ++ I+ ++DGE N Sbjct: 450 GGGTNIEDALRVSYERL------------------EGSGSDKRIIVLMSDGEANEGLVGD 491 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTC---VSSPEYHYNVVNADSLIHVFQNI 405 + I ++ K++ + I T+ S + + + ++SP HY V +A L + F +I Sbjct: 492 DLIAYANEIKDDGVTIYTLGFFQSVSDKAECQRVMEGIASPGCHYEVDDASQLRYFFGDI 551 Query: 406 SQLMVHRKY 414 + ++ Sbjct: 552 GDDINGTRF 560 >gi|254501086|ref|ZP_05113237.1| hypothetical protein SADFL11_1122 [Labrenzia alexandrii DFL-11] gi|222437157|gb|EEE43836.1| hypothetical protein SADFL11_1122 [Labrenzia alexandrii DFL-11] Length = 465 Score = 76.8 bits (187), Expect = 7e-12, Method: Composition-based stats. Identities = 71/475 (14%), Positives = 146/475 (30%), Gaps = 90/475 (18%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 + K ++ + IFA V+ +++ G I + + + A +AA L+ A+ Sbjct: 2 IKALISKFNRNQDGSILPIFAGMVLVLVVIGGAAIDISRAVNAREKLAYAIDAAALSVAT 61 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI-VNSS 125 + + + R I N RA + DA+ + S T + +N+ Sbjct: 62 DLSTTVLRDNQIKTRIENSF-RANLSDAEFLDQAIDNLDFDVDSNAGTVTVSSSAGLNNY 120 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVM-TSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 +++ + L + +V + +D L + + S + + Sbjct: 121 FLNIPGFGKDGLGPDVFNFGTSAEVNYSRFDVELALVVDVTG--------SMAGDMGALR 172 Query: 185 MGERPIFLIELVVDLSGS----------------MHCAMNSDPEDVNSAPICQDKKRTKM 228 + I + D S S + ++ S C +++ + Sbjct: 173 DAAEEVVDILIEDDASNSASKVRISLVPYSQGVNLGSYASTVTNGSTSWRNCVNEREGQ- 231 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG--------------TE 274 + D++ + + GL Y + E +W T Sbjct: 232 -------QKYTDAVYNYDGTNSEYFHGLQSYFIWDYGSSE-NWSSARDDCPSSSLQPLTS 283 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF--FTNFFRQGVKIPSLPFQKF 332 ++ + T + + L+ + + ++ G P +KF Sbjct: 284 DKNTLISDIRNLSSGGGTGGQTGVAWGWYTLSPNWTSLWPTDSDPEPYGNGTPDDDVKKF 343 Query: 333 IIFLTDGENN------------------------------------NFKSNVNTIKICDK 356 + +TDG+ N N +CD Sbjct: 344 ALIMTDGDFNAQYGKEERTTCTGRGRNRVCTTNEYWVERYHRYSDYNDPPATRARTLCDA 403 Query: 357 AKENFIKIVTISINA--SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 K I+I T+ + S G L+ C S +Y+Y N D LI F NI++ + Sbjct: 404 MKAENIEIFTVFFDTGGSAFGDDLMSYCASGSDYYYEADNKDELITAFSNIAKRI 458 >gi|126738776|ref|ZP_01754472.1| hypothetical protein RSK20926_02629 [Roseobacter sp. SK209-2-6] gi|126719957|gb|EBA16664.1| hypothetical protein RSK20926_02629 [Roseobacter sp. SK209-2-6] Length = 530 Score = 76.4 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 25/130 (19%), Positives = 51/130 (39%), Gaps = 5/130 (3%) Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 S + A + +Y L + + + ++ + D N+ K Sbjct: 405 SSPYGGNN---AQQLSYPDLFAYTSLKYLYKYIYADWMGSYSARSEWYYGVYDYHGNSTK 461 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 N T +C AK I + TI A NG +L+ C SS ++++V + + F++I Sbjct: 462 -NTRTSNVCSAAKAQGIIVYTIGFEAPSNGVAVLQDCASSDSHYFDV-DGLEIRDAFESI 519 Query: 406 SQLMVHRKYS 415 + + + + Sbjct: 520 ATSIRKLRLT 529 >gi|312622403|ref|YP_004024016.1| von willebrand factor type a [Caldicellulosiruptor kronotskyensis 2002] gi|312202870|gb|ADQ46197.1| von Willebrand factor type A [Caldicellulosiruptor kronotskyensis 2002] Length = 1166 Score = 76.4 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 47/237 (19%), Positives = 82/237 (34%), Gaps = 52/237 (21%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 I R + + V+D SGSM DP K A Sbjct: 483 PINKGEREINQQINYIDLVFVLDSSGSMSWN---DPNGYR----------------KIAA 523 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 F+D++ ++ + + + V+ + R T+ Sbjct: 524 KSFVDALIQGD------RAAVVDFDDFGYLLQPLTTDFQAVKNAIDRI---DSWGGTNIA 574 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 ++ A Q L S + S K II LTDGE + +N+ T Sbjct: 575 EGIRIANQQLIS----------------LSSEDRIKVIILLTDGE-GYYDNNLTT----- 612 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVH 411 +AK N I I TI + + + LL+ + + ++ V +A L VF+ I++++ Sbjct: 613 EAKNNGITIYTIGL-GTSVDENLLRDIATQTGGMYFPVSSASQLPQVFKRITEIVTE 668 >gi|222529355|ref|YP_002573237.1| von Willebrand factor type A [Caldicellulosiruptor bescii DSM 6725] gi|222456202|gb|ACM60464.1| von Willebrand factor type A [Caldicellulosiruptor bescii DSM 6725] Length = 1188 Score = 76.4 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 46/237 (19%), Positives = 80/237 (33%), Gaps = 52/237 (21%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 I R + + V+D SGSM DP K A Sbjct: 483 PINKGEREINQQVNYIDLVFVLDSSGSMSWN---DPNGYR----------------KIAA 523 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 F+D++ ++ + + + V+ + R T+ Sbjct: 524 KSFVDALIQGD------RAAVVDFDNFGYLLQPLTTDFQAVKNAIDRI---DSWGGTNIA 574 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 ++ A Q L S K II LTDGE + +N+ T Sbjct: 575 EGIRIANQQLISRSSEDRI----------------KVIILLTDGE-GYYDNNLTT----- 612 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVH 411 +AK N I I TI + + + LL+ + + ++ V +A L VF+ I++++ Sbjct: 613 EAKNNGITIYTIGL-GTSVDENLLRDIATQTGGMYFPVSSASQLPQVFKRITEIVTE 668 >gi|87310694|ref|ZP_01092822.1| BatA [Blastopirellula marina DSM 3645] gi|87286675|gb|EAQ78581.1| BatA [Blastopirellula marina DSM 3645] Length = 355 Score = 76.4 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 59/275 (21%), Positives = 95/275 (34%), Gaps = 55/275 (20%) Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPIF------LIELVVDLSGSMHCAMNSDPEDVNSA 217 L +V I AL R G IE+VVD SGSM Sbjct: 52 LPGALTLTAMVLLIIALARPREGREQAIVENDGIAIEMVVDRSGSM----------QAMD 101 Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 D+ ++ A+K F+ D L D+ +GLI + + P+ + Sbjct: 102 FQLGDEHVDRLTAIKKVAGDFVTGGDNLDGRLSDL-VGLITFAGYADGVTPPTLDHAFLV 160 Query: 278 QYVT--RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 + + + + T A+ A + L R+ KI S K II Sbjct: 161 SQLNHSQIVTNRSEDGTAIGDAISLAVEKL--------NALDARRKEKIQS----KIIIL 208 Query: 336 LTDGENNNFKSNVNTIKICDKAKENFIKIVTISI---------------------NASPN 374 LTDGENN ++ I+ + A+ IK+ TI + + Sbjct: 209 LTDGENNA--GDLEPIQAAELAQTMGIKVYTIGVGTKGRAPMPVTDMFGRQSMQWMSVNI 266 Query: 375 GQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 + L+ S + ++ + DSL ++ I QL Sbjct: 267 DEETLQKVASITGGKYFRATDTDSLAKIYGEIDQL 301 >gi|54298847|ref|YP_125216.1| hypothetical protein lpp2914 [Legionella pneumophila str. Paris] gi|53752632|emb|CAH14067.1| hypothetical protein lpp2914 [Legionella pneumophila str. Paris] Length = 344 Score = 76.1 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 49/258 (18%), Positives = 94/258 (36%), Gaps = 55/258 (21%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + + R + I +V+DLSGSM + + + +++ +K A F+ Sbjct: 79 VGEPKPVAREGYNIMMVLDLSGSME----------ITDMLLHGRPVSRLLVVKRAAEQFV 128 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 + D + +GLI + TR ++ V + L K T A+ Sbjct: 129 E--DRVGD-----RIGLILFGTRAYLQTPLTYDRHSVLMRIDDATAGLAGKTTSIGDAVG 181 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 A + L + + II LTDG NN+ + +K + AK+ Sbjct: 182 LAVKRLQDVPSKG------------------RVIILLTDGANNSGV--LAPLKAAELAKQ 221 Query: 360 NFIKIVTIS-------------INASP-----NGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + IKI TI A + + L + + ++ + +SL + Sbjct: 222 DGIKIYTIGLGSEADPRALTGDFFAPTLSAELDEKTLEEMAKMTGGRYFRATDPESLQSI 281 Query: 402 FQNISQLMVHRKYSVILK 419 +Q I+QL ++ ++ Sbjct: 282 YQTINQLETVKQEQATVR 299 >gi|86137906|ref|ZP_01056482.1| hypothetical protein MED193_08588 [Roseobacter sp. MED193] gi|85825498|gb|EAQ45697.1| hypothetical protein MED193_08588 [Roseobacter sp. MED193] Length = 543 Score = 76.1 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 24/126 (19%), Positives = 48/126 (38%), Gaps = 2/126 (1%) Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 S A++ Y L + + + + + N+ K N Sbjct: 419 GSGQSGTAVELNYAELFARTSLKYIYRYIFYEWMNFYDARDDWYYGIYSSHGNSTK-NAR 477 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 T +C+ AK I + TI A NG +L+ C SS ++++V + + F +I+ + Sbjct: 478 TRSVCEAAKAKGIVVYTIGFEAPSNGVAVLRDCASSDAHYFDV-DGLEIKDAFASIATSI 536 Query: 410 VHRKYS 415 + + Sbjct: 537 RQLRLT 542 Score = 44.1 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 41/372 (11%), Positives = 115/372 (30%), Gaps = 45/372 (12%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 ++ L R + + + ++ L + G + ++ + ++ + A Sbjct: 13 LYALRRGAQSLRAFRRDDSGVMAYPTVAFFLAMLAVGGIGVDLMRMERDRTVLQYTLDRA 72 Query: 61 ILAGA--------SKMVSNLSRLGDRFE---------SISNHAKRALIDDAKRFIKNHIK 103 +LA A + +V + E + +A ID F + ++ Sbjct: 73 VLAAADLDQTQPPAVVVQDYLNKAGLGEYYQEPIVESGLGYKRVQATIDAT--FEAHLLR 130 Query: 104 ESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEH 163 S VF ++ + ++ IS+ + ++S++ + D Sbjct: 131 FSNGNDLPVFATSKAEESIDGLEISLVLDVSGSMNSNSRLSNLKVAAKDFID------TM 184 Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 + N + +S +P ++ + + ++D ++ S+ + A Sbjct: 185 VANTTDGKMSISVVPYATQVSLPDD-------LIDQYTTVGENPYSNCIN-FEAAEYNSA 236 Query: 224 KRTKMAALKNAL-----LLFLDSIDLLSHVKEDVYMGLIGYTTR------VEKNIEPSWG 272 + + L+ ++ + V + + Sbjct: 237 SLSTLDTLERSMHFTPWGYSNRDMRTYYSSPRLVRSPVCDERASREVLPLQKDATTLKNF 296 Query: 273 TEKVRQYVTRDMDSLILKPTDS-TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + + +D + T P+ + A +++ S +F + + K Sbjct: 297 IQNLSAGGNTSIDVGMKWGTALLDPSARPAISAISTGIGASVPGDFSDRPAEYSDSDTIK 356 Query: 332 FIIFLTDGENNN 343 I+ +TDG+N + Sbjct: 357 IIVLMTDGQNTS 368 >gi|332558842|ref|ZP_08413164.1| hypothetical protein RSWS8N_07295 [Rhodobacter sphaeroides WS8N] gi|332276554|gb|EGJ21869.1| hypothetical protein RSWS8N_07295 [Rhodobacter sphaeroides WS8N] Length = 566 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 19/69 (27%), Positives = 35/69 (50%), Gaps = 1/69 (1%) Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 N T +ICD A+ I + +++ A GQ LL+ C S+ ++Y V + VF +I+ Sbjct: 498 NERTRQICDAARAQGITVYSVAFEAEAGGQALLQYCASTTGHYYATVGP-QIRTVFHSIA 556 Query: 407 QLMVHRKYS 415 + + + Sbjct: 557 SHITQLRLT 565 >gi|329848522|ref|ZP_08263550.1| flp pilus assembly protein TadG [Asticcacaulis biprosthecum C19] gi|328843585|gb|EGF93154.1| flp pilus assembly protein TadG [Asticcacaulis biprosthecum C19] Length = 486 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 34/201 (16%), Positives = 66/201 (32%), Gaps = 25/201 (12%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK---P 291 + + S ++ Y GL+ + I P V + + I Sbjct: 290 VKNQVASSKVVMPDPTTPYGGLVQTSQTCLNPILPLSNDATVVTNTIKGLVVNIGGYKPE 349 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T M LT + + + +K I+ +TDG N + + I Sbjct: 350 TYIPGGMIWGVNALTPPAPFT-----EGKPYDANNKEPRKTIVLMTDGANTLYANTSGGI 404 Query: 352 KI----------------CDKAKENFIKIVTISINASP-NGQRLLKTCVSSPEYHYNVVN 394 + CD AK I+I TI + + LK C + +++++ + Sbjct: 405 AVANATQVAVTYSDQIRVCDYAKSKKIEIYTIGFDVTDSKALSTLKACATDAQHYFDAKS 464 Query: 395 ADSLIHVFQNISQLMVHRKYS 415 + LI F+ I + + + Sbjct: 465 SADLIKAFETIGGKLSKVRLT 485 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 51/291 (17%), Positives = 93/291 (31%), Gaps = 66/291 (22%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 R++ + S N ++IF L++ + + +G I K S + A ++A+LA A Sbjct: 8 MRYFVNAFLRSRGGNTTMIFGLAIFAIMAALGTAIDFAVLQRAKRSTQDALDSAVLAAA- 66 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 +V+N + D K+ + KE+L T + + Sbjct: 67 -IVNNSNE-----------------GDLKKLAADVFKENLGAADLDAKVTAFKYDAKART 108 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG 186 + T SYD + + N Y + ++ G Sbjct: 109 VKATAQ-------------------GSYDPVIMQLFGFKNLPY-----AVTSDAIKAADG 144 Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + LV+D + SM +N P K+ LK A + +I Sbjct: 145 T---LEVALVLDNTWSMSATVNGTP---------------KIDILKTAAQGLVSTILTKD 186 Query: 247 HVKEDVYMGLIGYTTRV---EKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + K+ V + ++ Y V N W Y T + T + Sbjct: 187 N-KDYVKIAVVPYADYVNVGMANRNMPW-VSVAADYSTTSTKTCKTVSTAT 235 >gi|77463970|ref|YP_353474.1| hypothetical protein RSP_0399 [Rhodobacter sphaeroides 2.4.1] gi|77388388|gb|ABA79573.1| conserved hypothetical protein [Rhodobacter sphaeroides 2.4.1] Length = 566 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 19/69 (27%), Positives = 35/69 (50%), Gaps = 1/69 (1%) Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 N T +ICD A+ I + +++ A GQ LL+ C S+ ++Y V + VF +I+ Sbjct: 498 NERTRQICDAARAQGITVYSVAFEAEAGGQALLQYCASTTGHYYATVGP-QIRTVFHSIA 556 Query: 407 QLMVHRKYS 415 + + + Sbjct: 557 SHITQLRLT 565 >gi|126462813|ref|YP_001043927.1| hypothetical protein Rsph17029_2052 [Rhodobacter sphaeroides ATCC 17029] gi|126104477|gb|ABN77155.1| conserved hypothetical protein [Rhodobacter sphaeroides ATCC 17029] Length = 566 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 19/69 (27%), Positives = 35/69 (50%), Gaps = 1/69 (1%) Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 N T +ICD A+ I + +++ A GQ LL+ C S+ ++Y V + VF +I+ Sbjct: 498 NERTRQICDAARAQGITVYSVAFEAEAGGQALLQYCASTTGHYYATVGP-QIRTVFHSIA 556 Query: 407 QLMVHRKYS 415 + + + Sbjct: 557 SHITQLRLT 565 >gi|221639828|ref|YP_002526090.1| hypothetical protein RSKD131_1729 [Rhodobacter sphaeroides KD131] gi|221160609|gb|ACM01589.1| Hypothetical Protein RSKD131_1729 [Rhodobacter sphaeroides KD131] Length = 566 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 19/69 (27%), Positives = 35/69 (50%), Gaps = 1/69 (1%) Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 N T +ICD A+ I + +++ A GQ LL+ C S+ ++Y V + VF +I+ Sbjct: 498 NERTRQICDAARAQGITVYSVAFEAEAGGQALLQYCASTTGHYYATVGP-QIRTVFHSIA 556 Query: 407 QLMVHRKYS 415 + + + Sbjct: 557 SHITQLRLT 565 >gi|91977525|ref|YP_570184.1| hypothetical protein RPD_3057 [Rhodopseudomonas palustris BisB5] gi|91683981|gb|ABE40283.1| conserved hypothetical protein [Rhodopseudomonas palustris BisB5] Length = 464 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 67/469 (14%), Positives = 134/469 (28%), Gaps = 67/469 (14%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 +++R R + + + N ++IFAL+++ L IG I + +M++A ++ L Sbjct: 6 IITRSRKVASRFVGDDGGNIAVIFALTLLPILGFIGAAIDYSRASRARTAMQAALDSTAL 65 Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 + + ++ + + E + R + + S + Sbjct: 66 MVSKDLGADKIKTSEVSEKAQTYFNSLYTGTEARGVTLTTNYTAKDDSGSSTVVVNGDGA 125 Query: 123 NSSRISMTHMANNRLDSSNNTIFY-------NMDVMTSYDYRLQFIEHLLNQRYNQKIVS 175 S+ + S T + M + + L L +K S Sbjct: 126 VSTHFMKMFGFPSLAIGSAATATWGGTRLRVAMALDVTGSMVLNGSTKL---AEMKKAAS 182 Query: 176 FIPALLRIEMGERPIFLIELV-------VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 + LR + I +V V S + D D + K +TK Sbjct: 183 ALVDTLRASAQSKDDLYISVVPFAQMVNVGSSNIDASWIKWDVWDETEGSCSKSKFKTKT 242 Query: 229 A-----------ALKNALLLFLDSIDLLS-----HVKEDVYMGLI----GYTTRVEKNIE 268 D +D + G T+ + Sbjct: 243 DCEDNGRTWTVTDRSKWKGCVTDRDQPADTTKDAPTSDDTRFPALRTLLGTTSCPAQIFP 302 Query: 269 P--SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 ++ ++ D + T+ M A+ L + K P+ Sbjct: 303 MTSAYAATDAQKIKDVIDDLVADGGTNQPIGMAWAWMSLQQGNPLNTPA-------KDPN 355 Query: 327 LPFQKFIIFLTDGENN--------------NFKSNVNTIKICDKAK---ENFIK--IVTI 367 + II L+DG N + K + +CD K N + + TI Sbjct: 356 YKYTDAIILLSDGLNTMDRWPDYGDGQRQFDGKIDARQKLLCDNIKLPDSNGKRPVVYTI 415 Query: 368 SINAS-PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 +N + +LK C + A + F I + + + Sbjct: 416 QVNTTGDPESTILKYCA-DGGNFFATTTASGIGTAFAQIGSSLSKLRIA 463 >gi|163738634|ref|ZP_02146048.1| hypothetical protein RGBS107_11437 [Phaeobacter gallaeciensis BS107] gi|161387962|gb|EDQ12317.1| hypothetical protein RGBS107_11437 [Phaeobacter gallaeciensis BS107] Length = 558 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 23/117 (19%), Positives = 47/117 (40%), Gaps = 2/117 (1%) Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 + ++ L + + + S + + D N + K + T +C+ AK Sbjct: 443 RLSWGELFGETSLKYLYKYLYGDWMGQSSARSVWYYGVYDYWNTSTK-DARTRAVCNAAK 501 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 I + TI A +G +LK C SS +H++V + F +I+ + + + Sbjct: 502 NQGIVVYTIGFEAPSSGTAVLKDCASSDAHHFDVRGL-EIRDAFASIATSIRQLRLT 557 Score = 38.7 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 45/359 (12%), Positives = 102/359 (28%), Gaps = 80/359 (22%) Query: 29 SVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA--------SKMVSNL-------S 73 ++S L + G + ++ + ++ + A+LA A + +V + Sbjct: 43 FLLSMLAVGGIGVDLMRMERDRTILQYTLDRAVLAAADLDQPLPPAAVVQDYLSKAGLNK 102 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 K+ F + +K S ++ + + ++ IS+ Sbjct: 103 YYTPPVAETGLGFKKVQSTIDTTFETHMLKFSSGQDMPLYATSRAEESIDGLEISLVLDV 162 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGER----- 188 + + S++ + D + N N+ +S IP ++ + Sbjct: 163 SGSMGSNSRLANLKVAAKDFVD------TMIANTIDNKMSISIIPYATQVSLPTELMDQY 216 Query: 189 ---------------------PIFLIELVVDLSGSMHCAMNSDPEDVNS---APICQDKK 224 +D + SD + +P C+D Sbjct: 217 NTTDEHAYSNCVNFVGSHFQTTALSTTQELDRTMHFSVWSGSDYRASANPLDSPTCEDSA 276 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 ++ + ID L T ++ + WGT + + Sbjct: 277 NREILPFQKDANTLKGFIDGLQAEG----------NTSID--VGMKWGTALLDPSARPAI 324 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 +L PA +++ +F + K I+ +TDG+N N Sbjct: 325 SALASGGGAMVPA-------TFNNRPAAFNDHETV-----------KVIVLMTDGKNTN 365 >gi|163742980|ref|ZP_02150363.1| hypothetical protein RG210_01902 [Phaeobacter gallaeciensis 2.10] gi|161383663|gb|EDQ08049.1| hypothetical protein RG210_01902 [Phaeobacter gallaeciensis 2.10] Length = 560 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 23/117 (19%), Positives = 47/117 (40%), Gaps = 2/117 (1%) Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 + ++ L + + + S + + D N + K + T +C+ AK Sbjct: 445 RLSWGELFGETSLKYLYKYLYGDWMGQSSARSVWYYGVYDYWNTSTK-DARTRAVCNAAK 503 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 I + TI A +G +LK C SS +H++V + F +I+ + + + Sbjct: 504 NQGIVVYTIGFEAPSSGTAVLKDCASSDAHHFDVRGL-EIRDAFASIATSIRQLRLT 559 >gi|254496635|ref|ZP_05109500.1| conserved hypothetical protein [Legionella drancourtii LLAP12] gi|254354157|gb|EET12827.1| conserved hypothetical protein [Legionella drancourtii LLAP12] Length = 342 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 51/247 (20%), Positives = 94/247 (38%), Gaps = 55/247 (22%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 L + ER + I + +DLSGSM I D+ +++ +KNA F+ Sbjct: 79 LGEPKPIEREGYNIMMALDLSGSME----------IPDMILHDRPASRLTVVKNAAEQFV 128 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 D L +GLI + +R ++ + V + L K T A+ Sbjct: 129 R--DRLGD-----KIGLILFGSRAYLQTPLTYDRQTVLLRIEDATVGLAGKTTSIGDAVG 181 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 A + L + ++ + II LTDG NN+ + +K + AK+ Sbjct: 182 LAVKRLDAVPQKG------------------RVIILLTDGANNSGI--LEPLKAAELAKD 221 Query: 360 NFIKIVTISINASPNGQR-----LLKTCVSS-------------PEYHYNVVNADSLIHV 401 IKI TI + A+ + + L++ + ++ + +L + Sbjct: 222 EGIKIYTIGLGAATDPRALTNGFLMQAAAADLDEETLKEMSAMTGGRYFRATDTATLNSI 281 Query: 402 FQNISQL 408 ++ I+QL Sbjct: 282 YKTINQL 288 >gi|84502751|ref|ZP_01000870.1| hypothetical protein OB2597_00965 [Oceanicola batsensis HTCC2597] gi|84389146|gb|EAQ01944.1| hypothetical protein OB2597_00965 [Oceanicola batsensis HTCC2597] Length = 470 Score = 74.5 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 68/470 (14%), Positives = 142/470 (30%), Gaps = 66/470 (14%) Query: 2 HLLS-RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 ++L R R + +E + +++ + + G + V+ ++ ++ +A+ Sbjct: 12 NVLRDRVRRHVVDFARAEDGVMLALVMFMLLTMMTVAGIGVDVMRTEMERTRIQQVIDAS 71 Query: 61 ILAGASK---------------------MVSNLSRLGDRFESISNHAKRALIDDAKRFIK 99 LA A K +S L E+ + FI+ Sbjct: 72 TLAAAHKDNALDPKQVVLDYFDKAALASYISADDILVGGGETSTAVEVNLTAQVKTPFIR 131 Query: 100 NHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQ 159 + ES V + +S +S+ + +D + + D L Sbjct: 132 HLGNESF----NVPARGRAEQAYGNSEVSLVLDISGSMDDNRRMSRLHRAANEFVDTVLT 187 Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN---- 215 ++ VS IP + +G + + S D Sbjct: 188 P------DSVDRVSVSLIPYTGDVNVGWDIFSRMNVRQLHDYSYCVQFTPDDFSTTAIDP 241 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE-----DVYMGLIGYTTRVEKNIEPS 270 Q + + + A N + S + ++ + + + + R +I Sbjct: 242 EDAYIQGQHFSHVDARFNYISCPTQSYETVTPFSQNNAALEAQINRLTGRERTSIHIGIK 301 Query: 271 WGTE----KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK--- 323 WG R V +D+ I+ L + N + +K Sbjct: 302 WGAAMLDEAFRPLVNDLVDNSIVDEAFRDRPAPFTSNTLKVIVVMTDGMNTETKRIKEFA 361 Query: 324 --IPSLPFQKFIIFLTDGENNNFKS---------------NVNTIKICDKAKENFIKIVT 366 P + + D +N+ S N IC+ AK N I I + Sbjct: 362 YDTPDMRAHWARHAMDDWDNDVDGSVEDHLFDTYYDTAIGNALLQNICNAAKANGIIIYS 421 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I + + + ++ C SSP + Y V + F +I+Q + + ++ Sbjct: 422 IGFEINNDAAQEMEDCASSPSHFYRVEGV-QISEAFSSIAQQLKQLRLTL 470 >gi|31789431|gb|AAP58546.1| hypothetical protein [uncultured Acidobacteria bacterium] Length = 327 Score = 74.5 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 47/258 (18%), Positives = 92/258 (35%), Gaps = 40/258 (15%) Query: 156 YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 L L+ Q ++ + A LR I L++DLS SM M S Sbjct: 55 VPLSAAAVLIGLAIMQPVIPYSQADLRSRG-----LDIVLLLDLSSSMQEEMGSGQSLKT 109 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 RT+M A+K+A+ F+ + D +GL+ ++ ++ + Sbjct: 110 ---GTTAAGRTRMDAVKDAVRTFVR-------GRRDDRIGLVVFSDNAYVISPLTFDHQY 159 Query: 276 VRQYV--TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + Y+ L T + A +L R + + + Sbjct: 160 LLDYLGFVDGEILLGEGQTAIGDGLALASAVLARQAGRDARGH--------------QVV 205 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN-----GQRLLKT-CVSSPE 387 + TDGE+N + I++ +AK I++ I ++ G +LL+ V++ Sbjct: 206 VLFTDGESNRGRD---PIEVVGEAKSAGIRVHVIGVDLDAEVKTRPGVQLLRRGVVAAGG 262 Query: 388 YHYNVVNADSLIHVFQNI 405 ++ + L+ + I Sbjct: 263 RYFAADSERDLLTASRTI 280 >gi|89069885|ref|ZP_01157219.1| hypothetical protein OG2516_06272 [Oceanicola granulosus HTCC2516] gi|89044561|gb|EAR50680.1| hypothetical protein OG2516_06272 [Oceanicola granulosus HTCC2516] Length = 536 Score = 74.5 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 18/135 (13%), Positives = 43/135 (31%), Gaps = 1/135 (0%) Query: 282 RDMDSLILKPTDSTPAMK-QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 R + ++ Y+ L S + + + D Sbjct: 401 RRDNRFPHGGGRIEDNVRRLTYEELFSRVSVYYNAYYHHYLQNFDRTELDTWYWDFLDMS 460 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + N IC AK +++ T+ + +++ C SS + + V L Sbjct: 461 LSTSAKNARLEAICTAAKNQGVQVFTVGFEVEDDEAIIMEDCASSRAHFFRVSGGGDLTT 520 Query: 401 VFQNISQLMVHRKYS 415 F++I++ + + + Sbjct: 521 AFESIARQITELRLT 535 >gi|83312851|ref|YP_423115.1| Flp pilus assembly protein TadG [Magnetospirillum magneticum AMB-1] gi|82947692|dbj|BAE52556.1| Flp pilus assembly protein TadG [Magnetospirillum magneticum AMB-1] Length = 464 Score = 74.1 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 59/473 (12%), Positives = 132/473 (27%), Gaps = 75/473 (15%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSME------------ 54 R + + K N +II A+ ++ + IG + V + K+ M Sbjct: 2 IRNSLSRLMTCRKGNMAIILAIGLLPIITTIGLGVDVARAYAVKSRMSAALDAAALAVGS 61 Query: 55 SANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESL-----SGY 109 S+ A L+ ++ + + + + A + D + Sbjct: 62 SSGTDAQLSAVAQKFFDANYPTGALGAHPSVAVKVTGDVISASAVAEVDTVFMKVVGLND 121 Query: 110 SAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRY 169 V ++ + + ++M + ++NN D + Sbjct: 122 VPVHADSTVNRQIAGLELAMVLDNTGSMTTNNNIQAVRDAANQLTDILFGTATVHPYLKI 181 Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 S + + DL G C + + T+ Sbjct: 182 ALVPYSAAVNVGSVAPSLITTGDTYAPNDLLGWKGCVVERAGANGVGDTSAATAPWTRYK 241 Query: 230 ALKNALLLFLDSIDL----LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 L A+ D+ + + G I P + M+ Sbjct: 242 WL-PAVDNNYDATKSSTVLANPSNGNASTGP---NLGCPTAITPLTNVKATLTPAINAME 297 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + T S M ++L+ + F +G+ + + K +I +TDG+N +K Sbjct: 298 AWSRGGTLSDVGMAWGLRVLSPEP-------PFTEGLPWGTPKWSKAVILMTDGDNQFYK 350 Query: 346 S-----------------------------------------NVNTIKICDKAKENFIKI 364 N +C+ K I + Sbjct: 351 LTSTTGGNKVNSAVNSDYGAYGRLDELGRIGTTNATTAKTTINTRLTSVCNAMKAKNIIV 410 Query: 365 VTISINA--SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 T++ + + + + K C + +++ + D L F+ I+ + + + S Sbjct: 411 YTVTFTSGINQATKDIYKACATDASKYFDSPSQDELKSAFRAIATSLSNLRVS 463 >gi|13473479|ref|NP_105046.1| hypothetical protein mll4092 [Mesorhizobium loti MAFF303099] gi|14024228|dbj|BAB50832.1| mll4092 [Mesorhizobium loti MAFF303099] Length = 477 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 62/467 (13%), Positives = 143/467 (30%), Gaps = 73/467 (15%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 NF+I+F + L GF + + + K+ ++ +AA+ + A + + + + Sbjct: 16 RHSGGNFAILFGFAASVLALAAGFSVDISQLYNAKSGLQGVVDAAVTSTARDLTTGVIKE 75 Query: 76 GDRFESISN----------------HAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQ 119 D +++ N R ++D ++ ++ + VF Q Sbjct: 76 ADASKAVQNFLVANSMAGILQPDQIVLDRLVVDRTANTVQADAHVDVALFFPVFGMGNTQ 135 Query: 120 NIVNS----SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRL-------QFIEHLLNQR 168 + S +A + + + D + +++ ++ Sbjct: 136 RVTASTTSLYSDKTIEVAMMLDVTGSMAANWWAKTDKIGDLQAAASTAVENLLDNNIDPN 195 Query: 169 YNQKIVSFIPALLRIEMGERPIFLIELVVDLSG---------SMHCAMNSDPEDVNSAPI 219 + V+ +P + G + V+ +G S + Sbjct: 196 NPRVRVAIVPYAEAVNTG---GLADSVFVEQAGGSNLPPPVPSAGAPIPVGSSVTLRPDK 252 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVR 277 C +++ K + + + +E + + R+ +P T + Sbjct: 253 CATERKDK-DGYADYSSDGPSELRRNNQNQEYL--AKVNRDDRMGTCPKPELIPLTADKQ 309 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQG-VKIPSLPFQKFIIFL 336 + + D T A++ Y +L+ + + G + K I + Sbjct: 310 KLLDTIADFKAAGVTAGGIAVQWGYYMLSPSWRSTIVNARLGSGPANFDNRKVGKVAILM 369 Query: 337 TDGE---------------NNNFKSNVNTIKICDKAKENFIKIVTISINAS--------- 372 TDG+ N S N ICD K + I+I TI + Sbjct: 370 TDGQFNTAFAAGRGAPRSQNAGQMSRSNAESICDNMKRDGIEIFTIGFDLDDPSMTSTER 429 Query: 373 PNGQRLLKTCVSSP----EYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + +L+ C ++ +++Y L F I Q + + Sbjct: 430 DQAKSVLQDCSTADTSTLKHYYEAATGPELDEAFNAIVQNIERLTIA 476 >gi|302382135|ref|YP_003817958.1| von Willebrand factor A [Brevundimonas subvibrioides ATCC 15264] gi|302192763|gb|ADL00335.1| von Willebrand factor type A [Brevundimonas subvibrioides ATCC 15264] Length = 560 Score = 73.8 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 27/176 (15%), Positives = 58/176 (32%), Gaps = 52/176 (29%) Query: 292 TDSTPAMKQAYQILTSDKKRS--FFTNFFRQ----------GVKIPSLPFQKFIIFLTDG 339 TD+ A++ A + + + T + G + +K II +TDG Sbjct: 384 TDNYTALRTAVNNMIASGNTNVPLGTMWGWHTLSPNAPFGDGRPYGTERLKKIIIIMTDG 443 Query: 340 ----------------------------------------ENNNFKSNVNTIKICDKAKE 359 +N + T +C K+ Sbjct: 444 ANVMSDTTSPNDSTYNGLGYIWQNRLGIVSGNDTTRRTRMDNRFDHATAATEDMCGNMKD 503 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 I++ T+++ Q LL+ C + ++++ V +A + F I+ + + + S Sbjct: 504 KDIEVYTVAVQVDSTAQTLLRRCATDTDHYFPVDSAAGIGAAFDRIAGAIENLRIS 559 >gi|144898053|emb|CAM74917.1| conserved hypothetical protein, secreted [Magnetospirillum gryphiswaldense MSR-1] Length = 460 Score = 73.4 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 69/467 (14%), Positives = 149/467 (31%), Gaps = 72/467 (15%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 ++ + + +IIFAL+++ L +G + + K+ + A +AA LA S + Sbjct: 3 LRRLMHDRRGTVAIIFALALIPLSLSVGLAVDTARAYAVKSKLSQALDAAALAVGSSTGT 62 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 + + K + +D A F + + +S + T + +V I+++ Sbjct: 63 AAELQQIGQKFFDANFKDSGLDAAGSFSVSVTGDVVSANGSAQVQTTLMQLVGIDTIAVS 122 Query: 131 HMANNRLDSSN----------NTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKI--VSFIP 178 A ++ + ++ D + ++ L R + V+ +P Sbjct: 123 ESAQVIRSIKGLELALVLDNTGSMTTSDNIGALRDAAQELVDILFGGRADHPTLRVAVVP 182 Query: 179 ALLRIEMGERPIFLIE-----LVVDLSGSMHCAMNSDPEDVNSAPICQDK--KRTKMAAL 231 + G LI +L G C + + +P + + A+ Sbjct: 183 YSASVNPGPIAPTLISGNDAYAPTNLLGWKGCVIERVGRAMEDSPASTAPWLRYQWLPAI 242 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 N S + G G I P G + + + + Sbjct: 243 DNYYDATKASTVRADPSQG---NGGTGPNLGCPTPITPLTGVKATVDSAIQALRAWSRGG 299 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS----- 346 T M ++L+ + F +G+ + + K +I +TDG+N +K Sbjct: 300 TMGDIGMAWGLRVLSPEP-------PFTEGLAWNTPKWAKAVILMTDGDNQFYKLTSTTG 352 Query: 347 ------------------------------------NVNTIKICDKAKENFIKIVTISIN 370 N ++C K+ I + TI+ Sbjct: 353 PNKVNSAVNSDYSGYGRLDQYGALGTTSTTTAKSVINTRLTQVCQAMKDKGITVYTITFT 412 Query: 371 A--SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + + + + K C SS ++ + L F+ I+ + + S Sbjct: 413 SGINQATKDIYKACASSTAKWFDSPSQADLRASFRAIATELSQLRVS 459 >gi|294678572|ref|YP_003579187.1| hypothetical protein RCAP_rcc03056 [Rhodobacter capsulatus SB 1003] gi|294477392|gb|ADE86780.1| conserved hypothetical protein [Rhodobacter capsulatus SB 1003] Length = 647 Score = 73.4 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 27/121 (22%), Positives = 49/121 (40%), Gaps = 2/121 (1%) Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 P +Y L K + T G ++ + + + + T K+C Sbjct: 528 APLYDVSYDHLYKTKNWNLNTVAGLLGKPYGRSAGTQY-ELMANAVYDTSVKDARTKKLC 586 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 D AK I I +++ +A G+ LLK C S Y+Y V ++ L F +I+ + + Sbjct: 587 DLAKSKGIYIFSVAADAPSGGKTLLKYCSSGTSYYYEVQGSN-LSTAFASIAASISSLRL 645 Query: 415 S 415 + Sbjct: 646 T 646 >gi|284040938|ref|YP_003390868.1| von Willebrand factor A [Spirosoma linguale DSM 74] gi|283820231|gb|ADB42069.1| von Willebrand factor type A [Spirosoma linguale DSM 74] Length = 359 Score = 73.4 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 55/297 (18%), Positives = 96/297 (32%), Gaps = 41/297 (13%) Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 ++S+ ++ ++ + Y L I P ++R Sbjct: 48 QRLKMSLGQISAEPGSWMGRQSVQSLLSLGRYLLPLCMFLG----TACLLIALARPQIIR 103 Query: 183 IEMGERP-IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 E+ I L +D+S SM D T++AA + F+ Sbjct: 104 ELREEQSEGIDIMLAMDVSVSMS---------------ESDILPTRLAAARRVAQAFVR- 147 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDSTPAMKQ 300 + + +GL+ + + + QY+ D +I T A+ + Sbjct: 148 ------GRRNDRIGLVIFAGEAFSLCPLTTDYNLLNQYLNDLNDGMIRTSGTAIGDALAR 201 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 + S T + K II L+DG+N + T AK Sbjct: 202 CINRMRDRPAASSDTTQAKTEQWKS--ERSKVIILLSDGDNTAGNLDPITAA--SLAKAF 257 Query: 361 FIKIVTISI--------NASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 IKI TI++ AS + +LK + + V++ L VF ISQL Sbjct: 258 NIKIYTIAVGQPVASASEASTVDEGILKKIATIGKGSFFRAVDSGRLKTVFAQISQL 314 >gi|323138519|ref|ZP_08073587.1| hypothetical protein Met49242DRAFT_2975 [Methylocystis sp. ATCC 49242] gi|322396153|gb|EFX98686.1| hypothetical protein Met49242DRAFT_2975 [Methylocystis sp. ATCC 49242] Length = 458 Score = 73.0 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 63/461 (13%), Positives = 133/461 (28%), Gaps = 89/461 (19%) Query: 5 SRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG 64 SR R + A+E+ + ++IF L+++ +++G + ++ + + A LA Sbjct: 13 SRVRKRLRNFRANERGSIAMIFGLALIPMFMMMGAAVDYTQAVTVRSRLNHLADRAALAA 72 Query: 65 ASKMVSNLSR--LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 S ++SN D K + ++ + Sbjct: 73 VKAAAQKESDCVANPAGNNVSNFQGCGQKDIIKAGVAAGVQY-----------------M 115 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 N + ++ S++ ++ V S D I L+ + Sbjct: 116 NGDPLMRGADRKPTIELSSSEGSWSATVNYSADIPTN-IARLMGVQTIPVNGKVTS---N 171 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSA--------------PICQDKKRT-- 226 I +G L++D S SM SD A D+ + Sbjct: 172 IALGTHMYLNFHLLLDRSMSMGIGATSDDISRLQALTGCAFACHSEGYEAQYYDQPKAQG 231 Query: 227 ---KMAALKNALLLFLDSIDLLS--HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 ++ L++A + +++ + +E + MG+ + V +E + V V Sbjct: 232 IRFRIDDLRDATGALVAQAKMVASANAREHIQMGVYAFNHHVSPLVEMTSDLTNVANAVK 291 Query: 282 RDMDSLILKPTDSTPAMKQAYQI--------------------LTSDKKRSFFTNFFRQG 321 T + A+ +T + +T + + Sbjct: 292 NLDLPTHDDGTQAADAVTWLVANKIKGNGTGLTSAAPLEIVFLVTDGVEDGIYTGWNKMV 351 Query: 322 VKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK---IVT------------ 366 LP + ++T + F + CD K + T Sbjct: 352 GPTG-LPLPWWPSWMTKAPTSAFP-----VTACDALKSKGAIVAVVYTTYVPFPGTVQYD 405 Query: 367 --ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 I A L+ C S Y + + Q++ Sbjct: 406 RLIGPFAPNISPN-LQGCASQ-GYFFTASEPGDITRGMQSL 444 >gi|254486311|ref|ZP_05099516.1| conserved hypothetical protein [Roseobacter sp. GAI101] gi|214043180|gb|EEB83818.1| conserved hypothetical protein [Roseobacter sp. GAI101] Length = 476 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 58/388 (14%), Positives = 113/388 (29%), Gaps = 118/388 (30%) Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIE 194 + + N F + V + + + QF+ + +PAL + E + I Sbjct: 99 SSVTIENGLNFRTVTVKANNEMKTQFLGRFGFPTLD------VPALSKAEERVEKV-EIS 151 Query: 195 LVVDLSGSM--HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS------ 246 LV+D+SGSM + + + + + + TK + +L+ + + +++ Sbjct: 152 LVLDVSGSMKNNSKLTTMKDAAKTFIDTVLRPETK-NNVSLSLIPYSEQVNVGPDIFNAL 210 Query: 247 --------------HVKEDVYMGLI-GYTTRVEKNIEPSWGTEKVRQ------------- 278 V + G+ ++ W T + Sbjct: 211 WVDTRHDFSYCIDVPDGHFVQTQMTPGFPW--DQTQHFQWNTYSIESGYQQNTLHDTVCP 268 Query: 279 --------YVTRDMDSL--------ILKPTDSTPAMKQ-------AYQILTSDKKRSFFT 315 +++D SL T MK +++ T+ Sbjct: 269 RAVYERVRPISQDGPSLKAQIDLFQPRAGTAIYMGMKWGTALLDPSFRETTASLVSDSVV 328 Query: 316 N--FFRQGVKIPSLPFQKFIIFLTDGENNNFK---------------------------- 345 F + K I+ +TDG+N+N + Sbjct: 329 ESTFADRPADYSDRETLKTIVLMTDGQNSNSQRISTAYYNSSSEVVHWSKWNFNYYLSQY 388 Query: 346 ------------------SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE 387 N IC AK+ I I TI + G ++K C SSP Sbjct: 389 IKEKDWHRYYYTRYTAEKGNTLMDNICSAAKDEGIVIWTIGFEVNDTGADVMKKCASSPS 448 Query: 388 YHYNVVNADSLIHVFQNISQLMVHRKYS 415 + + V L F I+ + + + Sbjct: 449 HFFRVEGV-ELTDAFSAIASQINQLRLT 475 >gi|291514853|emb|CBK64063.1| Mg-chelatase subunit ChlD [Alistipes shahii WAL 8301] Length = 328 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 44/240 (18%), Positives = 74/240 (30%), Gaps = 64/240 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I L +D+SGSM D K ++ A K F+ + Sbjct: 88 IDIMLAIDVSGSMLAR---------------DFKPDRITAAKEVAGSFIA-------DRY 125 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +GL+ + + ++ + R LI T + A L Sbjct: 126 GDRIGLVAFAGEAFTQSPLTTDQSTLQTLLARIRSGLIEDGTAIGNGLATAINRLRESDA 185 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI- 369 +S K II LTDG NN + T + AK I++ TI + Sbjct: 186 KS------------------KVIILLTDGVNNQGQIAPMTAA--EIAKAQGIRVYTIGVG 225 Query: 370 ---NAS-----------------PNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 A +++LK + ++ + + L V+ I+QL Sbjct: 226 TEGMAPYPAIDMFGNLTFVNQKVEIDEKVLKAISDMTGGRYFRATDKEKLKAVYDEINQL 285 >gi|84515372|ref|ZP_01002734.1| hypothetical protein SKA53_01901 [Loktanella vestfoldensis SKA53] gi|84510655|gb|EAQ07110.1| hypothetical protein SKA53_01901 [Loktanella vestfoldensis SKA53] Length = 485 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 67/473 (14%), Positives = 150/473 (31%), Gaps = 86/473 (18%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 E + I+ L +++ L++ G + + + ++ +++S ++ A+LA AS L++ Sbjct: 27 RDEDGSVIIMTILLLVTMLIMGGMAVDFMRYEARRATLQSVSDRAVLAAAS-----LNQT 81 Query: 76 GDRFESISNHAKRALIDDAKR----FIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 D + + ++ +A +A + N +++ SA+ NT + R+ T Sbjct: 82 LDSRDVVEDYFAKAGFPNALVGAPIVVDNGNSRTVTVRSALDVNTFYLRLAGMDRL--TA 139 Query: 132 MANNRLDSSNNTIFYNMDVMTSYDYR------------LQFIEHLLNQRYN-QKIVSFIP 178 A + + ++ + S R + F E +L+ ++ IP Sbjct: 140 PARSSATEGVGKVEISLVLDISGSMRFSNRFVNMQAAAIAFAEEVLDPANGGTVSLTIIP 199 Query: 179 ALLRIEMGERPIFLI-------ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 G + L+ G + + V+S + A L Sbjct: 200 YAGATNPGPEMFAFMGGVRYPDTLLAGDDGILGTEDDYFFPQVSSCVEMVGSDWSS-AGL 258 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 A + + + + G + + + R ++ + Sbjct: 259 PGAGRAQVPHFQVWDIARSVMDWGWCPQDRSSIQYAMAT--PAQARSFINGLR---MHDG 313 Query: 292 TDSTPAMKQAYQILTSDKKRSFFT-----------NFFRQGVKIPSLPFQKFIIFLTDGE 340 T + AMK A L + +F F + +K I+ +TDG+ Sbjct: 314 TGTHYAMKYALATLDPSSQPAFMHLSHPGRGLVPPQFANRPAAWDDPETKKIIVLMTDGD 373 Query: 341 ------------------------NNNFKSN---------VNTIKICDKAKE--NFIKIV 365 N IC A + + + Sbjct: 374 ITQQERPRIAQQERDIDYIISRSINGRDNRGQFVDAATNVGRFEAICTLANQPARSVDVY 433 Query: 366 TISINA-SPNGQRL-LKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 T++ + L ++ C S P + + LI VF I++ + + ++ Sbjct: 434 TVAFEVQPNSAADLQMRNCASDPSMFFR-TSGAELIDVFSGIAERITDLRLNL 485 >gi|270158235|ref|ZP_06186892.1| von Willebrand factor type A domain protein [Legionella longbeachae D-4968] gi|289163509|ref|YP_003453647.1| hypothetical protein LLO_0165 [Legionella longbeachae NSW150] gi|269990260|gb|EEZ96514.1| von Willebrand factor type A domain protein [Legionella longbeachae D-4968] gi|288856682|emb|CBJ10493.1| putative unknown protein [Legionella longbeachae NSW150] Length = 342 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 48/247 (19%), Positives = 90/247 (36%), Gaps = 55/247 (22%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + + R + I + +DLSGSM I + +++ +K+A F+ Sbjct: 79 VGAPKPVSREGYNIMMALDLSGSME----------IPDMILHGRPTSRLNIVKSAAEQFV 128 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 +GLI + TR ++ + + L K T A+ Sbjct: 129 RE-RSGD------KIGLILFGTRAYLQTPLTYDRHSILLRLEDATAGLAGKTTSIGDAVG 181 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 A + L S K+ + II LTDG NN+ + +K + AKE Sbjct: 182 LAVKRLDSAPKKG------------------RVIILLTDGANNSGV--LAPLKAAELAKE 221 Query: 360 NFIKIVTISINASPNGQR-----LLKTCVSS-------------PEYHYNVVNADSLIHV 401 IKI TI + + + + L+++ + ++ + +SL + Sbjct: 222 EGIKIYTIGLGSEGDSRALVGDFLMQSPAADLDEETLKKMSDMTGGRYFRATDTESLHLI 281 Query: 402 FQNISQL 408 ++ I+QL Sbjct: 282 YKTINQL 288 >gi|313159758|gb|EFR59115.1| von Willebrand factor type A domain protein [Alistipes sp. HGB5] Length = 330 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 41/242 (16%), Positives = 72/242 (29%), Gaps = 66/242 (27%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I L +D+SGSM D + ++ A K F+ + Sbjct: 88 IDIMLAIDVSGSMLAR---------------DFRPDRITAAKEVAGSFIA-------DRY 125 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +GL+ + + ++ + R LI T + A L + Sbjct: 126 GDRIGLVAFAGEAFTQSPLTTDQGTLQTLLARIRSGLIEDGTAIGNGLATAINRLRESEA 185 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI- 369 +S K II LTDG NN + T + AK I++ TI + Sbjct: 186 KS------------------KVIILLTDGVNNRGEIAPQTAA--EIAKAQGIRVYTIGVG 225 Query: 370 ---NAS--------------------PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 A + + L + ++ + L ++ I+ Sbjct: 226 TEGMAPYPAVDIYGTPTGGTVMAKVEIDEKTLRSIAEQTGGQYFRATDKAKLKAIYDQIN 285 Query: 407 QL 408 QL Sbjct: 286 QL 287 >gi|307943467|ref|ZP_07658811.1| putative Flp pilus assembly protein TadG [Roseibium sp. TrichSKD4] gi|307773097|gb|EFO32314.1| putative Flp pilus assembly protein TadG [Roseibium sp. TrichSKD4] Length = 466 Score = 72.2 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 78/496 (15%), Positives = 148/496 (29%), Gaps = 139/496 (28%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 YF K + E+ + + A + L++ G + ++ + +A +AA L A ++ Sbjct: 8 SYFHKFGSDERGSLLPLVAGVCLILLVVAGSAVDYGRALGYRHKIANAVDAAALTVAKQL 67 Query: 69 VSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRIS 128 + + + N + N + Sbjct: 68 STTVLTENQIRTGLKNAFR----------------------------------ANLNAAG 93 Query: 129 MTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGER 188 + + LD + +DV +S D + FI K+ P L + + Sbjct: 94 INSQGIDNLDFKVDPGEGTLDVWSSVDIQTNFI----------KLGGIGPEKLEVGAASQ 143 Query: 189 ---PIFLIEL--VVDLSGSMHCAMNSDPEDVNSAPI--CQDKKRTKMAALKNALLLFLDS 241 F +EL V+D++GSM MN+ E S D ++ + ++ +L+ + Sbjct: 144 VNYSRFDVELALVLDVTGSMRPDMNALKEASKSIVNILLPDDSNSRESKVRISLVPYSQG 203 Query: 242 IDLLSHVKEDV------------YMGLIGYTTRVE------------KNIEPSWGTEKVR 277 ++L S+ G +T K + W Sbjct: 204 VNLGSYATRVTNGGSTWRNCVNERSGPQKFTDAPYNYAGSRSDFFHGKPKQFVWDYGWTE 263 Query: 278 QYVTR-------DMDSLILKPTDSTPAMK---------------QAYQILTSDKKRSFFT 315 Q+ TR ++ L T A+ + L+ K + Sbjct: 264 QWQTRPEACPKTAVEPLTADRTKLLRAISGLKDGGGTGGQTGIAWGWYTLSPKWKNLWPR 323 Query: 316 NFFRQGVKIPS--LPFQKFIIFLTDGENNNFKSN-------------------------- 347 + S +KF + +TDG+ N Sbjct: 324 DSAPATYGTGSHTDDTKKFALIMTDGDFNAAYGWDCGCRKIRDKPLYCRKKSNKKSWIER 383 Query: 348 ------------VNTIKICDKAKENFIKIVTISINASPN--GQRLLKTCVSSPEYHYNVV 393 K+CD+ K I+I T+ + G L+ C S +Y Sbjct: 384 YFSPSKISHAPAQRAKKLCDEMKSKNIEIFTVYFDTGGATFGDDLMSYCASGSRNYYRAD 443 Query: 394 NADSLIHVFQNISQLM 409 N++ LI F NI+ + Sbjct: 444 NSNELIQAFSNIANEI 459 >gi|254450361|ref|ZP_05063798.1| conserved hypothetical protein [Octadecabacter antarcticus 238] gi|254450938|ref|ZP_05064375.1| conserved hypothetical protein [Octadecabacter antarcticus 238] gi|198264767|gb|EDY89037.1| conserved hypothetical protein [Octadecabacter antarcticus 238] gi|198265344|gb|EDY89614.1| conserved hypothetical protein [Octadecabacter antarcticus 238] Length = 75 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 20/74 (27%), Positives = 35/74 (47%), Gaps = 1/74 (1%) Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 N +++ IC A+ + I T++ A GQ L+ C SSP +H++V N + Sbjct: 2 NGTEADARLSDICAAARAQGVVIYTVAFEAPSGGQSALQDCASSPSHHFDV-NGTDISSA 60 Query: 402 FQNISQLMVHRKYS 415 F I+ + K + Sbjct: 61 FSAIASDIRALKLT 74 >gi|323138635|ref|ZP_08073702.1| hypothetical protein Met49242DRAFT_3090 [Methylocystis sp. ATCC 49242] gi|322396123|gb|EFX98657.1| hypothetical protein Met49242DRAFT_3090 [Methylocystis sp. ATCC 49242] Length = 547 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 35/237 (14%), Positives = 81/237 (34%), Gaps = 26/237 (10%) Query: 200 SGSMHCAMNSD--PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLI 257 +GS+ + + P ++ ++ K+ + ++ + +++ +++ G + Sbjct: 317 TGSLGGSFGPSDFKGSSSFCPDPGTQRILQLTQKKSDVQNKINQL--VANGATNLHEGFM 374 Query: 258 GYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS-----TPAMKQAYQILTSDKKRS 312 + R P G + R + + +S A Y L Sbjct: 375 -WGWRTLSPNAPFSGGRAYQAPKNRKIMVFMTDGFNSWNSRVNTATGSTYDTLGYYSYNG 433 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK--------SNVNTIKICDKAKENFIKI 364 F G + + ++ L NN+ + T + C AK I++ Sbjct: 434 AENERFPDGSQGNGVNYRS---LLAAAANNSSSYQTISRAMQDELTRQACTNAKTAGIEV 490 Query: 365 VTISINAS-----PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 TI + S G L+K C ++ ++++ +A L F I + + S+ Sbjct: 491 FTIGFSVSGDPIDAQGLALMKECATNEDHYFKAEDASQLNAAFSQIGIGLGKLRLSL 547 Score = 41.4 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 46/304 (15%), Positives = 110/304 (36%), Gaps = 57/304 (18%) Query: 24 IIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESIS 83 +IF LS+M +L++G + + +++ A ++A+L ASKM ES + Sbjct: 1 MIFGLSLMPVMLMLGATADYTRFTTTRAALQQAADSAVLTVASKMT----------ESTT 50 Query: 84 NHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNT 143 N + D A+ + + + + + + + + + ++++++ Sbjct: 51 NAQAK---DQAQVVLNAQPRMTTAIVTGATVSEDKRTVCATAKVTI------------QN 95 Query: 144 IFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSM 203 F M + + ++ +L + G + I LV+D SGSM Sbjct: 96 SFMQMAQLATLTPTVKSCANLAG---------------GADPGT--TYEIALVLDNSGSM 138 Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 + + + ++K++ LK+A F+D++ S+ + + V Sbjct: 139 NSSSDG---------------QSKISILKSAANSFVDTMFSKSNNVKFSVVPFSSGVAAV 183 Query: 264 EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 + + S V + + + A + DK ++ + +G Sbjct: 184 DPSEPSSRNAAWVDKNGANSQHWIAFGGKTAANAAGFTSRFDIFDKLKARNSALDWRGCF 243 Query: 324 IPSL 327 P + Sbjct: 244 EPQV 247 >gi|163747459|ref|ZP_02154811.1| hypothetical protein OIHEL45_00415 [Oceanibulbus indolifex HEL-45] gi|161379312|gb|EDQ03729.1| hypothetical protein OIHEL45_00415 [Oceanibulbus indolifex HEL-45] Length = 476 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 16/74 (21%), Positives = 34/74 (45%), Gaps = 1/74 (1%) Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 +++ ICD AK I I +I + +G ++K C SS + + V ++ Sbjct: 403 TAGQADNLLDNICDAAKAKGIVIWSIGFEVTDHGAAVMKNCASSDSHFFRVEGV-EIVDA 461 Query: 402 FQNISQLMVHRKYS 415 F+ I++ + + + Sbjct: 462 FEAIARQINQLRLT 475 >gi|331006778|ref|ZP_08330044.1| BatA [gamma proteobacterium IMCC1989] gi|330419396|gb|EGG93796.1| BatA [gamma proteobacterium IMCC1989] Length = 364 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 46/235 (19%), Positives = 86/235 (36%), Gaps = 48/235 (20%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VD+SGSM ++ T++AA+K + F+D ++ Sbjct: 95 DLLLAVDISGSM----------QQEDMQINNRPATRLAAVKKVVSDFIDQ-------RQG 137 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + T+ ++ T+ V Q++ K T A+ + + L + Sbjct: 138 DRIGLILFGTQAYLQTPLTFDTQSVNQFLQEAQLGFAGKDTAIGDAIGLSVKRLKNQSSA 197 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-- 369 S K II LTDGEN V ++ A++ KI T+ I Sbjct: 198 S-----------SAKPSNSKVIILLTDGENTA--GEVEPLQAAKLAEKIGAKIYTVGIGA 244 Query: 370 ----------------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 +AS + + L ++ ++ N L +++ + +L Sbjct: 245 DEMIVRGFFGNRRVNPSASLDEETLTAIANTTGGLYFRARNTQELNNIYSELDKL 299 >gi|254466920|ref|ZP_05080331.1| conserved hypothetical protein [Rhodobacterales bacterium Y4I] gi|206687828|gb|EDZ48310.1| conserved hypothetical protein [Rhodobacterales bacterium Y4I] Length = 550 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 16/75 (21%), Positives = 35/75 (46%), Gaps = 1/75 (1%) Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 N + +T ICD K+ + + ++ A G ++L+ C SSP + ++V + Sbjct: 476 NYRVAKDQHTKTICDITKDQGVIVYSVGFEAPSAGIKVLEDCASSPAHFFDVEGL-EISD 534 Query: 401 VFQNISQLMVHRKYS 415 F +I+ + + + Sbjct: 535 AFSSIATSIRQLRLT 549 Score = 44.1 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 53/365 (14%), Positives = 115/365 (31%), Gaps = 56/365 (15%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA-- 65 + + E + + ++ L + G + ++ + ++ + A+LA A Sbjct: 20 LSKVRSFLREEDGVLAKPMIGTFLAMLAVGGIGVDLMRMERDRTELQYTLDRAVLAAADL 79 Query: 66 -------SKMVSNLSRLG--------DRFESISNHAKRALIDDAKRFIKNHIKESLSGYS 110 + ++ L++ G D + + + A ID F +K + Sbjct: 80 DQSLDADAVVLDYLTKAGLEQYYSDPDDQKGLGYKSVEATID--TDFEAYLLKFAGGDNM 137 Query: 111 AVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYN 170 +++ N+ + I+ S ISM + ++S N + + N + Sbjct: 138 SLYANSRAEEIIGSVEISMVLDISGSMNSGNRLVNLQAAAKSFVTQITS------NTDVS 191 Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 +S IP ++ GE+ + V +N + + + Q++ Sbjct: 192 NLSISIIPYATQVNAGEK-LLSKYTKVSQEHDYSYCVNFIKDQFSKHTLNQNE-----DL 245 Query: 231 LKNA-LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 ++ A F S++++ G + + K+ Y+ S Sbjct: 246 IRTAHFDTFTYSMNMIDRPVCPTRPGS--------AILPFTNDAAKLHAYIDSLTAS--- 294 Query: 290 KPTDSTPAMKQA-----------YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 T MK L DK S + K II ++D Sbjct: 295 GNTSIDIGMKWGSALLDPTAQPVVNALVDDKVISENFRGRPKAYGSGDTL--KIIILMSD 352 Query: 339 GENNN 343 G+N N Sbjct: 353 GQNTN 357 >gi|323136144|ref|ZP_08071226.1| hypothetical protein Met49242DRAFT_0613 [Methylocystis sp. ATCC 49242] gi|322398218|gb|EFY00738.1| hypothetical protein Met49242DRAFT_0613 [Methylocystis sp. ATCC 49242] Length = 652 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 31/186 (16%), Positives = 54/186 (29%), Gaps = 54/186 (29%) Query: 269 PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 + V + M+ + T + + A++ L+ K + + + Sbjct: 465 LTNNLSTVTAAIDS-MNYWLNGGTVISEGLMWAWRTLSPQKPYADGAAYTDKKT------ 517 Query: 329 FQKFIIFLTDGENNNFKS--------------------------------------NVNT 350 +K I+ +TDG N + + Sbjct: 518 -KKVIVLMTDGVNGLADNGNAASANISDYSAYGYMGASRLSVADGVTTYAGLQTFLDDRL 576 Query: 351 IKICDKAKENFIKIVTISIN--------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 K CD AK I I T+ N LL C S PEY + ++ +L F Sbjct: 577 KKACDNAKAKGISIYTVMFNHNGFLSATEQARSATLLSYCASKPEYAFLATDSAALNSAF 636 Query: 403 QNISQL 408 I+ Sbjct: 637 GQIASS 642 >gi|319780897|ref|YP_004140373.1| hypothetical protein Mesci_1159 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317166785|gb|ADV10323.1| hypothetical protein Mesci_1159 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 492 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 72/467 (15%), Positives = 145/467 (31%), Gaps = 78/467 (16%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 NF+I+F LS L +GF + V + ++S++ +AA+ + A + + + Sbjct: 16 RDRGGNFAILFGLSASVLALAVGFSVNVSQLYNARSSLQGVVDAAVTSTARDLTTGAIKE 75 Query: 76 GDRFESISN----------------HAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQ 119 D +S+ R +++ + ++ + Y +F +++ Sbjct: 76 ADANKSVQAFLDANSQAGILQADQIVLDRLIVNRTAKTVQADAHVDVGLYFPIFGTGDMK 135 Query: 120 NIVNSSRI---SMTHMANNRLDSSN-----NTIFYNMDVMTSYDYRLQFIEHLLNQRYNQ 171 + S+ T LD + + D+ T+ +Q + + + + Sbjct: 136 RVAASTTALYSDKTVEVAMMLDITGSMAKRGKVDKIGDLKTAAKNAVQTMLQKQDPQNPR 195 Query: 172 KIVSFIPALLRIEMGE-------RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 V+ +P + G+ EL ++GS + + S Sbjct: 196 IRVAIVPYASGVNAGKLAENVYAEKQASTELP-PVAGSPLLVAKTGKNLLPSFSDYISIV 254 Query: 225 RTKMAALKNALLLFLDS---IDLLSHVKEDVYMGLIG--YTTRVEKNIEPSWG------- 272 M N D D+ + + V G + V ++ G Sbjct: 255 GAAMPRPDNCATERKDKNGNADMSADGPDTVRTDGNGKKFYALVNRDDHLGDGDMNRCPD 314 Query: 273 ------TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 T + D T A++ Y +L+ + + +G Sbjct: 315 AKVIPLTADSDALLESIEDFRANGFTAGAIAIQWTYYMLSPQWRTAIRNAGLGKGASDAD 374 Query: 327 -LPFQKFIIFLTDGENNNFKSN-------------VNTIKICDKAKENFIKIVTISINAS 372 K I +TDG+ N + N +CD K + I+I TI + Sbjct: 375 PKKIAKVAILMTDGQFNTAFAGAGDSYNRQGTLARGNAETLCDNMKNDGIEIFTIGFDLD 434 Query: 373 PNG---------QRLLKTCVSSPE-----YHYNVVNADSLIHVFQNI 405 + +LK C S + ++V L FQ I Sbjct: 435 DKDMSTTERDQAKAVLKDCSSKDTSGAKRHFFDVSTGAELDDAFQEI 481 >gi|296445280|ref|ZP_06887239.1| von Willebrand factor type A [Methylosinus trichosporium OB3b] gi|296257235|gb|EFH04303.1| von Willebrand factor type A [Methylosinus trichosporium OB3b] Length = 575 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 15/81 (18%), Positives = 35/81 (43%), Gaps = 5/81 (6%) Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINAS-----PNGQRLLKTCVSSPEYHYNVVN 394 + + + T++ C AK ++I TI + G ++++ C + +++ + Sbjct: 494 DTSRNMQDELTLEACTNAKTAGVEIYTIGFSVPVDPIDAQGLKMMQDCATDANHYFAATD 553 Query: 395 ADSLIHVFQNISQLMVHRKYS 415 DSL F +I + + S Sbjct: 554 VDSLNAAFASIGSGVGKLRLS 574 Score = 42.9 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 43/261 (16%), Positives = 90/261 (34%), Gaps = 69/261 (26%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGD 77 ++ N ++IF LS + +L++G + K++++ A ++A LA A +V+ + Sbjct: 20 DRGNVAVIFGLSFIPLVLMLGAGVDYGRAVSTKSNLQQATDSAALAVAKTIVATTT---- 75 Query: 78 RFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRL 137 +A + +AV T+ + + + + A Sbjct: 76 ----NQQAQSQAQVYLLTNV-----------RNAVAVVTKAEISADRLTLCLDSTAQIPT 120 Query: 138 DSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVV 197 ++ + + I + + G + I LV+ Sbjct: 121 T-------------------------IMKIAHIETITTKATTCAQTPGGMNGTYEIALVL 155 Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLI 257 D SGSM + ++K+AAL++A F+++I DV M ++ Sbjct: 156 DNSGSMSKSAGG---------------KSKIAALRDAATSFVNNIYS---KTTDVKMSIV 197 Query: 258 GYTT-------RVEKNIEPSW 271 ++ V N SW Sbjct: 198 PFSAGVRVLDPSVSSNRTLSW 218 >gi|260434111|ref|ZP_05788082.1| conserved hypothetical protein [Silicibacter lacuscaerulensis ITI-1157] gi|260417939|gb|EEX11198.1| conserved hypothetical protein [Silicibacter lacuscaerulensis ITI-1157] Length = 600 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 22/75 (29%), Positives = 37/75 (49%), Gaps = 1/75 (1%) Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD-SLIH 400 N + + IC KAK+ I I +I+ +A + LLK CVS +Y + D +I Sbjct: 525 NQVQKDPRLTSICQKAKDEKIIIFSIAFDAPDGVKPLLKGCVSDDGAYYEAKDNDKDIIS 584 Query: 401 VFQNISQLMVHRKYS 415 VF +I + + + + Sbjct: 585 VFSSIGSTIQNLRLT 599 >gi|114571147|ref|YP_757827.1| hypothetical protein Mmar10_2603 [Maricaulis maris MCS10] gi|114341609|gb|ABI66889.1| conserved hypothetical protein [Maricaulis maris MCS10] Length = 520 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 64/177 (36%), Gaps = 41/177 (23%) Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 T R + D T+ + ++++ + + + + K Sbjct: 350 TSTERTVLNAIGDMGASGTTNIPNGVGWGIRLISPGAPFTEGSAWD-------DDEYIKA 402 Query: 333 IIFLTDGEN---------------------------NNFKS------NVNTIKICDKAKE 359 ++ LTDG+N ++ + + T C A+ Sbjct: 403 MVILTDGDNVMRGRNTDQMSDYEAYGFVADGRLGRRSSSSNVLSNELDDRTEAACAYARS 462 Query: 360 NFIKIVTISINASPNG-QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 I++ TI+ + + + L++ C S+P +++ ++++L F+ I+ + + + S Sbjct: 463 LGIRVYTITFQVNSSSTRSLMQNCASNPSLYFDSPSSEALEDAFEMIAGDLTNLRLS 519 >gi|323136279|ref|ZP_08071361.1| von Willebrand factor type A [Methylocystis sp. ATCC 49242] gi|322398353|gb|EFY00873.1| von Willebrand factor type A [Methylocystis sp. ATCC 49242] Length = 577 Score = 70.7 bits (171), Expect = 5e-10, Method: Composition-based stats. Identities = 27/160 (16%), Positives = 60/160 (37%), Gaps = 18/160 (11%) Query: 270 SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 ++ T VR+ + D +++ Y+ L G + + + Sbjct: 421 AYNTAGVRKILVLMTDGYNNWTSNTNTVGGSYYEALGYYSYNGAKNRRLPDGTQGNGVDY 480 Query: 330 QKFIIFLTDGENNNFKS---------NVNTIKICDKAKENFIKIVTISINAS-----PNG 375 Q + DG N++ + T + C+ AK I+I +I+ + S G Sbjct: 481 QSQL----DGAANSWTDYKSVSRQAQDELTRQSCENAKAKGIEIYSIAFSVSTNPIDAAG 536 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 LLK+C ++ +++ ++ + F I+ + + S Sbjct: 537 INLLKSCATNADHYLLATDSTQIDRAFSQIAMNLSKLRLS 576 >gi|296120496|ref|YP_003628274.1| von Willebrand factor type A [Planctomyces limnophilus DSM 3776] gi|296012836|gb|ADG66075.1| von Willebrand factor type A [Planctomyces limnophilus DSM 3776] Length = 396 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 61/411 (14%), Positives = 130/411 (31%), Gaps = 57/411 (13%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGD 77 + +I+ A +++ L L GF + + + + +A +AA + ++V S Sbjct: 18 RRGAIAILAAFVMVALLALAGFFLSLSYVELTRAELRAATDAAARSAVIRLVETQS---- 73 Query: 78 RFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRL 137 + + A D A RF SL+ F + Q+ S ++ N Sbjct: 74 -----TTSGRAAARDIASRFEVGGKALSLNDNDIQFGRSTRQS-NGSYSFAINGTPTNAA 127 Query: 138 DSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVV 197 + V+ R + I +V+ Sbjct: 128 RVFGRKT--KTSAAGPVELPFGGFVGAPEYSTELNAVAM-----------RLDYDIVIVL 174 Query: 198 DLSGSMHCAMNSD-------------PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 D SGSM ++ E+ S P + ++A N L L+ + Sbjct: 175 DRSGSMGWDLSGVEFEYPEAVRQRPLVENYFSPPDPTGSRWAILSASVNDFLTILNQRQV 234 Query: 245 LSHVKEDVYMGLIGY----TTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 + V Y G + + ++ + + + +T ++ TD + Sbjct: 235 AARVGLVTYAGDYTFGKYSSVKLTVESDLTSTFSTITSKLTAIGQVPLIGGTDIGAGITA 294 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A +LT+ + +K II +DG N V+ ++ Sbjct: 295 AQTMLTTSSQA---------RLKTGQP----IIIVFSDGMFNQGTEPVSLAA--SAYSQS 339 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYN--VVNADSLIHVFQNISQLM 409 I +++ A+ G+ + + ++ + A L F++I+ + Sbjct: 340 STIIHSVTFGATAQGRATMNSVTATAGKGLSLHANTAAELAESFRSIANAI 390 >gi|84688081|ref|ZP_01015939.1| hypothetical protein 1099457000215_RB2654_05415 [Maritimibacter alkaliphilus HTCC2654] gi|84663909|gb|EAQ10415.1| hypothetical protein RB2654_05415 [Rhodobacterales bacterium HTCC2654] Length = 595 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 20/83 (24%), Positives = 40/83 (48%), Gaps = 3/83 (3%) Query: 334 IFLTDGEN--NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYN 391 FL D + N + N N +IC AK + + TI S + ++++C S+P Y+++ Sbjct: 512 NFLADAHDYFNYSEKNDNLDEICTAAKNAGMVVFTIGFEVSGSQHDIMRSCASAPAYYFD 571 Query: 392 VVNADSLIHVFQNISQLMVHRKY 414 V + + F I++ + + Sbjct: 572 V-DGLDISAAFAAIAREISKLRL 593 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 18/117 (15%), Positives = 40/117 (34%), Gaps = 8/117 (6%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 + E+ +F I M L G + + + + ++ + AILA AS L + Sbjct: 22 LRDERGSFIIFGIAVFMLMCLAGGIAVDTMRYETHRVHVQGTLDRAILAAAS-----LDQ 76 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 D E + ++ +A + I + + ++ S++ Sbjct: 77 DLDPEEVVLDYFTKAGLGHV---ISQDDIDVFENQTNGEVADDVAVTTRRVEASVSA 130 >gi|323135758|ref|ZP_08070841.1| von Willebrand factor type A [Methylocystis sp. ATCC 49242] gi|322398849|gb|EFY01368.1| von Willebrand factor type A [Methylocystis sp. ATCC 49242] Length = 588 Score = 70.3 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 19/80 (23%), Positives = 37/80 (46%), Gaps = 5/80 (6%) Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINAS-----PNGQRLLKTCVSSPEYHYNVVNA 395 + T++ C AK +++ TI + S G LLK+C ++ ++++ V NA Sbjct: 508 TARDAQDELTLQACTNAKNAGVEVFTIGFSTSTDPIDAQGLELLKSCATNVDHYFAVENA 567 Query: 396 DSLIHVFQNISQLMVHRKYS 415 + L F +I + + S Sbjct: 568 NQLNAAFSSIGIGLGKLRLS 587 Score = 43.3 bits (100), Expect = 0.065, Method: Composition-based stats. Identities = 41/238 (17%), Positives = 80/238 (33%), Gaps = 57/238 (23%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 K A E N ++F L ++ + ++G + ++++ A + A+L ASK+ + Sbjct: 14 KSFGADESGNVGMVFGLGLVPVMFMLGATADYTRYATTRSALRQATDVAVLTVASKLTAT 73 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 ++ +A A+ + + S + + T Q +S +++ Sbjct: 74 T----------TDAQAKA---QAQVILNAQPRMSTASITTASIATTKQTFCATSEVTI-- 118 Query: 132 MANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF 191 F M +TS + L + Sbjct: 119 ----------QNSFMQMARVTSLTPSVTSCADLAW-----------------GANPNATY 151 Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + LVVD SGSM D TK++ALK A F+D++ + + Sbjct: 152 EVALVVDNSGSMLS---------------SDGSVTKISALKTAAKSFVDTMFAKAPDR 194 >gi|254786433|ref|YP_003073862.1| von Willebrand factor A [Teredinibacter turnerae T7901] gi|237687231|gb|ACR14495.1| von Willebrand factor type A domain protein [Teredinibacter turnerae T7901] Length = 347 Score = 69.9 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 42/238 (17%), Positives = 81/238 (34%), Gaps = 58/238 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VD+SGSM + QDK+ ++ +K + F++ +E Sbjct: 92 DLLLAVDISGSM----------KTPDMVVQDKQIARILVVKYVVNEFIER-------RES 134 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ + V + + T A+ A + L Sbjct: 135 DRLGLILFGSQAYLQAPLTFDRKTVSTLLDEAQLGFAGEQTAIGDAVGLAIKRLRERPAS 194 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 Q+ +I LTDG N + D AK+ IKI T+ + A Sbjct: 195 ------------------QRVLILLTDGANTAGEVAPRQAA--DLAKQAGIKIYTVGVGA 234 Query: 372 SPNGQRL--------------------LKTCV-SSPEYHYNVVNADSLIHVFQNISQL 408 QR+ L+ ++ ++ N L +++ + +L Sbjct: 235 DQMEQRMGLFGGFSRTVNPSSDLDEDTLRYMAETTGGLYFRARNPQELQAIYEELDKL 292 >gi|260466792|ref|ZP_05812977.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259029404|gb|EEW30695.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 492 Score = 69.9 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 73/472 (15%), Positives = 148/472 (31%), Gaps = 88/472 (18%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 NF+++F + L GF + + + K+S++ +AA+ + A + + + + Sbjct: 16 RDRGGNFAVLFGFAASVLALAAGFSVNISQLYNAKSSLQGVVDAAVTSTARDLTTGVIKE 75 Query: 76 GDRFESISN----------------HAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQ 119 D S+ + ++D + ++ ++ ++ Y +F ++Q Sbjct: 76 ADADNSVKAFLVANSAAGILQPDQVVLDKLIVDKTAKTVQANVHVDVALYFPLFGIGDMQ 135 Query: 120 NIVNSSRI---SMTHMANNRLDSSN-----NTIFYNMDVMTSYDYRLQFIEHLLNQRYNQ 171 + S+ T LD + + D+ + +Q + + + + Sbjct: 136 RVAASTTALYSDKTVEVAMMLDITGSMAKRGNVDKIGDLRAAARNAVQTMLQNQDPKRPR 195 Query: 172 KIVSFIPALLRIEMGE---------------RPIFLIELVVDLSGSMHCAMNSDPEDVNS 216 V+ +P + G+ P+ L+V +G SD + Sbjct: 196 IRVAIVPYASGVNAGKLAENVYAETQGSSELPPVAGSSLLVAKTGKALLPSFSDYISIVG 255 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG--YTTRVEKNIEPSWG-- 272 A + T KN DL + + V G Y V ++ G Sbjct: 256 AAMPHPDNCTTERKNKN------GDADLSADGPDTVRTDRNGKKYYALVNRDDHLDGGGM 309 Query: 273 -----------TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQG 321 T + D T A++ Y +L+ + + G Sbjct: 310 NRCPDAEVIPLTADSDALLDSIDDFRAAGYTAGAIAIQWTYYMLSPQWRAAIKNVGLGNG 369 Query: 322 VKIPSLP-FQKFIIFLTDGENNNFKSN-------------VNTIKICDKAKENFIKIVTI 367 + K I +TDG+ N + N +C K + I+I TI Sbjct: 370 ASDANAKKIAKVAILMTDGQFNTAFAGAGGSYNGQGDLARGNAEALCGNMKNDGIEIFTI 429 Query: 368 SI-------NASPNGQ--RLLKTCVSSPE-----YHYNVVNADSLIHVFQNI 405 +A+ Q +LK C S +++ L FQ I Sbjct: 430 GFDLNDKDMSATERDQAKAVLKGCSSKDASAAERHYFEASTGAELDAAFQEI 481 >gi|85859126|ref|YP_461328.1| von Willebrand factor type A domain-containing protein [Syntrophus aciditrophicus SB] gi|85722217|gb|ABC77160.1| von Willebrand factor type A domain protein [Syntrophus aciditrophicus SB] Length = 447 Score = 69.9 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 57/359 (15%), Positives = 113/359 (31%), Gaps = 81/359 (22%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 I ++K IIFAL ++ L + V W+ ++ + + +A +AGA + + Sbjct: 5 IKNQKGAVLIIFALLLIVLLGFTALAVDVGRWYTTRSELSKSVDAGAIAGAKNISNPY-- 62 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 L +D + + +SA + T Sbjct: 63 ---------------LGEDGHLRLAEEVAR--ENFSAGYLMTP----------------- 88 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIE 194 DS + + + R++ V ++ + + ++ I Sbjct: 89 ---DSGERSATFTAYADEDHRIRVEGTVSSPGNLAGLFGVDWVATS-AMGVAKKNEVEIM 144 Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM T M LK A F+ + ++ M Sbjct: 145 LVLDRSGSMDG--------------------TPMNDLKKAARSFVSFFEE---TQDQDKM 181 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 GL+ + T V+ ++ G V ++ + T++ ++ QA Sbjct: 182 GLVSFATSVKVDVPL--GNNYVSSMTSKINAMDAVGATNAEDSLSQAGNP--------AK 231 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS-------NVNTIKICDKAKENFIKIVT 366 Q + Q+F+IF +DG F+ + +C + + T Sbjct: 232 GGLTDQSGVPGNKRVQQFVIFFSDGNPTAFRGKFKYNGTDNIDAVVCGTGNDCG-TVYT 289 >gi|307941972|ref|ZP_07657325.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307945282|ref|ZP_07660618.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307771155|gb|EFO30380.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307774878|gb|EFO34086.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] Length = 412 Score = 69.9 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 63/445 (14%), Positives = 154/445 (34%), Gaps = 69/445 (15%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 + + + + + K + +I FAL L I I + + K++ + + L Sbjct: 1 MYTGIKNRIQALKGNIKGSIAIPFALLATLILAAISVGIDMSFAYNKRDQSQLVADEVSL 60 Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALID-DAKRFIKNHIKESLSGYSAVFYNTEIQNI 121 + ++ +S + R + DA++F+ K SL G + F NI Sbjct: 61 FAVTTFRKYVAD------GMSKNQARKRAETDARKFLTARTK-SLDGTTEKFSI--KINI 111 Query: 122 VNS----SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFI 177 V+ + ++ + ++ F N+D + + F + +Y + + Sbjct: 112 VDREAKVVKANVNISGKHESYMTHAMGFDNIDYTADSESTISFGQG----KYEFIFLVDV 167 Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK-KRTKMAALKNALL 236 + I R +++ ++ C +S + R ++ +K+AL Sbjct: 168 SPSMGIGASNRDRQIMQ------RAIGCQFACHEPWYSSVSRAKSAGARLRIDVVKDALK 221 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM---DSLILKPTD 293 + ++ + V D+ GL ++ + + G K ++ + + L T+ Sbjct: 222 SLVTQLEEATEV--DLRTGLYSFSNYLHIQTGLNKGISKFKREANKIAIHREYLRGGGTN 279 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN---------- 343 N + +K P ++ II ++DG N+ Sbjct: 280 FHGVFS--------------DFNGVLRSLK-PKADVKQHIIIISDGVNHLNLRSGTNRHL 324 Query: 344 FKSNVNTI--------KICDKAKENFIKIVTISINASPNGQ------RLLKTCVSSPEYH 389 + N + CD+ K+ ++ V + ++ C +S ++ Sbjct: 325 WNQTPNWRPYNYSFNPRWCDEFKKGEVRTVHTMLVEPDRAHYVRASTSSMRACATSADFF 384 Query: 390 YNVVNADSLIHVFQNISQLMVHRKY 414 Y+ +A + F+++ + ++ Y Sbjct: 385 YSANSAAEIDKAFKDLFEALLKSVY 409 >gi|315499132|ref|YP_004087936.1| von willebrand factor type a [Asticcacaulis excentricus CB 48] gi|315417144|gb|ADU13785.1| von Willebrand factor type A [Asticcacaulis excentricus CB 48] Length = 519 Score = 69.9 bits (169), Expect = 8e-10, Method: Composition-based stats. Identities = 49/345 (14%), Positives = 111/345 (32%), Gaps = 37/345 (10%) Query: 81 SISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSS 140 + + +D + +++ + Y + ++ + T A R+ +S Sbjct: 199 RLPASTSYSFVDYTQCYLRTSVNYYACRTVWRAYEALCDQVGSTICATATSKAFYRVYTS 258 Query: 141 NNTIFYNMDVM----TSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELV 196 +Y +M + Y L + ++S + + L G + Sbjct: 259 GGVTYYAASLMGYAKSGSKYTLYTYNLTTTVNASTLVMSTVSSGLTTSTGTSVAYSPSGY 318 Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL 256 S S ++ + D+ ++ + + V + Y+ Sbjct: 319 TAFSSSYFSSLATSSWG----GCLTDRNQS----------FDVSVAPYATDVVDSNYIAA 364 Query: 257 IGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTN 316 TT + K ++ + V Y++ T+ T ++ ++L+ Sbjct: 365 SCSTTALAKVLDLTSDFTSVNTYLSSL---SPGGNTNITLGVQFGMEMLSP-------AE 414 Query: 317 FFRQGVKIPSLPFQKFIIFLTDGEN-------NNFKSNVNTIKICDKAKENFIKIVTISI 369 + + +K++I +TDG N +N N T C AK I + + + Sbjct: 415 PYTKATAFGDTDVKKYMIIVTDGANTQNRWSTSNSAINARTALACTAAKAQGITLFVVRV 474 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 LL+ C S Y+Y++ A L Q+I + + Sbjct: 475 E--DGDSSLLEACASQSSYYYDLSQASDLTKTMQDIFATINKLRL 517 >gi|86749514|ref|YP_486010.1| hypothetical protein RPB_2394 [Rhodopseudomonas palustris HaA2] gi|86572542|gb|ABD07099.1| conserved hypothetical protein [Rhodopseudomonas palustris HaA2] Length = 456 Score = 69.5 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 58/455 (12%), Positives = 115/455 (25%), Gaps = 63/455 (13%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 + + ++ N +IIFA++++ + IG I + + SM++A ++A L + + S Sbjct: 14 SSRFVKTDGGNVAIIFAIALLPMIGFIGAAIDYSRANKARTSMQAALDSAALMVSKDLAS 73 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 + G ++ + I + + S+ Sbjct: 74 GVITAGQVSAKAQSYFASLYNNTEAPNITVTATYTAKDSTGSSTVLLKGTGDISTEFMNM 133 Query: 131 HMANNRLDSSNNTIFY---------------------NMDVMTSYDYRLQFIEHLLNQRY 169 S T + M M S L Q Sbjct: 134 FGFPTLGIGSAATATWGGTRLRVAIALDVTGSMASAGKMPAMQSAAKTLVDNLRANAQTA 193 Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVVD----LSGSMHCAMNSDPEDVNSAPICQDKKR 225 + +S IP + +G+ + D +GS + + SA Sbjct: 194 DDLYISIIPFAQMVNVGKSNKNASWIKWDYWEDTTGSCNWWWLTTKSSCESAGRTWSSTN 253 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE-------PSWGTEKVRQ 278 D K+ + ++ + Sbjct: 254 QS-----QWGGCVTDRDQPADTTKDAPTTAATRFPAANYSACPEQILPMTSAYSSSNATT 308 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T+ M A+ L + + II L+D Sbjct: 309 IKDKIDALSPNGGTNQPIGMHWAWMSLQDGAPLNTPAKDADYK-------YTDAIILLSD 361 Query: 339 GENN-----------NFKSNVNTIKICDKAKENFIK------IVTISINAS-PNGQRLLK 380 G N + + +CD + I TI +N +LK Sbjct: 362 GMNTIDRWYGNGSSWSKDVDARQKLLCDNIRAASAASTTKTVIYTIQVNTDGDPESEVLK 421 Query: 381 TCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 C S + A + F I + + + Sbjct: 422 YCADS-GNFFATTTASGISTAFAQIGASLSKLRIA 455 >gi|32471725|ref|NP_864718.1| hypothetical protein RB2055 [Rhodopirellula baltica SH 1] gi|32397096|emb|CAD72400.1| conserved hypothetical protein [Rhodopirellula baltica SH 1] Length = 402 Score = 69.5 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 47/411 (11%), Positives = 123/411 (29%), Gaps = 55/411 (13%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 K ++ + + L+L ++I V + +AA+ A + Sbjct: 29 LKNQPKQRSGAVIVLLVIMLPVLLILAAYVINVAYVEAVTADSQVVTDAAVCAAGRVYIQ 88 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 + + + + ++ F + +++ Sbjct: 89 TGDKNAALAAARDAAERNPVAGKVV---------PINMSDLEFGISLRESLDEGYSFQ-- 137 Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPI 190 ++ + N + + S + + + R+ + + Sbjct: 138 -PLSDDDEFGNAVRLTTLSLSNSPQPVFSPLFPTMGTNLEIR-------PQRVAVSTQST 189 Query: 191 FLIELVVDLSGSMHCAMN-----------SDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + LV+D SGSM A + + P P+ + + + A NA FL Sbjct: 190 MDVALVIDRSGSMAYANDEAPDPYVNPAAAPPGWTYGDPVPPNSRWLDLVASVNAFNGFL 249 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 + + L Y+ ++ + + ++ + T ++ Sbjct: 250 A------DSPQYEKLCLATYSDNASRDCDLTHTYAEISNQLDAISYQFNGGGTSVGYGLE 303 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 +LT F + ++ +TDG +N KS + + Sbjct: 304 HGLAVLTDATHARKFA--------------VRVMVLMTDGHHNTGKSPESMTY---HLQN 346 Query: 360 NFIKIVTISINASPNGQRLLKTCVSS-PEYHYNVVNADSLIHVFQNISQLM 409 + + + TI+ + Q + ++ +++ +A L + FQ I++ + Sbjct: 347 HGVTLFTITFS-DDADQSRMSNLANACGGENFHATDASQLQNAFQKIAKKL 396 >gi|288956977|ref|YP_003447318.1| hypothetical protein AZL_001360 [Azospirillum sp. B510] gi|288909285|dbj|BAI70774.1| hypothetical protein AZL_001360 [Azospirillum sp. B510] Length = 456 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 74/466 (15%), Positives = 145/466 (31%), Gaps = 81/466 (17%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 R A + + +I+ ALS + L ++G I + + + A +AA LA Sbjct: 13 RVFAPLVAVAADRRGSVAIMVALSFLVLLGMLGVAIDFARAQFVSSRIYYAADAATLA-- 70 Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHI--KESLSGYSAVFYNTEIQNIVN 123 +SR + + A F + SLS ++ T V Sbjct: 71 ------VSRENFQVSTNDQLKALAQSYFDANFPPGTMGATTSLSVATSGTPPTVQGFTVT 124 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 + A TI V S F NQ + ++ + Sbjct: 125 VTATLPLVFAPLVETLGGPTIG---SVGISKASGAVFTTQTSNQGGMELVIVLDNSASMK 181 Query: 184 EMGERPIFLIELVVDL------------------SGSMHCAMNS----------DPEDVN 215 E ++ ++D+ SG+++ ++ + Sbjct: 182 GSQEDLRGGVKALLDMLYGNADTRKNLYVGIVHYSGAVNVLQSALKNKADIVAPVVGGMA 241 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 + P+ + + L NA D + E Y G T + + Sbjct: 242 NCPMATVNGKLNGSRLSNAPPKTFK-FDSTTDGVEIQYCGASTLGTSSALSPNRGDADKA 300 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF------ 329 ++ YV T + +++LT S+ + + SLP Sbjct: 301 IKSYV-------AGGDTLIGEGLVWGWRMLTP----SWRGLWNTKDQPGASLPLDYDLPY 349 Query: 330 -QKFIIFLTDGEN-------------------NNFKSNVNTIKICDKAKEN-FIKIVTIS 368 +K ++ +TDG N + K++ + + IC+ AK++ + + TI+ Sbjct: 350 MKKVLVLMTDGVNHIAGRNYTAYYSDPYQTVADASKADADLMTICNAAKKDHNVVLYTIT 409 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 + + Q+ + C S P HY+ L F + + K Sbjct: 410 YGSDTDEQQ-MSDCASDPSKHYHAALPQDLAKAFTQVGTDLTTMKL 454 >gi|192360615|ref|YP_001982630.1| von Willebrand factor type A domain-containing protein [Cellvibrio japonicus Ueda107] gi|190686780|gb|ACE84458.1| von Willebrand factor type A domain protein [Cellvibrio japonicus Ueda107] Length = 318 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 35/210 (16%), Positives = 73/210 (34%), Gaps = 40/210 (19%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VD+SGSM + +++ T++ A+K + F+ ++ Sbjct: 90 DLLLAVDISGSM----------REPDMVYNNRRITRLMAVKKVVGDFVAR-------RQS 132 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + T+ ++ + V++ + + T A+ + + L Sbjct: 133 DRLGLVLFGTQAFLQAPLTFDVKTVQEMLIEAESGYAGEATAIGDAIALSIKRLREQPNA 192 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 ++ II LTDGEN + + T D A + KI TI+ + Sbjct: 193 ------------------KRVIILLTDGENTAGELGIATAT--DLAVKANTKIYTIAFSP 232 Query: 372 SPN---GQRLLKTCVSSPEYHYNVVNADSL 398 + + + + N L Sbjct: 233 YDREVDSHSMQQIAEQTGGEFFRARNTRDL 262 >gi|39936212|ref|NP_948488.1| hypothetical protein RPA3149 [Rhodopseudomonas palustris CGA009] gi|39650067|emb|CAE28590.1| conserved hypothetical protein [Rhodopseudomonas palustris CGA009] Length = 455 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 58/460 (12%), Positives = 131/460 (28%), Gaps = 54/460 (11%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 + L ++ ++ + N ++IFAL+++ L IG + + + S+++A ++A Sbjct: 4 ISLAAQLTRAARRFPQANGGNIAVIFALALVPLLGFIGVAVDYSRANNARTSLQNALDSA 63 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSA--------- 111 L + + + + + + S Sbjct: 64 ALMLSRDLGVGTITPDQVSSKAQTYFNSLYTNKETGAVTVTATYTAKDGSGSSTIAMSGQ 123 Query: 112 -----------VFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFY-NMDVMTSYDYRLQ 159 F I + ++ LD + + M M + L Sbjct: 124 GAVQTQFMKILGFQTMAIGSSTTTTWGGTRLRVAMALDVTGSMASAGKMSAMKTAAKNLV 183 Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 Q + +S +P + +G + DL + + +S S+ Sbjct: 184 DSLRASAQTVDDVYISVVPFAQMVNVGSSNRNASWVRWDLWDESNGSCSSWWYSTKSSCE 243 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKED-----VYMGLIGYTTRVEKNIEP--SWG 272 + T + + A D K+ + Y ++ + ++ Sbjct: 244 YAGRTWTATSHNQWA-GCVTDRDQPADTTKDVPTSYATRFPAVDYDACPQQLLGMTSAYS 302 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + T+ M A+ L + + K + + Sbjct: 303 LSNATTIKNKIDALSPNGGTNQAIGMHWAWMSLRTGDPLNTPA-------KDSNYKYTDA 355 Query: 333 IIFLTDGENN-----------NFKSNVNTIKICDKAKE-----NFIKIVTISINAS-PNG 375 II L+DG N + + + +CD + N + I TI +N Sbjct: 356 IILLSDGLNTVDRWYGNGRDWSPQVDARQRILCDNIRASATNTNPVVIYTIQVNTDGDPE 415 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 +LK C S + + + F I + + + Sbjct: 416 SAVLKYCADS-GNFFATTTSSGIGTAFAQIGSSLSKLRVA 454 >gi|283782262|ref|YP_003373017.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] gi|283440715|gb|ADB19157.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] Length = 395 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 62/407 (15%), Positives = 134/407 (32%), Gaps = 40/407 (9%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 +K + + ++ A ++ + + F I V ++ + +A +AA AG + Sbjct: 9 SRKPRLARRGAMLVLIAFLLVVVVCMAAFAIDVSYMQLVRSELRAATDAAAKAGTLALAK 68 Query: 71 NLSRLGDRFESISNHAKRALI-DDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + A R + A + ++ S A + N + + + Sbjct: 69 TDGDAASARTAAIQAAARNKVAGRALVLTTDQVQVGRSAAQANGTWSFTANQTPYTSVKI 128 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 ++ + + + F+ + + SF PA + + Sbjct: 129 LSSMSDSTAAGSVPL---------------FLGTFMGRG------SFQPA--QSATASQM 165 Query: 190 IFLIELVVDLSGSMHCAMN----SDPEDVNSAPICQDKKR----TKMAALKNALLLFLDS 241 I LV+D S SM M+ S P + P ++ AAL++++ LF+ Sbjct: 166 EQEICLVIDRSHSMCFNMSGVEWSYPPGTKTTPHTICYPPHATLSRWAALQSSVNLFM-- 223 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNI-EPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 D + + LI + + + N E S+ + S S A + Sbjct: 224 -DTILETNNTPRVALITWGSTIGTNTAEYSYTKKTEVAVANELGLSTDYAAVKSKIAART 282 Query: 301 AYQILTSDK-KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 +L +K +I +TDG+ + + I + A + Sbjct: 283 TKVMLGGTNMSAGIDAGRTLLNGNTVRALAKKTMILMTDGQ---WNQGRDPIDAAEDAAD 339 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 I+I TI+ + + + + +Y N L F++++ Sbjct: 340 EGIQIHTITFLSGSAQNTMRQVAEITGGKYYVSSNQAELEEAFRDLA 386 >gi|126727880|ref|ZP_01743708.1| hypothetical protein RB2150_00467 [Rhodobacterales bacterium HTCC2150] gi|126702821|gb|EBA01926.1| hypothetical protein RB2150_00467 [Rhodobacterales bacterium HTCC2150] Length = 576 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 25/124 (20%), Positives = 46/124 (37%), Gaps = 5/124 (4%) Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN----FKSNVNTIK 352 A + Y +T + + I TD + + +S+ Sbjct: 454 ATETWYNSMTGMTFENLSWEEAWGLMSIEYYESVMGSGAATDWGSTSARTGSQSDTLMSA 513 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 C AK+ I I TI+ A N + L C +S ++Y+ S+ VF +I+ + Sbjct: 514 NCTAAKDRGITIFTIAFEAPSNAETQLNNCATSDNHYYDAQGT-SITSVFSSIATTIQKL 572 Query: 413 KYSV 416 K ++ Sbjct: 573 KLTL 576 >gi|126731914|ref|ZP_01747718.1| BatB protein, putative [Sagittula stellata E-37] gi|126707741|gb|EBA06803.1| BatB protein, putative [Sagittula stellata E-37] Length = 323 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 38/222 (17%), Positives = 81/222 (36%), Gaps = 43/222 (19%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I + +DLSGSM + T+++ +K F+ ++ Sbjct: 92 DIMMAIDLSGSME----------ERDFAVGGRPATRLSIVKETADDFISR-------RDG 134 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ ++ R ++ E VR+ + + L + T A+ + + L + Sbjct: 135 DRLGLVLFSDRAYLQAPLTFDREAVRKLLDQAQVGLTGQKTAIGDAIAVSVKRLKDRPED 194 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ LTDG NN + + D A + I+I TI + Sbjct: 195 G------------------RVLVLLTDGANNEGVMSPDKAA--DLAAKLGIRIYTIGV-G 233 Query: 372 SPNGQRL----LKTCV-SSPEYHYNVVNADSLIHVFQNISQL 408 S + L L+ ++ ++ + L +++ I +L Sbjct: 234 SARSRDLDERTLRQIADATGGAYFRATDVQGLAQIYRAIDRL 275 >gi|118496821|ref|YP_897871.1| von Willebrand factor type A domain-containing protein [Francisella tularensis subsp. novicida U112] gi|118422727|gb|ABK89117.1| von Willebrand factor type A domain protein [Francisella novicida U112] Length = 333 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 52/269 (19%), Positives = 96/269 (35%), Gaps = 55/269 (20%) Query: 158 LQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSA 217 L+++ + S I L + + + + +DLSGSM + Sbjct: 60 LKYLLGFIWILLI-ISGSGIQWLGKPVSLAQSGRDLIMAIDLSGSMA---------IQDM 109 Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 + ++ + FLD+ ++ +GLI + TR ++ V+ Sbjct: 110 KKANGQMESRFDLVMRVANQFLDT-------RKGDRVGLILFGTRAYLQTPLTFDIATVK 162 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + + +L T A+ A + L S K +I LT Sbjct: 163 KMLDDASIALPGPQTAIGDAIGLAVKKLKKYPGDS------------------KALILLT 204 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTIS------INASPNGQRL-----------LK 380 DGENN + ++ + AK+ IKI TI I + GQRL L+ Sbjct: 205 DGENN--SGTLQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLVNTSEDLDTTVLE 262 Query: 381 TCVS-SPEYHYNVVNADSLIHVFQNISQL 408 + + ++ N+ L V+++I +L Sbjct: 263 KIATMTGGKYFRAQNSSDLKKVYESIDKL 291 >gi|114704798|ref|ZP_01437706.1| hypothetical protein FP2506_07676 [Fulvimarina pelagi HTCC2506] gi|114539583|gb|EAU42703.1| hypothetical protein FP2506_07676 [Fulvimarina pelagi HTCC2506] Length = 545 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 36/176 (20%), Positives = 73/176 (41%), Gaps = 23/176 (13%) Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 K+ +Y T +E ++ + V V + T+ T ++ + LT+ Sbjct: 383 KDTIYPEAHCATGSLEPITGLTFDLQSVETAVNKL---TPSGNTNVTIGVQWGMEALTAA 439 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN---------NNFKSNVNTIKICDKAKE 359 + GV+ S +K +I LTDG N + K + T+ C+ AK Sbjct: 440 APLT--------GVRTGS-EVRKVMIVLTDGLNTQNRWWGSRDRNKIDARTLAACNNAKA 490 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 I++ T+ + + LLKTC + + ++ V +A L F ++++ + + + Sbjct: 491 MGIELYTVRLV--EGNEDLLKTCAETEDKYHYVTSASQLKTTFADLARQVKGVRLA 544 >gi|328676285|gb|AEB27155.1| BatA in aerotolerance operon [Francisella cf. novicida Fx1] Length = 333 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 54/272 (19%), Positives = 100/272 (36%), Gaps = 56/272 (20%) Query: 157 RLQFIEHLLNQRYNQKIVSFIPA--LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 R ++++LL + I+S L + + + + +DLSGSM + Sbjct: 56 RANYLKYLLGVIWILLIISGSGIQWLGKPVSLPQSGRDLIMAIDLSGSMA---------I 106 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + ++ + FLD+ ++ +GLI + TR ++ Sbjct: 107 QDMKKANGQMESRFDLVMRVANQFLDT-------RKGDRVGLILFGTRAYLQTPLTFDIA 159 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 V++ + +L T A+ A + L S K +I Sbjct: 160 TVKKMLDDASIALPGPQTAIGDAIGLAVKKLKKYPGDS------------------KALI 201 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS------INASPNGQRL---------- 378 LTDGENN + ++ + AK+ IKI TI I + GQRL Sbjct: 202 LLTDGENN--SGTLQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLVNTSEDLDTT 259 Query: 379 -LKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 L+ + + ++ N+ L V+++I +L Sbjct: 260 VLEKIATMTGGKYFRAQNSSDLKKVYESIDKL 291 >gi|254372185|ref|ZP_04987677.1| conserved hypothetical protein [Francisella tularensis subsp. novicida GA99-3549] gi|151569915|gb|EDN35569.1| conserved hypothetical protein [Francisella novicida GA99-3549] Length = 339 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 52/269 (19%), Positives = 96/269 (35%), Gaps = 55/269 (20%) Query: 158 LQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSA 217 L+++ + S I L + + + + +DLSGSM + Sbjct: 66 LKYLLGFIWILLI-ISGSGIQWLGKPVSLPQSGRDLIMAIDLSGSMA---------IQDM 115 Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 + ++ + FLD+ ++ +GLI + TR ++ V+ Sbjct: 116 KKANGQMESRFDLVMRVANQFLDT-------RKGDRVGLILFGTRAYLQTPLTFDIATVK 168 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + + +L T A+ A + L S K +I LT Sbjct: 169 KMLDDASIALPGPQTAIGDAIGLAVKKLKKYPGDS------------------KALILLT 210 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTIS------INASPNGQRL-----------LK 380 DGENN + ++ + AK+ IKI TI I + GQRL L+ Sbjct: 211 DGENN--SGTLQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLVNTSEDLDTTVLE 268 Query: 381 TCVS-SPEYHYNVVNADSLIHVFQNISQL 408 + + ++ N+ L V+++I +L Sbjct: 269 KIATMTGGKYFRAQNSSDLKKVYESIDKL 297 >gi|167759260|ref|ZP_02431387.1| hypothetical protein CLOSCI_01607 [Clostridium scindens ATCC 35704] gi|167663134|gb|EDS07264.1| hypothetical protein CLOSCI_01607 [Clostridium scindens ATCC 35704] Length = 800 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 48/329 (14%), Positives = 107/329 (32%), Gaps = 61/329 (18%) Query: 108 GYSAVFYNTEIQNIVNSSRISMTH--MANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLL 165 + + E+ ++ +T + + N + ++ + + + +++ Sbjct: 94 ADTGAVSDAEVTPEKEAATEGITEQDVPEAENVGAGNPVAVHLSRGAASEAAPEHQKYIK 153 Query: 166 NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 N ++ + +P+ + L+VD SGSM+ M++D K Sbjct: 154 KNAENDYTLTLNVKGMYDSETTKPMIDVLLIVDKSGSMNWKMDTDKVG----------KP 203 Query: 226 TKMAALKNALLLFLDSIDLLSHVKE-DVYMGLIGYT--------TRVEKNIEPSWGTEKV 276 ++M LK + D + + D M ++ Y+ + I W K Sbjct: 204 SRMDVLKQVVTGTGGLTDSIFGNTQIDAQMAVVTYSGSNDFLDQRYDDAEIIQEW--TKQ 261 Query: 277 RQYVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 + V ++++ T+ ++ L ++ + +KF+IF Sbjct: 262 KDTVNNAVNNIQAKGGTNCEAGLRTGATALEGSRENA-----------------KKFVIF 304 Query: 336 LTDGE--------------NNNFKSNVNTIKICDKAKENFIK-IVTISINASPNGQRLLK 380 L+DG+ + I K ++ TI S + L Sbjct: 305 LSDGDATFYYGDDGYTKGPGSGSSPTAREKAIAQVQKITGLEGFYTIG-MTSSSSSEFLT 363 Query: 381 TCV----SSPEYHYNVVNADSLIHVFQNI 405 +S + Y N ++L FQ I Sbjct: 364 NLANNSKASEKRFYPANNTEALEKAFQEI 392 >gi|194324498|ref|ZP_03058270.1| von Willebrand factor type A domain membrane protein [Francisella tularensis subsp. novicida FTE] gi|194321333|gb|EDX18819.1| von Willebrand factor type A domain membrane protein [Francisella tularensis subsp. novicida FTE] Length = 339 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 52/269 (19%), Positives = 96/269 (35%), Gaps = 55/269 (20%) Query: 158 LQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSA 217 L+++ + S I L + + + + +DLSGSM + Sbjct: 66 LKYLLGFIWILLI-ISGSGIQWLGKPVSLAQSGRDLIMAIDLSGSMA---------IQDM 115 Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 + ++ + FLD+ ++ +GLI + TR ++ V+ Sbjct: 116 KKANGQMESRFDLVMRVANQFLDT-------RKGDRVGLILFGTRAYLQTPLTFDIATVK 168 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + + +L T A+ A + L S K +I LT Sbjct: 169 KMLDDASIALPGPQTAIGDAIGLAVKKLKKYPGDS------------------KALILLT 210 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTIS------INASPNGQRL-----------LK 380 DGENN + ++ + AK+ IKI TI I + GQRL L+ Sbjct: 211 DGENN--SGTLQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLVNTSEDLDTTVLE 268 Query: 381 TCVS-SPEYHYNVVNADSLIHVFQNISQL 408 + + ++ N+ L V+++I +L Sbjct: 269 KIATMTGGKYFRAQNSSDLKKVYESIDKL 297 >gi|118591415|ref|ZP_01548813.1| hypothetical protein SIAM614_27248 [Stappia aggregata IAM 12614] gi|118436087|gb|EAV42730.1| hypothetical protein SIAM614_27248 [Stappia aggregata IAM 12614] Length = 474 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 80/512 (15%), Positives = 159/512 (31%), Gaps = 136/512 (26%) Query: 1 MHLLSRF--RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANN 58 M R R K KA+ +F L V+ +++ G I V + + A + Sbjct: 1 MQYFRRLIGRPLLKGFTGDRKASILPVFGLMVVLIVVIAGITIDVSRTVNAREKLSFAID 60 Query: 59 AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEI 118 AA L+ A+ + +++ +S+ +A + D+ + + F + I Sbjct: 61 AAALSVAADLSTSV---------MSDEQIKAALADSFK---------ANLADVEFLDEAI 102 Query: 119 QNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP 178 +N+ + + + + D ++ L + +N S + Sbjct: 103 KNLS---------FVVDAENGTIKVSSFATLDNYFIDMGGYGMQALGPETFNFGTSSQVT 153 Query: 179 ALLRIEMGERPIFLIEL--VVDLSGSMHCAMNSDPEDVNSA-----PICQDKKRTKM--- 228 F +EL VVD++GSM M++ + P ++ +K+ Sbjct: 154 YS---------RFDVELALVVDVTGSMRNDMDTLRDASKGLVNILIPETTEEADSKVRIS 204 Query: 229 --------------AALKNALLLFLDSIDLLSHVKED--------VYMGLIGYTTRVEKN 266 A +K + + DS ++ ++ V + Y V+ + Sbjct: 205 LVPYSQGVNLGTYAAKVKGGVYGYADSSVCVTERQDYDDGEDIYKVRYTDMPYNYYVKTD 264 Query: 267 IEPS-----------WGTEKVR-------QYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 P GT K+ + D T + + + Sbjct: 265 PPPKDVFYGGGSNRCSGTSKMIPLTADRDTLLDAIADLDDNGGTAGQTGVVWGWNSI--S 322 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF------------------------ 344 S + + KF I +TDG+NN F Sbjct: 323 PNYSDVWPLASKPEPYDNDDVLKFAIIMTDGDNNRFYEFVKEREECDWVYSRRYGWQWTC 382 Query: 345 -------------------KSNVNTIKICDKAKENFIKIVTISI--NASPNGQRLLKTCV 383 S+ +C K+ I I + N S G + +++C Sbjct: 383 EMVSVNQWQERSESESYNNNSSKAQRALCQAMKDEGISIFGVYFGTNDSSAGSKNMQSCA 442 Query: 384 SSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 S+ +Y ++D LI+ F NI++ + S Sbjct: 443 ST-GNYYKATSSDELINAFANIAKKIQQIYVS 473 >gi|240137370|ref|YP_002961839.1| hypothetical protein MexAM1_META1p0632 [Methylobacterium extorquens AM1] gi|240007336|gb|ACS38562.1| Conserved hypothetical protein containing a von Willebrand factor type A (vWA) domain; putative membrane protein [Methylobacterium extorquens AM1] Length = 339 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 44/228 (19%), Positives = 82/228 (35%), Gaps = 38/228 (16%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I L +DLSGSM S + +++AA+K F+ + Sbjct: 100 IVLALDLSGSMERKDFSLDGETV----------SRLAAVKRVGAEFIRR-------RAGD 142 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GL+ + + P++ T V + + L+ + T + A + L Sbjct: 143 RIGLVEFADQAYVAAAPTFDTATVARTLEEATIGLVGRSTGIGDGLGLALKRL------- 195 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI--- 369 P K ++ L+DG NN ++ + AK+ +++ TI++ Sbjct: 196 APAQVAAADGAGPPPSRDKVVVLLSDGANNAGQTAPKDVA--ALAKDLGVRVYTIALGPI 253 Query: 370 -NA--SPNGQRL-----LKTCV-SSPEYHYNVVNADSLIHVFQNISQL 408 A N Q + L+ +S + V D L +V I +L Sbjct: 254 DMADNPNNEQDVVDVETLRAMAETSGGRAFRVKTTDDLENVANAIDEL 301 >gi|329928736|ref|ZP_08282585.1| IPT/TIG domain protein [Paenibacillus sp. HGF5] gi|328937517|gb|EGG33935.1| IPT/TIG domain protein [Paenibacillus sp. HGF5] Length = 964 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 49/235 (20%), Positives = 98/235 (41%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L++D SGSM ++ A KNA F+D +D+ H Sbjct: 70 DVILIIDKSGSMQTD-------------------NRINAAKNAAKGFIDLMDMTKH---- 106 Query: 252 VYMGLIGYTTRVEKNI-EPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +G++GY++ E + + T +Q++ + S T++ A+ QA +L+S Sbjct: 107 -QVGIVGYSSVAETSSLPLTTDTAAAKQFIDPIVAS---GGTETGYAIDQAITLLSSH-- 160 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 Q I+ +TDGE N+ ++ + + AK+ I TI++ Sbjct: 161 ---------------RPEAQPVIVIMTDGEANSSQAALERAQ---AAKDAGIVFYTIALL 202 Query: 371 ASPNG------QRLLKTCVSSPEYHYNVVNADSLIHVFQNI-SQLMVHRKYSVIL 418 + LLK ++ +H+ V+ + L ++ I +++ V Y+V++ Sbjct: 203 GPNDNPDTSAPNELLKQMATTNSHHHFVLGSTGLAEIYAAIVAEIGVASAYNVVV 257 >gi|218528855|ref|YP_002419671.1| von Willebrand factor type A [Methylobacterium chloromethanicum CM4] gi|254559548|ref|YP_003066643.1| hypothetical protein METDI1003 [Methylobacterium extorquens DM4] gi|218521158|gb|ACK81743.1| von Willebrand factor type A [Methylobacterium chloromethanicum CM4] gi|254266826|emb|CAX22625.1| Conserved hypothetical protein containing a von Willebrand factor type A (vWA) domain; putative membrane protein [Methylobacterium extorquens DM4] Length = 339 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 44/228 (19%), Positives = 83/228 (36%), Gaps = 38/228 (16%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I L +DLSGSM S + +++AA+K F+ + Sbjct: 100 IVLALDLSGSMERKDFSLDGETV----------SRLAAVKRVGAEFIRR-------RAGD 142 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GL+ + + P++ T V + + L+ + T + A + L Sbjct: 143 RIGLVEFADQAYVAAAPTFDTATVARTLEEATIGLVGRSTGIGDGLGLALKRL------- 195 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI--- 369 + P K ++ L+DG NN ++ + AK+ +++ TI++ Sbjct: 196 APAQVAAADGEGPPPARDKVVVLLSDGANNAGQTAPKDVA--ALAKDLGVRVYTIALGPI 253 Query: 370 -NA--SPNGQRL-----LKTCV-SSPEYHYNVVNADSLIHVFQNISQL 408 A N Q + L+ +S + V D L +V I +L Sbjct: 254 DMADNPNNEQDVVDVETLRAMAETSGGRAFRVKTTDDLENVANAIDEL 301 >gi|163850298|ref|YP_001638341.1| von Willebrand factor type A [Methylobacterium extorquens PA1] gi|163661903|gb|ABY29270.1| von Willebrand factor type A [Methylobacterium extorquens PA1] Length = 339 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 44/228 (19%), Positives = 83/228 (36%), Gaps = 38/228 (16%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I L +DLSGSM S + +++AA+K F+ + Sbjct: 100 IVLALDLSGSMERKDFSLDGETV----------SRLAAVKRVGAEFIRR-------RAGD 142 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GL+ + + P++ T V + + L+ + T + A + L Sbjct: 143 RIGLVEFADQAYVAAAPTFDTATVARTLEEATIGLVGRSTGIGDGLGLALKRL------- 195 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI--- 369 + P K ++ L+DG NN ++ + AK+ +++ TI++ Sbjct: 196 APAQVAAADGEGPPPARDKVVVLLSDGANNAGQTAPKDVA--ALAKDLGVRVYTIALGPI 253 Query: 370 -NA--SPNGQRL-----LKTCV-SSPEYHYNVVNADSLIHVFQNISQL 408 A N Q + L+ +S + V D L +V I +L Sbjct: 254 DMADNPNNEQDVVDVETLRAMAETSGGRAFRVKTTDDLENVANAIDEL 301 >gi|254373668|ref|ZP_04989152.1| conserved hypothetical protein [Francisella novicida GA99-3548] gi|151571390|gb|EDN37044.1| conserved hypothetical protein [Francisella novicida GA99-3548] Length = 339 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 54/272 (19%), Positives = 100/272 (36%), Gaps = 56/272 (20%) Query: 157 RLQFIEHLLNQRYNQKIVSFIPA--LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 R ++++LL + I+S L + + + + +DLSGSM + Sbjct: 62 RANYLKYLLGVIWILLIISGSGIQWLGKPVSLPQSGRDLIMAIDLSGSMA---------I 112 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + ++ + FLD+ ++ +GLI + TR ++ Sbjct: 113 QDMKKANGQMESRFDLVMRVANQFLDT-------RKGDRVGLILFGTRAYLQTPLTFDIA 165 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 V++ + +L T A+ A + L S K +I Sbjct: 166 TVKKMLDDASIALPGPQTAIGDAIGLAVKKLKKFPGDS------------------KALI 207 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS------INASPNGQRL---------- 378 LTDGENN + ++ + AK+ IKI TI I + GQRL Sbjct: 208 LLTDGENN--SGTLQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLVNTSEDLDTT 265 Query: 379 -LKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 L+ + + ++ N+ L V+++I +L Sbjct: 266 VLEKIATMTGGKYFRAQNSSDLKKVYESIDKL 297 >gi|94498567|ref|ZP_01305122.1| hypothetical protein SKA58_08339 [Sphingomonas sp. SKA58] gi|94422010|gb|EAT07056.1| hypothetical protein SKA58_08339 [Sphingomonas sp. SKA58] Length = 678 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 34/190 (17%), Positives = 59/190 (31%), Gaps = 41/190 (21%) Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 L YT R S Y+ + + T M + L+ + Sbjct: 499 LTSYTNRTSTPTGQS---SSFNSYIDNLIA---VGGTYHDIGMLWGARFLSPKGIFASDN 552 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFK----------------------------SN 347 N G I + I+F+TDG+ + ++ N Sbjct: 553 NSAPNGFNIS-----RHIVFMTDGDMSAYQQVYGAYGYQQLDARVAPGNTSDTDLTAIHN 607 Query: 348 VNTIKICDKAKENFIKIVTISINASPNG--QRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 +C+ K I I I G Q L+ C +S + +A SL F++I Sbjct: 608 TRLQMLCNAIKAKGITIWVIGFRNQSEGNIQTPLQNCATSSNHWTMAYDATSLSQKFKDI 667 Query: 406 SQLMVHRKYS 415 ++ + + S Sbjct: 668 AKNIGGLRVS 677 Score = 44.9 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 41/254 (16%), Positives = 86/254 (33%), Gaps = 52/254 (20%) Query: 10 YFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMV 69 + + ++K N + A +++ LIG + + + + M+ A +AA LAG M Sbjct: 11 FMSRLARNQKGNVMAMVAAAIIPLAALIGGGLDMGRAYMARARMQQACDAAALAGRRAMT 70 Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + ++ + AK+ F + + + ++ V + + S Sbjct: 71 T-----SSMTQANKDEAKKFFD---FNFPQGTFQAA--TFTPVIRSKPGETTTVQVTAST 120 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 T T+ ++ +D Sbjct: 121 TMPTTVMKIFRYETLPLSVTCEARFDIGNT------------------------------ 150 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + LV+D +GSM A++ T++AALK A+ F D++ S+ Sbjct: 151 --DVMLVLDTTGSMAYAISDGKGGST----------TRLAALKQAVKDFYDTLGAGSNAT 198 Query: 250 EDVYMGLIGYTTRV 263 + G + Y++ V Sbjct: 199 GRIRYGFMPYSSTV 212 >gi|328675375|gb|AEB28050.1| BatA in aerotolerance operon [Francisella cf. novicida 3523] Length = 333 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 54/272 (19%), Positives = 99/272 (36%), Gaps = 56/272 (20%) Query: 157 RLQFIEHLLNQRYNQKIVSFIPA--LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 R ++++LL + I+S L + + + + +DLSGSM + Sbjct: 56 RANYLKYLLGVIWILLIISGSGIQWLGKPISLPQSGRDLIMAIDLSGSMA---------I 106 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + ++ + FLD+ ++ +GLI + TR ++ Sbjct: 107 QDMKKSNGQMESRFDLVMRVANQFLDT-------RKGDRVGLILFGTRAYLQTPLTFDIA 159 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 V++ + +L T A+ A + L S K +I Sbjct: 160 TVKKMLDDASIALPGPQTAIGDAIGLAVKKLKKYPGDS------------------KALI 201 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS------INASPNGQRLLKTCVS---- 384 LTDGENN + ++ + AK+ IKI TI I + GQRL+ T Sbjct: 202 LLTDGENN--SGTLQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLINTSEDLDTT 259 Query: 385 --------SPEYHYNVVNADSLIHVFQNISQL 408 + ++ N+ L V+++I +L Sbjct: 260 VLEKIAEMTGGKYFRAQNSSDLKKVYESIDKL 291 >gi|83859216|ref|ZP_00952737.1| hypothetical protein OA2633_12465 [Oceanicaulis alexandrii HTCC2633] gi|83852663|gb|EAP90516.1| hypothetical protein OA2633_12465 [Oceanicaulis alexandrii HTCC2633] Length = 441 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 68/489 (13%), Positives = 139/489 (28%), Gaps = 144/489 (29%) Query: 5 SRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG 64 R + + N +++FA+ + ++ +G + ++SA +A LA Sbjct: 18 RDMRSLLRHFTQDVRGNVAMMFAMLLGPLVVSVGGALDYSRTFTIGAEIQSAMDAGTLAA 77 Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 AS + +D + ++N+I +LS ++ V Sbjct: 78 ASL---------------------SQGEDPETIVRNYITAALSEHNGVLE---------- 106 Query: 125 SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 R+++ ++ ++S T + V T + ++ + + Sbjct: 107 -RLNVQVSSDLAINSREVTADAVISVPT-------LMLGIIGYD------ALTLNRVSEA 152 Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID- 243 I LV+D+SGSM +K+ AL++A F+ + Sbjct: 153 NERVRNLEISLVLDISGSMSG--------------------SKITALRDAAEEFVGVMMD 192 Query: 244 ---------LLSHVKEDVYMGLIGYTTRVEKNIE------------------------PS 270 + V + V Sbjct: 193 PDLEGLTSLSVIPYNGGVRLPQTVTNDLVPGTPNDSGCLELGVSDPVTMDLAANGYDWLD 252 Query: 271 WGTEKVRQY-----------------------VTRDMDSLILKPTDSTPAMKQAYQILTS 307 W R + V D T A + L Sbjct: 253 WQDRDQRGWRSSAFCPEENEATVFLEQTPSVLVNLIRDLDAGGNTGLDVATAWGARALDP 312 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG---------ENNNFKSNVNTIKICDKAK 358 + +F + K ++ +TDG +N I +A+ Sbjct: 313 AWRGRLGGDFASRPAAYDDPSTMKVLVVMTDGAATAQIRRAQNWYGDWYSYEIYSASQAR 372 Query: 359 EN-----------FIKIVTISINAS-PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 +N + I TI+ S + L++ C S PE +Y V N + F +I+ Sbjct: 373 DNMADACDAAEAEGVHIYTIAFQVSGSTNRNLMRDCASRPENYYAVENL-DISAAFNSIA 431 Query: 407 QLMVHRKYS 415 + + + + Sbjct: 432 ADLNNLRLA 440 >gi|320158179|ref|YP_004190557.1| BatA [Vibrio vulnificus MO6-24/O] gi|319933491|gb|ADV88354.1| BatA [Vibrio vulnificus MO6-24/O] Length = 323 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 46/235 (19%), Positives = 85/235 (36%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLSGSM ED+ D +++A+KN + F++ ++ Sbjct: 87 DLMLVVDLSGSMQ------QEDILQDGDYID----RLSAVKNVVTQFIEQ-------RQG 129 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + + + V + + + LI + T + A D + Sbjct: 130 DRLGLVLFADHAYLQTPLTADRQTVANQLNQTIIGLIGQKTAIGDGLALA-TKTFVDSEA 188 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-- 369 Q+ +I L+DG N + + AK+ +KI TI I Sbjct: 189 -----------------PQRVVILLSDGSNTAGTLDPIEAA--NIAKKYGVKIYTIGIGA 229 Query: 370 ----------------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 +A + + L K + ++ +A L ++Q I+QL Sbjct: 230 GEMEVKQFFMTRKVNTSADLDEKTLTKIATMTGGQYFRARDAQELQAIYQAINQL 284 >gi|254443725|ref|ZP_05057201.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] gi|198258033|gb|EDY82341.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] Length = 339 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 48/252 (19%), Positives = 81/252 (32%), Gaps = 58/252 (23%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 A+ + + I L VDLS SM K+ ++ A+K L F Sbjct: 74 AVTTERHSKSRGYDIVLAVDLSRSME----------AEDYFVDRKRSNRLQAVKPVLSAF 123 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 ++ +E+ +GLI + R ++ + + + R LI T ++ Sbjct: 124 INR-------RENDRIGLIAFAGRAYTVAPLTFDHKWLARQTERLQIGLIEDGTAIGDSL 176 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 A L K FI+ LTDGEN + AK Sbjct: 177 AVATSRLLEGAKERAGEREGA------------FIVLLTDGENTAGMMDPMEGAT--LAK 222 Query: 359 ENFIKIVTISINASPNGQ-------------------------RLLKTCVSSPEYHYNVV 393 + I++ TI+ A NG L+K + + Sbjct: 223 DAGIRVYTIA--AGKNGYVPFPRRNERGERIGTTQEFLRVDTETLMKIANETNGEFFRAE 280 Query: 394 NADSLIHVFQNI 405 N+D++ F+ I Sbjct: 281 NSDTIDQAFEKI 292 >gi|83816834|ref|YP_446668.1| von Willebrand factor type A domain-containing protein [Salinibacter ruber DSM 13855] gi|83758228|gb|ABC46341.1| von Willebrand factor type A domain protein [Salinibacter ruber DSM 13855] Length = 289 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 42/235 (17%), Positives = 76/235 (32%), Gaps = 62/235 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I +V+D S SM D + T+ A + A F++ + Sbjct: 51 IDIMMVLDASTSMQAE---------------DFQPTRFEAAREAAGAFVE-------GRV 88 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +GLI + + +++ + + T A+ A L + Sbjct: 89 SDRVGLIVFAAEAYTQAPLTLDYSFLQRMLEDVEVGAVEDGTAVGTALATAVNRLKDSEA 148 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 S K I LTDG NN + + T + A+ +++ I + Sbjct: 149 ES------------------KVAILLTDGRNNRGQIDPRTAA--EVARTMGVRVYAIGVG 188 Query: 371 AS----------PNGQR----------LLKTCVSSPEYHYNVVNADSLIHVFQNI 405 +S P GQR L VS+ +++ N D+L ++ I Sbjct: 189 SSEDRDTWEEPLPQGQRDESAGVDAEMLRSVSVSTGGQYFSATNRDALERIYAEI 243 >gi|83859217|ref|ZP_00952738.1| hypothetical protein OA2633_12470 [Oceanicaulis alexandrii HTCC2633] gi|83852664|gb|EAP90517.1| hypothetical protein OA2633_12470 [Oceanicaulis alexandrii HTCC2633] Length = 436 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 62/484 (12%), Positives = 136/484 (28%), Gaps = 118/484 (24%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 M +L R F + + N +II AL + +G + + ++SA ++ Sbjct: 1 MSVLERI---FSRWSDDRRGNVAIIMALCSGVLVTAVGGALDYSRSTTVSSELQSALDSG 57 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 LA AS + RA ++ A + Sbjct: 58 ALAAASL----------TQDRNPEDVVRAYVEAAL--------------------ADHPQ 87 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 ++ S ++ + + N T M + L+ Sbjct: 88 LLASLQLDVVADISLNSRVVNATASVAMPTT---------MLGLVGINTLTL------EH 132 Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + + I LV+D+SGSM + + +D + +++ + Sbjct: 133 ASEAIEQVRDVEISLVLDVSGSMGGSKINALQDAAIEFVEIVLAADAAERTSISVIPYNG 192 Query: 241 SIDLLSHVKEDV---------YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + V +D+ G + T + + + +Y + Sbjct: 193 GVRTPREVNQDIVSGNNNHRRQSGCVDMGTDYPVEMTLPYREMEFTEYYGSEQTGNSSSA 252 Query: 292 ------------TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL------------ 327 + + M+ L ++ + Sbjct: 253 FCPRSNMESEFLSQNEGRMRGLINSLRAEGNTGLDVATMWGARALDPAWRGNLGGSFSDR 312 Query: 328 -------PFQKFIIFLTDGEN----------------------------NNFKSNVNTIK 352 K ++ +TDGE + ++ N + Sbjct: 313 PASYDDRDTIKILVVMTDGEATAQIRSEEYTYYDWWGRERTGTRSYELYSARQARENMAE 372 Query: 353 ICDKAKENFIKIVTISINAS-PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 CD A+ N ++I TI+ S + L++ C + P+ +Y V N + F +I+ + Sbjct: 373 ACDIAEGNGVQIYTIAFQLSGQTNRDLMRNCANKPQNYYQVENL-DIAEAFSSIAADINR 431 Query: 412 RKYS 415 + + Sbjct: 432 LRLT 435 >gi|268316013|ref|YP_003289732.1| von Willebrand factor type A [Rhodothermus marinus DSM 4252] gi|262333547|gb|ACY47344.1| von Willebrand factor type A [Rhodothermus marinus DSM 4252] Length = 329 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 45/267 (16%), Positives = 88/267 (32%), Gaps = 71/267 (26%) Query: 171 QKIVSFIPALLRIEMGERPIF--------LIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 + V + L ER +F + LV+DLS SM QD Sbjct: 60 RLAVLTLGILALARPQERQVFEKRTVEGRDLMLVLDLSSSMLA---------------QD 104 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 ++ + + F+ +GL+ + + + P+ + + R Sbjct: 105 FSPSRFEVARRTAIQFV-QGRRAD------RIGLVVFAGQAFTQVPPTLDYRFLLTMLQR 157 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + T A+ A L + + RS K II LTDG+NN Sbjct: 158 LQVGRLEDGTAIGTAIATAINRLKNSEARS------------------KVIILLTDGQNN 199 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISIN----AS----------------PNGQRLLKTC 382 + + T + A++ I+I TI ++ A + +++ Sbjct: 200 RGEIDPLTAA--ELARQAGIRIYTIGLSGRGEAPYPVQTPFGTRPQPVPVEIDEAMMREV 257 Query: 383 VS-SPEYHYNVVNADSLIHVFQNISQL 408 + ++ +A +L ++ I +L Sbjct: 258 AEKTGGRYFRATDARTLEAIYAEIDRL 284 >gi|167644155|ref|YP_001681818.1| Flp pilus assembly protein TadG [Caulobacter sp. K31] gi|167346585|gb|ABZ69320.1| Flp pilus assembly protein TadG [Caulobacter sp. K31] Length = 562 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 29/174 (16%), Positives = 57/174 (32%), Gaps = 44/174 (25%) Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + T+ + + L+ + + + K K I+ LTDG+N Sbjct: 392 KLDQMIASGNTNVAMGLIWGWHTLSKNAPFADGVDPATTVGK----RTTKVIVLLTDGDN 447 Query: 342 NNFKS----------------------------------------NVNTIKICDKAKENF 361 N + + C AK Sbjct: 448 TNDTYNNPNASIYTGYGYITQGRLLNASNSPLGATSTATNRRDAIDSREARACTNAKAAG 507 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 ++I I + S + + +L+ C S PE +Y+V +A L VF I+ + + + + Sbjct: 508 VQIYAIGVGVSSHSRGILQDCASKPEMYYDVTDAAQLASVFNTIAGSIQNLRIT 561 Score = 41.0 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 43/304 (14%), Positives = 88/304 (28%), Gaps = 75/304 (24%) Query: 5 SRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG 64 + A E+ ++ FAL ++ +L LI + +K ++ A +AA L Sbjct: 8 RLVTRLVTRLGADERGAIAVQFALLLIPIAVLTFGLIDISRASVQKRQLQDALDAATLMA 67 Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 A R + +N + D A + + ++ F + +V + Sbjct: 68 A------------RSTATTNADLDTIGDAALATEMAGLGVTFGPGNSSFVLGDNNTVVGT 115 Query: 125 SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 + + ++ I L N + + + I Sbjct: 116 IQ----------------------------NVVIKPIISNLWSSTNTPVSATATVMRSIN 147 Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID- 243 + LV+D +GSM ++ S +K+ AL A +D + Sbjct: 148 -----HLEVALVLDNTGSMASSLGSGG--------------SKITALITASKSLVDVLSA 188 Query: 244 -------------LLSHVKEDVYMGLI--GYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + V +G T+ + ++G + R Sbjct: 189 AAARATEADAVKISVVPFSMTVNIGSTYQTQTSWLTGTQPAAYGVDNFATSQNRFTLLSN 248 Query: 289 LKPT 292 L T Sbjct: 249 LGLT 252 >gi|189219434|ref|YP_001940075.1| hypothetical protein Minf_1423 [Methylacidiphilum infernorum V4] gi|189186292|gb|ACD83477.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Methylacidiphilum infernorum V4] Length = 334 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 42/244 (17%), Positives = 85/244 (34%), Gaps = 54/244 (22%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + + I LV+D+SGSM + K+ +++ + + FLD Sbjct: 81 PLRKEGYDIILVLDISGSM----------LAEDYEIDQKRVSRLDIVLEVVKTFLDK--- 127 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + + +GL+ + R ++ +++ + + I T A+ A Sbjct: 128 ----RTNDRIGLVAFAGRAYTVCPLTFDHNWLKRKIDQLQAGTIEDGTAIGDALGLALSR 183 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L K+ + KI S F+I LTDG NN N+ I+ A + + Sbjct: 184 LEGKKESG-------ERKKIGS-----FLILLTDGANNC--GNLTPIEAARLAAHAAVPV 229 Query: 365 VTIS----------INASPNGQ------------RLLKTCVS-SPEYHYNVVNADSLIHV 401 TI + + LL+ + ++ ++++++ Sbjct: 230 FTIGAGINGEVTMPVMDEERRKIGSQTVVSEVDEGLLRNIAQLTGGEYFRATDSNAIVSA 289 Query: 402 FQNI 405 FQ I Sbjct: 290 FQAI 293 >gi|192291928|ref|YP_001992533.1| hypothetical protein Rpal_3558 [Rhodopseudomonas palustris TIE-1] gi|192285677|gb|ACF02058.1| conserved hypothetical protein [Rhodopseudomonas palustris TIE-1] Length = 455 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 58/460 (12%), Positives = 131/460 (28%), Gaps = 54/460 (11%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 + L ++ ++ + N ++IFAL+++ L IG + + + S+++A ++A Sbjct: 4 ISLAAQLTRAARRFPQANGGNIAVIFALALVPLLGFIGVAVDYSRANNARTSLQNALDSA 63 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSA--------- 111 L + + + + + + S Sbjct: 64 ALMLSRDLGVGTITPDQVSSKAQTYFNSLYTNKETGAVTVTATYTAKDGSGSSTIAMSGQ 123 Query: 112 -----------VFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFY-NMDVMTSYDYRLQ 159 F I + ++ LD + + M M + L Sbjct: 124 GAVQTQFMKILGFQTMAIGSSTTTTWGGTRLRVAMALDVTGSMASAGKMSAMKTAAKNLV 183 Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 Q + +S +P + +G + DL + + +S S+ Sbjct: 184 DSLRASAQTADDVYISVVPFAQMVNVGSSNRNANWVRWDLWDESNGSCSSWWYSTKSSCE 243 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKED-----VYMGLIGYTTRVEKNIEP--SWG 272 + T + + A D K+ + Y ++ + ++ Sbjct: 244 YAGRTWTATSHNQWA-GCVTDRDQPADTTKDVPTSYATRFPAVDYDACPQQLLGMTSAYS 302 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + T+ M A+ L + + K + + Sbjct: 303 LSNATTIKNKIDALSPNGGTNQAIGMHWAWMSLRTGDPLNTPA-------KDSNYKYTDA 355 Query: 333 IIFLTDGENN-----------NFKSNVNTIKICDKAKE-----NFIKIVTISINAS-PNG 375 II L+DG N + + + +CD + N + I TI +N Sbjct: 356 IILLSDGLNTVDRWYGNGRDWSPQVDARQRILCDNIRASATNTNPVVIYTIQVNTDGDPE 415 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 +LK C S + + + F I + + + Sbjct: 416 STVLKYCADS-GNFFATTTSSGIGTAFAQIGSSLSKLRVA 454 >gi|188580059|ref|YP_001923504.1| von Willebrand factor type A [Methylobacterium populi BJ001] gi|179343557|gb|ACB78969.1| von Willebrand factor type A [Methylobacterium populi BJ001] Length = 339 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 44/228 (19%), Positives = 81/228 (35%), Gaps = 38/228 (16%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I L +DLSGSM S + +++AA+K F+ + Sbjct: 100 IVLALDLSGSMERKDFSLDGETV----------SRLAAVKRVGAEFIRR-------RAGD 142 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GL+ + + P++ T V + + L+ + T + A + L Sbjct: 143 RIGLVEFADQAYVAAAPTFDTAAVARTLEEATIGLVGRSTGIGDGLGLALKRL------- 195 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI--- 369 P K ++ L+DG NN ++ + AK+ +++ TI++ Sbjct: 196 APAQLADAEGGGPPPSRDKVVVLLSDGANNAGQTAPKDVA--ALAKDLGVRVYTIALGPI 253 Query: 370 -NA--SPNGQRL-----LKTCV-SSPEYHYNVVNADSLIHVFQNISQL 408 A N Q + L+ +S + V D L V I +L Sbjct: 254 DMADNPNNEQDVVDVETLRAMAETSGGRAFRVKTTDDLESVAAAIDEL 301 >gi|307943460|ref|ZP_07658804.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307773090|gb|EFO32307.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] Length = 320 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 19/81 (23%), Positives = 33/81 (40%), Gaps = 3/81 (3%) Query: 336 LTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQR--LLKTCVSSPEYHYNVV 393 + N+ S +CD+AK I I T+ N L C +SP ++Y V Sbjct: 239 MNSPANSKHNSVAYMKTMCDQAKAKGIIIYTVGFQIRRNTLPDLSLSYCATSPSHYYFVE 298 Query: 394 NADSLIHVFQNISQLMVHRKY 414 ++ L F+ I+ + + Sbjct: 299 SS-DLSAAFKAIASSIKSLRI 318 >gi|152990340|ref|YP_001356062.1| von Willebrand factor type A domain-containing protein [Nitratiruptor sp. SB155-2] gi|151422201|dbj|BAF69705.1| von Willebrand factor type A domain protein [Nitratiruptor sp. SB155-2] Length = 289 Score = 66.8 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 39/228 (17%), Positives = 84/228 (36%), Gaps = 44/228 (19%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 I++ +R + L +D SGSM ++ + ++K +K+ F Sbjct: 68 SIKLDDRKGRDLVLALDASGSMEESLYDE--------------KSKFEVVKSMAQNFFH- 112 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 +G++ + + ++ T+ + + S+ T + Q Sbjct: 113 -KRFDDN-----IGIVIFGSFAYIAAPLTYDTKALDFLINYLEPSIAGNNTAIGEGLWQG 166 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + L +D + QK +I +TDG +N+ + + +KAK+ Sbjct: 167 IKALQADTAK------------------QKVLILITDGHHNSGSISPR--QAVEKAKKLG 206 Query: 362 IKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQL 408 IKI TI + + LL+ S + + + L +F +++L Sbjct: 207 IKIYTIGLG--DADKHLLEQIAKESGGKFFYAKSEEDLQSIFSELNKL 252 >gi|294508603|ref|YP_003572662.1| von Willebrand factor type A domain protein [Salinibacter ruber M8] gi|294344932|emb|CBH25710.1| von Willebrand factor type A domain protein [Salinibacter ruber M8] Length = 317 Score = 66.8 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 40/235 (17%), Positives = 77/235 (32%), Gaps = 62/235 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I +V+D S SM D + T+ A + A F++ + Sbjct: 79 IDIMMVLDASTSMQAE---------------DFQPTRFEAAREAAGAFVE-------GRV 116 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +GLI + + +++ + + T A+ A L + Sbjct: 117 SDRVGLIVFAAEAYTQAPLTLDYSFLQRMLEDVEVGAVEDGTAVGTALATAVNRLKDSEA 176 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 S K I LTDG NN + + T + A+ +++ I + Sbjct: 177 ES------------------KVAILLTDGRNNRGQIDPRTAA--EVAQTMGVRVYAIGVG 216 Query: 371 AS----------PNGQ---------RLLKTCVSS-PEYHYNVVNADSLIHVFQNI 405 +S P GQ +L++ +S +++ N D+L ++ I Sbjct: 217 SSEDRDTWEEPLPQGQRDESAGVDAEMLRSVSTSTGGQYFSATNRDALERIYAEI 271 >gi|148256121|ref|YP_001240706.1| hypothetical protein BBta_4775 [Bradyrhizobium sp. BTAi1] gi|146408294|gb|ABQ36800.1| hypothetical protein BBta_4775 [Bradyrhizobium sp. BTAi1] Length = 602 Score = 66.8 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 30/222 (13%), Positives = 68/222 (30%), Gaps = 38/222 (17%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 Q ++ + + + + + + +T++ + + S+ ++ Sbjct: 393 RTQPNDANAVSPASSDVATLFPANQHMENNVQYCSSSA---STKLGQIVPLSYNWTSLKS 449 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 V T+ M A Q L + + + + II L+D Sbjct: 450 AVNAME---PTGGTNQAIGMAWAVQSLIPNGVLGAPAEDA-------NTTYNRVIILLSD 499 Query: 339 GENNNFKS---------------NVNTIKICDKAKENF-------IKIVTISINAS---P 373 G N + + +C K I TI +N S Sbjct: 500 GLNTEDRWPDYGNGSTQASGNPIDARQALLCSNLKNTKDSKGNAMYTIYTIQVNTSSPAD 559 Query: 374 NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 +L+ C SSP+ Y + ++ ++ F +I + + + Sbjct: 560 PTSTVLQNCASSPDKFYMLTSSSQIVTTFNSIGTALSKLRVA 601 Score = 37.2 bits (84), Expect = 4.6, Method: Composition-based stats. Identities = 41/260 (15%), Positives = 83/260 (31%), Gaps = 62/260 (23%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 L+ R ++ N + +FA++++ L IG I + +++M+ A ++ L Sbjct: 17 LVRLCRQVSRRFSGDISGNIATLFAIALLPILAFIGAAIDYSRANAARSAMQGALDSTAL 76 Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 + + D S + K Sbjct: 77 MLSRDLSQGTITAADVAAKASTYFKALY-------------------------------- 104 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 +S + + +S ++ N+ + S QF+ L+ Sbjct: 105 -TSTDAQSVAVTASYTASTSSSASNIQLNASGQIVTQFM-KLVGFPTMTFNTKATTTWGD 162 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL---LLFL 239 ++M + L +D +GSM + KM AL+NA+ + Sbjct: 163 VKMR------VALALDNTGSMAYS-------------------GKMTALQNAVAGSGGLI 197 Query: 240 DSIDLLSHVKEDVYMGLIGY 259 D + L+ DVY+ LI + Sbjct: 198 DQLSALAKSPGDVYISLIPF 217 >gi|323700353|ref|ZP_08112265.1| von Willebrand factor type A [Desulfovibrio sp. ND132] gi|323460285|gb|EGB16150.1| von Willebrand factor type A [Desulfovibrio desulfuricans ND132] Length = 400 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 62/437 (14%), Positives = 141/437 (32%), Gaps = 97/437 (22%) Query: 27 ALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA--------------SKMVSNL 72 AL + L + G + + + + +++A +A LAG+ ++ V+++ Sbjct: 2 ALLLPVLLGVAGIAVDMGNMYMTHTRLQAAVDAGALAGSLELPYDPDLSKGIVTQAVNDM 61 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNT-EIQNIVNSSRISMTH 131 ++ + I K + ++ L + T E + +++ + Sbjct: 62 VETNMEEAVVTEISAGTEIRSVKVTAQAEVRMLLMEVLGMADKTVEASAMAGFNKLEVVF 121 Query: 132 MANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF 191 + +N S + +D++ L + + + V +P +I +GE Sbjct: 122 VIDN----SGSMKGTPIDLVKQASEELTDLL-IPDGTTPDTKVGLVPFRGKIRLGEAVDG 176 Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 E V+ GS++ +N + D +A K+ + Sbjct: 177 YAEGCVNADGSLNTGINEEFMDEYNALPYYYKRYITLDTC-------------------- 216 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + + + S + + + T + +K ILT D Sbjct: 217 ---------SDIPTVLPLSKNKSTIIAAIGSQTATGAASGTVISEGIKWGRNILTPDA-- 265 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGE----------------NNNFKSN----VNTI 351 F Q F+K +I LTDG+ NN + + Sbjct: 266 -----PFTQAGSKED--FRKIMIVLTDGDTEDGECGGTYRATYRPNNYWTNAYYGMGVDT 318 Query: 352 KICDK--------------AKENFIKIVTISI-NASPNGQRLLKTCVSS----PEYHYNV 392 C+ AK+ I+I +I ++ L+K SS +++++ Sbjct: 319 AHCNDGGVLNADMLSEAQLAKDAGIEIFSIRFGSSDTTDINLMKEIASSKAGTDDHYFDA 378 Query: 393 VNADSLIHVFQNISQLM 409 + + +F+ I + + Sbjct: 379 PSVYDIPDIFKQIGKQL 395 >gi|27367909|ref|NP_763436.1| aerotolerance operon protein BatA [Vibrio vulnificus CMCP6] gi|27359482|gb|AAO08426.1| BatA (Bacteroides aerotolerance operon) [Vibrio vulnificus CMCP6] Length = 323 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 45/235 (19%), Positives = 83/235 (35%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLSGSM A D +++A+KN + F++ ++ Sbjct: 87 DLMLVVDLSGSMQQADILQDGDY----------IDRLSAVKNVVTQFIEQ-------RQG 129 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + + + V + + + LI + T + A D + Sbjct: 130 DRLGLVLFADHAYLQTPLTADRQTVANQLNQTIIGLIGQKTAIGDGLALA-TKTFVDSEA 188 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-- 369 Q+ +I L+DG N + + AK+ +KI TI I Sbjct: 189 -----------------PQRVVILLSDGSNTAGTLDPIEAA--NIAKKYGVKIYTIGIGA 229 Query: 370 ----------------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 +A + + L K + ++ +A L ++Q I+QL Sbjct: 230 GEMEVKQFFMTRKVNTSADLDEKTLTKVATMTGGQYFRARDAQELQTIYQAINQL 284 >gi|197105075|ref|YP_002130452.1| hypothetical protein PHZ_c1612 [Phenylobacterium zucineum HLK1] gi|196478495|gb|ACG78023.1| conserved hypothetical protein [Phenylobacterium zucineum HLK1] Length = 521 Score = 66.4 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 40/230 (17%), Positives = 69/230 (30%), Gaps = 69/230 (30%) Query: 255 GLIGYTTRVE-KNIEPSWGT--EKVRQYVTRDMDSLILKP------------TDSTPAMK 299 G + V+ + +W + +YV P T ++ Sbjct: 291 GTKFANSYVDDRTSSTNWRVRQGNLTKYVNTKGLGTSKGPNAGCGLRPIIRLTTDFDGLR 350 Query: 300 QAYQILTSDKKRSFFTNFFRQ-GVKIPSLPF-----------QKFIIFLTDGEN------ 341 A L +D + P PF +K ++ +TDGEN Sbjct: 351 DAVDDLVADGSTNIPMGLVWGWHTLAPMAPFPDGVPYLTEKHKKIVVLMTDGENTILYKD 410 Query: 342 --NNFKS--------------------------------NVNTIKICDKAKE--NFIKIV 365 N + +K+C K I+I Sbjct: 411 TPNGSDYSGVGHARQGRVLDPAGRPITESSSQRERTAALDDRLLKLCANMKAPAKDIEIY 470 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 I + S +L+TC SS +++Y+V NA + FQ+I+ + S Sbjct: 471 AIRVEVSSGSSSVLQTCASSADHYYDVQNAADMTMAFQSIAGQIAALHLS 520 >gi|149909171|ref|ZP_01897828.1| hypothetical protein PE36_09171 [Moritella sp. PE36] gi|149807695|gb|EDM67641.1| hypothetical protein PE36_09171 [Moritella sp. PE36] Length = 402 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 57/422 (13%), Positives = 142/422 (33%), Gaps = 41/422 (9%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 I ++ ++ F + + + L+ ++ + A ++A+LA A + + Sbjct: 3 VIQRQRGAITLTFTFMLPAIVSLLAITVFFAMYSQVVIRAGQAADSAVLACAYQ----QN 58 Query: 74 RLGDRFESISNHAKRALIDDAK-----------RFIKNHIKESLSGYSAVFYNTEIQ-NI 121 G E I ++ + + I + + +A+ + + Sbjct: 59 DTGVVTEGILDYYRPNFVLPELNKSVKLNSNNGCQISAQYRFEPAMVNALPVAIDSDTEV 118 Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL 181 V++S+ S + N ++ N + +++ + S E L + I +P+ Sbjct: 119 VSNSQSSAKLVQNVNVNGIQNPVDFSLVLDISGSMTWHLPE--LKKIITDVISDIVPSSN 176 Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 ++ P + V +SG+ ++S+ + +L D Sbjct: 177 QVRFSIVPF---QTGVGVSGA-PWLLSSEASPKCVDGLVYRNGNLDADKTVQSLNYSSDR 232 Query: 242 IDL--LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 +D ++ + + + + +V +YV T S Sbjct: 233 LDFNEVTPGRWLDRCSETSF------ILPLTNNLNRVIRYVESL--DTSGGSTASYQGFI 284 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK--SNVNTIKICDKA 357 + LT Q + S + +I TDG++N +++ + +CD Sbjct: 285 WGVRTLTDQ------WQKEWQVTPVQSSSLTQRLILFTDGDDNRRDYFNDLMSAGLCDVI 338 Query: 358 KEN-FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 +++ I++ I S + + K C ++ N L F++ + + K + Sbjct: 339 QQDLNIQVSFIGFGVSADRIKQFKQCAGRNGSVFDANNTAELADYFEDAININIETKVRI 398 Query: 417 IL 418 +L Sbjct: 399 VL 400 >gi|315498202|ref|YP_004087006.1| von willebrand factor type a [Asticcacaulis excentricus CB 48] gi|315416214|gb|ADU12855.1| von Willebrand factor type A [Asticcacaulis excentricus CB 48] Length = 489 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 34/225 (15%), Positives = 67/225 (29%), Gaps = 42/225 (18%) Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 Q +TKM L + L + ++ + S Sbjct: 280 YGCVRHQVDSKTKMLVLPDPLTAYTGVLETAQKCPTAIQ--------------PLSNDKT 325 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 V + ++S+ D+ + + T F +K I+ Sbjct: 326 VVTNSIKGLVNSIGSYKPDTFIPGGLHWGVNTLSPPAPFKEGMAYDSKNKEP---KKVIV 382 Query: 335 FLTDGENNNFKSNVNTI------------------------KICDKAKENFIKIVTISIN 370 +TDG N + ++ I C AK I++ I + Sbjct: 383 LMTDGANTLYTNSSGQIVSAATGSPPTISSSLVAPTYTAQDNACKYAKGKNIEVFVIGLG 442 Query: 371 A-SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 P LK+C + +++++ NA+ LI F+ I + + Sbjct: 443 VTDPTALSALKSCATDAQHYFDAQNANDLIEAFEIIGGKLSVVRL 487 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 38/248 (15%), Positives = 83/248 (33%), Gaps = 61/248 (24%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 + N +++F L ++ + + + +++ + A +AA LA A Sbjct: 8 RDRRGNTAVMFGLFFSILIVSMAGAVDYSNVISRRSKAQDALDAATLAVA---------- 57 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 + A ++ A+ +K L + N + I Sbjct: 58 ---------VLRPATVEQAQAAVK------LRLDKELGDNPDKVVI-------------G 89 Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 + + T Y + +Y F+ ++N + V I ++ G + L Sbjct: 90 QFNYDTKTRTYYVTAKGTYK---PFLLGVVNIKEIPYEV--ISETIQAANGT---LELAL 141 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V+D + SM +N T++ LK A ++++ S K+ V + Sbjct: 142 VLDNTDSMGQILNG--------------SSTRLDVLKTAATNLVNTVMT-SANKDYVKVA 186 Query: 256 LIGYTTRV 263 ++ Y V Sbjct: 187 VVPYADYV 194 >gi|147921050|ref|YP_685140.1| hypothetical protein RCIX370 [uncultured methanogenic archaeon RC-I] gi|110620536|emb|CAJ35814.1| hypothetical protein RCIX370 [uncultured methanogenic archaeon RC-I] Length = 1310 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 44/204 (21%), Positives = 80/204 (39%), Gaps = 31/204 (15%) Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 + V P+ K T + K + + F++S +G++ + T N Sbjct: 887 TYSLMVLKPPLRLGSKLTTDSVAKTSAVSFVESRGDGD------QVGVVSFYTSASLNSA 940 Query: 269 P-SWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 + + V ++SL TD + +K+A L + K+ + Sbjct: 941 LKQMNSGTNKTTVKNAINSLSASGGTDISSGIKKAIAELDAHKRSTA------------- 987 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-NASPNGQRLLKTCVSS 385 +++II LTDG + + ++ DKAK I TI + A LK S Sbjct: 988 ---KQYIIVLTDGYSQYPEFDLIEA---DKAKAKGYTIFTIGMGMADE---DTLKKIASK 1038 Query: 386 PEYHYNVVNADSLIHVFQNISQLM 409 PEY+Y V++ + L + +I Q + Sbjct: 1039 PEYYYRVLSPEQLEAAYYDIGQEI 1062 >gi|37676036|ref|NP_936432.1| hypothetical protein VVA0376 [Vibrio vulnificus YJ016] gi|37200576|dbj|BAC96402.1| conserved hypothetical protein [Vibrio vulnificus YJ016] Length = 323 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 45/235 (19%), Positives = 85/235 (36%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLSGSM ED+ D +++++KN + F++ ++ Sbjct: 87 DLMLVVDLSGSMQ------QEDILQDGDYID----RLSSVKNVVTQFIEQ-------RQG 129 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + + + V + + + LI + T + A D + Sbjct: 130 DRLGLVLFADHAYLQTPLTADRQTVANQLNQTIIGLIGQKTAIGDGLALA-TKTFVDSEA 188 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-- 369 Q+ +I L+DG N + + AK+ +KI TI I Sbjct: 189 -----------------PQRVVILLSDGSNTAGTLDPIEAA--NIAKKYGVKIYTIGIGA 229 Query: 370 ----------------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 +A + + L K + ++ +A L ++Q I+QL Sbjct: 230 GEMEVKQFFMTRKVNTSADLDEKTLTKIATMTGGQYFRARDAQELQTIYQAINQL 284 >gi|327542784|gb|EGF29248.1| von Willebrand factor type A [Rhodopirellula baltica WH47] Length = 264 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 36/240 (15%), Positives = 85/240 (35%), Gaps = 36/240 (15%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCA-----------MNSDPEDVNSAPICQDKKRTKMAA 230 R+ + + + LV+D SGSM A ++ P P+ + + + A Sbjct: 43 RVAVSTQSTMDVALVIDRSGSMAYASDETPDPYVNPASAPPGWTYGDPVPPNSRWLDLVA 102 Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 NA FL + + L Y++ ++ + + ++ + Sbjct: 103 SVNAFNGFL------VDSPQYEKLCLATYSSTASRDCDLTHTYAEISNELDAISYQFDGG 156 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T ++ +LT F + ++ +TDG +N KS + Sbjct: 157 GTSVGYGLEHGLAVLTDATHARKFA--------------VRVMVLMTDGHHNTGKSPESM 202 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSS-PEYHYNVVNADSLIHVFQNISQLM 409 + + + + + TI+ + Q + ++ +++ +A L + FQ I++ + Sbjct: 203 MY---HLQNHGVTLFTITFS-DDADQSRMSNLANACGGENFHATDASQLQNAFQKIAKKL 258 >gi|167752252|ref|ZP_02424379.1| hypothetical protein ALIPUT_00495 [Alistipes putredinis DSM 17216] gi|167660493|gb|EDS04623.1| hypothetical protein ALIPUT_00495 [Alistipes putredinis DSM 17216] Length = 328 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 39/247 (15%), Positives = 74/247 (29%), Gaps = 64/247 (25%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 I L +D+S SM QD + ++ A K F+ Sbjct: 81 STSNTEGIDIVLAIDISTSMLA---------------QDLQPDRIQAAKQVAGNFIT--- 122 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + +GL+ + + ++ + R ++ T + A Sbjct: 123 ----DRPGDRIGLVAFAGEAFTQSPLTTDQGTLQTLLGRLRSGVVEDGTAIGNGLATAIN 178 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L +S K II LTDGENN + T + A++ I+ Sbjct: 179 RLRESNAKS------------------KVIILLTDGENNRGEIAPLTAA--EIARDQGIR 218 Query: 364 IVTISI----NAS-----------------PNGQRLLKTCVS-SPEYHYNVVNADSLIHV 401 + TI + A +++L + ++ + L + Sbjct: 219 VYTIGVGTRGTAPYPTVDFFGNPTVVQAKVQIDEKILGEIADLTGGRYFRATDNAKLQSI 278 Query: 402 FQNISQL 408 + I+QL Sbjct: 279 YDEINQL 285 >gi|327542237|gb|EGF28726.1| BatA aerotolerance operon protein [Rhodopirellula baltica WH47] Length = 345 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 50/244 (20%), Positives = 86/244 (35%), Gaps = 52/244 (21%) Query: 191 FLIELVVDLSGSMHC-AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 IE+V+D SGSM N D + ++ A+KN F+ + L Sbjct: 74 IAIEMVIDRSGSMQAMDFNID-----------GEPVDRLTAVKNVASKFITGGEDLEGRF 122 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD--MDSLILKPTDSTPAMKQAYQILTS 307 D+ +GLI + + P+ V + + + T A+ + + L Sbjct: 123 SDL-VGLITFAAYADAETPPTLDHSFVVSRLNQTEIVSRRDEDGTAIGDAIALSVEKL-- 179 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 RQ K+ S K +I LTDGEN + + I+ + A+ IKI I Sbjct: 180 ------NALDARQERKVQS----KILILLTDGENTAGELDP--IQAAELAETLGIKIYAI 227 Query: 368 S----------INASPNGQRLLKT-------------CVSSPEYHYNVVNADSLIHVFQN 404 + G++ L + ++ + DSL +++ Sbjct: 228 GVGTKGKAPVPVRDPFTGRQRLHYMEVNIDEATLQKVAEITGGKYFRATDTDSLDAIYRE 287 Query: 405 ISQL 408 I QL Sbjct: 288 IDQL 291 >gi|217978613|ref|YP_002362760.1| von Willebrand factor type A [Methylocella silvestris BL2] gi|217503989|gb|ACK51398.1| von Willebrand factor type A [Methylocella silvestris BL2] Length = 325 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 40/259 (15%), Positives = 79/259 (30%), Gaps = 50/259 (19%) Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPIFLIELV--VDLSGSMHCAMNSDPEDVNSAPICQ 221 L + +V+ +P ++V +DLSGSM Sbjct: 66 LAWIAWILLVVALAGPRTIAASPAQPASGRDIVFALDLSGSMAAEDFVLDGHA------- 118 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 +++ ALK + + +GL+ + R S+ + V + + Sbjct: 119 ---ASRIDALKRVGAALIKR-------RTGDRIGLVIFAERAYAAAPLSFDVDAVSRTLA 168 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 L+ T + A + LT K S + I+ L+DG N Sbjct: 169 EIPLGLVGHSTAIGEGLGLALKRLTESKAPS------------------RVIVLLSDGAN 210 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINA-----------SPNGQRLLKTCVSSPEYHY 390 + ++ + + A +KI TI + + L + + Sbjct: 211 DAGTTDPTGVA--ELANNLGVKIYTIGLGVVDTQTFNGLGDPVDFLALQRLAEIGGGEAF 268 Query: 391 NVVNADSLIHVFQNISQLM 409 V + L + I +L+ Sbjct: 269 RVRTTEDLAYASAAIERLV 287 >gi|304393172|ref|ZP_07375100.1| Flp pilus assembly protein TadG [Ahrensia sp. R2A130] gi|303294179|gb|EFL88551.1| Flp pilus assembly protein TadG [Ahrensia sp. R2A130] Length = 692 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 40/234 (17%), Positives = 77/234 (32%), Gaps = 28/234 (11%) Query: 198 DLSGSMHCAMNSDPEDVNSAPI--CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 D G+ NS+P + + P C + + KN L S+ V MG Sbjct: 470 DDGGNRPSVGNSNPLGMGAGPNSMCSSVSVSDLTDNKNTTQAKLTSMQASGATN--VQMG 527 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 + + R EP + + ++ ++ D K + Sbjct: 528 -VAWGWRTLSPGEPFTEGRPYDAEDNKKIMIIMTDGNNTYYPTNIYGNQYAQDNKSFYGG 586 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKS--------NVNTIKICDKAKENFIKIVTI 367 + +I DGE N + + + + C AK I I +I Sbjct: 587 HGHSVKGRIFDG---------YDGEANPGHNSQTFTKAMDEHLTETCTNAKNAGITIYSI 637 Query: 368 SINASPNG--QRLLKTCVSSP----EYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + + + L+ C SS + +++ N +LI F+ I++ + + S Sbjct: 638 AFDVPNGSSVKATLEDCASSDVGGGKLYFDANNNAALIDTFEKIAERLADLRIS 691 >gi|208780564|ref|ZP_03247903.1| von Willebrand factor type A domain protein [Francisella novicida FTG] gi|208743539|gb|EDZ89844.1| von Willebrand factor type A domain protein [Francisella novicida FTG] Length = 333 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 48/235 (20%), Positives = 86/235 (36%), Gaps = 54/235 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + +DLSGSM + + ++ + FLD+ ++ Sbjct: 93 DLIMAIDLSGSMA---------IQDMKKANGQMESRFDLVMRVANQFLDT-------RKG 136 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + TR ++ V++ + +L T A+ A + L Sbjct: 137 DRVGLILFGTRAYLQTPLTFDIATVKKMLDDASIALPGPQTAIGDAIGLAVKKLKKFPGD 196 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS--- 368 S K +I LTDGENN + ++ + AK+ IKI TI Sbjct: 197 S------------------KALILLTDGENN--SGTLQPLQAAEIAKQYHIKIYTIGLGG 236 Query: 369 ---INASPNGQRL-----------LKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 I + GQRL L+ + + ++ N+ L V+++I +L Sbjct: 237 GQMIVETTFGQRLVNTSEDLDTTVLEKIATMTGGKYFRAQNSSDLKKVYESIDKL 291 >gi|225028486|ref|ZP_03717678.1| hypothetical protein EUBHAL_02763 [Eubacterium hallii DSM 3353] gi|224954191|gb|EEG35400.1| hypothetical protein EUBHAL_02763 [Eubacterium hallii DSM 3353] Length = 538 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 40/228 (17%), Positives = 82/228 (35%), Gaps = 52/228 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +D+S SM + K A F+DSI + ++ Sbjct: 226 DIVLTLDISASMDGI--------------------PLDETKKAAAKFVDSI-----LNKN 260 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ Y+ ++ +T + T+ + +AY +L + + Sbjct: 261 SNIGLVSYSDEATSLSGICSNDVFLKNTITSLSSAEN---TNIEDGLSRAYSMLQLGQSK 317 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-- 369 +K I+ ++DG K IK +K K+ + I T+ Sbjct: 318 ------------------KKLIVLMSDGLPTLGKDGEELIKYAEKIKDQGVLIYTLGFFQ 359 Query: 370 ---NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 GQ L++ S ++ V +++ L+ F++++ + +KY Sbjct: 360 NTEEYKAEGQYLMEKIASEGCHY-EVSSSEDLVFFFEDVAGQIGGQKY 406 >gi|327541799|gb|EGF28311.1| von Willebrand factor type A [Rhodopirellula baltica WH47] Length = 363 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 62/388 (15%), Positives = 118/388 (30%), Gaps = 69/388 (17%) Query: 20 ANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRF 79 ++ A+ + FL+++ F I + H + + S+ +AA A A+ + L R Sbjct: 35 GAMLVLIAIMMFLFLIVVAFSIDIAQMHLARTELRSSTDAAANAAATTLADTLDRNLAIQ 94 Query: 80 ESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDS 139 + L F VN A + Sbjct: 95 RGQQIAQA--------NLVNGQ---PLLLADGDFQFGRSDRQVNG---KYAFNAGEAPFN 140 Query: 140 SNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDL 199 S L F ++ + + I LVVD Sbjct: 141 GVRVNGQRTAGSLSGPVPL-FFGNVTGTSIFEPEAFATATYVER--------DITLVVDR 191 Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 SGSM ++ L+ A+ +F D + ++ +GL Y Sbjct: 192 SGSMAG--------------------SRFNDLQAAIRIFTDLLATTPVDEQ---IGLASY 228 Query: 260 TTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILTSDKKRSFFTNFF 318 R ++++ + + V MD L T + M+ A Q + + Sbjct: 229 NDRASEDVQLTENFAE----VNNAMDRLRTGGFTSISRGMQ-AGQEIALRGR-------- 275 Query: 319 RQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL 378 P ++ +I +TDG +N + + + I TI+ A + R+ Sbjct: 276 ------PPEFVERTMIVMTDGRHNR---GPEPRVVATDLAADGVTIHTITFGAGADFGRM 326 Query: 379 LKTCVSSPEYHYNVVNADSLIHVFQNIS 406 H++ N D L +++ I+ Sbjct: 327 QDVARIGGGRHFHATNGDQLRDIYREIA 354 >gi|312794604|ref|YP_004027527.1| von willebrand factor type a [Caldicellulosiruptor kristjanssonii 177R1B] gi|312181744|gb|ADQ41914.1| von Willebrand factor type A [Caldicellulosiruptor kristjanssonii 177R1B] Length = 900 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 44/227 (19%), Positives = 78/227 (34%), Gaps = 45/227 (19%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + + LV+D SGSM +D K+ K+A ++ ++ Sbjct: 399 NKEKEKNIDVMLVLDHSGSMA--------------DTEDAGIPKLEIAKSASAKMVEHLE 444 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQAY 302 V G+I + + G ++ V + S+ + T P + +A Sbjct: 445 SSDGV------GVIAFDHNYYWAYKF--GKLVRKEDVIESISSIEVGGGTAIIPPLSEAV 496 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + L K ++ K ++ LTDG + D+AK N I Sbjct: 497 KTLKKSKAKN------------------KLVVLLTDGMGEQSGYEIPA----DEAKRNNI 534 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 KI TI + N L + Y V N L+ VF ++++ Sbjct: 535 KITTIGVGKFVNASVLSWIAAYTSGRFYLVSNPSELVDVFLKETKII 581 >gi|241667423|ref|ZP_04755001.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Francisella philomiragia subsp. philomiragia ATCC 25015] Length = 333 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 48/269 (17%), Positives = 91/269 (33%), Gaps = 55/269 (20%) Query: 158 LQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSA 217 L++I + S I L + + + + +DLSGSM + Sbjct: 60 LKYILSTIWILLI-ISGSGIQWLGKPVSLPQSGRDLMMAIDLSGSMA---------IQDM 109 Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 K ++ + FLD+ ++ +GLI + T ++ V+ Sbjct: 110 QKSNGKMESRFDLVMRVANEFLDT-------RQGDRVGLILFGTWAYLQTPLTFDIPTVK 162 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + + +L T A+ A + L S K ++ LT Sbjct: 163 KMLDDASIALPGPQTAIGDAIGLAVKKLKRYPGDS------------------KALVLLT 204 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISI-----------------NASPNGQRLLK 380 DGENN + ++ + AK+ IKI TI + + +L+ Sbjct: 205 DGENN--SGALQPLQAAELAKQYHIKIYTIGLGGGQMMVKTTFGERLVNTSEDLDTEVLQ 262 Query: 381 TCVS-SPEYHYNVVNADSLIHVFQNISQL 408 + + + N+ L V+++I QL Sbjct: 263 KIATMTGGKFFRAQNSTDLKQVYESIDQL 291 >gi|89255637|ref|YP_512998.1| hypothetical protein FTL_0203 [Francisella tularensis subsp. holarctica LVS] gi|134302613|ref|YP_001122584.1| hypothetical protein FTW_1793 [Francisella tularensis subsp. tularensis WY96-3418] gi|156501587|ref|YP_001427652.1| hypothetical protein FTA_0219 [Francisella tularensis subsp. holarctica FTNF002-00] gi|167009921|ref|ZP_02274852.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Francisella tularensis subsp. holarctica FSC200] gi|224456527|ref|ZP_03665000.1| hypothetical protein FtultM_01598 [Francisella tularensis subsp. tularensis MA00-2987] gi|254367031|ref|ZP_04983067.1| hypothetical protein FTHG_00206 [Francisella tularensis subsp. holarctica 257] gi|290953465|ref|ZP_06558086.1| hypothetical protein FtulhU_03745 [Francisella tularensis subsp. holarctica URFT1] gi|295313263|ref|ZP_06803900.1| hypothetical protein FtulhU_03730 [Francisella tularensis subsp. holarctica URFT1] gi|89143468|emb|CAJ78644.1| hypothetical membrane protein [Francisella tularensis subsp. holarctica LVS] gi|134050390|gb|ABO47461.1| conserved membrane protein with von Willebrand factor type A domain [Francisella tularensis subsp. tularensis WY96-3418] gi|134252857|gb|EBA51951.1| hypothetical protein FTHG_00206 [Francisella tularensis subsp. holarctica 257] gi|156252190|gb|ABU60696.1| conserved membrane protein with von Willebrand factor, type A domain [Francisella tularensis subsp. holarctica FTNF002-00] gi|282158589|gb|ADA77980.1| hypothetical protein NE061598_01650 [Francisella tularensis subsp. tularensis NE061598] Length = 333 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 47/235 (20%), Positives = 86/235 (36%), Gaps = 54/235 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + +DLSGSM + + ++ + F+D+ ++ Sbjct: 93 DLIMAIDLSGSMA---------IQDMKKANGQMESRFDLVMRVANQFIDT-------RKG 136 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + TR ++ V++ + +L T A+ A + L Sbjct: 137 DRVGLILFGTRAYLQTPLTFDIATVKKMLDDASIALPGPQTAIGDAIGLAVKKLKKYPGD 196 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS--- 368 S K +I LTDGENN + ++ + AK+ IKI TI Sbjct: 197 S------------------KALILLTDGENN--SGTLQPLQAAEIAKQYHIKIYTIGLGG 236 Query: 369 ---INASPNGQRL-----------LKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 I + GQRL L+ + + ++ N+ L V+++I +L Sbjct: 237 GQMIVETTFGQRLVNTSEDLDTTVLEKIATMTGGKYFRAQNSSDLKKVYESIDKL 291 >gi|32475535|ref|NP_868529.1| BatA [Rhodopirellula baltica SH 1] gi|32446077|emb|CAD75906.1| BatA [Rhodopirellula baltica SH 1] Length = 357 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 49/244 (20%), Positives = 84/244 (34%), Gaps = 52/244 (21%) Query: 191 FLIELVVDLSGSMHC-AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 IE+V+D SGSM N D + ++ A+KN F+ + L Sbjct: 86 IAIEMVIDRSGSMQALDFNID-----------GEPVDRLTAVKNVASKFITGGEDLEGRF 134 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD--MDSLILKPTDSTPAMKQAYQILTS 307 D+ +GLI + + P+ V + + + T A+ + + L Sbjct: 135 SDL-VGLITFAAYADAETPPTLDHSFVVSRLNQTEIVSRRDEDGTAIGDAIALSVEKL-- 191 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 RQ K+ S K +I LTDGEN + + + A+ IKI I Sbjct: 192 ------NALDARQERKVQS----KILILLTDGENTAGELDPVQAA--ELAETLGIKIYAI 239 Query: 368 S----------INASPNGQRLLKT-------------CVSSPEYHYNVVNADSLIHVFQN 404 + G++ L + ++ + DSL +++ Sbjct: 240 GVGTTGKAPVPVRDPFTGRQRLHYMEVNIDEATLQKVAEITGGKYFRATDTDSLDAIYRE 299 Query: 405 ISQL 408 I QL Sbjct: 300 IDQL 303 >gi|301058342|ref|ZP_07199375.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] gi|300447578|gb|EFK11310.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] Length = 331 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 42/240 (17%), Positives = 75/240 (31%), Gaps = 58/240 (24%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 P I L +D SGSM + K T++ A+K + F+ Sbjct: 85 PGVDIMLCLDTSGSMQAL----------DFKVEGKSVTRLEAVKKVVADFIGK------- 127 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 +E +GL+ + + + + V R + T A+ + L Sbjct: 128 RETDRIGLVVFGEEAFTQSPLTIDKGLLLELVNRMKIGMAGDRTAIGSAIAIGGKRLKDL 187 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 K +S K +I LTDG NN + + +E IK+ TI Sbjct: 188 KSKS------------------KILILLTDGRNNAGEISPQAAAR--AVREFGIKLYTIG 227 Query: 369 IN----ASPNGQRL----------------LKTCV-SSPEYHYNVVNADSLIHVFQNISQ 407 + A + L L+ + ++ N+ L ++ I + Sbjct: 228 VGGKGPAPFRMKTLFGTRLVPQHVDLDEVTLRNVAKTGGGKYFRAANSQELQEIYDIIDR 287 >gi|315122852|ref|YP_004063341.1| hypothetical protein CKC_05540 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496254|gb|ADR52853.1| hypothetical protein CKC_05540 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 494 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 33/171 (19%), Positives = 71/171 (41%), Gaps = 21/171 (12%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 + F+ K S+K NF++I A+ + S LL+G ++ ++ K S+ESA+ A+ G Sbjct: 22 KIHFFNKLLFFSKKGNFAMISAIMIPSLALLLGIVLVTSNYLLHKYSVESASEEALSHG- 80 Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 +S + D ++ + +++D +S F E + +S Sbjct: 81 ---MSLICYQNDIE---RDNLAKIILNDL----------IVSLKKNNFTKQEADLVAKNS 124 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSF 176 +I +T + N+ + Y+ + + Y L I + + + + Sbjct: 125 KIDITTLINDSTNVK----SYHFYIKSVYKMPLNKITKIFYPKDLTIVTNV 171 >gi|224370036|ref|YP_002604200.1| hypothetical protein HRM2_29490 [Desulfobacterium autotrophicum HRM2] gi|223692753|gb|ACN16036.1| conserved hypothetical protein [Desulfobacterium autotrophicum HRM2] Length = 598 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 35/189 (18%), Positives = 63/189 (33%), Gaps = 45/189 (23%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E+ I + +D S SM QD K T++ K ++ + I Sbjct: 85 EQKGVDIMICLDCSRSMLA---------------QDIKPTRLERAKREIIDLMGMIQSD- 128 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-MDSLILKPTDSTPAMKQAYQIL 305 GL+ + R + ++ D L + TD A++ A Sbjct: 129 ------RAGLVAFAGRAILQCPLTLDHSAFNLFLNALEPDYLPVGGTDLGGAIETALNGF 182 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 + + +K II +TDGEN ++I++ KA + +KI Sbjct: 183 EKEVES------------------EKAIILITDGENT----TGDSIEMAKKAADQGVKIF 220 Query: 366 TISINASPN 374 I + + Sbjct: 221 CIGVGSPEG 229 >gi|254875972|ref|ZP_05248682.1| conserved hypothetical protein [Francisella philomiragia subsp. philomiragia ATCC 25015] gi|254841993|gb|EET20407.1| conserved hypothetical protein [Francisella philomiragia subsp. philomiragia ATCC 25015] Length = 339 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 48/269 (17%), Positives = 91/269 (33%), Gaps = 55/269 (20%) Query: 158 LQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSA 217 L++I + S I L + + + + +DLSGSM + Sbjct: 66 LKYILSTIWILLI-ISGSGIQWLGKPVSLPQSGRDLMMAIDLSGSMA---------IQDM 115 Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 K ++ + FLD+ ++ +GLI + T ++ V+ Sbjct: 116 QKSNGKMESRFDLVMRVANEFLDT-------RQGDRVGLILFGTWAYLQTPLTFDIPTVK 168 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + + +L T A+ A + L S K ++ LT Sbjct: 169 KMLDDASIALPGPQTAIGDAIGLAVKKLKRYPGDS------------------KALVLLT 210 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISI-----------------NASPNGQRLLK 380 DGENN + ++ + AK+ IKI TI + + +L+ Sbjct: 211 DGENN--SGALQPLQAAELAKQYHIKIYTIGLGGGQMMVKTTFGERLVNTSEDLDTEVLQ 268 Query: 381 TCVS-SPEYHYNVVNADSLIHVFQNISQL 408 + + + N+ L V+++I QL Sbjct: 269 KIATMTGGKFFRAQNSTDLKQVYESIDQL 297 >gi|187932172|ref|YP_001892157.1| protein of unknown function containing a von Willebrand factor type A (vWA) domain [Francisella tularensis subsp. mediasiatica FSC147] gi|187713081|gb|ACD31378.1| protein of unknown function containing a von Willebrand factor type A (vWA) domain [Francisella tularensis subsp. mediasiatica FSC147] Length = 333 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 47/235 (20%), Positives = 86/235 (36%), Gaps = 54/235 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + +DLSGSM + + ++ + F+D+ ++ Sbjct: 93 DLIMAIDLSGSMA---------IQDMKKANGQMESRFDLVMRVANQFIDT-------RKG 136 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + TR ++ V++ + +L T A+ A + L Sbjct: 137 DRVGLILFGTRAYLQTPLTFDIATVKKMLDDASIALPGPQTAIGDAIGLAVKKLKKYPGD 196 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS--- 368 S K +I LTDGENN + ++ + AK+ IKI TI Sbjct: 197 S------------------KALILLTDGENN--SGTLQPLQAAEIAKQYHIKIYTIGLGG 236 Query: 369 ---INASPNGQRL-----------LKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 I + GQRL L+ + + ++ N+ L V+++I +L Sbjct: 237 GQMIVETTFGQRLVNTSEDLDTTVLEKIATMTGGKYFRAQNSSDLKKVYESIDKL 291 >gi|167626845|ref|YP_001677345.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Francisella philomiragia subsp. philomiragia ATCC 25017] gi|167596846|gb|ABZ86844.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Francisella philomiragia subsp. philomiragia ATCC 25017] Length = 333 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 48/269 (17%), Positives = 91/269 (33%), Gaps = 55/269 (20%) Query: 158 LQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSA 217 L++I + S I L + + + + +DLSGSM + Sbjct: 60 LKYILSAIWILLI-ISGSGIQWLGKPVSLPQSGRDLMMAIDLSGSMA---------IQDM 109 Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 K ++ + FLD+ ++ +GLI + T ++ V+ Sbjct: 110 QKSNGKMESRFDLVMRVANEFLDT-------RQGDRVGLILFGTWAYLQTPLTFDIPTVK 162 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + + +L T A+ A + L S K ++ LT Sbjct: 163 KMLDDASIALPGPQTAIGDAIGLAVKKLKRYPGDS------------------KALVLLT 204 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISI-----------------NASPNGQRLLK 380 DGENN + ++ + AK+ IKI TI + + +L+ Sbjct: 205 DGENN--SGALQPLQAAELAKQYHIKIYTIGLGGGQMMVKTTFGERLVNTSEDLDTEVLQ 262 Query: 381 TCVS-SPEYHYNVVNADSLIHVFQNISQL 408 + + + N+ L V+++I QL Sbjct: 263 KIATMTGGKFFRAQNSADLKQVYESIDQL 291 >gi|86134839|ref|ZP_01053421.1| aerotolerance-related membrane protein [Polaribacter sp. MED152] gi|85821702|gb|EAQ42849.1| aerotolerance-related membrane protein [Polaribacter sp. MED152] Length = 336 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 46/250 (18%), Positives = 74/250 (29%), Gaps = 66/250 (26%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + I + +D+S SM D K ++ ALK Sbjct: 83 SVSKRTKTNRGIDIVMAIDVSASMLAR---------------DLKPNRLEALKKVA---- 123 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 +D + D +G++ Y + V++ + R + T + Sbjct: 124 --VDFVDRRPND-RIGIVVYAGESFTQTPITSDKTIVKRTINRLQWGQLEGGTAIGMGLG 180 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 L K +S K II LTDG NN + T + AKE Sbjct: 181 SRVNRLKDSKAKS------------------KVIILLTDGVNNAGNIDPTTAT--ELAKE 220 Query: 360 NFIKIVTISI----NA-------SPNG------------QRLLKTCV-SSPEYHYNVVNA 395 IK+ TI I A G + LLK + ++ + Sbjct: 221 LGIKVYTIGIGTNGMADFPWSKDPRTGMLNFRKQQVQIDEDLLKNIAEETQGKYFRATDN 280 Query: 396 DSLIHVFQNI 405 SL ++ I Sbjct: 281 TSLKEIYDEI 290 >gi|209809314|ref|YP_002264852.1| hypothetical protein VSAL_II0524 [Aliivibrio salmonicida LFI1238] gi|208010876|emb|CAQ81278.1| putative membrane protein [Aliivibrio salmonicida LFI1238] Length = 320 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 45/235 (19%), Positives = 82/235 (34%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLSGSM ED+ + D ++ A+K + F++ ++ Sbjct: 84 DMMLVVDLSGSMS------EEDMKTDSGFVD----RLTAVKRVVSDFIEK-------RKG 126 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + ++ V++ + R + L+ + T + A + Sbjct: 127 DRLGLVLFGDHAYLQTPLTFDRNTVQEQLNRTVLGLVGQRTAIGEGLGLATKTFIESNA- 185 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-- 369 Q+ II L+DG N + AK+N KI T+ I Sbjct: 186 -----------------PQRTIILLSDGANTAGVLDPIEAAQ--LAKDNNAKIYTVGIGA 226 Query: 370 ---------------NASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 A + L + + ++ NAD L ++Q I +L Sbjct: 227 GEMQVRGFFGNQTVNTARDLDEDTLTKIATMTGGQYFRARNADELAEIYQTIDKL 281 >gi|116329598|ref|YP_799317.1| BatA [Leptospira borgpetersenii serovar Hardjo-bovis L550] gi|116332487|ref|YP_802204.1| BatA [Leptospira borgpetersenii serovar Hardjo-bovis JB197] gi|116122491|gb|ABJ80384.1| BatA [Leptospira borgpetersenii serovar Hardjo-bovis L550] gi|116127354|gb|ABJ77446.1| BatA [Leptospira borgpetersenii serovar Hardjo-bovis JB197] Length = 312 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 41/254 (16%), Positives = 83/254 (32%), Gaps = 49/254 (19%) Query: 161 IEHLLNQRYNQKIVSFIPALLRIE---MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSA 217 LL +V + + E+ + + +D+SGSM + + PE Sbjct: 53 FLPLLRPIAISLVVVALAGPGKKTTFLPNEKKGVDVMIALDVSGSMSRSRDFLPE----- 107 Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 T++ K L F+D +GL+ + + E + Sbjct: 108 --------TRLGVSKKLLRKFIDKRKSD-------RLGLVVFAGAAYLQAPLTGDRESLN 152 Query: 278 QYVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + + + + + T A+ + L + + RS K I+ + Sbjct: 153 EILGTIEEETVAEQGTAIGDAIILSTYRLRASQARS------------------KVIVLI 194 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQR-----LLKTCVSSPEYHYN 391 TDG +N K + T D A+ +KI ++ I L + S+ + Sbjct: 195 TDGVSNTGKIDPVTAT--DLAEHIGVKIYSVGIGKEDGSYEINFEILRELSASTGGKFFR 252 Query: 392 VVNADSLIHVFQNI 405 + + + V +I Sbjct: 253 AEDPEEMKAVLTSI 266 >gi|56707447|ref|YP_169343.1| hypothetical protein FTT_0293 [Francisella tularensis subsp. tularensis SCHU S4] gi|110669918|ref|YP_666475.1| hypothetical protein FTF0293 [Francisella tularensis subsp. tularensis FSC198] gi|115314141|ref|YP_762864.1| hypothetical protein FTH_0198 [Francisella tularensis subsp. holarctica OSU18] gi|254370860|ref|ZP_04986865.1| conserved hypothetical protein [Francisella tularensis subsp. tularensis FSC033] gi|254874284|ref|ZP_05246994.1| conserved hypothetical protein [Francisella tularensis subsp. tularensis MA00-2987] gi|56603939|emb|CAG44926.1| hypothetical membrane protein [Francisella tularensis subsp. tularensis SCHU S4] gi|110320251|emb|CAL08309.1| hypothetical membrane protein [Francisella tularensis subsp. tularensis FSC198] gi|115129040|gb|ABI82227.1| conserved hypothetical protein [Francisella tularensis subsp. holarctica OSU18] gi|151569103|gb|EDN34757.1| conserved hypothetical protein [Francisella tularensis subsp. tularensis FSC033] gi|254840283|gb|EET18719.1| conserved hypothetical protein [Francisella tularensis subsp. tularensis MA00-2987] Length = 339 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 47/235 (20%), Positives = 86/235 (36%), Gaps = 54/235 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + +DLSGSM + + ++ + F+D+ ++ Sbjct: 99 DLIMAIDLSGSMA---------IQDMKKANGQMESRFDLVMRVANQFIDT-------RKG 142 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + TR ++ V++ + +L T A+ A + L Sbjct: 143 DRVGLILFGTRAYLQTPLTFDIATVKKMLDDASIALPGPQTAIGDAIGLAVKKLKKYPGD 202 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS--- 368 S K +I LTDGENN + ++ + AK+ IKI TI Sbjct: 203 S------------------KALILLTDGENN--SGTLQPLQAAEIAKQYHIKIYTIGLGG 242 Query: 369 ---INASPNGQRL-----------LKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 I + GQRL L+ + + ++ N+ L V+++I +L Sbjct: 243 GQMIVETTFGQRLVNTSEDLDTTVLEKIATMTGGKYFRAQNSSDLKKVYESIDKL 297 >gi|254368552|ref|ZP_04984568.1| conserved hypothetical protein [Francisella tularensis subsp. holarctica FSC022] gi|157121455|gb|EDO65646.1| conserved hypothetical protein [Francisella tularensis subsp. holarctica FSC022] Length = 339 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 47/235 (20%), Positives = 86/235 (36%), Gaps = 54/235 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + +DLSGSM + + ++ + F+D+ ++ Sbjct: 99 DLIMAIDLSGSMA---------IQDMKKANGQMESRFDLVMRVANQFIDT-------RKG 142 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + TR ++ V++ + +L T A+ A + L Sbjct: 143 DRVGLILFGTRAYLQTPLTFDIATVKKMLDDASIALPGPQTAIGDAIGLAVKKLKKYPGD 202 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS--- 368 S K +I LTDGENN + ++ + AK+ IKI TI Sbjct: 203 S------------------KALILLTDGENN--SGTLQPLQAAEIAKQYHIKIYTIGLGG 242 Query: 369 ---INASPNGQRL-----------LKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 I + GQRL L+ + + ++ N+ L V+++I +L Sbjct: 243 DQMIVETTFGQRLVNTSEDLDTTVLEKIATMTGGKYFRAQNSSDLKKVYESIDKL 297 >gi|295691296|ref|YP_003594989.1| TadE family protein [Caulobacter segnis ATCC 21756] gi|295433199|gb|ADG12371.1| TadE family protein [Caulobacter segnis ATCC 21756] Length = 531 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 72/531 (13%), Positives = 152/531 (28%), Gaps = 120/531 (22%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 +SR +F++ ++ +I FAL + +L+ LI + +++ M+ A +AA L Sbjct: 1 MSRLTRFFRRLGRDDRGAIAIQFALLAIPMSILVFALIDLGRISLQRHQMQDALDAATLM 60 Query: 64 GASK---------------MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSG 108 A ++ ++ L + + K + +K ++ Sbjct: 61 AARSTAVTDAELESVGDPAFLAEIAGLNLGLSASNASFKAGAGNHIIGTATATVKPIIAN 120 Query: 109 --YSAVFYNTEIQNIVNSSR---ISMTHMANNRLDSSNNT-------------------I 144 + F T ++V SS+ +++ + S T Sbjct: 121 LWTTDDFNLTATSDVVRSSKNLEVAVVLDITGSMSGSRITDLKTGASDLVDIVVKDQQAP 180 Query: 145 FYNMDVMTSYDYRLQ---FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSG 201 FY+ + Y + + + + + I + F++ V +SG Sbjct: 181 FYSKVAIVPYSVGVNVGTYADAVRGAVIARTITGVSKTNAAVVASAAHGFIVGDKVTISG 240 Query: 202 SMHCAMNSDPEDVNSAP-------ICQDKKRTKMAALKNAL-------LLFLDSIDLLSH 247 M + +A K + A + S+ Sbjct: 241 VSGPTMLNGNTYNITAASADSFTINANTSNAPKYVSGGVATCDTSTNPGCLNFTFTSASN 300 Query: 248 VKED--------VYMGLIGYTT---------RVEKNIEPSWGT-------------EKVR 277 KE G YT R + R Sbjct: 301 TKETRTLSTCVTERTGTYAYTDIAPSIAPVGRNYPTTKLENSMQPNPCPTATITPLSSDR 360 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQG-VKIPSLPFQKFIIFL 336 + +++L + + + + A+ + Q S K ++ + Sbjct: 361 VTLKGQINALSIGGS-TAGQIGFAWGWYMVSPNFGYLWPNATQRPAPYNSKDLVKVVVLM 419 Query: 337 TDGENNNF----------------------------KSNVNTIKICDKAKENFIK--IVT 366 TDG N + T K+CD K+ +K I T Sbjct: 420 TDGAFNTPYCKGVIAKDAGSGSGAVDDHINCVATNGDAFTQTRKLCDAMKDPSLKLTIFT 479 Query: 367 ISINA--SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + + N +LK C + ++ Y L F++I+Q + + + Sbjct: 480 VGFDVGGDANAVNMLKYCATDAQHVYFPATGSELKTAFKSIAQEISSLRIA 530 >gi|306823858|ref|ZP_07457232.1| conserved hypothetical protein [Bifidobacterium dentium ATCC 27679] gi|309802423|ref|ZP_07696530.1| conserved repeat protein [Bifidobacterium dentium JCVIHMP022] gi|304552856|gb|EFM40769.1| conserved hypothetical protein [Bifidobacterium dentium ATCC 27679] gi|308221023|gb|EFO77328.1| conserved repeat protein [Bifidobacterium dentium JCVIHMP022] Length = 1136 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 59/341 (17%), Positives = 108/341 (31%), Gaps = 77/341 (22%) Query: 106 LSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLL 165 + + NIVNS S+T A L + + +Y Sbjct: 532 FTNTATATGTDTDGNIVNSPEASVTIKAIKPLGAPEKHKRIKKNADNTY----------- 580 Query: 166 NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 V A + LV+D+SGSM M S Sbjct: 581 -----TVNVDVTGAASSSTITTTQPVDFTLVLDVSGSMRENMGSV--------------- 620 Query: 226 TKMAALKNALLLFLDSIDLLSH----VKEDVYMGLIGYTTRVEKNIEP-----SWGTEKV 276 TK+ AL++A+ FLD ++ E V +GL+ + K I W T Sbjct: 621 TKLQALQSAVNNFLDEAAKINKGAQSGSEPVRVGLVKFAGNATKKIGNKTYQDKWNTYNY 680 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTS--DKKRSFFTNFFRQGVKIPSLPFQKFII 334 Q V + T T +K LT+ + + + +K +I Sbjct: 681 SQIVKKL--------TADTDGLKNEVNKLTAGGATRADYGFQHAFTVMSEARTEAKKVVI 732 Query: 335 FLTDGENNNFKS-----NVNTIKICDKAKENFIKIVTISI-------NASPNGQRLLKTC 382 F TDG+ + K+ + ++ + K++ + +I + + + + + + Sbjct: 733 FFTDGKPTSEKTFDGKVANDAVEYAKQLKDSGAIVYSIGVFDGANPASTATSENKFMHAV 792 Query: 383 V---------------SSPEYHYNVVNADSLIHVFQNISQL 408 S+ Y+ +A L +F+ I + Sbjct: 793 SSNYPNAANYEDLSEGSNAGYYKTATDASGLNSIFEEIRKS 833 >gi|90417299|ref|ZP_01225225.1| batB protein, putative [marine gamma proteobacterium HTCC2207] gi|90330884|gb|EAS46147.1| batB protein, putative [marine gamma proteobacterium HTCC2207] Length = 330 Score = 64.9 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 38/235 (16%), Positives = 80/235 (34%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L VD+SGSM + ++ A+K + F+ +E Sbjct: 91 DILLAVDISGSME----------REDMQLSGQTVNRLMAVKAVVGNFVTE-------REG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + ++ + ++ + T A+ + + L + Sbjct: 134 DRLGLILFGEKAYLQTPLTFDRKTMQTLLYEAQLGFAGNGTAIGDAIGLSVKRLQQRPEN 193 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-- 369 + +I LTDG NN + + +K + A +KI TI + Sbjct: 194 ------------------HRVVILLTDGANNAGELDP--LKAAELASSAKVKIYTIGVGA 233 Query: 370 ----------------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 +A + Q L ++ ++ N + L+ ++Q +++L Sbjct: 234 ETQEAWGLFGKRVTNPSADLDEQTLTAIAEATGGQYFRARNPEELMAIYQELNRL 288 >gi|149176865|ref|ZP_01855475.1| BatA [Planctomyces maris DSM 8797] gi|148844302|gb|EDL58655.1| BatA [Planctomyces maris DSM 8797] Length = 356 Score = 64.9 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 49/241 (20%), Positives = 87/241 (36%), Gaps = 52/241 (21%) Query: 191 FLIELVVDLSGSMHC-AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 IE+VVD SGSM D E V+ ++ A+KN F++ + L Sbjct: 85 IAIEMVVDRSGSMQAMDFKIDGEHVD-----------RLTAIKNVAGKFVEGKEELEGRF 133 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT--RDMDSLILKPTDSTPAMKQAYQILTS 307 D+ +GL+ + + P+ + + + + + T A+ A + L + Sbjct: 134 NDL-VGLMTFAGYADGITPPTLDHPYLVSQLNNIQIVTNRSEDGTAIGDAISLAVEKLNA 192 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 R K II LTDGENN V I+ + A+ IK+ TI Sbjct: 193 LDARRDEKVKS------------KVIILLTDGENNA--GEVEPIQAAELAETLGIKVYTI 238 Query: 368 SINA----------SPNGQRL------------LKTCVS-SPEYHYNVVNADSLIHVFQN 404 + +G+++ L+ + ++ + DSL ++ Sbjct: 239 GVGTKGEAPVPVTDPFSGKQVVQWMPVNIDEATLQKVADLTHGKYFRATDTDSLEKIYHE 298 Query: 405 I 405 I Sbjct: 299 I 299 >gi|301064759|ref|ZP_07205139.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] gi|300441134|gb|EFK05519.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] Length = 332 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 43/249 (17%), Positives = 81/249 (32%), Gaps = 59/249 (23%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + I L VD+SGSM + ++ ++ +K + F+ Sbjct: 83 DSSGIDIVLAVDVSGSMEALDFTI----------NNEPANRVDVVKKVVFRFIGE----- 127 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + D +GL+ + R + + + + + ++ T A+ + L Sbjct: 128 --RPDDRIGLVAFAGRPYMVSPLTLDHDWLGRRLQTIHPGMVEDGTAIGSAIGSSINRLR 185 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 K +S K +I LTDG NN K T + A+ IKI T Sbjct: 186 DQKAKS------------------KVVILLTDGMNNAGKILPVTAA--EAAETLGIKIYT 225 Query: 367 IS------INASPNGQ----------------RLLKTCVSSPEYHYNVVNADSLIHVFQN 404 I + + L K + +Y + DSL ++ Sbjct: 226 IGAGSRGEVPVPITDKFGNQKIVRAKVDIDEATLEKVAQMTGAKYYRATDTDSLKKIYSE 285 Query: 405 ISQLMVHRK 413 I++L ++ Sbjct: 286 INKLETTKR 294 >gi|332291974|ref|YP_004430583.1| von Willebrand factor type A [Krokinobacter diaphorus 4H-3-7-5] gi|332170060|gb|AEE19315.1| von Willebrand factor type A [Krokinobacter diaphorus 4H-3-7-5] Length = 334 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 43/280 (15%), Positives = 85/280 (30%), Gaps = 70/280 (25%) Query: 157 RLQFIEHLLNQRYNQKIVS----FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPE 212 +L+ + +L I+ + + I + +D+S SM Sbjct: 54 KLRPLLFILRLAALSLIIVALARPRNVEVSTKTKTTKGIDIVIAIDVSASMLAK------ 107 Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + ++ ALK F+ + + + +GLI Y + Sbjct: 108 ---------DLRPNRLEALKKVAASFI-------NGRPNDRIGLIEYAGESFTKTPITSD 151 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 V + +++I T + L K K Sbjct: 152 KSIVLSALKSIQYNNIIEGGTAIGMGLATGVNRLKDSKA------------------LSK 193 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS----------------------- 368 II +TDGENN + + + A+E IK+ TI Sbjct: 194 VIILMTDGENNAGQIDPRIAA--ELAQEFGIKVYTIGMGTNGMALSPYARNANGTFVYEN 251 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 I + + + L + ++ ++ N + L ++ I +L Sbjct: 252 IQVTIDEELLEEIAATTGGQYFRATNNEKLQEIYDEIDKL 291 >gi|209884898|ref|YP_002288755.1| hypothetical protein OCAR_5764 [Oligotropha carboxidovorans OM5] gi|209873094|gb|ACI92890.1| conserved hypothetical protein [Oligotropha carboxidovorans OM5] Length = 600 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 28/150 (18%), Positives = 57/150 (38%), Gaps = 21/150 (14%) Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 ++ T+ + +Q L + + F+ K P+ +Q +I+ L+DG Sbjct: 456 SKVNAMNPSGNTNQAIGLFWGWQTLNTAN------DPFKAPSKDPNWVYQDYIVILSDGL 509 Query: 341 NNN------------FKSNVNTIKICDKAKENFIKIVTISINASPNG--QRLLKTCVSSP 386 N + +CD K + I I TI +N + ++LK C SS Sbjct: 510 NTQNRWYTCPNAGPCPTIDGREKTLCDNIKADKITIFTIQVNINSKDPESQVLKDCASSG 569 Query: 387 EYHYN-VVNADSLIHVFQNISQLMVHRKYS 415 ++ + +A+ F N+ + + + Sbjct: 570 SGYFQLITSANDTATAFDNVLNKIAKLRIA 599 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 47/309 (15%), Positives = 103/309 (33%), Gaps = 72/309 (23%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 + RF + N +IIF L + + L+G + + +M+SA ++A L Sbjct: 6 IKRFASLANGFRKDARGNVAIIFTLVAIPLVALVGAAVDYTRVSSARTAMQSALDSAAL- 64 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 + + + + D I ++ + ++++ NTE Sbjct: 65 ---MISKDAATMSDSE------------------ITTRARQYV---NSLYTNTETPIQTF 100 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 S+ + NN + + + F+ ++ ++ ++ Sbjct: 101 SAVY-----------TPNNGSGATILLNAGGNMPTYFM-KIVGTNFSTLPINTASTTKWG 148 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL----FL 239 + LV+D +GSM + KM ALK A + Sbjct: 149 SS----RMRVALVLDNTGSMD-------------------QNGKMTALKKAAANATTGLI 185 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV-RQYVT-RDMDSLILKPTDSTPA 297 + + + DVY+ ++ + V GT V ++ + ++ TD++ Sbjct: 186 KKLSAFNTNEGDVYISVVPFAKDVNV------GTSNVGASWLNWSEWEAAPRILTDNSYP 239 Query: 298 MKQAYQILT 306 +K Y +T Sbjct: 240 IKVKYNNIT 248 >gi|158425008|ref|YP_001526300.1| von Willebrand factor type A domain-containing protein [Azorhizobium caulinodans ORS 571] gi|158331897|dbj|BAF89382.1| von Willebrand factor type A domain protein [Azorhizobium caulinodans ORS 571] Length = 343 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 42/231 (18%), Positives = 82/231 (35%), Gaps = 51/231 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM + ++ +K F+ ++ Sbjct: 92 DLMLAVDLSGSMS----------RQDLSYDNIPVDRLTIIKGVADDFIAK-------RKG 134 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI ++TR ++ VR + + + T A+ A + L + + Sbjct: 135 DRIGLILFSTRAYVQAPLTFDRNVVRDLLRTSSIGMTGQETAIGDAIALAVKTLRTRPQE 194 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 Q+ ++ LTDG NN+ + + AK N +KI TI + A Sbjct: 195 ------------------QRVLVLLTDGANNSGMLSPIPAA--EIAKANGVKIYTIGVGA 234 Query: 372 S--PNGQRLL------------KTCVSSPEYHYNVVNADSLIHVFQNISQL 408 GQR++ + + ++ +A L ++ +I ++ Sbjct: 235 DAFAVGQRMVNPSFDLDEGALEQIAQMTGGRYFRARDAAGLAAIYNDIDRM 285 >gi|332534652|ref|ZP_08410484.1| protein BatA [Pseudoalteromonas haloplanktis ANT/505] gi|332035932|gb|EGI72413.1| protein BatA [Pseudoalteromonas haloplanktis ANT/505] Length = 328 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 42/269 (15%), Positives = 87/269 (32%), Gaps = 55/269 (20%) Query: 157 RLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNS 216 RL +E ++ + I + I L VDLSGSM Sbjct: 54 RLNPLEWIIWLLLVTAAANPTWLDEPISLPNEGR-DIMLAVDLSGSM----------TEQ 102 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 + ++ +K L F++ ++ +GLI + + + V Sbjct: 103 DMAYNGQYVDRLTMVKAVLSDFIEQ-------RQGDRLGLILFGDTAFLQTPLTRDVKTV 155 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + ++ L+ + T A+ + + + + + + ++ L Sbjct: 156 SKMLSEAQIGLVGRATAIGDALGLSVKRFANKDESN------------------RIVVLL 197 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISI-NASPNGQRLLKTCVSSPEY------- 388 TDG+N N + A+E IK+ TI + + +P G L SS Sbjct: 198 TDGQNTAGNLNPEDALL--LAREEGIKVYTIGVGSDNPRGFSLFNVGGSSGSNLDESLLK 255 Query: 389 ---------HYNVVNADSLIHVFQNISQL 408 ++ + L ++ + +L Sbjct: 256 KIAEQTGGLYFRAKDVAGLQQIYAELDKL 284 >gi|110634434|ref|YP_674642.1| hypothetical protein Meso_2084 [Mesorhizobium sp. BNC1] gi|110285418|gb|ABG63477.1| conserved hypothetical protein [Chelativorans sp. BNC1] Length = 549 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 34/192 (17%), Positives = 65/192 (33%), Gaps = 54/192 (28%) Query: 267 IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 S +++ V+R T+ + ++L+ + + K P+ Sbjct: 368 TPLSNDYAALKREVSRF---TADGNTNIMEGVAWGMRVLSPREPFT--------EGKEPA 416 Query: 327 LPFQKFIIFLTDGENN-------------NFKS----------------------NVNTI 351 +K +I LTDG NN ++ S N T+ Sbjct: 417 SDVEKIMIVLTDGANNMGLSNNRNHALGSSYSSFGYLVEDRLTRERSQRRVTEEMNRRTL 476 Query: 352 KICDKAK-------ENFIKIVTISINASP-NGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 C+ AK E+ + I TI + LL+ C + P Y+++ + L +F+ Sbjct: 477 AACENAKREYTPSKEDDVTIYTIRLEEPDVATGTLLQECATGPGYYFDSPSRTQLNAIFK 536 Query: 404 NISQLMVHRKYS 415 I + + S Sbjct: 537 EIRDGITKLRLS 548 Score = 44.9 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 39/260 (15%), Positives = 81/260 (31%), Gaps = 71/260 (27%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 + R R + NF++I ALS + G + + ++ +++A +A Sbjct: 4 IRRLRSACSALCRDRRGNFAVILALSALPVFGAAGLAVDYTNMSRTRSELQNALDA---- 59 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 + + R + IS+ R++ A F+ ++ + + N Sbjct: 60 -------AVLAVAQRGDKISDAEARSI---AASFLTGNLSSAYKNMAVE---------RN 100 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 + + ++ A L V S + F Sbjct: 101 GTSVKLSAEATMPLSFGGLIGRKEATVGASSTADMAF----------------------- 137 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + I LV+D +GSM + K+ A+K A+ +D + Sbjct: 138 -----AYYEIALVLDTTGSM--------------------RGGKLQAMKEAVNGLIDDLS 172 Query: 244 LLSHVKEDVYMGLIGYTTRV 263 KE + L+ + + V Sbjct: 173 SRVTDKERLKFALVPFASFV 192 >gi|115525407|ref|YP_782318.1| hypothetical protein RPE_3406 [Rhodopseudomonas palustris BisA53] gi|115519354|gb|ABJ07338.1| conserved hypothetical protein [Rhodopseudomonas palustris BisA53] Length = 580 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 32/170 (18%), Positives = 57/170 (33%), Gaps = 30/170 (17%) Query: 267 IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 E + + V + T+ + +Q L F VK Sbjct: 419 TEMNNNWATMNTTVDGL---FPVGGTNQPIGLVWGWQSLVG------GGPFPTPPVKDEQ 469 Query: 327 LPFQKFIIFLTDGEN-------NNFKSNVN---------TIKICDKAKENFIKIVTISIN 370 +Q I+ ++DG N N + +N + T C K IK+ T+ +N Sbjct: 470 YTYQDIIVLMSDGLNTVDRWYGNGWDTNTSVDNRMYASATTGTCVNVKAAGIKVYTVHVN 529 Query: 371 ASPN-GQRLLKTCVSSPE----YHYNVVNADSLIHVFQNISQLMVHRKYS 415 + + LLK C S + V +A L F +I+ + + + Sbjct: 530 TNGSPESTLLKNCASPADDGGKEFQMVTSASGLNAAFNSIATKLTDLRVA 579 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 42/261 (16%), Positives = 76/261 (29%), Gaps = 60/261 (22%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 L ++ R + IA N +++F ++ + + +G + + +M+SA ++ L Sbjct: 5 LFTQLRKSAARFIADRSGNIAVLFGIACVPLITFVGAAVDYSRAVAARTAMQSALDSTAL 64 Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 A N K E ++F Sbjct: 65 MVAKDYSLN---------------------------KISASEIDGKAKSIFSALYTNKSA 97 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 NS + N + V + F+ L+N S Sbjct: 98 NSVEVVAVLTP-------NTGKGSTIKVDGTGKVPTDFM-KLVNISQIDIGASSTTTWGS 149 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + LV+D +GSM K+ ALK A L + Sbjct: 150 T------RLRVALVLDTTGSM-------------------NDNGKIGALKTATQNLLTQL 184 Query: 243 DLLSHVKEDVYMGLIGYTTRV 263 + EDVY+ +I ++ V Sbjct: 185 KDAAGKPEDVYVSIIPFSKDV 205 >gi|170750695|ref|YP_001756955.1| von Willebrand factor type A [Methylobacterium radiotolerans JCM 2831] gi|170657217|gb|ACB26272.1| von Willebrand factor type A [Methylobacterium radiotolerans JCM 2831] Length = 345 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 42/227 (18%), Positives = 81/227 (35%), Gaps = 35/227 (15%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I + +DLSGSM + + + ++ A+K F+ + Sbjct: 105 IMIAMDLSGSMERRDFALDGETVN----------RLTAVKRVGTDFIRR-------RAGD 147 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GL+ + + PS+ T V + + + + T + A + L Sbjct: 148 RIGLVIFADQAYVAAAPSFDTAAVARALDEATIGISGRSTGIGDGLGLALRRL-----DP 202 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI--- 369 P K +I L+DG NN ++ + + A+E IK+ TI++ Sbjct: 203 RDAGGEAASGSKPGEKPAKAVILLSDGANNAGQTAPKDVA--ELARELGIKVYTIALGPR 260 Query: 370 -NASPNGQ------RLLKTCV-SSPEYHYNVVNADSLIHVFQNISQL 408 A +G+ L+ +S + V + L+ V I +L Sbjct: 261 DMADADGEQDVVDTETLRDMARASGGEAFRVRTTEDLVRVADAIDRL 307 >gi|194334883|ref|YP_002016743.1| von Willebrand factor type A [Prosthecochloris aestuarii DSM 271] gi|194312701|gb|ACF47096.1| von Willebrand factor type A [Prosthecochloris aestuarii DSM 271] Length = 327 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 45/238 (18%), Positives = 87/238 (36%), Gaps = 52/238 (21%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 R+ GE I L +D+S SM + D + +++ A K L F++ Sbjct: 88 RLSPGEAKGIDIMLALDISKSM---LQEDFDGK-----------SRLDAAKTVALQFIE- 132 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV-TRDMDSLILKPTDSTPAMKQ 300 + +GL+ + + + + + V +D++ T + A+ Sbjct: 133 ------NRRRDRIGLVLFKGKSFTQCPLTLDHDVLSMLVRAASVDAVPESGTATGSAILI 186 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A L + S ++ +I LTDGE+N + + T A Sbjct: 187 AVNRLRA------------------SESPERVLILLTDGEHNAGEVDPVTAA--GIAAGE 226 Query: 361 FIKIV--TISINASPNGQRLLKTCVS--------SPEYHYNVVNADSLIHVFQNISQL 408 ++I T+S+ S +G+ +L + + + +A+SL F I QL Sbjct: 227 GVRIYMATVSVPGSRSGEDMLASARDLSGEVSRITGGRSFRANDANSLNRTFSEIDQL 284 >gi|312126757|ref|YP_003991631.1| hypothetical protein Calhy_0520 [Caldicellulosiruptor hydrothermalis 108] gi|311776776|gb|ADQ06262.1| protein of unknown function DUF1355 [Caldicellulosiruptor hydrothermalis 108] Length = 909 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 49/247 (19%), Positives = 85/247 (34%), Gaps = 46/247 (18%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPI-FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 L N + +P ++++ E+ + LV+D SGSM Sbjct: 379 LGNYSNSVLEKMLPVKMQLKNKEKERNVAVVLVIDHSGSMG--------------ESNLG 424 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 K+ K+A +D ++ V G+I + + G K + V + Sbjct: 425 NINKLEIAKSAAAKMIDHLESSDSV------GVIAFDHNFYWASKF--GKLKSKNEVIEN 476 Query: 284 MDSLILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + + + T P + +A L K + K I+ LTD Sbjct: 477 ISGIQIGGGTAIIPPLTEAVNTLRKSKAK------------------DKVIVLLTD---- 514 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + AK N IKI TI + +S N L + Y V +A +LI VF Sbjct: 515 GYGEEGGYEYPASIAKRNNIKITTIGVGSSINAPILSWMAAYTSGRFYYVKDASNLIDVF 574 Query: 403 QNISQLM 409 ++++ Sbjct: 575 LKEAKII 581 >gi|154488145|ref|ZP_02029262.1| hypothetical protein BIFADO_01716 [Bifidobacterium adolescentis L2-32] gi|154083618|gb|EDN82663.1| hypothetical protein BIFADO_01716 [Bifidobacterium adolescentis L2-32] Length = 835 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 44/280 (15%), Positives = 81/280 (28%), Gaps = 65/280 (23%) Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 ++ + R + LV+D SGSM+ M+ C ++ A Sbjct: 266 TLNLNVVGKDTRESHETTEKIEVVLVLDTSGSMNYCMDGSQRG------CNKSNPKRLTA 319 Query: 231 LKNALLLFLDSI----DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 LK A F+D+ D + V + + + + T ++ V+R + Sbjct: 320 LKEAATSFIDATETTNDTIQDENSKVRIAIAQFGQTSGVVSSLTSDTAALKSSVSRLSAN 379 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG---ENNN 343 T + M A L +K +IF DG N Sbjct: 380 ---GATPADKGMAAAQTALLRA-----------------RPGAKKVVIFFADGVPTTQNT 419 Query: 344 FKSNV--NTIKICDKAKENFIKIVTISI-------------NASPNGQRLLKTCV----- 383 F + V + + K I +I I + + + Sbjct: 420 FSTRVANDAVTTALAMKSAGTLIYSIGIFEGANPEQQSFGNRENDQANQFMHAVSSNYPN 479 Query: 384 ------------SSPEYHYNVVNADSLIHVFQNISQLMVH 411 S+ Y+ +AD L +F +I + + Sbjct: 480 ATAYNKTNWGTGSNLGYYKATNSADDLTKIFDDIQKEITT 519 >gi|254283762|ref|ZP_04958730.1| conserved hypothetical protein [gamma proteobacterium NOR51-B] gi|219679965|gb|EED36314.1| conserved hypothetical protein [gamma proteobacterium NOR51-B] Length = 325 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 44/233 (18%), Positives = 77/233 (33%), Gaps = 43/233 (18%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + L +DLS SM D P K ++ A++ + F+ Sbjct: 88 TKTETARDVMLAIDLSASM---------DYRDFPGPDGKPVSRFDAVQRVVDQFVA---- 134 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 +E +GLI + + + + R V + T ++ A + Sbjct: 135 ---NREGDRVGLIVFGAKAYLQLPFTRDLNTARALVDLMQVGMAGPQTALGDSIGLAIRA 191 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 S + + +I LTDG N+ S + I + A+ N I+I Sbjct: 192 FESSEVD------------------DRVLILLTDG--NDTASKMTPINAAEIAQLNGIEI 231 Query: 365 VTISI-NASPNGQR------LLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 TI I +A G+ L S ++ + +L V+ I L V Sbjct: 232 YTIGIGDAEATGEDRIDFETLASIAERSGGQFFDAQDETALRQVYDRIDALAV 284 >gi|326792960|ref|YP_004310781.1| von Willebrand factor A [Clostridium lentocellum DSM 5427] gi|326543724|gb|ADZ85583.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] Length = 903 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 48/255 (18%), Positives = 84/255 (32%), Gaps = 45/255 (17%) Query: 188 RPIFLIELVVDLSGSMH---CAMNSDPEDVNSAPICQDKKR----------TKMAALK-- 232 + + LV+D SGSM N+ + C + R + LK Sbjct: 224 QEATDVVLVLDRSGSMGQGVVDKNNPNAQKCTVLTCTNSNRWHRHNADCYDEEYYILKCT 283 Query: 233 -----NALLLFLDS---IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 F+ + + VK+ Y L + + NI ++ Sbjct: 284 QNHTHTLPGDFIANSCYVSRADKVKDASYTFLDTLQEKEDVNIS----VVTYAGTASKVT 339 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 +S + +S AY +L +D + + S K I+ L+DGE+N Sbjct: 340 NSNLKSGIES------AYNVLGTDGTNTGRGIEIASQILSNSTAPNKMIVVLSDGESNA- 392 Query: 345 KSNVNTIKICDKAKENFIKIVTI--SINASPNGQRLLKTCVSSP-----EYHYNVVNA-D 396 N+ + AK + TI I + NG + L C S Y + + Sbjct: 393 ---GNSRTAANSAKNKGCIVYTIGAGIASGSNGAKELFDCASVDQSTNKAKFYLADDTGN 449 Query: 397 SLIHVFQNISQLMVH 411 +L +F I+ + Sbjct: 450 ALNEIFAEIAGEIQE 464 >gi|319956032|ref|YP_004167295.1| von willebrand factor type a [Nitratifractor salsuginis DSM 16511] gi|319418436|gb|ADV45546.1| von Willebrand factor type A [Nitratifractor salsuginis DSM 16511] Length = 306 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 50/258 (19%), Positives = 99/258 (38%), Gaps = 41/258 (15%) Query: 159 QFIEHLLNQRYNQKIVSFIPALLRIEMGE--RPIFLIELVVDLSGSMHCAMNSDPEDVNS 216 +++E L +++ +L E E + I LV+D S SM+ DP Sbjct: 49 RWLEILKWIGIISLLLALASPVLTNEYKEIKKKGRDIMLVIDSSDSMN-QWGFDPG---- 103 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 D ++K +K + F+D +++ +GLI + + ++ + + Sbjct: 104 -----DPNKSKFDVVKEVVGDFIDK-------RKNDRIGLINFASVAFVASPLTFEKDFL 151 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 R+ + + K T A+ Q Y IL+ +S K I L Sbjct: 152 RKILQMQEPGIAGKRTAINDALLQTYNILSKSDAKS------------------KIAILL 193 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN-GQRLLKTCVSSP-EYHYNVVN 394 TDG +N + + + I+ ++ IK+ TI I + + LK + + + Sbjct: 194 TDGIDNASRISFDEIRR--LISDSDIKLYTIGIGSYRDFDAPYLKALAQAGHGRFFAASD 251 Query: 395 ADSLIHVFQNISQLMVHR 412 SL +++ I +L + Sbjct: 252 RRSLQKIYEAIDRLETSK 269 >gi|330447847|ref|ZP_08311495.1| von Willebrand factor type A domain protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] gi|328492038|dbj|GAA05992.1| von Willebrand factor type A domain protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] Length = 321 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 44/235 (18%), Positives = 84/235 (35%), Gaps = 54/235 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM S P+ V + ++ A+K+ L F++ ++ Sbjct: 84 DMLLAVDLSGSM-----SIPDMVTK----NGQSIDRLTAVKHVLSDFIEK-------RKG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + ++ V Q + R + LI + T + A + + K Sbjct: 128 DRLGLVLFADHAYLQTPLTFDRNTVEQQLDRTVLGLIGQSTAIGEGLGIATKTFINSKA- 186 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 Q+ II L+DG N + + AKE+ +KI T+ + A Sbjct: 187 -----------------PQRVIILLSDGANTSGVIDPLEAA--KLAKESGVKIYTVGVGA 227 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + L + + ++ N L ++ I++L Sbjct: 228 DQMVQKGFFGDRLVNPSQDLDEKTLTEIAKMTGGEYFRARNPQQLEKIYDIINKL 282 >gi|149187170|ref|ZP_01865468.1| hypothetical protein VSAK1_16642 [Vibrio shilonii AK1] gi|148838706|gb|EDL55645.1| hypothetical protein VSAK1_16642 [Vibrio shilonii AK1] Length = 324 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 41/232 (17%), Positives = 77/232 (33%), Gaps = 55/232 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LV+DLS SM QD ++ A+K+ + F+D ++ Sbjct: 86 DMMLVIDLSYSMS----------QQDMAYQDDYIDRLTAVKHVVSDFVDR-------RKG 128 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + ++ E V+ + + + LI T + A + Sbjct: 129 DRVGLVYFADHAYLQTPLTFDRETVKTQLNQTVLKLIGTQTAIGDGIGLATKTFVDSNA- 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 Q+ +I L+DG NN + + AK+ I TI + A Sbjct: 188 -----------------PQRVMILLSDGSNNAGVLDPVQAA--EIAKKYGTTIYTIGVGA 228 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + + L+K + ++ NAD L ++ I Sbjct: 229 GEMQVKDFFMTRTVNTAEDLDEKTLIKIANITGGQYFRARNADELATIYDTI 280 >gi|307942638|ref|ZP_07657986.1| hypothetical protein TRICHSKD4_1260 [Roseibium sp. TrichSKD4] gi|307774277|gb|EFO33490.1| hypothetical protein TRICHSKD4_1260 [Roseibium sp. TrichSKD4] Length = 403 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 61/436 (13%), Positives = 148/436 (33%), Gaps = 69/436 (15%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 + + + + + K + +I FAL L I I + + K++ + + L Sbjct: 1 MYTGIKNRIQALKGNIKGSIAIPFALLATLILAAISVGIDMSFAYNKRDQSQLVADEVSL 60 Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALID-DAKRFIKNHIKESLSGYSAVFYNTEIQNI 121 + ++ +S + R + DA++F+ K SL G + F NI Sbjct: 61 FAVTTFRKYVAD------GMSKNQARKRAETDARKFLTARTK-SLDGTTEKFSI--KINI 111 Query: 122 VNS----SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFI 177 V+ + ++ + ++ F N+D + + F + +Y + + Sbjct: 112 VDREAKVVKANVNISGKHESYMTHAMGFDNIDYTADSESTISFGQG----KYEFIFLVDV 167 Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK-KRTKMAALKNALL 236 + I R +++ ++ C +S + R ++ +K+AL Sbjct: 168 SPSMGIGASNRDRQIMQ------RAIGCQFACHEPWYSSVSRAKSAGARLRIDVVKDALK 221 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM---DSLILKPTD 293 + ++ + V D+ GL ++ + + G K ++ + + L T+ Sbjct: 222 SLVTQLEEATEV--DLRTGLYSFSNYLHIQTGLNKGISKFKREANKIAIHREYLRGGGTN 279 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN---------- 343 N + +K P ++ II ++DG N+ Sbjct: 280 FHGVFS--------------DFNGVLRSLK-PKADVKQHIIIISDGVNHLNLRSGTNRHL 324 Query: 344 FKSNVNTI--------KICDKAKENFIKIVTISINASPNGQ------RLLKTCVSSPEYH 389 + N + CD+ K+ ++ V + ++ C +S ++ Sbjct: 325 WNQTPNWRPYNYSFNPRWCDEFKKGEVRTVHTMLVEPDRAHYVRASTSSMRACATSADFF 384 Query: 390 YNVVNADSLIHVFQNI 405 Y+ +A + + + Sbjct: 385 YSANSAAEIDKASKTV 400 >gi|146295744|ref|YP_001179515.1| von Willebrand factor, type A [Caldicellulosiruptor saccharolyticus DSM 8903] gi|145409320|gb|ABP66324.1| von Willebrand factor, type A [Caldicellulosiruptor saccharolyticus DSM 8903] Length = 909 Score = 63.4 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 49/247 (19%), Positives = 89/247 (36%), Gaps = 46/247 (18%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPI-FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 L N + +P ++++ E+ + LV+D SGSM + + Sbjct: 379 LGNYSNSVLEKMLPVKMQLKNKEKERNVAVVLVIDHSGSMGGS--------------NLR 424 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 K+ K+A +D ++ V G+I + + G K + V + Sbjct: 425 NINKLEIAKSAAAKMIDHLESSDSV------GVIAFDHNFYWASKF--GKLKSKNEVIEN 476 Query: 284 MDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + ++ + T P + +A +L K + K I+ LTD Sbjct: 477 ISTIQVGGGTAIIPPLTEAVNLLKKSKAK------------------DKVIVLLTD---- 514 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + AK N IKI TI + +S N L + Y V +A +LI VF Sbjct: 515 GYGEEGGYEYPASIAKRNNIKITTIGVGSSINAPILSWMAAYTSGRFYYVKDASNLIDVF 574 Query: 403 QNISQLM 409 ++++ Sbjct: 575 LKEAKII 581 >gi|326335930|ref|ZP_08202107.1| aerotolerance protein BatA [Capnocytophaga sp. oral taxon 338 str. F0234] gi|325691894|gb|EGD33856.1| aerotolerance protein BatA [Capnocytophaga sp. oral taxon 338 str. F0234] Length = 332 Score = 63.4 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 47/276 (17%), Positives = 81/276 (29%), Gaps = 66/276 (23%) Query: 157 RLQFIEHLLNQR-YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 + FI LL + + + I L +D+S SM Sbjct: 56 PILFILRLLALSCLIIALARPQSSSEITKTKTTEGIDIILAIDMSSSMLAK--------- 106 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 D K ++ ALK F++ +G++ Y+ + + Sbjct: 107 ------DLKPNRIEALKRVASQFIEERKSD-------RIGIVVYSGESYTKVPATTDKSI 153 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 V Q + I T + A L K +S K II Sbjct: 154 VLQSLKDIKQGEIEDGTAIGMGLGTAINRLKDSKTKS------------------KVIIL 195 Query: 336 LTDGENNNFKSNVNTIKICDKAKENFIKIVTISI---------------------NASPN 374 +TDG NN + + + AKE I++ TI I Sbjct: 196 MTDGVNNTGVIDPLSAA--ELAKEYGIRVYTIGIGTNGKALSPVAYNPDGSLQYDMVPVE 253 Query: 375 -GQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQL 408 ++LL S+ ++ + L ++ I +L Sbjct: 254 IDEKLLGEIAQSTGGKYFRATDNKKLAQIYTEIDKL 289 >gi|90577284|ref|ZP_01233095.1| hypothetical protein VAS14_09574 [Vibrio angustum S14] gi|90440370|gb|EAS65550.1| hypothetical protein VAS14_09574 [Vibrio angustum S14] Length = 321 Score = 63.4 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 44/235 (18%), Positives = 85/235 (36%), Gaps = 54/235 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM S P+ V + ++ A+K+ L F++ ++ Sbjct: 84 DMLLAVDLSGSM-----SIPDMVTK----NGQSIDRLTAVKHVLSDFIEK-------RKG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + ++ + V Q + R + LI + T + A + + K Sbjct: 128 DRLGLVLFADHAYLQTPLTFDRKTVEQQLDRTVLGLIGQSTAIGEGLGIATKTFINSKA- 186 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 Q+ II L+DG N + + AKE+ +KI T+ + A Sbjct: 187 -----------------PQRVIILLSDGANTSGVIDPLEAA--KLAKESGVKIYTVGVGA 227 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + L + + ++ N L ++ I++L Sbjct: 228 DQMVQQGFFGDRIVNPSQDLDEKTLTEIAKMTGGEYFRARNPQQLEKIYDIINKL 282 >gi|254281808|ref|ZP_04956776.1| von Willebrand factor, type A [gamma proteobacterium NOR51-B] gi|219678011|gb|EED34360.1| von Willebrand factor, type A [gamma proteobacterium NOR51-B] Length = 328 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 43/227 (18%), Positives = 78/227 (34%), Gaps = 47/227 (20%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L +DLSGSM + +++ A+K A+ S + Sbjct: 92 DLMLAIDLSGSM----------QIEDMQVGARLVSRIEAVK-AIASDFTSQRVGD----- 135 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + TR ++ T V +++ + T A+ A + L Sbjct: 136 -RVGLILFGTRAYVQAPLTFDTATVTRFIREAQLGFAGEDTAIGDALGLAIKRLRERPAE 194 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN- 370 S + +I LTDG++ + A E+ IK+ TI I+ Sbjct: 195 S------------------RVLILLTDGQDTASTVDPMEAT--ALAAESGIKVYTIGISR 234 Query: 371 ---ASPNG-----QRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQL 408 A G + LL ++ ++ N L ++ + QL Sbjct: 235 RIGARAGGSGEVDEALLNAIAEATGGEYFRARNPAELQSIYGVVDQL 281 >gi|323135950|ref|ZP_08071033.1| hypothetical protein Met49242DRAFT_0420 [Methylocystis sp. ATCC 49242] gi|322399041|gb|EFY01560.1| hypothetical protein Met49242DRAFT_0420 [Methylocystis sp. ATCC 49242] Length = 432 Score = 63.0 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 53/450 (11%), Positives = 130/450 (28%), Gaps = 67/450 (14%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 + + + + +I F L++M L+ G + +K+++ A +A +L + Sbjct: 3 RSFLQNRRGGVAIFFGLALMPLALMAGGAVDFSQISRQKSALNQAADAGVLTALKEAREQ 62 Query: 72 LSRLGDRFESISNHAK-RALIDDAKRF--IKNHIKESLSGYSAVFYNTEIQNIVNSSRIS 128 L + ++SI+ +A ++A + + S + + N+ Sbjct: 63 LKQGKPDWQSIAEKQGGKAFTNNASKIGGVSGTGATINLSLSGGVLSGSLNYAANAPTHF 122 Query: 129 MTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGER 188 + N ++ + M D V + A + I + Sbjct: 123 LRIAGLNTINLKG-SASATMSAAQYRDI---------------HFVIDVSASMGIGATKA 166 Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 ++ V + + H A +DP N A + ++ ++ A++ L I Sbjct: 167 DQQAMQNSVGCAVACHHAEAADPATDNLAAVRAIGATLRIDVVRKAVMDALAKI----PN 222 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR--DMDSLILKPTDSTPAMKQAYQILT 306 + + ++ ++ S + T+ ++ Q +L Sbjct: 223 DGSTRVAIHSFSNSLKTVFPLSTNIAGAISATQSIDLTNENGQGGTNFHYSLNQLNNLLA 282 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG----------ENNNFKSNVNTIKI--- 353 S + + + F DG E N N + Sbjct: 283 SAGNGLTASQPRGFVLLATDAVEDSSLFFYADGVAPPFARQWVEPNFVVGNPSYFAWGLH 342 Query: 354 ---------CDKAKENFIKIVTI-------------------SINASPNGQRLLKTCVSS 385 C K ++T+ + + C S+ Sbjct: 343 YVQAPDAANCSAIKAKGYTMMTLETEYLIPDGVYNPTFDAVRGDMGPAMTKS-MTDCASA 401 Query: 386 PEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 P+Y+++ + + Q + V+ + Sbjct: 402 PDYYFHAESPQEIDRAVQTMVSKTVNLSLT 431 >gi|196231436|ref|ZP_03130294.1| von Willebrand factor type A [Chthoniobacter flavus Ellin428] gi|196224289|gb|EDY18801.1| von Willebrand factor type A [Chthoniobacter flavus Ellin428] Length = 341 Score = 63.0 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 44/255 (17%), Positives = 79/255 (30%), Gaps = 60/255 (23%) Query: 178 PALLRIEMGERPI-FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 P L R + I L +D+SGSM + ++ +++ +K Sbjct: 80 PQLGRTFSQVQASGIDIMLALDVSGSM----------IAEDFTIGGERASRVDVVKQVTQ 129 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 F++ +G+I + R + + Q + R L+ T Sbjct: 130 KFIE----ARPND---RIGMIAFAARPYLVSPLTLDHGWLIQNLDRVKLGLVEDGTAIGS 182 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+ L K + ++ LTDG+NN K + T + Sbjct: 183 AIASCTTRLIERKDSK-----------------SRIVVLLTDGDNNAGKVSPLTAA--EA 223 Query: 357 AKENFIKIVTI-----SINASPNGQRL-----------------LKTCVS-SPEYHYNVV 393 A +K+ TI P G+ + LK + Y Sbjct: 224 ASALGVKVYTIGAGTKGFAPMPVGRDVFGRKVYQNVKVDVDEDTLKKIADMTKAKFYRAT 283 Query: 394 NADSLIHVFQNISQL 408 + SL +++ I QL Sbjct: 284 DTKSLTQIYEEIDQL 298 >gi|269104787|ref|ZP_06157483.1| protein BatA [Photobacterium damselae subsp. damselae CIP 102761] gi|268161427|gb|EEZ39924.1| protein BatA [Photobacterium damselae subsp. damselae CIP 102761] Length = 321 Score = 63.0 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 43/235 (18%), Positives = 79/235 (33%), Gaps = 54/235 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM + + ++ A+K+ L F++ ++ Sbjct: 84 DMMLAVDLSGSMA---------IKDMQTQSGQSIDRLTAIKHVLSNFIEK-------RKG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + ++ V Q + R + L+ + T + A + K Sbjct: 128 DRLGLVLFGDHAYLQTPLTFDRHTVEQQLDRTVLGLVGQSTAIGEGLGIATKTFIKSKA- 186 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 Q+ II L+DG N + AKE+ + I T+ I A Sbjct: 187 -----------------PQRVIILLSDGANTAGVIDPLEAA--KLAKESGVTIYTVGIGA 227 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + L K + ++ N L ++Q I+QL Sbjct: 228 DEMLQRSIFGVQKVNPSQDLDEKTLTKIAQMTGGKYFRARNPQELDKIYQIINQL 282 >gi|32474888|ref|NP_867882.1| hypothetical protein RB7557 [Rhodopirellula baltica SH 1] gi|32445428|emb|CAD75429.1| conserved hypothetical protein [Rhodopirellula baltica SH 1] Length = 327 Score = 63.0 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 62/386 (16%), Positives = 118/386 (30%), Gaps = 69/386 (17%) Query: 22 FSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFES 81 ++ A+ + FL+++ F I + H + + S+ +AA A A+ + L R Sbjct: 1 MLVLIAIMMFLFLIVVAFSIDIAQMHLARTELRSSTDAAANAAATTLADTLDRNLAIQRG 60 Query: 82 ISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSN 141 + L F VN A + Sbjct: 61 QQIAQA--------NLVNGQ---PLLLADGDFQFGRSDRQVNG---KYAFNAGEAPFNGV 106 Query: 142 NTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSG 201 S L F ++ + + I LVVD SG Sbjct: 107 RVNGQRTTGSLSGPVPL-FFGNVTGTSIFEPEAFATATYVER--------DITLVVDRSG 157 Query: 202 SMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT 261 SM ++ L+ A+ +F D + ++ +GL Y Sbjct: 158 SMAG--------------------SRFNDLQAAIRIFTDLLATTPVDEQ---IGLASYND 194 Query: 262 RVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 R ++++ + + V MD L T + M+ A Q + + Sbjct: 195 RASEDVQLTENFAE----VNNAMDRLRTGGFTSISRGMQ-AGQEIALRGR---------- 239 Query: 321 GVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLK 380 P ++ +I +TDG +N + + + I TI+ A + R+ Sbjct: 240 ----PPEFVERTMIVMTDGRHNR---GPEPRVVATDLAADGVTIHTITFGAGADFGRMQD 292 Query: 381 TCVSSPEYHYNVVNADSLIHVFQNIS 406 H++ N D L +++ I+ Sbjct: 293 VARIGGGRHFHATNGDQLRDIYREIA 318 >gi|307825379|ref|ZP_07655598.1| von Willebrand factor type A [Methylobacter tundripaludum SV96] gi|307733554|gb|EFO04412.1| von Willebrand factor type A [Methylobacter tundripaludum SV96] Length = 326 Score = 63.0 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 45/235 (19%), Positives = 83/235 (35%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM + + ++ A K F++ + Sbjct: 90 DLMLAVDLSGSME----------EQDFVINKRSVDRLTAAKMVAADFINR-------RVG 132 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + T+ ++ + V + + L T A+ A + L S++ Sbjct: 133 DRVGLILFGTQAYLQTPLTFDRKTVMTLLNEAVIGLAGDNTAIGDAIGLAVKRLKSEQVN 192 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S + ++ +TDG N V+ +K + A N +KI TI I A Sbjct: 193 S------------------RVLVLMTDGANTA--GEVSPLKAAELAAANHLKIYTIGIGA 232 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + L+K S+ +Y N D L +++ + +L Sbjct: 233 DEMIVRSFFGNRKINPSVDLDEKTLIKIAESTGGQYYRARNTDELNNIYMRLDEL 287 >gi|315498201|ref|YP_004087005.1| von willebrand factor type a [Asticcacaulis excentricus CB 48] gi|315416213|gb|ADU12854.1| von Willebrand factor type A [Asticcacaulis excentricus CB 48] Length = 570 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 34/178 (19%), Positives = 65/178 (36%), Gaps = 19/178 (10%) Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 ++ + +Y T + + + VR + + T+ T ++ ++ Sbjct: 404 IASNTDTLYPAAKCATNNLLPVMGLTTDIAAVRAHAQKL---TPAGNTNITIGVQWGMEL 460 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN-------NNFKSNVNTIKICDKA 357 L + F T K++I +TDGEN + N T+ C A Sbjct: 461 L--SPELPFNT-----AKPYSDKTNYKYMIVITDGENTQNRWSTSASTINARTLLACQAA 513 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 K+ I + TI + +LK+C S PEY Y+V + L + + + + Sbjct: 514 KDLGITVYTIRVM--EGNSDMLKSCASRPEYFYDVTASSQLTSTLAKVFYSIQSTRLT 569 >gi|83951473|ref|ZP_00960205.1| hypothetical protein ISM_12960 [Roseovarius nubinhibens ISM] gi|83836479|gb|EAP75776.1| hypothetical protein ISM_12960 [Roseovarius nubinhibens ISM] Length = 550 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 12/76 (15%), Positives = 31/76 (40%), Gaps = 3/76 (3%) Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNG--QRLLKTCVSSPEYHYNVVNADSLI 399 + + + C K + + +I G +++LK C SS +++ + + Sbjct: 475 DGSEKDTRMKASCTATKNEGVVVFSIGFEIDQGGTAEQVLKNCASSENHYFRAEGIN-IN 533 Query: 400 HVFQNISQLMVHRKYS 415 F I+ +V+ + + Sbjct: 534 DAFSAIASNVVNLRLT 549 >gi|89072369|ref|ZP_01158948.1| hypothetical protein SKA34_06335 [Photobacterium sp. SKA34] gi|89051901|gb|EAR57353.1| hypothetical protein SKA34_06335 [Photobacterium sp. SKA34] Length = 321 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 43/235 (18%), Positives = 84/235 (35%), Gaps = 54/235 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM S P+ V + ++ A+K+ L F++ ++ Sbjct: 84 DMLLAVDLSGSM-----SIPDMVTK----NGQSVDRLTAVKHVLSDFIEK-------RKG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + ++ + V + + R + LI + T + A + + K Sbjct: 128 DRLGLVLFADHAYLQTPLTFDRKTVEKQLDRTVLGLIGQSTAIGEGLGIATKTFINSKA- 186 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 Q+ II L+DG N + + AKE+ +KI T+ + A Sbjct: 187 -----------------PQRVIILLSDGANTSGVIDPLEAA--KLAKESGVKIYTVGVGA 227 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + L + ++ N L ++ I++L Sbjct: 228 DQMVQQGFFGDRIVNPSQDLDEKTLTDIAKMTGGEYFRARNPQQLEKIYDIINKL 282 >gi|146307954|ref|YP_001188419.1| von Willebrand factor, type A [Pseudomonas mendocina ymp] gi|145576155|gb|ABP85687.1| von Willebrand factor, type A [Pseudomonas mendocina ymp] Length = 334 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 37/226 (16%), Positives = 75/226 (33%), Gaps = 53/226 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VD+SGSM ++ +++ +K L F++ + Sbjct: 91 DLLLAVDVSGSMA----------YEDMHWDEQPISRLELVKRLLGDFIE-------DRRG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ M + K T A+ A + L + Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRHTVRTWLDEAMIGIAGKNTAIGDAIGLAVKRLRQRPAQ 193 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S + ++ +TDG NN + + A E ++I I I A Sbjct: 194 S------------------RVLVLITDGANNGGEIDPMVAAQ--LAAEEGVRIYAIGIGA 233 Query: 372 SPN---------------GQRLLKTCV-SSPEYHYNVVNADSLIHV 401 P + L+ ++ ++ N L + Sbjct: 234 DPRQSGVLGAFGFSALDLDETSLRAIAEATGGEYFRARNQAELTQI 279 >gi|109009638|ref|XP_001105446.1| PREDICTED: epithelial chloride channel protein-like [Macaca mulatta] Length = 829 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 60/374 (16%), Positives = 127/374 (33%), Gaps = 73/374 (19%) Query: 41 IYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKN 100 + + ++N++E+ + + G + +++ + K L + FI N Sbjct: 170 VDQPFYISRRNTIEATRCSTHITGVNMVLNECQGGSCIQRPCRRNPKTRLYEAKCTFIPN 229 Query: 101 ---HIKESLS-GYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDY 156 KES+ + F ++ N + N+ +++ + + Sbjct: 230 RSQTAKESIMFMQNLDFVTEFCTEKTHNKEA-----PNLQNKMCNHRSTWDVIMSSEDFQ 284 Query: 157 RLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNS 216 L + + + S + + R+ + LV+D SGSM Sbjct: 285 HLSPMTEINSPP--HPTFSLLQSKQRV---------VCLVLDKSGSM------------- 320 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT-EK 275 ++ + +M A L + I+ S V G++ + + E + E Sbjct: 321 ---NREDRLFRMNQA--AELYLIQIIEKGSLV------GMVTFDSSAEIQNNLTKIIDEN 369 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 Q +T ++ T +K +Q ++ + + + II Sbjct: 370 TYQKITANLPQKPSGGTSICGGLKAGFQAISQSNQSTSGSE----------------IIL 413 Query: 336 LTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRL--LKTCVSSPEYHYNV 392 LTDGE+N S C ++ K++ I TI++ + L L ++ Y Sbjct: 414 LTDGEDNQMSS-------CFEEVKQSGAIIHTIAL-GPSADRELETLSN-MTRGRRFYAH 464 Query: 393 VNADSLIHVFQNIS 406 + + LI F IS Sbjct: 465 KDINGLIDAFSRIS 478 >gi|162147499|ref|YP_001601960.1| hypothetical protein GDI_1715 [Gluconacetobacter diazotrophicus PAl 5] gi|161786076|emb|CAP55658.1| conserved hypothetical protein [Gluconacetobacter diazotrophicus PAl 5] Length = 571 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 19/80 (23%), Positives = 31/80 (38%), Gaps = 5/80 (6%) Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISI-----NASPNGQRLLKTCVSSPEYHYNVVNA 395 N + N +CD K + I I I A Q +L+ C S P +Y+ A Sbjct: 491 NILPEMNSLVSTVCDNIKNSGITIYVILYTHEGEEADATTQAMLQNCASKPGNYYDAPTA 550 Query: 396 DSLIHVFQNISQLMVHRKYS 415 S+ F ++ + + S Sbjct: 551 ASMKQAFSDLGGQLSALRIS 570 >gi|88801581|ref|ZP_01117109.1| batA protein [Polaribacter irgensii 23-P] gi|88782239|gb|EAR13416.1| batA protein [Polaribacter irgensii 23-P] Length = 334 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 43/249 (17%), Positives = 75/249 (30%), Gaps = 66/249 (26%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + + I + +D+S SM D K ++ ALK Sbjct: 82 VSKKTKTNSGIDIIMAIDVSASMLAR---------------DLKPNRLEALKKVA----- 121 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 ID + D +G++ Y + V++ ++ + T + Sbjct: 122 -IDFVDRRPND-RIGIVVYAGESFTQTPITSDKNIVKRTISELQWGQLDGGTAIGMGLGS 179 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 L S K II LTDG NN + T + A+E Sbjct: 180 GVNRL------------------KESTAKSKVIILLTDGVNNAGNIDPRTAT--ELAREL 219 Query: 361 FIKIVTISI----NA-------SPNGQ------------RLLKTCVSS-PEYHYNVVNAD 396 IK+ TI I A G+ +LL+ ++ ++ + Sbjct: 220 EIKVYTIGIGTNGMADFPWSKDPRTGKLNFRKQQVEIDEKLLQEIATATDGKYFRATDNQ 279 Query: 397 SLIHVFQNI 405 SL ++ I Sbjct: 280 SLKEIYDEI 288 >gi|126341666|ref|XP_001379908.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 2347 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 58/344 (16%), Positives = 114/344 (33%), Gaps = 55/344 (15%) Query: 77 DRFESISNHAKRALIDDAKRFIKNHIKESLSGY----------SAVFYNTEIQNIVNSSR 126 D + +SN + L A +F+ ++ S + +++ + R Sbjct: 693 DAIDRMSNIDRETLTGSALKFVSDYFHPSKGARPGVRKFLILITDGESQDPVKDPAMALR 752 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG 186 + + + +N T + + ++ + L + + P R E Sbjct: 753 QDGVIIYSVGVYGANETQLVEISGKPEMIFYVETFDILKHIEDDLVFGICNP---REECK 809 Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + + I V+D SGS+ + K + + Sbjct: 810 QIEVLDIVFVIDSSGSID---------------YNEYNIMK--------EFMIKLVKKAD 846 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 K+ V G + Y+ + I R V + + K D+ A A+ Sbjct: 847 VAKDRVRFGALKYS--YDPTILFYLDEFDTRSKVISLLQNDSPKGGDTYTAKALAFSEHM 904 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA-KENFIKIV 365 + R N K+P + +I +TDGE + N + KA ++ I I+ Sbjct: 905 FTEARGSRIN-----QKVPQV-----LIVITDGE----SHDANQLNATAKALRDKGILIL 950 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + I A N + LL S + ++ V L +FQN+S + Sbjct: 951 AVGI-AGANTEELL-AMAGSTDKYFFVETFGGLKGIFQNVSDSI 992 Score = 44.1 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 38/276 (13%), Positives = 94/276 (34%), Gaps = 57/276 (20%) Query: 137 LDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELV 196 + +N T + Y + + L + + + + E + + + Sbjct: 578 VKEANRTQLLEIAGTEKRVYYVYNFDSLKDIKN-----QVVQGICSKEACKDVKADVMFL 632 Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL 256 VD S S+ + KM K + ++ + ++ V +G+ Sbjct: 633 VDSSRSIGL-----------------ENFGKM---KTFMKNLVNKSQI---GEDQVQVGI 669 Query: 257 IGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 + ++ ++ + + W ++ + R S I + T + A+K F Sbjct: 670 VQFSDVNKEEFQLNRYWTQHEIFDAIDRM--SNIDRETLTGSALK--------------F 713 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 + + K +KF+I +TDGE+ + + +++ + I ++ + + Sbjct: 714 VSDYFHPSKGARPGVRKFLILITDGESQDPVKDPAM-----ALRQDGVIIYSVGVYGANE 768 Query: 375 GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 Q L PE + V + ++I +V Sbjct: 769 TQ--LVEISGKPEMIFYVET----FDILKHIEDDLV 798 >gi|209545606|ref|YP_002277835.1| hypothetical protein Gdia_3496 [Gluconacetobacter diazotrophicus PAl 5] gi|209533283|gb|ACI53220.1| conserved hypothetical protein [Gluconacetobacter diazotrophicus PAl 5] Length = 568 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 19/80 (23%), Positives = 31/80 (38%), Gaps = 5/80 (6%) Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISI-----NASPNGQRLLKTCVSSPEYHYNVVNA 395 N + N +CD K + I I I A Q +L+ C S P +Y+ A Sbjct: 488 NILPEMNSLVSTVCDNIKNSGITIYVILYTHEGEEADATTQAMLQNCASKPGNYYDAPTA 547 Query: 396 DSLIHVFQNISQLMVHRKYS 415 S+ F ++ + + S Sbjct: 548 ASMKQAFSDLGGQLSALRIS 567 >gi|315649824|ref|ZP_07902907.1| von Willebrand factor type A [Paenibacillus vortex V453] gi|315274798|gb|EFU38179.1| von Willebrand factor type A [Paenibacillus vortex V453] Length = 1316 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 48/227 (21%), Positives = 87/227 (38%), Gaps = 53/227 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L++D SGSM KM + N+ F+D +DL H Sbjct: 69 DVILIIDRSGSM-------------------NDENKMQSAINSAKGFIDLMDLSKH---- 105 Query: 252 VYMGLIGYTT-RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +G++ Y++ + S E V+ YV + T + A+K+A ++L + Sbjct: 106 -KVGIVDYSSANNISSFPLSTDKEAVKNYVNGLRAN---GGTATGDAIKKARELLVNH-- 159 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV--NTIKICDKAKENFIKIVTIS 368 Q I+ LTDG+ N + ++AK+ I TI+ Sbjct: 160 ---------------RPDAQPVIVLLTDGDATEPNGNAYNYALTNSNEAKQEGIVFYTIA 204 Query: 369 I-----NASPNGQR-LLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + N +G LLK ++ +H+ V+ + L ++ I Q + Sbjct: 205 LLNTNANPDTSGPNLLLKQMATTSHHHHFVLGSVGLGDIYAAIVQEI 251 >gi|146298482|ref|YP_001193073.1| von Willebrand factor, type A [Flavobacterium johnsoniae UW101] gi|146152900|gb|ABQ03754.1| BatA-like protein [Flavobacterium johnsoniae UW101] Length = 334 Score = 62.2 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 47/263 (17%), Positives = 86/263 (32%), Gaps = 66/263 (25%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 + + + I + +D+SGSM D K +M ALK Sbjct: 74 LARPRTVDISNQTKTTKGIDIVMAIDVSGSMLAK---------------DLKPNRMEALK 118 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-MDSLILKP 291 F++ + + +GL+ Y + + + + + D+++ Sbjct: 119 RVAADFVEE-------RPNDRIGLVLYASEAYTKTPVTSDKPIILEAIKGIRYDTVLQDG 171 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A L K +S + II LTDG NN T Sbjct: 172 TGIGMGLATAVNRLKDSKAKS------------------RVIILLTDGVNNAGFIEPETA 213 Query: 352 KICDKAKENFIKIVTISI-----------NASPNG-----------QRLLKTCV-SSPEY 388 D AK+ IK+ TI + A G +RL+K+ + Sbjct: 214 A--DIAKQYGIKVYTIGLGTNGMAESPYAYAPNGGFLFKMQKVEIDERLMKSIAKKTDGT 271 Query: 389 HYNVVNADSLIHVFQNISQLMVH 411 ++ + D L ++ +I++L Sbjct: 272 YFRATSNDKLAEIYNSINKLETT 294 >gi|87121300|ref|ZP_01077190.1| batB protein, putative [Marinomonas sp. MED121] gi|86163457|gb|EAQ64732.1| batB protein, putative [Marinomonas sp. MED121] Length = 333 Score = 62.2 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 35/277 (12%), Positives = 83/277 (29%), Gaps = 56/277 (20%) Query: 147 NMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCA 206 ++ + + ++ + I + L + + + +DLSGSM Sbjct: 48 SVKKQSFSKHIIRLFLWISWLLLVVAIARPV-WLGEPKSVTPSGRDLLIALDLSGSM--- 103 Query: 207 MNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN 266 + + ++ A K L F+ ++ +G+I + ++ Sbjct: 104 -------QTADMKINQQAANRLDAAKQVLNRFITE-------RQGDRIGIIVFGSKAYLQ 149 Query: 267 IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 S+ + + Q V + T A+ + L + Sbjct: 150 APLSYDLDTIAQLVNETQIGFAGENTAIGDAIGLGIKRLANIDAD--------------- 194 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN------------ 374 ++ +I +TDG N + + A + +KI TI I A Sbjct: 195 ---KRVMILMTDGANTAGRVKPDQAAQ--FAAKQGVKIHTIGIGAEQMVSQGFFGPRVIN 249 Query: 375 -----GQRLLKTCVS-SPEYHYNVVNADSLIHVFQNI 405 + LL+ + ++ + L ++ + Sbjct: 250 PSTDLDEELLQKVADLTQGQYFRAKSTQELASIYATL 286 >gi|330504126|ref|YP_004380995.1| von Willebrand factor, type A [Pseudomonas mendocina NK-01] gi|328918412|gb|AEB59243.1| von Willebrand factor, type A [Pseudomonas mendocina NK-01] Length = 334 Score = 62.2 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 34/182 (18%), Positives = 67/182 (36%), Gaps = 37/182 (20%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VD+SGSM + D+ +++ +K L F++ + Sbjct: 91 DLLLAVDVSGSMD----------YADMQWDDEPISRLELVKRLLGDFIE-------GRRG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + + K T A+ A + L + Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRHTVRTWLDEALIGIAGKNTAIGDAIGLAVKRLRQRPAQ 193 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S + ++ +TDG NN + + A + ++I TI I A Sbjct: 194 S------------------RVLVLITDGANNGGEIDPMVAAQ--LAADEGVRIYTIGIGA 233 Query: 372 SP 373 P Sbjct: 234 DP 235 >gi|332716075|ref|YP_004443541.1| hypothetical protein AGROH133_11102 [Agrobacterium sp. H13-3] gi|325062760|gb|ADY66450.1| hypothetical protein AGROH133_11102 [Agrobacterium sp. H13-3] Length = 429 Score = 62.2 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 72/437 (16%), Positives = 134/437 (30%), Gaps = 72/437 (16%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 K + NF I+ AL ++ G + + K ++ A ++A LA + M ++ Sbjct: 2 KNFWQEKSGNFGILTALLMVPLCGAAGVALDITRGMSVKADLQQAADSAALAAVADMSAS 61 Query: 72 LSRL----GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 + GD + N RA D +R ++ S+ N ++ + + Sbjct: 62 VQAAKKMSGDGVIPVGNEEARAFFDGNQRGDADYTITSVDVSVIKHGNVVESSVSFKASV 121 Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIE--HLLNQRYNQKIVSFIPALLR--- 182 S T L F ++ + Y + +LL V P + Sbjct: 122 STT------LSGLLGKDFVSVAGTATAKYETETFSDFYLLLDNTPSMGVGATPTDVATLV 175 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 G++ F +V D +DP T++ + A +D+ Sbjct: 176 ANTGDKCAFACHIVKD--------GVADPNSYYFKAKKLGVT-TRIDVVAKATASLMDTA 226 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA--MKQ 300 M + + R E V L + M Sbjct: 227 KSTRKSSNQYRMAVYTFGERAEDTKLLE---------VVSLTSDLDAAKKKAGEINLMSI 277 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIP-------SLPFQKFIIFLTDGENNNFKSNVNTIKI 353 YQ +D++ F + G K+ S K I F++DG +++K + T K+ Sbjct: 278 PYQGYNNDQQTDFDRALIQIGDKVGSSGTGASSANPDKVIFFVSDGVGDSYKPSSCTKKL 337 Query: 354 -------------CDKAKENFIKI---VTISINASPNGQ-------------RLLKTCVS 384 C K KE +I T + N +++C Sbjct: 338 TGGRCQEPIDIKDCTKLKEKGFRIAVLYTTYLPLPTNDWYNSWIKPFQAEIGSRMQSCA- 396 Query: 385 SPEYHYNVVNADSLIHV 401 SP ++ V + + Sbjct: 397 SPGLYFEVSPSQGISDA 413 >gi|307720603|ref|YP_003891743.1| von Willebrand factor A [Sulfurimonas autotrophica DSM 16294] gi|306978696|gb|ADN08731.1| von Willebrand factor type A [Sulfurimonas autotrophica DSM 16294] Length = 310 Score = 62.2 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 45/254 (17%), Positives = 86/254 (33%), Gaps = 39/254 (15%) Query: 159 QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF--LIELVVDLSGSMHCAMNSDPEDVNS 216 + + L I++ + + +P I L++D SGSM Sbjct: 50 KLLFFLKWLTIIMMILALMSPVKDEPYELKPKHGHEIALILDASGSM----------KER 99 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 + ++ +K+ + F+ + MGL+ + + ++ + Sbjct: 100 GFDPVNPAASRFDVVKSIVKDFISQ-RTNDN------MGLVVFGSYSFIASPLTYDKHIL 152 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + V++ + K T A+ Q +L K +S K I L Sbjct: 153 SRIVSQLEVGMAGKYTALYEALAQGVNLLKMSKAKS------------------KVAILL 194 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQR--LLKTCVSSPEYHYNVVN 394 TDG + + + D AK+ +K+ I I R LLK + + N Sbjct: 195 TDGYSTAGADKIPLDVVLDMAKKEGVKVYPIGIGGPDEYNRAVLLKIAKETGGVAFGASN 254 Query: 395 ADSLIHVFQNISQL 408 A L V++ I +L Sbjct: 255 ASQLKEVYKKIDEL 268 >gi|310829014|ref|YP_003961371.1| hypothetical protein ELI_3449 [Eubacterium limosum KIST612] gi|308740748|gb|ADO38408.1| hypothetical protein ELI_3449 [Eubacterium limosum KIST612] Length = 684 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 37/167 (22%), Positives = 64/167 (38%), Gaps = 35/167 (20%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + + LV+D SGSM ++TKM LK+ F+ Sbjct: 62 KASSSAVGAADVVLVIDRSGSMG-------------ERYDGSRQTKMEVLKDTAKDFITQ 108 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSL-ILKPTDSTPAMK 299 + S + ++ Y + + NI S T++ Q + R +D L T S ++ Sbjct: 109 LSAQSPASQV---SVVSYASDSKTNIGLTSLDTQENIQSLNRAIDKLWASGATRSDLGLE 165 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 AY +L + + ++F+IFLTDGE N++ Sbjct: 166 DAYSVLGAADSGN-----------------KQFVIFLTDGEPNSYSG 195 >gi|225377140|ref|ZP_03754361.1| hypothetical protein ROSEINA2194_02786 [Roseburia inulinivorans DSM 16841] gi|225211045|gb|EEG93399.1| hypothetical protein ROSEINA2194_02786 [Roseburia inulinivorans DSM 16841] Length = 1406 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 36/195 (18%), Positives = 70/195 (35%), Gaps = 27/195 (13%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY-----TTRVEKNIEPSWGTEKVRQY- 279 +K++ALK+A F+ I S +G+ + + + G KV + Sbjct: 822 SKISALKDAASGFVTGISDTSPDS---LVGIATFYGIGNGWNSSTEGKLNHGLSKVNKNE 878 Query: 280 -VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T ++ AY L Q + + +K++I +D Sbjct: 879 MLKSVNALFADGGTSPQKGLEHAYSEL--------------QKAEDGN---KKYVILFSD 921 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 GE ++ + T K KE ++T+ + + L V+S + A+ L Sbjct: 922 GEPSDSNDKMETEASAVKLKEAGYTVITVGLGLNNETATWLGEKVASAGCAFTADTAEEL 981 Query: 399 IHVFQNISQLMVHRK 413 +FQNI + + Sbjct: 982 NKIFQNIQSTITQSR 996 >gi|284046349|ref|YP_003396689.1| von Willebrand factor A [Conexibacter woesei DSM 14684] gi|283950570|gb|ADB53314.1| von Willebrand factor type A [Conexibacter woesei DSM 14684] Length = 319 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 45/240 (18%), Positives = 80/240 (33%), Gaps = 59/240 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I LV D+SGSM D + +M A K A F+D + Sbjct: 87 SIALVTDVSGSMLA---------------TDVQPNRMIAAKRAARRFVDEV------PRT 125 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 V +G+I + P+ V + R S T + A+ A ++L + Sbjct: 126 VNLGVISFNNTATVLQSPTRNRSDVLTAIDRLAVS---GGTATGEAIATATEMLRNQPGE 182 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-- 369 + G + PS I+ ++DG + N + + +A+ I I T++ Sbjct: 183 N--------GRRPPSA-----IVLISDGTSTNGRDPIEAAA---EARRLRIPIYTVAFGT 226 Query: 370 -----------------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 P+ L + + + +AD L VF+ + + R Sbjct: 227 DQGTITVPGRDGVERTERVPPDPTALAQIAEMTGGETFTADSADRLDTVFERLGSQLGTR 286 >gi|152995759|ref|YP_001340594.1| von Willebrand factor type A [Marinomonas sp. MWYL1] gi|150836683|gb|ABR70659.1| von Willebrand factor type A [Marinomonas sp. MWYL1] Length = 342 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 38/232 (16%), Positives = 74/232 (31%), Gaps = 55/232 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + +DLSGSM + + ++ A K+ L F+ + Sbjct: 89 DLLIALDLSGSM----------QVTDMALNGQPANRLEAAKSVLSDFIQE-------RRG 131 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G+I + ++ S+ T+ + Q V + T A+ + L Sbjct: 132 DRIGIIVFGSKAYLQAPLSFDTKTINQLVQEAQIGFAGEQTAIGDAIGLGIKRLEDKPSD 191 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 +K +I +TDG N + A +KI TI I A Sbjct: 192 ------------------KKVLILMTDGANTAGRVQPQQAAT--FAASQNVKIHTIGIGA 231 Query: 372 -----------------SPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNI 405 S + LLK + + ++ + + L ++Q + Sbjct: 232 DSMIVQSFFGPKAINPSSDLDETLLKNIAAQTGGEYFRAKSTEDLQAIYQTL 283 >gi|59713864|ref|YP_206639.1| hypothetical protein VF_A0681 [Vibrio fischeri ES114] gi|59482112|gb|AAW87751.1| hypothetical membrane spanning protein [Vibrio fischeri ES114] Length = 321 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 45/233 (19%), Positives = 81/233 (34%), Gaps = 56/233 (24%) Query: 192 LIELVVDLSGSMH-CAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVDLSGSM M + D ++ A+K + F+D ++ Sbjct: 84 DMMLVVDLSGSMAEEDMKTSNGDFVD----------RLTAVKQVVSDFIDQ-------RK 126 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +GL+ + ++ VR+ + R + +L+ + T + A + Sbjct: 127 GDRLGLVLFGDHAYLQTPLTFDRNTVREQLDRTVLNLVGQRTAIGEGLGLATKTFIESNA 186 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI- 369 Q+ II L+DG N + ++ AK+N KI T+ I Sbjct: 187 ------------------PQRTIILLSDGANTAGV--LEPLEAAQLAKDNHAKIYTVGIG 226 Query: 370 ----------------NASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNI 405 A + L + + ++ NAD L ++Q I Sbjct: 227 AGEMQVRGFFGKQTVNTARDLDEDTLTKIATMTGGQYFRARNADELAEIYQTI 279 >gi|317483048|ref|ZP_07942050.1| von Willebrand factor type A domain-containing protein [Bifidobacterium sp. 12_1_47BFAA] gi|316915549|gb|EFV36969.1| von Willebrand factor type A domain-containing protein [Bifidobacterium sp. 12_1_47BFAA] Length = 813 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 50/284 (17%), Positives = 81/284 (28%), Gaps = 97/284 (34%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS--------- 241 I LV+D+SGSM ++ P K+ ALK A+ F+D+ Sbjct: 123 LDIVLVLDVSGSMADNLSGGP--------------KKIDALKTAVNGFIDATADENAKIT 168 Query: 242 ---------------IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + S + G Y + ++ + V S Sbjct: 169 DQSQRNRIALVKFAGTEKTSVGNDFYREGWSSYN-YTQIVSNLTYDVSGLTSTVNGLSAS 227 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK- 345 T + A +A LT P +K +IF TDGE N+ Sbjct: 228 ---GATSADYAFNRAQAALTYQ----------------PRANAKKVVIFFTDGEPNHGSG 268 Query: 346 -SNVNTIKICDKAKE---NFIKIVTISI----NASPNGQRLLKT-----------CVSSP 386 +KAK I +I + N L K +S Sbjct: 269 FDPTVAATAVNKAKSLKDAGTTIYSIGVVSGANPGDTSSNLNKYMHGISSNYPDATATSS 328 Query: 387 EY-------------------HYNVVNADSLIHVFQNISQLMVH 411 E+ + +A L ++F++I Q + Sbjct: 329 EHLWGKSWNANLGDRAETSSYYKAATDAGQLNNIFESIYQEITK 372 >gi|95928343|ref|ZP_01311091.1| von Willebrand factor, type A [Desulfuromonas acetoxidans DSM 684] gi|95135614|gb|EAT17265.1| von Willebrand factor, type A [Desulfuromonas acetoxidans DSM 684] Length = 329 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 42/235 (17%), Positives = 80/235 (34%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM + ++ ALK F+D ++ Sbjct: 91 DLMLAVDLSGSM----------RTDDFQLSGRSVDRLTALKAVAGAFIDQ-------RQG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + ++ V + + + L T A+ A + L D + Sbjct: 134 DRIGLILFGEQPYIQAPLTFDHNTVTRLLHEAVVGLAGNKTAIGDAIGLAVKRLRKDPQA 193 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + +I LTDG +N+ + +K A + +K+ TI I A Sbjct: 194 ------------------KNVLILLTDGASNSGSLDP--LKAAKLAAQRGLKVYTIGIGA 233 Query: 372 SPN-----------------GQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQL 408 ++ LK ++ ++ + + L ++Q + QL Sbjct: 234 EAVEVGSFFFKRTVNPSLDLDEKTLKAIAETTGGRYFRARDTEELAQIYQQLDQL 288 >gi|328545070|ref|YP_004305179.1| von Willebrand factor type A [polymorphum gilvum SL003B-26A1] gi|326414812|gb|ADZ71875.1| von Willebrand factor type A [Polymorphum gilvum SL003B-26A1] Length = 552 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 26/179 (14%), Positives = 58/179 (32%), Gaps = 40/179 (22%) Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 T + + T+ + +++LT + S + ++ K ++ Sbjct: 377 TNDKQALLDAVAAMRADGYTNIHQGVVWGWRVLTPQEPFSRGRSPDQKREKD----HRRI 432 Query: 333 IIFLTDGENNNFKS----------------------------------NVNTIKIC-DKA 357 +I +TDG N + T C + A Sbjct: 433 MIVMTDGANTYQDKSSSHNRTEYNAYGYGTEQRLGSGIDTAGEIAAKMDERTALACRNAA 492 Query: 358 KENFIKIVTISINASP-NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 ++ TI+ ++LL+ C SSPE ++ + L+ F+ I + + + + Sbjct: 493 TYEATQVYTIAFQVGDYATRKLLRDCASSPEMAFDAGSNSELVTAFERIGKEISRLRLA 551 Score = 41.4 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 30/158 (18%), Positives = 49/158 (31%), Gaps = 31/158 (19%) Query: 110 SAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIF-YNMDVMTSYDYRLQFIEHLLNQR 168 S + +I+ + + S N T+ + V F LL Sbjct: 60 SGTYTAEQIRQHAEAYFDGLYTAPERGSVSRNVTVGDGTISVEAGVTMP-TFFAPLLGVE 118 Query: 169 YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 V + F + LV+D SGSM ++M Sbjct: 119 EISFAVMAESKVGTAS------FDVVLVLDNSGSMAG--------------------SRM 152 Query: 229 AALKNALLLFLD---SIDLLSHVKEDVYMGLIGYTTRV 263 LK A + SI+ +S + V +GL+ +T V Sbjct: 153 TTLKQAASDLIRTLMSINEISTEDDRVMVGLVPFTAFV 190 >gi|156308416|ref|XP_001617662.1| hypothetical protein NEMVEDRAFT_v1g225902 [Nematostella vectensis] gi|156195093|gb|EDO25562.1| predicted protein [Nematostella vectensis] Length = 273 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 45/262 (17%), Positives = 81/262 (30%), Gaps = 68/262 (25%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 + + + I + +D+SGSM D K ++ ALK Sbjct: 11 LARPRSVDVTAKSRTTKGIDIVMAIDVSGSMLAK---------------DFKPNRLDALK 55 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV--TRDMDSLILK 290 F++ D ++ +GL+ Y + + Q + DS+I Sbjct: 56 RVASTFIE--DRIND-----RIGLVVYAGESYTRTPITSDKTVILQSLKTVEYDDSIIAD 108 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + A + K +S + II LTDG NN + Sbjct: 109 GTGIGVGLATAINRIKDSKAKS------------------RVIILLTDGVNNAGTIDPRM 150 Query: 351 IKICDKAKENFIKIVTISI-----------------------NASPNGQRLLKTCVS-SP 386 D AK+ IK+ TI I ++L+K + Sbjct: 151 AA--DIAKQYGIKVYTIGIGTNGMALFPYAKDQETGKFLFRNMQVEIDEKLMKEIAEMTD 208 Query: 387 EYHYNVVNADSLIHVFQNISQL 408 ++ + L ++ I++L Sbjct: 209 GKYFRATDDKKLKAIYAEINKL 230 >gi|312877126|ref|ZP_07737097.1| von Willebrand factor type A [Caldicellulosiruptor lactoaceticus 6A] gi|311796100|gb|EFR12458.1| von Willebrand factor type A [Caldicellulosiruptor lactoaceticus 6A] Length = 900 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 44/227 (19%), Positives = 78/227 (34%), Gaps = 45/227 (19%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + + LV+D SGSM +D K+ K+A ++ ++ Sbjct: 399 NKEKEKNIDVVLVLDHSGSMA--------------DTEDAGIPKLEIAKSASAKMVEHLE 444 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQAY 302 V G+I + + G ++ V + S+ + T P + +A Sbjct: 445 SSDGV------GVIAFDHNYYWAYKF--GKLVRKEDVIESISSIEVGGGTAIIPPLSEAV 496 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + L K ++ K ++ LTDG + D+AK N I Sbjct: 497 KTLKKSKAKN------------------KLVVLLTDGMGEQSGYEIPA----DEAKRNNI 534 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 KI TI + N L + Y V N L+ VF ++++ Sbjct: 535 KITTIGVGKFVNASVLSWIADYTSGRFYLVSNPSELVDVFLKETKII 581 >gi|307943468|ref|ZP_07658812.1| putative Flp pilus assembly protein TadG [Roseibium sp. TrichSKD4] gi|307773098|gb|EFO32315.1| putative Flp pilus assembly protein TadG [Roseibium sp. TrichSKD4] Length = 479 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 17/70 (24%), Positives = 29/70 (41%), Gaps = 2/70 (2%) Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPN--GQRLLKTCVSSPEYHYNVVNAD 396 G + +C+ KE + I T+ + G+ L+K+C S P+ Y + D Sbjct: 400 GATKRDAPATQALALCEAMKEQDVVIYTVYFETTGAKFGKDLMKSCASDPDKFYLAEDRD 459 Query: 397 SLIHVFQNIS 406 L F I+ Sbjct: 460 GLKAAFSAIA 469 >gi|315649108|ref|ZP_07902201.1| von Willebrand factor type A [Paenibacillus vortex V453] gi|315275543|gb|EFU38898.1| von Willebrand factor type A [Paenibacillus vortex V453] Length = 983 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 38/221 (17%), Positives = 75/221 (33%), Gaps = 54/221 (24%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E P + LV+D SGSM TK+ K + + ++ + Sbjct: 403 EIPSLGLILVIDRSGSMDG--------------------TKIELAKESAMRTVELLRSKD 442 Query: 247 HVKEDVYMGLIGYTTR---VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 V G++ + + V + E+V + + T+ PA+ A + Sbjct: 443 TV------GVVAFDDQPWWVVPPQKLG-NKEEVLSSIQSIPSA---GGTNIYPAVSSALE 492 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 + K + ++ II +TDG + S + D EN I Sbjct: 493 EMLKIKSQ------------------RRHIILMTDG-QSAMNSGYQDLT--DTMVENKIT 531 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + ++++ + L ++ +Y V + +L VF Sbjct: 532 MSSVAVGTDADTHLLQSLAEAAKGRYYFVEDETTLPAVFSR 572 >gi|304312669|ref|YP_003812267.1| von Willebrand factor, type A protein [gamma proteobacterium HdN1] gi|301798402|emb|CBL46626.1| von Willebrand factor, type A protein [gamma proteobacterium HdN1] Length = 347 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 46/252 (18%), Positives = 81/252 (32%), Gaps = 60/252 (23%) Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 + + I R + L VD S SM + ++ +K+ + Sbjct: 82 VGDPVSINPHAR---DLMLAVDTSQSME----------IQDMRLHGEPVDRLTVIKSVVD 128 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 D +SH K D +GLI + T+ ++ + VR + + T Sbjct: 129 ------DFISHRKND-RIGLILFGTQAYLQTPLTFDHKTVRTLLNESRIGIAGGQTAIGD 181 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+ A + L + K S K +I LTDG N + + Sbjct: 182 AIGLALKRL--------------KNHKTGS----KVLILLTDGANTAGSVSPVQAA--EL 221 Query: 357 AKENFIKIVTISINA-----------------SPNGQR--LLKTCVS-SPEYHYNVVNAD 396 A +KI T+ + A +K S + ++ N D Sbjct: 222 AARQGMKIYTVGVGADEMRIPGVLGFGSQIVNPSADLDEVTMKKIASLTGAQYFRARNTD 281 Query: 397 SLIHVFQNISQL 408 L ++Q+I +L Sbjct: 282 ELRRIYQHIDKL 293 >gi|239620965|ref|ZP_04663996.1| conserved hypothetical protein [Bifidobacterium longum subsp. infantis CCUG 52486] gi|239516066|gb|EEQ55933.1| conserved hypothetical protein [Bifidobacterium longum subsp. infantis CCUG 52486] Length = 816 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 49/284 (17%), Positives = 82/284 (28%), Gaps = 94/284 (33%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI-------- 242 I LV+D+SGSM + S + TK+ +LK A+ F+++ Sbjct: 123 LDIVLVLDVSGSMAEKIASGW-----------NQPTKIDSLKTAVNKFINATAAENAKIT 171 Query: 243 ----------------DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + S + G Y + ++ + V S Sbjct: 172 DQSQRNRIALVKFAGTEKTSVGNDFYREGWSSYN-YTQIVSNLTYDVSGLTSTVNGLSAS 230 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK- 345 T + A +A LT P +K +IF TDGE N+ Sbjct: 231 ---GATSADYAFNRAQAALTYQ----------------PRANAKKVVIFFTDGEPNHGSG 271 Query: 346 -SNVNTIKICDKAKE---NFIKIVTISI----NASPNGQRLLKT-----------CVSSP 386 +KAK I +I + N L K +S Sbjct: 272 FDPTVAATAVNKAKSLKDAGTTIYSIGVVSGANPGDTSSNLNKYMHGISSNYPDATATSS 331 Query: 387 EY-------------------HYNVVNADSLIHVFQNISQLMVH 411 E+ + +A L ++F++I Q + Sbjct: 332 EHLWGKSWNANLGDRAETSSYYKAATDAGQLNNIFESIYQEITK 375 >gi|152990152|ref|YP_001355874.1| von Willebrand factor A [Nitratiruptor sp. SB155-2] gi|151422013|dbj|BAF69517.1| von Willebrand factor type A domain protein [Nitratiruptor sp. SB155-2] Length = 305 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 46/271 (16%), Positives = 101/271 (37%), Gaps = 44/271 (16%) Query: 144 IFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG---ERPIFLIELVVDLS 200 IF ++D++ + F+ L + + + I+ ++ + I L +D S Sbjct: 33 IFPHIDILKKAANKKAFLPEFLKITAIIGTLFALAGPVHIDNMVHLKKKGYDIVLAIDAS 92 Query: 201 GSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT 260 GSM ++ P D ++TK +++ + F+ + +G++ + Sbjct: 93 GSM--------QEKGFDP--TDPQKTKFDVVRSLVKAFISKRRNDN-------IGVVIFG 135 Query: 261 TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 + ++ E V++ + + T A+ ++ ++L Sbjct: 136 SFAYIASPLTFNKEAVKKILDYLDIGVAGSKTAIDDALIESVRLL--------------- 180 Query: 321 GVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG--QRL 378 S K +I LTDG + K+ + AK+ +KI TI I G + Sbjct: 181 ---KESQAKSKIVILLTDGIDTASKTPPDVAV--KMAKKYGVKIYTIGI-GDKRGIDEAF 234 Query: 379 LKTCVSSP-EYHYNVVNADSLIHVFQNISQL 408 L+ Y++ +A L ++ I++L Sbjct: 235 LRWLAQQGHGYYFYAKDASMLRKIYDEINRL 265 >gi|326795817|ref|YP_004313637.1| von Willebrand factor type A [Marinomonas mediterranea MMB-1] gi|326546581|gb|ADZ91801.1| von Willebrand factor type A [Marinomonas mediterranea MMB-1] Length = 337 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 48/265 (18%), Positives = 93/265 (35%), Gaps = 63/265 (23%) Query: 164 LLNQRYNQKIVSF-----IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP 218 LL + +V+ + + +I R +F + +DLSGSM S Sbjct: 64 LLMFAWISLVVAIARPIWVGSPTQITPSGRDLF---VALDLSGSM----------QISDM 110 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 Q + ++ K+ L F++ ++ +G+I + T+ S+ T+ VRQ Sbjct: 111 YYQSRPVNRLVISKHVLSDFIEK-------RKGDRIGVIVFGTKAYLQAPLSFDTKTVRQ 163 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + + T A+ + L+ +K +I +TD Sbjct: 164 LIQETQIGFAGEKTAIGDAIGLGIKQLSELPSD------------------KKVLILMTD 205 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINAS-----------------PNGQRLLKT 381 G N + V+ ++ + A E + I TI I A + LL+ Sbjct: 206 GANTAGR--VSPLQAANFAAEQGVTIHTIGIGADEMEVQGFFGPQTVNPSEDLDEALLEN 263 Query: 382 CVS-SPEYHYNVVNADSLIHVFQNI 405 S + +Y + L ++ +I Sbjct: 264 VASLTGGKYYRAKSTSDLEEIYGDI 288 >gi|290769676|gb|ADD61455.1| putative protein [uncultured organism] Length = 816 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 49/284 (17%), Positives = 82/284 (28%), Gaps = 94/284 (33%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI-------- 242 I LV+D+SGSM + S + TK+ +LK A+ F+++ Sbjct: 123 LDIVLVLDVSGSMAEKIASGW-----------NQPTKIDSLKTAVNKFINATAAENAKIT 171 Query: 243 ----------------DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + S + G Y + ++ + V S Sbjct: 172 DQSQRNRIALVKFAGTEKTSVGNDFYREGWSSYN-YTQIVSNLTYDVSGLTSTVNGLSAS 230 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK- 345 T + A +A LT P +K +IF TDGE N+ Sbjct: 231 ---GATSADYAFNRAQAALTYQ----------------PRANAKKVVIFFTDGEPNHGSG 271 Query: 346 -SNVNTIKICDKAKE---NFIKIVTISI----NASPNGQRLLKT-----------CVSSP 386 +KAK I +I + N L K +S Sbjct: 272 FDPTVAATAVNKAKSLKDAGTTIYSIGVVSGANPGDTSSNLNKYMHGISSNYPDATATSS 331 Query: 387 EY-------------------HYNVVNADSLIHVFQNISQLMVH 411 E+ + +A L ++F++I Q + Sbjct: 332 EHLWGKSWNANLGDRAETSSYYKAATDAGQLNNIFESIYQEITK 375 >gi|220921017|ref|YP_002496318.1| von Willebrand factor type A [Methylobacterium nodulans ORS 2060] gi|219945623|gb|ACL56015.1| von Willebrand factor type A [Methylobacterium nodulans ORS 2060] Length = 324 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 46/258 (17%), Positives = 86/258 (33%), Gaps = 49/258 (18%) Query: 159 QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP 218 F+ LL + L + I L +DLSGSM S Sbjct: 64 TFLAWLLWIALVVALAGPRLVLPAAALPASGR-EIMLALDLSGSMERVDFSLDG------ 116 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 + ++AA+K F+ + +GL+ + + + S+ T V Sbjct: 117 ----RNVARLAAVKRVGADFIRR-------RAGDRIGLVIFADQADVAASLSFDTASVAH 165 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + L+ + T + A + L + R +K ++ L+D Sbjct: 166 ALDEAQIGLVGRSTGIGDGLGLALKRLDATPAR------------------EKVVVLLSD 207 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISI----NASPNG------QRLLKTCV-SSPE 387 G NN ++ + + A+E I++ TI++ + G L+ +S Sbjct: 208 GANNAGQTTPHDVA--ALARELGIRVHTIALGPRDLSDAEGDPDVVDTEALRDVATTSGG 265 Query: 388 YHYNVVNADSLIHVFQNI 405 + V D L V +I Sbjct: 266 RFFRVRTTDDLAAVADSI 283 >gi|306821351|ref|ZP_07454960.1| von Willebrand factor [Eubacterium yurii subsp. margaretiae ATCC 43715] gi|304550638|gb|EFM38620.1| von Willebrand factor [Eubacterium yurii subsp. margaretiae ATCC 43715] Length = 467 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 39/230 (16%), Positives = 76/230 (33%), Gaps = 47/230 (20%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 L+ E + I V+D SGSM +DP + Sbjct: 14 TFLIEDEKDKYDGINIAFVIDSSGSM---FYNDPNGLRREVTH----------------- 53 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 + + ++ +IG+ + + + EK+ V + T+ A Sbjct: 54 -----KFIDRLTDNDMAAVIGFDYKATVLEQFTSNKEKLHDAVDKIRSD---GGTNIGRA 105 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + AY + + + KF+I LTDG+ + + A Sbjct: 106 VSIAYDLFNNLDNNRKE-------------KYPKFLILLTDGDGDYSEEYTIL------A 146 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 K+ IKI TI + + + L + +++ +A L +F+ I+ Sbjct: 147 KKAGIKIYTIGLGNGVSEKLLKDIAKGTDGEYFHAKDASKLNKIFEKIAD 196 >gi|126730249|ref|ZP_01746060.1| hypothetical protein SSE37_10854 [Sagittula stellata E-37] gi|126708982|gb|EBA08037.1| hypothetical protein SSE37_10854 [Sagittula stellata E-37] Length = 666 Score = 61.4 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 15/83 (18%), Positives = 34/83 (40%), Gaps = 6/83 (7%) Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYN 391 + F T G N + N C A++ + + ++ + L+ C SS + ++ Sbjct: 588 YTTFSTRGNNLD-----NLHTQCQLARDLGVTVFAVAFETTDADADELRLCASSDSHFFH 642 Query: 392 VVNADSLIHVFQNISQLMVHRKY 414 V +I F I++ + + + Sbjct: 643 VQGT-EIIDAFDTIARQINNLRL 664 >gi|84498078|ref|ZP_00996875.1| putative secreted protein [Janibacter sp. HTCC2649] gi|84381578|gb|EAP97461.1| putative secreted protein [Janibacter sp. HTCC2649] Length = 659 Score = 61.4 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 39/220 (17%), Positives = 79/220 (35%), Gaps = 42/220 (19%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 L++D SGSM TK+ A K AL + ++ + + Sbjct: 40 LMLDASGSMKAK--------------DPSGLTKIEAAKKALTGVVGAL------PDTAQV 79 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 GL Y V+ +P+ Q +T T PA+ A + + + + Sbjct: 80 GLRVYGATVDGKGKPTPAACADTQLITPI-------GTIDKPALTSAISAINALGE-TPI 131 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE-----NFIKIVTISI 369 + + +K ++ I+ ++DGE + C K+ ++I T+ Sbjct: 132 AHSLTEALKDLGPTGKRNIVLVSDGEESCTPDP------CPAVKKLTAGGVDLQIDTVGF 185 Query: 370 NASPNGQRLLKTCVSSPEY--HYNVVNADSLIHVFQNISQ 407 + + L+ C++ +Y+ +A +L +SQ Sbjct: 186 GVNAKARSQLQ-CIADAGKGTYYDAKDAPALAASLSKLSQ 224 >gi|319784437|ref|YP_004143913.1| hypothetical protein Mesci_4754 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317170325|gb|ADV13863.1| hypothetical protein Mesci_4754 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 553 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 26/168 (15%), Positives = 54/168 (32%), Gaps = 41/168 (24%) Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 M T+ + + ++L+ + + G+ K ++ LTDGEN Sbjct: 392 SQMTEWNGSGTNVSEGLSWGMRVLSPAAPYTDGAPWKTPGI-------SKIVLLLTDGEN 444 Query: 342 NNFKSNVN---------------------------------TIKICDKAKENFIKIVTIS 368 + ++ T +C + K ++I T+ Sbjct: 445 VVYGASEQEPTKSDYTSYGYLAGGRFGSDNQTTAARNVDGWTKNVCTQLKNEGVQIYTMV 504 Query: 369 INAS-PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + + + L C S P +Y V + L +VF I+ + + Sbjct: 505 LQSDTAANRALYSACASDPSNYYAVNDPTKLPNVFLQIANNFTKLQLT 552 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 53/262 (20%), Positives = 84/262 (32%), Gaps = 69/262 (26%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 K S+ NF+++ L + L + F V K+++++A ++A LA Sbjct: 5 KFWRSKSGNFTLVLGLGLPVILTAVAFATDVSTLMRAKSNLQNALDSANLAS-------- 56 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 S LGD D R + F NIV + Sbjct: 57 SHLGDL--------------DITR-------------NDAFNRYFQANIVGHGELD---N 86 Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFL 192 A L F + S D L F L ++ IV A+ Sbjct: 87 AQATLTVDKGVNFVKTKAVASADVHLNF-AFLFG--DSKHIVVDASAVESNNQ-----LE 138 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + LV+D +GSM A +M AL+ A LD+++ + Sbjct: 139 VVLVLDNTGSMAGA--------------------RMTALRTATKSLLDTLEAAKSPTRKI 178 Query: 253 YMGLIGYTTRVEKN---IEPSW 271 + + T V N +PSW Sbjct: 179 RASPVPFVTAVNVNGDGFDPSW 200 >gi|327399949|ref|YP_004340788.1| von Willebrand factor type A [Archaeoglobus veneficus SNP6] gi|327315457|gb|AEA46073.1| von Willebrand factor type A [Archaeoglobus veneficus SNP6] Length = 527 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 38/224 (16%), Positives = 70/224 (31%), Gaps = 52/224 (23%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + +D SGSM S K A F+D ++ + Sbjct: 71 IDVVFALDSSGSMGWNDPSGLR-------------------KTAAKSFVDKLNSTTD--- 108 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 G++ + ++ + V+ + S TD + A +L + K+ Sbjct: 109 --QAGVVSWDNNIDFTQTLTNNFSLVKSKIDAVDSS---GGTDLNVGLNAAISLLDTGKQ 163 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 + IIFL++G+ S A + TI + Sbjct: 164 ANSSW----------------VIIFLSNGQGTYSHSTAVV------AANKGYTVYTIGLA 201 Query: 371 ASPNG---QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 SP L ++ +Y+ NA +L VF +I + +V Sbjct: 202 ISPGSTAESNLKDIANTTGGKYYSSPNATNLDAVFNDIYKEVVT 245 >gi|86131264|ref|ZP_01049863.1| aerotolerance-related exported protein BatA [Dokdonia donghaensis MED134] gi|85818675|gb|EAQ39835.1| aerotolerance-related exported protein BatA [Dokdonia donghaensis MED134] Length = 334 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 45/280 (16%), Positives = 86/280 (30%), Gaps = 70/280 (25%) Query: 157 RLQFIEHLLNQRYNQKIVS----FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPE 212 +L+ + +L I+ + + I + +D+S SM Sbjct: 54 KLRPLLFILRLAALSLIIVALARPRNVEVSTKTKTTKGIDIVIAIDVSASMLAK------ 107 Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + ++ ALK F+ + + + +GL+ Y + Sbjct: 108 ---------DLRPNRLEALKKVASSFI-------NGRPNDRIGLVEYAGESFTKTPITSD 151 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 V + +S+I T + + K K Sbjct: 152 KSIVLSALKGIQYNSIIEGGTAIGMGLATGVNRIKDSKA------------------LSK 193 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS----------INASPNG------ 375 II +TDGENN + + + A+E IK+ TI +PNG Sbjct: 194 VIILMTDGENNAGQIDPRIAA--ELAQEFGIKVYTIGMGTNGTALSPYARNPNGTFVYEN 251 Query: 376 ------QRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQL 408 + LL+ ++ ++ N L ++ I +L Sbjct: 252 IQVTIDEELLEEIAETTGGQYFRATNNKKLQEIYDEIDKL 291 >gi|149180101|ref|ZP_01858606.1| hypothetical protein BSG1_03760 [Bacillus sp. SG-1] gi|148852293|gb|EDL66438.1| hypothetical protein BSG1_03760 [Bacillus sp. SG-1] Length = 931 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 46/214 (21%), Positives = 78/214 (36%), Gaps = 52/214 (24%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E P + +V+D SGSM K+ K A I Sbjct: 404 ELPSLGMVIVLDRSGSMAGY--------------------KIQLAKEAA------IRSAE 437 Query: 247 HVKEDVYMGLIGYTTR--VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 ++E +G I + R + EP EKV + + T+ P+++ AY+ Sbjct: 438 LLREKDTLGFIAFDDRPWQIIDTEPIKDKEKVIEKINGL---TSGGGTNIFPSLELAYEQ 494 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 LT P +K II LTDG++ + TI+ + KEN I + Sbjct: 495 LT------------------PLELQRKHIILLTDGQSATSPDYLTTIQ---EGKENNITL 533 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 T++I + L + Y+V ++ ++ Sbjct: 534 STVAIGEGSDSVLLEELSDEGGGRFYDVNDSSTI 567 >gi|322689979|ref|YP_004209713.1| cell surface protein [Bifidobacterium longum subsp. infantis 157F] gi|320461315|dbj|BAJ71935.1| putative cell surface protein [Bifidobacterium longum subsp. infantis 157F] Length = 794 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 48/284 (16%), Positives = 80/284 (28%), Gaps = 94/284 (33%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI-------- 242 I LV+D+SGSM + S + TK+ +LK A+ F+++ Sbjct: 101 LDIVLVLDVSGSMAEKIASGW-----------NQPTKIDSLKTAVNKFINATAAENAKIT 149 Query: 243 ----------------DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + S + G Y + ++ + V S Sbjct: 150 DQSQRNRIALVKFAGTEKTSVGNDFYREGWSSYN-YTQIVSNLTYDVSGLTSTVNGLSAS 208 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK- 345 T + A +A LT P +K +IF TDGE N+ Sbjct: 209 ---GATSADYAFNRAQAALTYQ----------------PRANAKKVVIFFTDGEPNHGSG 249 Query: 346 -SNVNTIKICDKAKE---NFIKIVTISI----NASPNGQRLLKTC--------------- 382 +KAK I +I + N L K Sbjct: 250 FDPTVAATAVNKAKSLKDAGTTIYSIGVVSGANPGDTSSNLNKYMHGISSNYPDATATSS 309 Query: 383 ---------------VSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + Y+ +A L ++F++I Q + Sbjct: 310 EHLWGKSWNANLGDRAETSSYYKAATDAGQLNNIFESIYQEITK 353 >gi|312133821|ref|YP_004001160.1| von willebrand factor (vwf) domain containing protein [Bifidobacterium longum subsp. longum BBMN68] gi|311773110|gb|ADQ02598.1| Von Willebrand factor (VWF) domain containing protein [Bifidobacterium longum subsp. longum BBMN68] Length = 794 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 48/284 (16%), Positives = 80/284 (28%), Gaps = 94/284 (33%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI-------- 242 I LV+D+SGSM + S + TK+ +LK A+ F+++ Sbjct: 101 LDIVLVLDVSGSMAEKIASGW-----------NQPTKIDSLKTAVNKFINATAAENAKIT 149 Query: 243 ----------------DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + S + G Y + ++ + V S Sbjct: 150 DQSQRNRIALVKFAGTEKTSVGNDFYREGWSSYN-YTQIVSNLTYDVSGLTSTVNGLSAS 208 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK- 345 T + A +A LT P +K +IF TDGE N+ Sbjct: 209 ---GATSADYAFNRAQAALTYQ----------------PRANAKKVVIFFTDGEPNHGSG 249 Query: 346 -SNVNTIKICDKAKE---NFIKIVTISI----NASPNGQRLLKTC--------------- 382 +KAK I +I + N L K Sbjct: 250 FDPTVAATAVNKAKSLKDAGTTIYSIGVVSGANPGDTSSNLNKYMHGISSNYPDATATSS 309 Query: 383 ---------------VSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + Y+ +A L ++F++I Q + Sbjct: 310 EHLWGKSWNANLGDRAETSSYYKAATDAGQLNNIFESIYQEITK 353 >gi|13476511|ref|NP_108081.1| hypothetical protein mlr7847 [Mesorhizobium loti MAFF303099] gi|14027272|dbj|BAB54226.1| mlr7847 [Mesorhizobium loti MAFF303099] Length = 548 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 27/167 (16%), Positives = 56/167 (33%), Gaps = 40/167 (23%) Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 +M T+ + + ++L+ + + G+ K ++ LTDGEN Sbjct: 388 SEMTEWNGSGTNVSEGLSWGMRVLSPAAPYTDGAPWKTPGI-------SKIVLLLTDGEN 440 Query: 342 NNFKSNVN--------------------------------TIKICDKAKENFIKIVTISI 369 + ++ T +C + K ++I T+ + Sbjct: 441 VVYGASEQPTKSDYTSYGYLAGGRFGSDDQTAAARNVDGWTKSVCTQLKNQGVQIYTMVL 500 Query: 370 NAS-PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + + L C S P +Y V + L VFQ+I+ + + Sbjct: 501 QSDTAANRALYSACASDPSGYYAVNDPAKLPDVFQHIANKFSRLQLT 547 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 49/262 (18%), Positives = 83/262 (31%), Gaps = 69/262 (26%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 K S+ NF+++ L + + L + F + V K+++++A +AA LA Sbjct: 5 KFWRSKSGNFALLMGLGLPAILSAVAFAVDVSTVMRAKSNLQNALDAANLAS-------- 56 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 +H+ + + F NI + Sbjct: 57 ---------------------------SHLGDLDISRTDAFDRYFQANIAGHGEL---AN 86 Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFL 192 A L F + S D L F L N+ I A+ Sbjct: 87 AQATLTVDRGVNFIKTKAVASADVNLNF-GFLFGH--NRHIAVDASAVESDNQ-----LE 138 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + LV+D +GSM A +M AL+ A LD+++ + Sbjct: 139 VVLVLDNTGSMAGA--------------------RMTALRTATKSLLDTLEATKSPTRQI 178 Query: 253 YMGLIGYTTRVEKN---IEPSW 271 L+ + T V N +PSW Sbjct: 179 RASLVPFVTAVNVNGDEFDPSW 200 >gi|146276888|ref|YP_001167047.1| hypothetical protein Rsph17025_0838 [Rhodobacter sphaeroides ATCC 17025] gi|145555129|gb|ABP69742.1| hypothetical protein Rsph17025_0838 [Rhodobacter sphaeroides ATCC 17025] Length = 563 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 14/70 (20%), Positives = 33/70 (47%), Gaps = 2/70 (2%) Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSS-PEYHYNVVNADSLIHVFQNI 405 + +C+ AK N + I I+ A NG+ ++ C +S ++++ + F+ I Sbjct: 494 DAQLQALCNLAKSNNVTIFGIAFEAPANGKTQIQNCSTSRSSHYFDASGL-EIQTAFRAI 552 Query: 406 SQLMVHRKYS 415 + + + + S Sbjct: 553 ASQISYLRLS 562 >gi|116252440|ref|YP_768278.1| hypothetical protein RL2693 [Rhizobium leguminosarum bv. viciae 3841] gi|115257088|emb|CAK08182.1| conserved hypothetical exported protein [Rhizobium leguminosarum bv. viciae 3841] Length = 427 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 63/446 (14%), Positives = 132/446 (29%), Gaps = 94/446 (21%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 I+ NF I+ AL V+ G + + + +A +AA + + Sbjct: 9 ISDRSGNFGIMTALLVVPLFGAAGMAVDFAHALSLRTQLYAAADAAAVGS---IAEKSGA 65 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + N DDA+ + + L+ I + Sbjct: 66 VAAAMTMSGNGTISLGKDDARNIFMSQMSGELT----------------DVHIDLGINVT 109 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIE 194 + N+ + ++ V T+ +L + I E Sbjct: 110 KTANKLNSQVSFSATVPTT-------FMRILGRDSIT-----ISGAATAEYQTAAFMDFY 157 Query: 195 LVVDLSGSMH----------------CAMNSDPEDVNSAPICQDKK---RTKMAALKNAL 235 +++D + SM CA D ++ K ++ ++ A Sbjct: 158 ILLDNTPSMGVGATANDVSKLQAKTGCAFACHQMDQSTNNYTIAKGLGVAMRIDVVRQAT 217 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 D+ + MG+ + T+ E + +T+ + TD+ Sbjct: 218 QALTDTAKTERVSSDQFRMGVYTFGTKAEDAKLTT--ISSPTSDLTKVKNY-----TDTV 270 Query: 296 PAMKQAYQILTSDKKRSFFTNFFR-------QGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 M YQ D+ SF + + G ++ +K + F++DG +++K + Sbjct: 271 DLMTIPYQNYNQDQLTSFDSALTQMNTIIDPAGDGTSNISPEKILFFVSDGVGDSYKPST 330 Query: 349 NTIKI-------------CDKAKENFIKI---VTISINASPNG-------------QRLL 379 T K C K+ +KI T + N + Sbjct: 331 CTKKTTGGRCQEPIDTSFCKPLKDRGVKIAVLYTTYLPLPSNDWYNKWISPFQSEIPTKM 390 Query: 380 KTCVSSPEYHYNVVNADSLIHVFQNI 405 + C SP +++ V + + + + Sbjct: 391 QACA-SPGFYFEVTPTEGITDAMKAL 415 >gi|23466092|ref|NP_696695.1| hypothetical protein BL1539 [Bifidobacterium longum NCC2705] gi|322691915|ref|YP_004221485.1| cell surface protein [Bifidobacterium longum subsp. longum JCM 1217] gi|23326823|gb|AAN25331.1| hypothetical protein with gram positive cell wall anchoring domain [Bifidobacterium longum NCC2705] gi|320456771|dbj|BAJ67393.1| putative cell surface protein [Bifidobacterium longum subsp. longum JCM 1217] Length = 794 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 48/284 (16%), Positives = 80/284 (28%), Gaps = 94/284 (33%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI-------- 242 I LV+D+SGSM + S + TK+ +LK A+ F+++ Sbjct: 101 LDIVLVLDVSGSMAEKIASGW-----------NQPTKIDSLKTAVNKFINATAAENAKIT 149 Query: 243 ----------------DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + S + G Y + ++ + V S Sbjct: 150 DQSQRNRIALVKFAGTEKTSVGNDFYREGWSSYN-YTQIVSNLTYDVSGLTSTVNGLSAS 208 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK- 345 T + A +A LT P +K +IF TDGE N+ Sbjct: 209 ---GATSADYAFNRAQAALTYQ----------------PRANAKKVVIFFTDGEPNHGSG 249 Query: 346 -SNVNTIKICDKAKE---NFIKIVTISI----NASPNGQRLLKTC--------------- 382 +KAK I +I + N L K Sbjct: 250 FDPTVAATAVNKAKSLKDAGTTIYSIGVVSGANPGDTSSNLNKYMHGISSNYPDATATSS 309 Query: 383 ---------------VSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + Y+ +A L ++F++I Q + Sbjct: 310 EHLWGKSWNANLGDRAETSSYYKAATDAGQLNNIFESIYQEITK 353 >gi|303240108|ref|ZP_07326629.1| conserved hypothetical protein [Acetivibrio cellulolyticus CD2] gi|302592377|gb|EFL62104.1| conserved hypothetical protein [Acetivibrio cellulolyticus CD2] Length = 323 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 44/250 (17%), Positives = 83/250 (33%), Gaps = 65/250 (26%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + + I + +D+SG+M D + ++ + + F+D Sbjct: 73 KNSPISKQGIDIAVALDVSGTMQS---------------VDFEPNRLEVARKTIQDFVDQ 117 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQ 300 + + LI + + + VR+ + S+ + T A+ Sbjct: 118 -------RPSDRIALIAFAGTAYTRVPLTLDHNVVRESLQDISFKSVNEEGTAIGMAISV 170 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 L S K +I LTDG+NN + NT AK++ Sbjct: 171 GLNRL------------------KKSTSPSKIMILLTDGDNNAGSIDPNTAST--LAKDS 210 Query: 361 FIKIVTISINASPN---------------------GQRLLKTCV-SSPEYHYNVVNADSL 398 IKI TI + + + LLK ++ +Y ++++L Sbjct: 211 GIKIYTIGVGSDKTIIPGTNEFGQTVYQEYESGLLNEDLLKKIAETTNGQYYRAKDSNAL 270 Query: 399 IHVFQNISQL 408 VF NI++L Sbjct: 271 SQVFANINKL 280 >gi|91201136|emb|CAJ74195.1| conserved hypothetical protein [Candidatus Kuenenia stuttgartiensis] Length = 331 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 41/237 (17%), Positives = 80/237 (33%), Gaps = 59/237 (24%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I L VD+SGSM + K++ ++ +K + F++ Sbjct: 86 IDIVLAVDISGSM----------LAEDFEMDGKRQNRLYVVKQVVKDFINK-RSTDP--- 131 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +GL+ ++ + + Q++ + L+ T A+ + L + K Sbjct: 132 ---IGLVVFSANAYTQCPLTLDYGILLQFLEKTEIGLLEDGTAIGSAIASSVDRLRNTKA 188 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS-- 368 +S K I+ LTDG NN+ + + T + A+ IKI TI Sbjct: 189 QS------------------KVIVLLTDGRNNSGQIDPLTAA--ELAQAFNIKIYTIGAG 228 Query: 369 --------------------INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + + + L + + +Y + SL ++Q I Sbjct: 229 SKGLVPYPARDLFGNRVMRQVKIDIDDESLAEIANITGGRYYRATDTGSLKEIYQQI 285 >gi|294054315|ref|YP_003547973.1| hypothetical protein Caka_0779 [Coraliomargarita akajimensis DSM 45221] gi|293613648|gb|ADE53803.1| conserved hypothetical protein [Coraliomargarita akajimensis DSM 45221] Length = 345 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 40/193 (20%), Positives = 72/193 (37%), Gaps = 39/193 (20%) Query: 178 PALLRIEMGERP-IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 P L + G+ I L +DLSGSM S E++ T++ A K + Sbjct: 75 PQLGNLNEGQEAEGIDIVLALDLSGSMRALDLSTRENIV----------TRLDAAKEVVQ 124 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-MDSLILKPTDST 295 F+ + +GL+ + + + +++ V R + + L T Sbjct: 125 EFIGK-------RPHDRIGLVAFAADAFVVSPLTLNHDWLKKNVQRLELGDINLSGTAIG 177 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ + L + RS + +I LTDGENN+ + + + Sbjct: 178 TALGASVNRLRDHESRS------------------RIVILLTDGENNSGTLSPLSAA--E 217 Query: 356 KAKENFIKIVTIS 368 AK +K+ TI+ Sbjct: 218 AAKSLNVKVYTIA 230 >gi|149197908|ref|ZP_01874957.1| BatA [Lentisphaera araneosa HTCC2155] gi|149139129|gb|EDM27533.1| BatA [Lentisphaera araneosa HTCC2155] Length = 341 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 57/253 (22%), Positives = 89/253 (35%), Gaps = 51/253 (20%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + + +R I+++VD+S SM M +++ T+M K + F Sbjct: 74 TEVTKKRQDRQGIAIQVLVDVSSSMDINM-----------KYGEERLTRMDVAKIVVEKF 122 Query: 239 LD-SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL--KPTDST 295 + D L +D+ +GLI + + S E + V + T Sbjct: 123 IGGDGDELVGRPDDL-IGLITFARYADTIAPLSLAHEALISIVQDVTINTRPNEDGT--- 178 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 AY T+ Q +K K II LTDGENN ++ Sbjct: 179 -----AYGDATALAAAQLDLLQGDQDIKS------KIIILLTDGENNCGNHLP--LQAAS 225 Query: 356 KAKENFIKIVTISIN-------------------ASPNGQRLLKTCVSSPEYHYN-VVNA 395 AKE IKI TISI G ++LK S + + Sbjct: 226 LAKEWGIKIYTISIQNKPTPERKKTDQGTFFVPPTPSAGDQVLKKMAESTGGVFRLAHDY 285 Query: 396 DSLIHVFQNISQL 408 DSL V++ I++L Sbjct: 286 DSLKSVYKEINKL 298 >gi|149911739|ref|ZP_01900346.1| von Willebrand factor type A domain protein [Moritella sp. PE36] gi|149805212|gb|EDM65230.1| von Willebrand factor type A domain protein [Moritella sp. PE36] Length = 330 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 40/234 (17%), Positives = 79/234 (33%), Gaps = 55/234 (23%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + L VDLS SM ++ +++ +K + F+ ++ Sbjct: 87 MMLAVDLSRSM----------QAEDMQINNRMVDRLSLVKTVVADFIQQ-------RKGD 129 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GLI + ++ + V Y+ + + L+ + T + A + Sbjct: 130 RVGLIFFADNAYLQAPLTFDLKTVSGYMQQAVLGLVGEQTAIGEGIGLAL--------KR 181 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 F Q K +I LTDG+N+ V + A+E +KI TI + A Sbjct: 182 FDAADNPQ----------KVLILLTDGQNSA--GEVKPLDAAKFAQEQGVKIYTIGVGAD 229 Query: 373 PNGQRLL------------------KTCVSSPEYHYNVVNADSLIHVFQNISQL 408 +R L + ++ +A SL ++ + +L Sbjct: 230 AYYKRTLFGNQKVDPSRDLDEVTLKTIAAQTGGQYFRARDASSLAAIYAELDKL 283 >gi|159896929|ref|YP_001543176.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159889968|gb|ABX03048.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 579 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 42/226 (18%), Positives = 79/226 (34%), Gaps = 43/226 (19%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + I LV+D SGSM + + + A K A +D + H Sbjct: 356 QDPLNIMLVIDTSGSMGPSKEGLTDGG-------------LDAAKIAA------LDFIDH 396 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + + +GLI + T V + + VRQ ++ T A+ +Y L Sbjct: 397 LPSNANVGLIHFGTLVTVDHSLTNDIGAVRQSISELKPE---GQTAIYDALAISYTQLRR 453 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 K ++ FI+ ++DG + K + N I KA + I I Sbjct: 454 AKGQT-------------------FIVLISDGADTASKGD-NYDSIVAKATKANIPTYII 493 Query: 368 SINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + + +LL+ + Y + + L + ++Q + + Sbjct: 494 GLTSPEFDGQLLEDLQRDTKAMIYQTPSKEQLGGFYTEVAQEVSGQ 539 >gi|46190503|ref|ZP_00121395.2| COG2304: Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Bifidobacterium longum DJO10A] gi|189440499|ref|YP_001955580.1| von Willebrand factor (vWF) domain containing protein [Bifidobacterium longum DJO10A] gi|189428934|gb|ACD99082.1| von Willebrand factor (vWF) domain containing protein [Bifidobacterium longum DJO10A] Length = 794 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 48/284 (16%), Positives = 80/284 (28%), Gaps = 94/284 (33%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI-------- 242 I LV+D+SGSM + S + TK+ +LK A+ F+++ Sbjct: 101 LDIVLVLDVSGSMAEKIASGW-----------NQPTKIDSLKTAVNKFINATAAENAKIT 149 Query: 243 ----------------DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + S + G Y + ++ + V S Sbjct: 150 DQSQRNRIALVKFAGTEKTSVGNDFYREGWSSYN-YTQIVSNLTYDVSGLTSTVNGLSAS 208 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK- 345 T + A +A LT P +K +IF TDGE N+ Sbjct: 209 ---GATSADYAFNRAQAALTYQ----------------PRANAKKVVIFFTDGEPNHGSG 249 Query: 346 -SNVNTIKICDKAKE---NFIKIVTISI----NASPNGQRLLKTC--------------- 382 +KAK I +I + N L K Sbjct: 250 FDPTVAATAVNKAKSLKDAGTTIYSIGVVSGANPGDTSSNLNKYMHGISSNYPDATATSS 309 Query: 383 ---------------VSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + Y+ +A L ++F++I Q + Sbjct: 310 EHLWGKSWNANLGDRAETSSYYKAATDAGQLNNIFESIYQEITK 353 >gi|86357991|ref|YP_469883.1| hypothetical protein RHE_CH02376 [Rhizobium etli CFN 42] gi|86282093|gb|ABC91156.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 427 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 64/446 (14%), Positives = 135/446 (30%), Gaps = 94/446 (21%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 I++ NF I+ AL ++ + G + V + + +A +AA + Sbjct: 9 ISNRSGNFGIMTALLMVPLMGAAGMAVDVAHALSLRTQLYAAADAAAV------------ 56 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 G E A ++ + + +F + + + I + Sbjct: 57 -GSIAEKSGAVAAAMTMNG-----NGTVSLGKTDARNIFMSQTSGELTD-IHIDLGIDVT 109 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIE 194 + N+ + + V F+ R + I E Sbjct: 110 KTANKLNSQVSFTATV------PTTFM------RIFGRDSIIISGTATAEYQTAAFMDFY 157 Query: 195 LVVDLSGSMHC-AMNSDPEDVNSAPIC---------QDKKRT---------KMAALKNAL 235 +++D + SM A SD + + C T ++ ++ A Sbjct: 158 ILLDNTPSMGVGATASDVSKLQAKTGCAFACHQMDQSTNNYTIAKSLGVTMRIDVVRQAT 217 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 D+ + MG+ + T+ E + + +T+ + T++ Sbjct: 218 QALTDTAKAERVSSDQFRMGVYTFGTKAEDAKLTT--ISGLTSDLTKVKNY-----TNAV 270 Query: 296 PAMKQAYQILTSDKKRSFFTNFFR-------QGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 M YQ SD+ SF + + G ++ +K + F+ DG +++K + Sbjct: 271 DLMTIPYQNYNSDQLTSFDSAMTQINTIIDPAGDGTSNISPEKILFFVADGVGDSYKPST 330 Query: 349 NTIKI-------------CDKAKENFIKI---VTISINASPNG-------------QRLL 379 T K C K+ +KI T + N + Sbjct: 331 CTKKTTGGRCQEPIDTTFCKPLKDRGVKIAVLYTTYLPLPSNSWYNTWIKPFQNEIPTKM 390 Query: 380 KTCVSSPEYHYNVVNADSLIHVFQNI 405 + C SP ++ V D + + + Sbjct: 391 QACA-SPGLYFEVTPTDGIADAMKAL 415 >gi|116753518|ref|YP_842636.1| von Willebrand factor, type A [Methanosaeta thermophila PT] gi|116664969|gb|ABK13996.1| von Willebrand factor, type A [Methanosaeta thermophila PT] Length = 795 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 42/223 (18%), Positives = 77/223 (34%), Gaps = 52/223 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L +D SGSM SDP D+ K+A F+ +DL Sbjct: 65 DVVLSIDSSGSM---TTSDPGDLR----------------KSAAKEFVTGLDLSMD---- 101 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILTSDKK 310 +G++ + T SW + + +DS T +K A +L Sbjct: 102 -RVGVVSWNT-----SAISWPLTNNTKDIESAIDSTGADGNTCLDTGLKSAIDLL----- 150 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDG---ENNNFKSNVNTIKICDKAKENFIKIVTI 367 K I+ LTDG + ++ D+A+ I + TI Sbjct: 151 --------------SECSGSKVIVLLTDGISTDGGHYTPPGVPGSPVDEARSKGILVFTI 196 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + + + L + S+ Y+ +A++L +++ I + Sbjct: 197 GLGPDADARNLTEIAHSTGGEFYSAPDANALAGIYKRIRSSIT 239 >gi|218671335|ref|ZP_03521005.1| hypothetical protein RetlG_06538 [Rhizobium etli GR56] Length = 49 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 10/43 (23%), Positives = 17/43 (39%) Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 A GQ LL+ C S +++ + L F+ I + Sbjct: 1 MAPEGGQALLQYCASDASHYFQAEKMEDLFAAFKAIGAKASTQ 43 >gi|294011439|ref|YP_003544899.1| Flp pilus assembly protein TadG [Sphingobium japonicum UT26S] gi|292674769|dbj|BAI96287.1| Flp pilus assembly protein TadG [Sphingobium japonicum UT26S] Length = 771 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 28/177 (15%), Positives = 56/177 (31%), Gaps = 12/177 (6%) Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY-VTRDMDSLILKPTDSTPA---MK 299 + V G + +I WG + + ++ M Sbjct: 601 SAQDLSNYV--GTLVPHNNTYHDIGLLWGARLMSPTGIFASENATTGGGAQIQRHLIFMT 658 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 T + S+ ++ + P+ P +N N +N + +C K Sbjct: 659 DGATATTVNNYASYGLEWWDRRQIAPAGPNDANY-----DDNLNAVNNARSNALCTAIKN 713 Query: 360 NFIKIVTISI-NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 I + I ++ + L C +SP Y Y N LI F+ I+ + + + + Sbjct: 714 KNITLWVIYYGSSDTATKTRLTNCATSPSYFYEARNTTLLIGKFREIADRISNLRLT 770 Score = 40.2 bits (92), Expect = 0.60, Method: Composition-based stats. Identities = 51/327 (15%), Positives = 93/327 (28%), Gaps = 87/327 (26%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 + R F + ++ N I A +++ + L+G + + K ++SA +A LA Sbjct: 5 IKRALFILMRLYHNQAGNILAITAAAIIPTIGLVGGAFDMARIYAVKTRLQSACDAGALA 64 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 G M S G ++ AL F + S + +E Sbjct: 65 GRRIMGS-----GRWTDNNGRPNTTALATFDLNF----AQNSFGAENRTRSYSESDG--- 112 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 + S D + + +LN + V+ + Sbjct: 113 -----------------------TVSGTASADVPMTLM-RVLNVPTKRVEVTCEGQMRIP 148 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI- 242 + V+D SGSM+ + D + KMA L+ A+ F +++ Sbjct: 149 NT------DVMFVLDNSGSMNEVIPGDTTGLK-----------KMAGLQLAIRCFYEALA 191 Query: 243 -------------DLLSHVKED-----VYMGLIGYTTRVEKNIEP-------SW------ 271 + + G + Y V SW Sbjct: 192 RQNITDVSAAQCGTSADPTGDLSSQVQLRFGFVNYDNMVNVGKLLPNNYLADSWTYRSRT 251 Query: 272 --GTEKVRQYVTRDMDSLILKPTDSTP 296 G + V + + P STP Sbjct: 252 ATGIQTVYAWTAGTQSATTWGPWSSTP 278 >gi|114571146|ref|YP_757826.1| Flp pilus assembly protein TadG [Maricaulis maris MCS10] gi|114341608|gb|ABI66888.1| Flp pilus assembly protein TadG [Maricaulis maris MCS10] Length = 500 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 24/154 (15%), Positives = 57/154 (37%), Gaps = 10/154 (6%) Query: 266 NIEPSWGTEKVRQYV--TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 I SWG + + T + + + L ++ ++ Sbjct: 352 PIGISWGVRVLSPGMPFTEGVSYDEEGTIKAMVVLTDGENYLDGRNNPNYSHYSGYGYMR 411 Query: 324 IPSLPFQKFIIFLTDGENNNFKS-NVNTIKICDKAKENFIKIVTISINASPNG-QRLLKT 381 L Q T ++ + N T C+ AK I++ TI+ + + + +++ Sbjct: 412 DGRLGIQ------TSSDSTIRNALNDRTEAACEYAKSLGIRVYTITFQVNSSSTRDMMRD 465 Query: 382 CVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 C + P +++ + D+L F+ I+ + + + S Sbjct: 466 CATHPTLYFDSPSDDALRSAFEMIAGDLTNLRLS 499 >gi|89098674|ref|ZP_01171556.1| hypothetical protein B14911_00755 [Bacillus sp. NRRL B-14911] gi|89086636|gb|EAR65755.1| hypothetical protein B14911_00755 [Bacillus sp. NRRL B-14911] Length = 920 Score = 60.7 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 39/214 (18%), Positives = 75/214 (35%), Gaps = 56/214 (26%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E P + +V+D SGSM +K+ K A + Sbjct: 399 EMPSLGLMIVMDRSGSMAG--------------------SKLELAKEAAA------RSVE 432 Query: 247 HVKEDVYMGLIGYTTR---VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 ++E +G I + R + + + V + T+ ++++AY+ Sbjct: 433 LLREKDTLGFIAFDDRPWVIVETGPLE----DKKDAVDKIGSVTPGGGTEIFTSLEKAYE 488 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L + K + +K II LTDG++ + I + KEN I Sbjct: 489 ELENLKLQ------------------RKHIILLTDGQSARSTDYESMI---ETGKENNIT 527 Query: 364 IVTISINASPNGQRLLKTCVS-SPEYHYNVVNAD 396 + T+++ S + LL+ Y+V ++ Sbjct: 528 LSTVAL-GSDADRNLLEELAGLGAGRFYDVTDSS 560 >gi|78776847|ref|YP_393162.1| von Willebrand factor, type A [Sulfurimonas denitrificans DSM 1251] gi|78497387|gb|ABB43927.1| von Willebrand factor, type A [Sulfurimonas denitrificans DSM 1251] Length = 307 Score = 60.7 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 48/233 (20%), Positives = 87/233 (37%), Gaps = 34/233 (14%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 R++ R I L +D SGSM+ D E + P +K ++ K F+ Sbjct: 73 RVDPLNRNGKDIVLAIDASGSMNS-TGFDFEGEAALP----QKLSRFEIAKIVASEFIQK 127 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 L +V G++ Y ++ + + ++ + + T A+ + Sbjct: 128 -RLSDNV------GIVLYGDFAFIASPITYEKNIIIEMLSYLNQGMAGQNTAIGEAIAMS 180 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + K +S K ++ LTDGE+N+ + + AKE Sbjct: 181 LRAFKHSKAKS------------------KIVVLLTDGEHNSGDISPKDALV--LAKEEN 220 Query: 362 IKIVTISI-NASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHR 412 IKI TI + N + LLK S + NA L ++++I +L + Sbjct: 221 IKIYTIGMGNRGEADEALLKKIADESGGEFFYATNAKELKEIYEHIDELESSK 273 >gi|145299821|ref|YP_001142662.1| flp pilus assembly protein FlpL [Aeromonas salmonicida subsp. salmonicida A449] gi|88866595|gb|ABD57363.1| FlpL [Aeromonas salmonicida subsp. salmonicida A449] gi|142852593|gb|ABO90914.1| putative flp pilus assembly protein FlpL [Aeromonas salmonicida subsp. salmonicida A449] Length = 460 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 13/108 (12%), Positives = 41/108 (37%), Gaps = 6/108 (5%) Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS---NV 348 T++ + +++L+ + + +K ++ +DGE+ ++ + Sbjct: 337 TNTAEGVMWGWRLLSPQWQGRWGQGAAELPRPYGQADNRKIMVLFSDGEHMGPEAALRDR 396 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 + +C + K I++ T++ + C S + Y + + Sbjct: 397 KQLLLCREMKRKGIQVYTVAFEGDAR---FVAQCASDRSHAYKATSGN 441 >gi|126649837|ref|ZP_01722073.1| hypothetical protein BB14905_16605 [Bacillus sp. B14905] gi|126593556|gb|EAZ87501.1| hypothetical protein BB14905_16605 [Bacillus sp. B14905] Length = 865 Score = 60.3 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 43/236 (18%), Positives = 81/236 (34%), Gaps = 60/236 (25%) Query: 165 LNQRYNQKIVSFIPALLRIEMGER-PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 L + I + +P + I+ E+ P + +V+D SGSM Sbjct: 380 LGGYFKTPIETLLPVEMEIKGKEQLPSLGLAIVLDRSGSMSG------------------ 421 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT---TRVEKNIEPSWGTEKVRQYV 280 +K+ K A ++ + +G I + + + + V Sbjct: 422 --SKLELAKEAAARSVEMLRDEDT------LGFIAFDDRPWEIIETGPL----NNKEEAV 469 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 + T+ ++ +AY+ L K + +K II LTDG+ Sbjct: 470 DTILSVTPGGGTEIYGSLAKAYENLADIKLQ------------------RKHIILLTDGQ 511 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISI--NASPNGQRLLKTCVSSPEYHYNVVN 394 N + ++ K+N I + T++I +A N L S YNV++ Sbjct: 512 ----SQPGNYEDLIEQGKDNGITLSTVAIGQDADANLLEALSEMGS--GRFYNVID 561 >gi|320102588|ref|YP_004178179.1| Heat shock protein 70 [Isosphaera pallida ATCC 43644] gi|319749870|gb|ADV61630.1| Heat shock protein 70 [Isosphaera pallida ATCC 43644] Length = 688 Score = 60.3 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 41/222 (18%), Positives = 72/222 (32%), Gaps = 53/222 (23%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I L++D+S SM + + A FLD D Sbjct: 511 LAILLLIDVSSSMAGP--------------------PLDEAREAARSFLDQCDF-----T 545 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +GLI Y+ +V + + KV + R T+ A++ + L + Sbjct: 546 TTRVGLISYSDQVVLQTDLTDNVRKVEAGLARLEAD---GTTNLAGALELGRRKLAT--- 599 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 V + K+++ LTDG N + AK + I+IV I Sbjct: 600 -----------VPTGHV---KYLVVLTDG---YPDDPDNALLEAAHAKGSGIEIVAIG-- 640 Query: 371 ASPNGQRLLKTCVSS-PEYHYNVVNADSLIHVFQNISQLMVH 411 Q L S+ + L+ F +I++++ Sbjct: 641 TGEADQAYLDRIASTQAGSIFARK--GELVRAFGHIARVIAE 680 >gi|78484419|ref|YP_390344.1| von Willebrand factor, type A [Thiomicrospira crunogena XCL-2] gi|78362705|gb|ABB40670.1| Type A von Willebrand factor-like [Thiomicrospira crunogena XCL-2] Length = 349 Score = 60.3 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 39/232 (16%), Positives = 78/232 (33%), Gaps = 52/232 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM + + + ++ A+K+ + F+ ++ Sbjct: 104 DLMLAVDLSGSME----------KTDMPLRGVEVDRLTAVKSVVKNFIQK-------RQG 146 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 MGL+ + ++ ++ V + + T A+ A + L + ++ Sbjct: 147 DRMGLVVFGSQAFLQSPLTYDLNTVETLLNETEIGMAGNNTAIGDAIGIALKHLHQNSEK 206 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + +I LTDG N V + +A+E +KI TI I Sbjct: 207 ------------------KAVLILLTDGSNTAGA--VQPLDAAKQAQEMGLKIYTIGIGQ 246 Query: 372 SPN--------------GQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 + L+ + + + + L ++Q I QL Sbjct: 247 NQATGLDAFIFGPNRNMDTTTLQKIAELTQGRFFMAKDTNQLNEIYQLIDQL 298 >gi|150024244|ref|YP_001295070.1| BatA protein [Flavobacterium psychrophilum JIP02/86] gi|149770785|emb|CAL42250.1| BatA protein [Flavobacterium psychrophilum JIP02/86] Length = 333 Score = 60.3 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 48/260 (18%), Positives = 83/260 (31%), Gaps = 66/260 (25%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 + + + I + +DLS SM D K +M ALK Sbjct: 73 MARPRTVDVSNKRNTTNGIDIVMAIDLSSSMLAK---------------DFKPNRMEALK 117 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR-DMDSLILKP 291 F+++ ++ +G++ YT + V + D+++ Sbjct: 118 EVAASFVEA-------RQSDRIGVVVYTAEAYTKTPVTSDKAVVLDAINTIKYDNVLQDG 170 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A L K +S K II +TDG NN T Sbjct: 171 TGIGMGLATAVNRLKDSKAKS------------------KVIILMTDGVNNAGFIEPVTA 212 Query: 352 KICDKAKENFIKIVTISI-----------NASPNG-----------QRLLKTCV-SSPEY 388 + AKE IK+ TI I A G ++L+K + Sbjct: 213 A--EFAKEFGIKVYTIGIGTNGNAPFPYAIAPNGGFLYKMLPVEIDEQLMKDIAKKTGGK 270 Query: 389 HYNVVNADSLIHVFQNISQL 408 ++ + SL ++ I++L Sbjct: 271 YFRAQSNSSLESIYSEINKL 290 >gi|329888464|ref|ZP_08267062.1| hypothetical protein BDIM_03870 [Brevundimonas diminuta ATCC 11568] gi|328847020|gb|EGF96582.1| hypothetical protein BDIM_03870 [Brevundimonas diminuta ATCC 11568] Length = 650 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 19/173 (10%), Positives = 47/173 (27%), Gaps = 40/173 (23%) Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 ++ T + A+ L+ D F Q K I +TDGE Sbjct: 479 SKIDQMSAGGSTAGQIGIAWAWYALSPDFASLFSGEG--QPGAYAPSDTLKVAILMTDGE 536 Query: 341 NNNFKSNV----------------------------NTIKICDKAKENFIKIVTISIN-- 370 N + ++ +C + + + T+ + Sbjct: 537 FNTPFRDGVIALDAGTGSGGLDSHIDLNSSNGDPFAQSVALCQAMQAKGVVVYTVGFDLG 596 Query: 371 -------ASPNGQRLLKTCVSSP-EYHYNVVNADSLIHVFQNISQLMVHRKYS 415 +++ C ++ + + + L F+ I + + + + Sbjct: 597 SATGREGVVDTALDVMRECATNEQTHFFQADDGTDLKEAFRAIGRDITRLRIA 649 >gi|223558081|gb|ACM91085.1| aerotolerance protein BatA [uncultured bacterium Rlip1] Length = 332 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 41/239 (17%), Positives = 74/239 (30%), Gaps = 63/239 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I + +D+SGSM D K ++ A KN F+ + Sbjct: 92 IDIVMAMDVSGSMLAR---------------DLKPDRLTAAKNVASDFVK-------GRP 129 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 MGL+ ++ + + + + + LI T + A L Sbjct: 130 GDRMGLVIFSGETFTQVPLTTDHGVMLNMLAEMKNGLIDDGTAIGDGLATAISRL----- 184 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI- 369 S K +I LTDG NN + T + AK I++ TI + Sbjct: 185 -------------KDSEAISKVVILLTDGMNNAGSVDPYTAA--EIAKLYGIRVYTIGVG 229 Query: 370 -------------------NASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 ++LL + S + ++ + L +++ I +L Sbjct: 230 SYGTAPYPVQTPFGTQIQQMKVEIDEKLLASVASMTGGKYFRATSNQKLDEIYEEIDKL 288 >gi|325954650|ref|YP_004238310.1| von Willebrand factor type A [Weeksella virosa DSM 16922] gi|323437268|gb|ADX67732.1| von Willebrand factor type A [Weeksella virosa DSM 16922] Length = 338 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 42/255 (16%), Positives = 79/255 (30%), Gaps = 67/255 (26%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + ++ I L VD S SM D + ++ ALK A+ + Sbjct: 81 VSTKIKSDKGVDIMLTVDTSLSMLAR---------------DLEPDRLTALK-AVAVKFS 124 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 +GL+ Y+ + + E + + + + T + Sbjct: 125 KERQAD------RLGLVSYSGEALTRVPLTTDREVLIREINALESGELEDGTAIGIGLAT 178 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG-ENNNFKSNVNTI---KICDK 356 A + K +S K II +TDG E+ N +++ I + Sbjct: 179 AINHIKDSKAKS------------------KVIILMTDGVESINPTNDLMYISPQTAAEM 220 Query: 357 AKENFIKIVTISI----------------------NASPNGQRLLKTCVS-SPEYHYNVV 393 A IK+ TI I ++LL+ + ++ Sbjct: 221 ATSRGIKVYTIGIGTRGLAPFPTAYDMYGNYIFDMMPVDIDEKLLQNIADLTGGLYFRAT 280 Query: 394 NADSLIHVFQNISQL 408 + SL ++Q I +L Sbjct: 281 DNQSLQKIYQEIDRL 295 >gi|91773457|ref|YP_566149.1| von Willebrand factor, type A [Methanococcoides burtonii DSM 6242] gi|91712472|gb|ABE52399.1| hypothetical protein with von Willebrand factor type A domain and Invasin domain [Methanococcoides burtonii DSM 6242] Length = 892 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 61/370 (16%), Positives = 118/370 (31%), Gaps = 50/370 (13%) Query: 57 NNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHI--KESLSGYSAVFY 114 N L + + LS G ++ N F ++ ++ G S Sbjct: 487 NINVTLNNTAPSLGTLSLDGSNESNLINFTTDQYGRVITEFTSSNFVGNCTIIGLSDAIN 546 Query: 115 NTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIV 174 ++ I N IS + A + +S + + + + + ++ Sbjct: 547 DSLSIEIRNQPFISASIDAEPYVVTSGDIVNITTVITVEGELPVSRSAATSMLILDR--- 603 Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 + + I LV+D SGSM N P+ K K+ Sbjct: 604 ---SGSMDPDYYAGTALDIVLVLDRSGSMKFLGN-----APEQPLTDAKSAAKI------ 649 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGY-TTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-KPT 292 + ++ + +G++ + +T S + + +DS++ T Sbjct: 650 ---------FMENLLSNTEVGVVSFSSTSTVDRQPVSLNISGNKDLLHNAIDSMVADGGT 700 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 AM A +L + +K +I LTDG + Sbjct: 701 AIGDAMADANNLLIN-----------------GRPDAKKIMIVLTDGVATAGSDRDGSDA 743 Query: 353 ICDKAKENFIKIVTISINASPN-GQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMV 410 I A N I+I +I + +S + +LK S + +YN + L V+ IS+ + Sbjct: 744 I-STANLNNIRIYSIGLGSSEYIDEPMLKRIASETGGSYYNAPSGSELQTVYNTISKEIS 802 Query: 411 HRKYSVILKG 420 + I G Sbjct: 803 DFDVTEIDYG 812 >gi|34558787|gb|AAQ75132.1| BatA protein [Alvinella pompejana epibiont 6C6] Length = 300 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 43/239 (17%), Positives = 92/239 (38%), Gaps = 43/239 (17%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 + S + ++ + L +D+SGSM S ++ ++++ K Sbjct: 64 LASPFTYEAKELSTKKGR-DLILTIDVSGSMAQKGFSK----------EESEKSRYEVAK 112 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP- 291 F+ + + +G++ + + ++ + + + D I Sbjct: 113 EIAKRFIKN-RFSDN------IGIVIFGSFSFSASPLTYDLKALLEMFDLMSDVGIAGNN 165 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T A+ +A + L S + +S K II LTDG++N K + Sbjct: 166 TAIGDAIFEAIKNLESGEAKS------------------KVIILLTDGKHNFGKKSPKEG 207 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCV--SSPEYHYNVVNADSLIHVFQNISQL 408 + +AK+ IKI T+ I + ++LL+ ++ + + N+ L VF+ I +L Sbjct: 208 VV--EAKKRGIKIYTVGI-GTDYDKKLLEKMAKETNAKSFF-AKNSKELEEVFKEIEEL 262 >gi|148975506|ref|ZP_01812377.1| hypothetical protein VSWAT3_03061 [Vibrionales bacterium SWAT-3] gi|145964934|gb|EDK30185.1| hypothetical protein VSWAT3_03061 [Vibrionales bacterium SWAT-3] Length = 357 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 51/285 (17%), Positives = 98/285 (34%), Gaps = 42/285 (14%) Query: 142 NTIFYNMDVMTSYDYRLQFIEHLL--NQRYNQKIVSFI---------PALLRIEMGERPI 190 F++ V + + L + I++++ L ++ E+ Sbjct: 41 KVPFFHELVEAIGEAPSEGASQLTPSWWQRTTLIITWVLVVCALAKPTILGEPQVREQLG 100 Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + +VVDLSGSM Q KK +++ A K L F + ++ Sbjct: 101 RDVMVVVDLSGSMA---------EQDFTSKQGKKISRLQATKEVLADFAKT-------RK 144 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +GLI + + + + + + ++ + T A+ A ++ K Sbjct: 145 GDRLGLILFGDAAFVQTPFTADQDVWLELLNQTDVAMAGQSTHLGDAIGLAIKVFEQSGK 204 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 + + +K +I LTDG N+ S V I AK ++I I++ Sbjct: 205 Q------MSAEQAQNDIEREKVVIVLTDG--NDTGSFVEPIDAAKVAKAKGVRIHVIAMG 256 Query: 371 ASPN------GQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 +K S S + +N D L ++ I QL Sbjct: 257 DPQTVGEVALDMETIKRIASESGGEAFEALNRDELSTAYEQIGQL 301 >gi|329894014|ref|ZP_08270022.1| BatA [gamma proteobacterium IMCC3088] gi|328923357|gb|EGG30676.1| BatA [gamma proteobacterium IMCC3088] Length = 339 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 36/230 (15%), Positives = 75/230 (32%), Gaps = 55/230 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VD+SGSM + ++ ++ A+++ F++ +E Sbjct: 92 DLLLAVDISGSM----------RVEDMVIANQAVRRIDAVRDIGAEFIER-------REG 134 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + +R S+ + V+Q+++ T A+ A + L + Sbjct: 135 DRVGLILFGSRAYMQSPLSFDRDTVKQFLSEAQIGFAGSETAIGDALGLAVKRLRDKE-- 192 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + +I LTDG++ + A +K+ TI I A Sbjct: 193 ----------------DGDRVVILLTDGQDTASSVDPLDAT--ALAANYGVKVYTIGIGA 234 Query: 372 SPN-----------------GQRLLKTCV-SSPEYHYNVVNADSLIHVFQ 403 + L S+ ++ + D L ++ Sbjct: 235 DEMLVPSLFGNRRVNPSAELDEETLSAMAESTGGRYFRARSPDELAKIYD 284 >gi|313207255|ref|YP_004046432.1| von willebrand factor type a [Riemerella anatipestifer DSM 15868] gi|312446571|gb|ADQ82926.1| von Willebrand factor type A [Riemerella anatipestifer DSM 15868] gi|315023479|gb|EFT36485.1| aerotolerance operon BatA [Riemerella anatipestifer RA-YM] gi|325335298|gb|ADZ11572.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Riemerella anatipestifer RA-GD] Length = 330 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 39/207 (18%), Positives = 68/207 (32%), Gaps = 43/207 (20%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 I + + E I L +D+S SM D + ++ ALK Sbjct: 70 IARPRTFSISEDRDETKGMDIVLSIDVSLSMLAK---------------DLEPDRLTALK 114 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 F+ + +GL+ Y+ + + V + + + T Sbjct: 115 EIARTFIKQ-------RTTDRIGLVEYSGEALMRVPLTSDHRVVEEELMSFNPMDLEGGT 167 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + A L K +S K II +TDG N + ++ + Sbjct: 168 NIGDGLAVAVSHLRKSKAKS------------------KIIILMTDGVNTIDNA-MSPLT 208 Query: 353 ICDKAKENFIKIVTISINASPNGQRLL 379 + A+ N IK+ TI I NG L+ Sbjct: 209 AAELARNNDIKVYTIGIG--SNGLALM 233 >gi|229590954|ref|YP_002873073.1| hypothetical protein PFLU3509 [Pseudomonas fluorescens SBW25] gi|229362820|emb|CAY49730.1| putative exported protein [Pseudomonas fluorescens SBW25] Length = 362 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 41/227 (18%), Positives = 79/227 (34%), Gaps = 54/227 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM Q + ++++ +K+ L FL+ +E Sbjct: 91 DLLVAVDVSGSMD----------FPDMNWQGEDVSRLSLVKHLLGDFLE-------GREG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + K T A+ A + L + Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRRTVRTWLDEARIGIAGKNTAIGDAIGLALKRLRQRPAQ 193 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S + +I +TDG NN + + T A E +KI I I A Sbjct: 194 S------------------RVLILVTDGANNAGQIDPLTAAR--LAAEEGVKIYPIGIGA 233 Query: 372 SPN----------------GQRLLKTCV-SSPEYHYNVVNADSLIHV 401 P + LK ++ ++ + + L + Sbjct: 234 DPEQTGSLGILGVNPSLDLDEPALKAIAEATGGQYFRARDGEELQAI 280 >gi|197336671|ref|YP_002158318.1| von Willebrand factor, type A [Vibrio fischeri MJ11] gi|197313923|gb|ACH63372.1| von Willebrand factor, type A [Vibrio fischeri MJ11] Length = 321 Score = 59.9 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 45/233 (19%), Positives = 80/233 (34%), Gaps = 56/233 (24%) Query: 192 LIELVVDLSGSMH-CAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVDLSGSM M + D ++ A+K + F+D ++ Sbjct: 84 DMMLVVDLSGSMAEEDMKTSNGDFVD----------RLTAVKQVVSDFIDQ-------RK 126 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +GL+ + ++ VR+ + R + L+ + T + A + Sbjct: 127 GDRLGLVLFGDHAYLQTPLTFDRNTVREQLDRTVLRLVGQMTAMGEGLGLATKTFIESNA 186 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI- 369 Q+ II L+DG N + ++ AK+N KI T+ I Sbjct: 187 ------------------PQRTIILLSDGANTAGV--LEPLEAAQLAKDNHAKIYTVGIG 226 Query: 370 ----------------NASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNI 405 A + L + + ++ NAD L ++Q I Sbjct: 227 AGEMQVRGFFGKQTVNTARDLDEDTLTKIATMTGGQYFRARNADELAEIYQTI 279 >gi|312961300|ref|ZP_07775805.1| von Willebrand factor, type A [Pseudomonas fluorescens WH6] gi|311284958|gb|EFQ63534.1| von Willebrand factor, type A [Pseudomonas fluorescens WH6] Length = 362 Score = 59.9 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 41/227 (18%), Positives = 79/227 (34%), Gaps = 54/227 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM +D+ ++++ +K+ L FL +E Sbjct: 91 DLLVAVDVSGSMD----------FPDMHWRDEDVSRLSLVKHLLGDFLQQ-------REG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + K T A+ A + L + Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRRTVRTWLDEARIGIAGKNTAIGDAIGLALKRLRLRPAQ 193 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S + +I +TDG NN + + T A E +KI I I A Sbjct: 194 S------------------RVLILITDGANNAGQIDPLTAAR--LAAEEGVKIYPIGIGA 233 Query: 372 SPN----------------GQRLLKT-CVSSPEYHYNVVNADSLIHV 401 P + LK ++ ++ + + L + Sbjct: 234 DPEQTGSLGILGVNPSLDLDEPALKAIAAATGGQYFRARDGEELQAI 280 >gi|163848654|ref|YP_001636698.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222526590|ref|YP_002571061.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163669943|gb|ABY36309.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222450469|gb|ACM54735.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 947 Score = 59.9 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 41/236 (17%), Positives = 76/236 (32%), Gaps = 46/236 (19%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL---LLFLDSIDL 244 RP I V+D SGSM +DP+ K+ K+A+ L D Sbjct: 407 RPDLAIVFVIDKSGSMDACHCADPD---RGAPITSSSERKIDIAKDAIVQAAALLGPQDT 463 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + G++ + + G V Q + PT+ + +A ++ Sbjct: 464 V---------GVVTFDGAASATFPATRG-ATVEQVMDAVSGVEPRGPTNIRAGLLRAEEM 513 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L R K +I LTDG + S + + + + +E I + Sbjct: 514 LQQVDARI------------------KHMILLTDG----WGSGGDQLDLAARLREQGITL 551 Query: 365 VTISINASPNGQRLLKTCVSS-PEYHYNVVNADSLIHVF-----QNISQLMVHRKY 414 ++ A LK + +Y + + +F I +V + + Sbjct: 552 TVVA--AGSGSAAYLKQLAAEGGGRYYPAADMAEVPQIFVQETITAIGNYIVEQPF 605 >gi|266620637|ref|ZP_06113572.1| putative von Willebrand factor type A domain protein [Clostridium hathewayi DSM 13479] gi|288867752|gb|EFD00051.1| putative von Willebrand factor type A domain protein [Clostridium hathewayi DSM 13479] Length = 2963 Score = 59.9 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 40/231 (17%), Positives = 83/231 (35%), Gaps = 34/231 (14%) Query: 192 LIELVVDLSGSM-HCAMNSDPEDVNSA-PICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + LV+D SGSM ++ K ++ + A++ F S Sbjct: 561 DVILVIDSSGSMEGEKWSTAKTAAKGFIDNLYQNKDGVVSDDRIAIVDFDSSAKAYPGTN 620 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL---KPTDSTPAMKQAYQILT 306 G + +V+ I T + Y+ + + TD A++ A ++ Sbjct: 621 ----SGSETF-LKVDDKITIKNKTYSAKDYLKSYVLDSQMKDTGGTDYNKALQTAQSVIN 675 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV-----NTIKICDKAKENF 361 + + S +I+F++DGE N + + + K + K + Sbjct: 676 NRRDSSRPA----------------YIVFMSDGEPNGYWDWLTYRYYDGQKYATELKSDG 719 Query: 362 IKIVTISIN-ASPNGQRLLKTCVSSPEYHY--NVVNADSLIHVFQNISQLM 409 + I ++ +N S N + + S P Y N+V L+ ++ I+ + Sbjct: 720 VTIYSLGLNIGSTNFNKFIVPLASDPTSTYAKNIVKTSDLVGIYDAIASSI 770 >gi|212635916|ref|YP_002312441.1| Von Willebrand factor type A domain-containing protein [Shewanella piezotolerans WP3] gi|212557400|gb|ACJ29854.1| Von Willebrand factor type A domain protein [Shewanella piezotolerans WP3] Length = 333 Score = 59.9 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 42/235 (17%), Positives = 70/235 (29%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM + D K T L ++ Sbjct: 86 DLMLSVDLSGSMQIE-----------DMVIDGKVTDRFTLIQHVISQFIERRKGD----- 129 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V QY+ L+ + T A+ + + Sbjct: 130 -RIGLILFADHAYLQSPLTQDRRTVAQYLNEAEIGLVGRQTAIGEAIALGVKR-FDQVEN 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S + +I LTDG NN + D A + I I T+ + A Sbjct: 188 SN-----------------RVLILLTDGSNNAGSISPEQAT--DIAAKRGITIYTVGVGA 228 Query: 372 SPNGQRLL------------------KTCVSSPEYHYNVVNADSLIHVFQNISQL 408 +R L + + ++ N + L ++Q I +L Sbjct: 229 EVMERRTLFGKERVNPSMDLDETQLKQIAEKTGGSYFRARNTEELERIYQEIDKL 283 >gi|170739681|ref|YP_001768336.1| von Willebrand factor type A [Methylobacterium sp. 4-46] gi|168193955|gb|ACA15902.1| von Willebrand factor type A [Methylobacterium sp. 4-46] Length = 329 Score = 59.9 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 42/227 (18%), Positives = 82/227 (36%), Gaps = 48/227 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I L +DLSGSM S + +++AA+K F+ + Sbjct: 102 IMLALDLSGSMERVDFSIDG----------RNVSRLAAVKQVGADFIRR-------RAGD 144 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GL+ + + + S+ T V + L+ + T + A + L + R Sbjct: 145 RIGLVIFADQADVAASLSFDTAAVVHALEEAQIGLVGRSTGIGDGLGLALKRLDAATAR- 203 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI--- 369 +K +I L+DG NN ++ + + A++ I++ TI++ Sbjct: 204 -----------------EKVVILLSDGANNAGQTTPHDVA--GLARDLGIRVHTIALGPR 244 Query: 370 -NASPNG-------QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + +G + L +S + V D L V I ++ Sbjct: 245 DLSDADGDPDVVDTEALRDVSATSGGRFFRVRTTDDLAAVADAIDEV 291 >gi|193214188|ref|YP_001995387.1| von Willebrand factor type A [Chloroherpeton thalassium ATCC 35110] gi|193087665|gb|ACF12940.1| von Willebrand factor type A [Chloroherpeton thalassium ATCC 35110] Length = 340 Score = 59.5 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 41/241 (17%), Positives = 72/241 (29%), Gaps = 64/241 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I L +DLSGSM + + ++ A K+ F+ L Sbjct: 98 IDIVLAIDLSGSMLAE--------------DFEPKNRIEAAKSVATDFIHQ-RLSD---- 138 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILTSDK 309 +GL+ ++ + + + +++ I T A+ A L Sbjct: 139 --RIGLVVFSGKSFTQCPLTLDYRLLTNFISELKAGTIEEDGTAIGTAIATATNRL---- 192 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS- 368 S K II LTDG+NN + T + A IKI T+ Sbjct: 193 --------------RESTAKSKVIILLTDGQNNAGEIEPVTAA--ELAAALGIKIYTVGA 236 Query: 369 ---------------------INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L + S ++ + +SL + I + Sbjct: 237 GTRGYARYPIPDPLFGKRYVQMKVDVDDSTLTRIARISGGRYFRATDLESLKKTYHEIDE 296 Query: 408 L 408 L Sbjct: 297 L 297 >gi|315126124|ref|YP_004068127.1| von Willebrand factor type A [Pseudoalteromonas sp. SM9913] gi|315014638|gb|ADT67976.1| von Willebrand factor type A [Pseudoalteromonas sp. SM9913] Length = 327 Score = 59.5 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 40/233 (17%), Positives = 77/233 (33%), Gaps = 53/233 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L VDLSGSM + ++ +K L F++ ++ Sbjct: 88 DIMLAVDLSGSM----------TEQDMAYNGQYVDRLTMVKAVLSDFIEQ-------RQG 130 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + V + ++ L+ + T A+ + + S K+ Sbjct: 131 DRLGLILFGDTAFLQTPLTRDVKTVSKMLSEAQIGLVGRATAIGDALGLSVKRFASKKES 190 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS--- 368 + + ++ LTDG+N N + A+E IK+ TI Sbjct: 191 N------------------RIVVLLTDGQNTAGNLNPEDALL--LAREEGIKVYTIGVGS 230 Query: 369 --------INASPNGQR----LLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 N G LLK + ++ + L ++ + +L Sbjct: 231 DNPRGFSLFNMGSGGSNLDEGLLKKIAEQTGGLYFRAKDVAGLQQIYAELDKL 283 >gi|88858061|ref|ZP_01132703.1| hypothetical protein PTD2_11764 [Pseudoalteromonas tunicata D2] gi|88819678|gb|EAR29491.1| hypothetical protein PTD2_11764 [Pseudoalteromonas tunicata D2] Length = 328 Score = 59.5 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 39/232 (16%), Positives = 79/232 (34%), Gaps = 52/232 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L VDLSGSM V Q + +++ +K L F+ ++ Sbjct: 88 DIMLAVDLSGSM----------VEQDMAYQGRYVDRLSMVKAVLKNFIAQ-------RQG 130 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V + + L+ + T A+ A + + + Sbjct: 131 DRLGLILFGDTAFLQTPLTRDLNTVSKMLEEAQIGLVGRATAIGDALGLAVKRFSQKQDS 190 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + ++ LTDGEN + A+E IK+ T+ + + Sbjct: 191 N------------------RILVLLTDGENTAGNLAPEEALL--LAREEGIKVYTVGVGS 230 Query: 372 SPNGQ--------------RLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 + LL+ + + ++ + SL ++Q + +L Sbjct: 231 QGGNRFNLFSMSGSSSLDESLLQKIATETGGLYFRATDVASLQQIYQELDKL 282 >gi|188578240|ref|YP_001915169.1| von Willebrand factor type A domain protein [Xanthomonas oryzae pv. oryzae PXO99A] gi|188522692|gb|ACD60637.1| von Willebrand factor type A domain protein [Xanthomonas oryzae pv. oryzae PXO99A] Length = 335 Score = 59.5 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 43/238 (18%), Positives = 78/238 (32%), Gaps = 54/238 (22%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + L VDLSGSM+ + K ++ A K L FLD Sbjct: 96 REARQMMLAVDLSGSMN----------EPDMVLGGKVVDRLTAAKAVLSDFLDR------ 139 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 ++ +GL+ + R + VR + + L + T A+ + + L Sbjct: 140 -RDGDRVGLLVFGQRAYALTPLTADLTSVRDQLRDSVVGLAGRETAIGDAIALSVKRLRE 198 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 K+ Q+ ++ LTDG N + +K + AK ++I TI Sbjct: 199 QKQG------------------QRVVVLLTDGVNTAGVLDP--LKAAELAKAEGVRIYTI 238 Query: 368 SINA-----------SPNGQR------LLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + G L K + + + + L ++ + +L Sbjct: 239 AFGGGGGYSLFGVPIPAGGNDDIDEDGLRKIAQQTGGRFFRARDTEELAGIYAELDRL 296 >gi|327541056|gb|EGF27607.1| von Willebrand factor type A [Rhodopirellula baltica WH47] Length = 497 Score = 59.5 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 68/453 (15%), Positives = 131/453 (28%), Gaps = 82/453 (18%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 RF + S + +++ A + LL F I + K + A +AA AG Sbjct: 71 VRFALTQRDDSRRGGITVLMAFVLPMLALLAAFCINLAQMQLVKTELAIATDAAARAGGR 130 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + + + A + + + + F + + N+ R Sbjct: 131 AFSEEQTVEAAKAAARLTAAMNEVAGEPYQLNTDDSANEFE-----FGVSAQTD-GNTGR 184 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG 186 T + + + ++ + D L + I F P M Sbjct: 185 FYFTKVPTSDVAANLVAVSSVRINGKRTDDSLLGPVPFIFPNTFS-IGDFSPVASATAMQ 243 Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDV------------------------NSAPICQ- 221 I LV+D SGSM P+D N P Sbjct: 244 VDR--DISLVLDRSGSMDWKTYDWPDDADPWGEDSLISAEDAGIVDLEWKYRNGQPQYIR 301 Query: 222 -------------------------DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL 256 T L A+ FL +D ++ + Sbjct: 302 RVSYNRGYDEYDLYDHAWEEVFGLGPAPNTPWEDLVLAVDAFLRVLDQTPQNEQV---SI 358 Query: 257 IGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTN 316 Y + + + VR V + + T M T + R + + Sbjct: 359 ASYNSHGTLDCWLLDDFDSVRAAVAQLGPN---GSTGIGNGMNSGKTAFTHENARPYAS- 414 Query: 317 FFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQ 376 K ++ +TDG N+N+ + NT+ + + I T++ + + Sbjct: 415 --------------KTMVVMTDG-NHNYGTQPNTVAQ-QLMSSSNLNIQTVTFGGGADQE 458 Query: 377 RLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + + V+ HY+ + D L+ F+ I+ + Sbjct: 459 TMQEVAVTGLGRHYHADSGDELVSAFEEIANNL 491 >gi|257469959|ref|ZP_05634051.1| hypothetical protein FulcA4_11506 [Fusobacterium ulcerans ATCC 49185] gi|317064188|ref|ZP_07928673.1| BatA protein [Fusobacterium ulcerans ATCC 49185] gi|313689864|gb|EFS26699.1| BatA protein [Fusobacterium ulcerans ATCC 49185] Length = 319 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 45/238 (18%), Positives = 82/238 (34%), Gaps = 61/238 (25%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 ++ I + +DLS SM + +D ++ K L F+D Sbjct: 78 KKDGIDIAISLDLSQSM---------------LQEDFTPNRLEKAKEVLSEFIDK----- 117 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK--QAYQI 304 + D + LI + + ++ +++ + I T + M A Sbjct: 118 --RTDDRLALIVFGGDAYTKVPLTFDHNVIKEMTGKLTVDDITSNTRTAIGMGIGVALNR 175 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L + +S K II LTDGENN+ + + + D AKE IKI Sbjct: 176 LKDSEAKS------------------KVIILLTDGENNSGEMSPSAAA--DIAKELGIKI 215 Query: 365 VTISINA----------------SPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNI 405 TI I A + + +LK+ ++ ++ ++ +F I Sbjct: 216 YTIGIGAKEIKVPSFFGYTTVKNTELDENMLKSIAETTGGEYFRASDSKEFKEIFNKI 273 >gi|325273881|ref|ZP_08140055.1| von Willebrand factor type A [Pseudomonas sp. TJI-51] gi|324100983|gb|EGB98655.1| von Willebrand factor type A [Pseudomonas sp. TJI-51] Length = 311 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 35/183 (19%), Positives = 68/183 (37%), Gaps = 37/183 (20%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM +++ +++ +K L FL +E Sbjct: 44 DLLVAVDVSGSMD----------FPDMQWKNEDISRLDLVKALLGDFLQ-------DREG 86 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + K T A+ A + L + Sbjct: 87 DRVGLILFGSQAYLQAPLTFDRRTVRTFLDEAQIGIAGKNTAIGDAIGLAVKRLRQRPAQ 146 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S + ++ +TDG NN + + T A + ++I TI I A Sbjct: 147 S------------------RVLVLITDGANNGGQIHPLTAAR--LAAQEGVRIYTIGIGA 186 Query: 372 SPN 374 +P Sbjct: 187 NPE 189 >gi|182414212|ref|YP_001819278.1| von Willebrand factor type A [Opitutus terrae PB90-1] gi|177841426|gb|ACB75678.1| von Willebrand factor type A [Opitutus terrae PB90-1] Length = 377 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 44/264 (16%), Positives = 86/264 (32%), Gaps = 60/264 (22%) Query: 173 IVSFIPALLRIEMGERPI------FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 ++ I AL R + E + + L +DLSGSM D + Sbjct: 87 VILIIAALARPQKVEDKRDVHSQGYDLMLCIDLSGSMLSEDYERGGD----------RIN 136 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 ++ A+K + F++ + +G++ ++ R ++ + + R Sbjct: 137 RLQAIKPVIQAFIER-------RPSDRIGIVLFSGRAYTMAPLTFDHRWLGSQLERIKVG 189 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 LI T + LT ++ + RQG F++ LTDG NN Sbjct: 190 LIEDGTAIGDGLGVG---LTRLEQAQRESGGKRQGA---------FVVLLTDGANNRGSL 237 Query: 347 NVNTIKICDKAKENFIKIVTIS--------INA-SPNGQRL-------------LKTCVS 384 + AK I + TI G++L L+ Sbjct: 238 TPQQAA--ELAKARGIPVYTIGAGQDGIVPFPVFDDKGRKLGYRRIMSDLDEGALRDIAE 295 Query: 385 -SPEYHYNVVNADSLIHVFQNISQ 407 + + + + ++ F+ I + Sbjct: 296 MTGGHFFRAADVGTVESAFRAIDR 319 >gi|110598614|ref|ZP_01386881.1| von Willebrand factor, type A [Chlorobium ferrooxidans DSM 13031] gi|110339783|gb|EAT58291.1| von Willebrand factor, type A [Chlorobium ferrooxidans DSM 13031] Length = 336 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 37/247 (14%), Positives = 76/247 (30%), Gaps = 59/247 (23%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 ++R E + L +D+S SM S ++++ A + F+ Sbjct: 88 VVRQTEAEARGIDVMLALDISESMLQKDGSG--------------KSRLDAAREVARKFV 133 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAM 298 + S +GL+ + + + + + + +I T A+ Sbjct: 134 --LRRSSD-----RIGLVVFRGKGYTQCPLTIDHDVLAMLIDHISPQVIQDEGTAIGSAI 186 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 A S QK II +TDGENN T A Sbjct: 187 LIATNRF------------------KGSTSLQKVIILITDGENNTGDVGPATAAT--LAA 226 Query: 359 ENFIKIVTI-----------SINASPNGQRLLKTCV------SSPEYHYNVVNADSLIHV 401 +N I+I + +++A + + ++ ++ + L + Sbjct: 227 QNGIRIYVVNAGFKSGGSAGNLSAESSAHAAMDEASLRGIARTTGGGYFRAEDPSVLDNT 286 Query: 402 FQNISQL 408 + I +L Sbjct: 287 IKTIGRL 293 >gi|169826904|ref|YP_001697062.1| hypothetical protein Bsph_1324 [Lysinibacillus sphaericus C3-41] gi|168991392|gb|ACA38932.1| conserved hypothetical protein [Lysinibacillus sphaericus C3-41] Length = 825 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 42/236 (17%), Positives = 82/236 (34%), Gaps = 60/236 (25%) Query: 165 LNQRYNQKIVSFIPALLRIEMGER-PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 L + I + +P + I+ E+ P + +V+D SGSM Sbjct: 340 LGGYFKTPIETLLPVEMEIKGKEQLPSLGLVIVLDRSGSMSG------------------ 381 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT---TRVEKNIEPSWGTEKVRQYV 280 +K+ K A ++ + +G I + + + + V Sbjct: 382 --SKLELAKEAAARSVEMLRDEDT------LGFIAFDDRPWEIIETGPL----NNKEEAV 429 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 + T+ ++ +AY+ L K + +K II LTDG+ Sbjct: 430 DTILSVTPGGGTEIYGSLAKAYENLADMKLQ------------------RKHIILLTDGQ 471 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISI--NASPNGQRLLKTCVSSPEYHYNVVN 394 + + + ++ K+N I + T++I +A N L S YNV++ Sbjct: 472 SQPGNYD----DLIEQGKDNGITLSTVAIGQDADANLLEALSEMGS--GRFYNVID 521 >gi|149922008|ref|ZP_01910450.1| hypothetical protein PPSIR1_18327 [Plesiocystis pacifica SIR-1] gi|149817173|gb|EDM76653.1| hypothetical protein PPSIR1_18327 [Plesiocystis pacifica SIR-1] Length = 996 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 37/237 (15%), Positives = 88/237 (37%), Gaps = 56/237 (23%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E+P + LV+D SGSM ++ +K A + L Sbjct: 524 EQPTLALILVIDKSGSMSSG-------------------DRLDLVKEAARA---TARTLD 561 Query: 247 HVKEDVYMGLIGYTT--RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 E +G+I + +V ++P+ ++ + R T++ PA+++AY Sbjct: 562 PSDE---IGVIAFDNSPQVLVRLQPAANRLRISSSIRRLSAG---GGTNAMPALREAYLQ 615 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L S K +I L+DGE + ++ +N + +++ I + Sbjct: 616 LA------------------GSKALVKHVILLSDGE--SPENGIN--ALLGDMRQSDITV 653 Query: 365 VTISINASPNGQRLLKTCVSSP-EYHYNVVNADSLIHVFQNISQLMVHRKYSVILKG 420 ++ + G+ L ++ + + +F ++ + ++ +++ +G Sbjct: 654 SSVGV-GDGAGKDFLIRVAERGRGRYFYSEDGTDVPRIFSREAREV--KRNALVERG 707 >gi|85716351|ref|ZP_01047324.1| hypothetical protein NB311A_19225 [Nitrobacter sp. Nb-311A] gi|85696867|gb|EAQ34752.1| hypothetical protein NB311A_19225 [Nitrobacter sp. Nb-311A] Length = 542 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 25/149 (16%), Positives = 50/149 (33%), Gaps = 21/149 (14%) Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 + T+ + + +Q L++ K +Q +I+ L+DG Sbjct: 400 NQIDAMTPSGNTNQSIGLAWGWQSLSTTNGPIAAP------GKESGYVYQDYIVLLSDGL 453 Query: 341 NNN------------FKSNVNTIKICDKAKENFIKIVTISINASPNG--QRLLKTCVSSP 386 N + +C K K++ + I TI +N ++L+ C S Sbjct: 454 NTQNRWYSCPPSGPCPTIDARQALLCQKVKDSGVTIFTIQVNVGSKDPLSQVLQNCASD- 512 Query: 387 EYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + +A FQNI + + + Sbjct: 513 GNFQMITSATETADAFQNILTQISQLRLA 541 >gi|254514588|ref|ZP_05126649.1| von Willebrand factor, type A [gamma proteobacterium NOR5-3] gi|219676831|gb|EED33196.1| von Willebrand factor, type A [gamma proteobacterium NOR5-3] Length = 347 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 38/239 (15%), Positives = 78/239 (32%), Gaps = 59/239 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VD+SGSM ++ ++ A+K L S Sbjct: 93 DLMLAVDISGSM----------RVEDMQVGNRMARRIDAVKQ-LGSDFMSRRSGD----- 136 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + +R S+ + V++++ + T A+ A + L Sbjct: 137 -RLGLILFGSRAYLQSPLSFDIQTVQRFLLESQIGFAGQETAIGDAIGLAVKRLQERPAT 195 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-- 369 S + +I LTDG++ + + A + ++I TI I Sbjct: 196 S------------------RVLILLTDGQDTASTVDPLEAA--NLAADLGVRIYTIGIGA 235 Query: 370 --------------------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 +A + L+ S+ ++ + + L V++ + +L Sbjct: 236 DSLTLPGLLGSPLGARTVNPSADLDENSLIAIASSTGGQYFRARDPEELATVYRLLEKL 294 >gi|258654082|ref|YP_003203238.1| von Willebrand factor type A [Nakamurella multipartita DSM 44233] gi|258557307|gb|ACV80249.1| von Willebrand factor type A [Nakamurella multipartita DSM 44233] Length = 618 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 39/222 (17%), Positives = 79/222 (35%), Gaps = 44/222 (19%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + +V+D+SGSM E +K+ K A L + Sbjct: 428 VLVVMDVSGSMASESGYGSE-------------SKLDLAKKAATSALGQLTDTD------ 468 Query: 253 YMGLIGYTTRVEKNIEPSW---GTEKVRQY----VTRDMDSLILKPTDSTPAMKQAYQIL 305 MGL +TT + + G + Q + L T A ++A + + Sbjct: 469 QMGLWAFTTDLPTPDTITADLVGVGPLAQTRQPIIDAISSLTPLNGTPLYAATREAAKAM 528 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI--KICDKAKENFIK 363 + K + ++ LTDG N ++++ + ++ A+E+ ++ Sbjct: 529 NAQKDPNSINA----------------VVVLTDGRNEYTDNDLDGLLRELNASAEEDGVR 572 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + TI+ + L + +S Y+ N S+ VF ++ Sbjct: 573 VFTIAYGPDADLATLQEISEASRAAAYDARNPTSIDKVFSDV 614 >gi|325286051|ref|YP_004261841.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] gi|324321505|gb|ADY28970.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] Length = 332 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 48/292 (16%), Positives = 89/292 (30%), Gaps = 66/292 (22%) Query: 141 NNTIFYNMDVMTSYDYRLQFIEHLL-NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDL 199 ++T + + S + F+ LL + + I + +D+ Sbjct: 40 SSTQGFTDTSILSKLKPILFVLRLLAMAAIITALARPQTKDVSTRTKTTKGIDIVMAIDV 99 Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 S SM +D K ++ ALK F+ + + +GL+ Y Sbjct: 100 SSSMLA---------------RDLKPDRLTALKKVAADFIKK-------RPNDRIGLVAY 137 Query: 260 TTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFR 319 + V + + + T + A L K +S Sbjct: 138 AAESYTKTPITSDKSIVLSSLRQITHGQLEDGTAIGMGLATAVNRLKDSKSKS------- 190 Query: 320 QGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN----- 374 K II LTDG NN+ T D A E IK TI + + N Sbjct: 191 -----------KVIILLTDGVNNSGFIEPKTAA--DLAVEYKIKTYTIGLGTNGNALTPI 237 Query: 375 -----------------GQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 ++LL+ + + ++ + + L ++ I++L Sbjct: 238 AFNPDRTYRYGMRQVEIDEKLLEEIATVTGGQYFRATDNEKLSAIYNEINKL 289 >gi|228472814|ref|ZP_04057572.1| BatA protein [Capnocytophaga gingivalis ATCC 33624] gi|228275865|gb|EEK14631.1| BatA protein [Capnocytophaga gingivalis ATCC 33624] Length = 332 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 47/288 (16%), Positives = 90/288 (31%), Gaps = 69/288 (23%) Query: 148 MDVMTSYDYRLQFIEHLL-NQRYNQKIVS---FIPALLRIEMGERPIFLIELVVDLSGSM 203 + ++++ RL+ I ++L I++ + + I L +D+S SM Sbjct: 44 VRSVSTWRTRLRPILYILRGLSLVCLIIALARPRSSSEITKTKTTEGIDIILSIDMSSSM 103 Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 D K ++ ALK F+ S +G++ Y+ Sbjct: 104 LAK---------------DLKPNRIEALKRVAAQFIQQ--RASD-----RIGIVVYSGES 141 Query: 264 EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 + + V Q + I T + A L K +S Sbjct: 142 YTKVPATTDKSIVLQALKEIRQGEIEDGTAIGMGLGTAINRLKDSKTKS----------- 190 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-------------- 369 K II +TDG NN + + + AKE I++ TI I Sbjct: 191 -------KVIILMTDGVNNTGVIDPLSAA--ELAKEYGIRVYTIGIGTNGKALSPVAYNP 241 Query: 370 -------NAS--PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + L + + ++ + + L ++ I +L Sbjct: 242 DGSFQYDMVPVEIDEKLLAEISKITGGKYFRATDNNKLAQIYTEIDKL 289 >gi|190892054|ref|YP_001978596.1| hypothetical protein RHECIAT_CH0002466 [Rhizobium etli CIAT 652] gi|190697333|gb|ACE91418.1| hypothetical conserved protein [Rhizobium etli CIAT 652] Length = 427 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 60/446 (13%), Positives = 131/446 (29%), Gaps = 94/446 (21%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 I+ NF I+ AL ++ + G + + + +A +AA + + Sbjct: 9 ISDRSGNFGIMTALLMVPLVGTAGMAVDFAHALSLRTQLYAAADAAAVGS---IAEKSGA 65 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + N DA+ + + L+ + + Sbjct: 66 VAAAMAMNGNGTISLGKTDARNIFMSQVSGELA----------------EVHVDLGIDVT 109 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIE 194 + N+ + + V T+ + + I E Sbjct: 110 KTANKLNSQVSFTATVPTT-------FMQIFGRDSIT-----ISGTATAEYQTAAFMDFY 157 Query: 195 LVVDLSGSM----------------HCAMNSDPEDVNSAPICQDKK---RTKMAALKNAL 235 +++D + SM CA D ++ K ++ ++ A Sbjct: 158 ILLDNTPSMGVGATPSDVSKLEAKVGCAFACHQMDKSTNNYTIAKSLGVAMRIDVVRQAT 217 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 D+ + MG+ + T+ E + + +T+ + TD+ Sbjct: 218 QALTDTAKTERVSSDQFRMGVYTFGTKAEDAKLTT--ISGLTSDLTK-----VKSYTDAV 270 Query: 296 PAMKQAYQILTSDKKRSFFTNFFR-------QGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 M YQ SD+ +F + + G + +K + F++DG +++K + Sbjct: 271 DLMTIPYQNYNSDQITNFDSAMTQMNTIIDPAGDGTSNTSAEKILFFVSDGVGDSYKPST 330 Query: 349 NTIKI-------------CDKAKENFIKI---VTISINASPNG-------------QRLL 379 T K C K+ +KI T + N + Sbjct: 331 CTKKTTGGRCQEPIDTSFCKPLKDRGVKIAVLYTTYLPLPSNSWYNTWIKPFQSEIPTKM 390 Query: 380 KTCVSSPEYHYNVVNADSLIHVFQNI 405 + C SP +++ V D + + + Sbjct: 391 QACA-SPGFYFEVSPTDGITDAMKAL 415 >gi|52548946|gb|AAU82795.1| conserved hypothetical protein [uncultured archaeon GZfos1C11] Length = 438 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 45/249 (18%), Positives = 93/249 (37%), Gaps = 25/249 (10%) Query: 153 SYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPE 212 SY + +L V ++ + + + LV+D+SGSM + D Sbjct: 143 SYALSKDPVSEVLGY---YLSVGLNTGIIESDFQRKK-LNLALVLDISGSMGSSF--DEY 196 Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW- 271 + T+ A + + LL H+++D +GL+ + T E S Sbjct: 197 YYDRFGNHVAVNDTEDAEKSKIEIAAAAIVALLDHLEDDDRLGLVLFNTGAELAEPVSLV 256 Query: 272 GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 G + +++ ++ T + M+ A T + + +++ ++ Sbjct: 257 GAKNMQKLKGDVLEISATGGTRLSAGMQMA-------------TELYDEFLEVNQSEYEN 303 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT--ISINASPNGQRLLKTCVS-SPEY 388 IIFLTD N+ +++ ++ +A N + T I I N L++ Sbjct: 304 RIIFLTDAMPNSGQTSEESLLGMIEANANK-NVYTTFIGIGVDFN-TELVEYITKIRGAN 361 Query: 389 HYNVVNADS 397 +Y+V +A Sbjct: 362 YYSVHSATQ 370 >gi|90021389|ref|YP_527216.1| BatB protein [Saccharophagus degradans 2-40] gi|89950989|gb|ABD81004.1| von Willebrand factor, type A [Saccharophagus degradans 2-40] Length = 341 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 34/233 (14%), Positives = 79/233 (33%), Gaps = 58/233 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM + Q+++ ++A +K+ + F++ + Sbjct: 92 DLLVAVDISGSMD----------TKDMVVQNQQIPRIAVVKHIVGDFIER-------RVG 134 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + T ++ V+Q + T A+ + + L Sbjct: 135 DRLGLVLFGTSAYLQSPLTFDRTTVKQLLVESQIGFAGPNTAIGDAIGLSIKRLRDRPAE 194 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + +I LTDG+N + + D AK++ +K+ TI + A Sbjct: 195 N------------------RVVILLTDGQNTAGEVSPRQAA--DLAKQSGVKVYTIGVGA 234 Query: 372 S--------------------PNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQ 403 + + L ++ ++ + L ++Q Sbjct: 235 NEMIVSDGFFGNFQRKINPSRDLDEDTLTYIAETTGGRYFRAHSPQELNQIYQ 287 >gi|294653581|ref|NP_714598.2| von Willebrand factor type A domain-containing protein [Leptospira interrogans serovar Lai str. 56601] gi|293630705|gb|AAN51613.2| BatA [Leptospira interrogans serovar Lai str. 56601] Length = 312 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 40/252 (15%), Positives = 84/252 (33%), Gaps = 46/252 (18%) Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 I + +S + E+ I + +D+SGSM + + PE Sbjct: 55 PILRPIAISLMILSLSGPGKKITFLPDEKEGVDILIALDVSGSMSRSRDFLPE------- 107 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 T++ K L F++ + + +GL+ + + E + + Sbjct: 108 ------TRLGVSKKLLKRFIEK-------RNNDRLGLVVFAGAAYLQAPLTGDRESLSEI 154 Query: 280 VTRDMDSLILK-PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + + + + T A+ + L + K RS K I+ +TD Sbjct: 155 LETIEEETVTEQGTAIGDAIILSTYRLRNSKARS------------------KVIVLITD 196 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQR-----LLKTCVSSPEYHYNVV 393 G +N K + T D A++ KI ++ I L + ++ + Sbjct: 197 GVSNTGKIDPVTAT--DLAEQIGAKIYSVGIGKEDGSYEINFEILQELSANTGGRFFRAE 254 Query: 394 NADSLIHVFQNI 405 + + + V +I Sbjct: 255 DPEEMKAVLSSI 266 >gi|163801617|ref|ZP_02195515.1| hypothetical protein 1103602000597_AND4_09192 [Vibrio sp. AND4] gi|159174534|gb|EDP59336.1| hypothetical protein AND4_09192 [Vibrio sp. AND4] Length = 367 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 42/224 (18%), Positives = 79/224 (35%), Gaps = 36/224 (16%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +VVDLSGSM + + +++ A K L F+ + ++ Sbjct: 107 DVMVVVDLSGSMAEQDFTSKAG---------ENISRLNAAKEVLSDFVKT-------RKG 150 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + + + + ++ + T AM A + K Sbjct: 151 DRLGLILFGDAAFVQTPFTPDQKVWLELLNQTDVAMAGQSTHLGDAMGLAI-KVFEQSKS 209 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 ++ K I LTDG N+ S V I+ AK ++I I++ Sbjct: 210 RIGVEENKE----------KVAIVLTDG--NDTGSFVEPIEAAKVAKAKGVRIHVIAMGD 257 Query: 372 SPN-GQRLL------KTCVSSPEYHYNVVNADSLIHVFQNISQL 408 G+ L + S + +N D L + +I +L Sbjct: 258 PQTLGEAALDMKTIRRIAKESGGKAFEAMNRDELAKAYDDIGRL 301 >gi|329850248|ref|ZP_08265093.1| von Willebrand factor type A [Asticcacaulis biprosthecum C19] gi|328840563|gb|EGF90134.1| von Willebrand factor type A [Asticcacaulis biprosthecum C19] Length = 575 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 35/155 (22%), Positives = 57/155 (36%), Gaps = 19/155 (12%) Query: 268 EPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 + R Y R + T+ T ++ ++L+ T F +G Sbjct: 432 ALTQDIAAARTYAARMAPA---GNTNVTIGVQWGMEVLSP-------TAPFSEGGAFTDK 481 Query: 328 PFQKFIIFLTDGEN-------NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLK 380 K++I LTDG N NN + N C AK I + T+ + L+ Sbjct: 482 AVLKYMIVLTDGINTQNRWTTNNSQINARLALACTNAKNLGITVFTVRVE--QGDSTTLQ 539 Query: 381 TCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 C S Y+YN+ NAD L I + + + + Sbjct: 540 NCASQTAYYYNLSNADQLPATMSKIMKSIRKVRLT 574 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 51/311 (16%), Positives = 95/311 (30%), Gaps = 57/311 (18%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 +++ Y +K A + N ++I A SV+ + +G + + + ++ A +A +A Sbjct: 1 MTKVSRYLRKFSADIRGNVTMIVAFSVIPIVAAVGGGLDFANIQAARAKLQDAVDAGAIA 60 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 + + AK+A + K+ G F NT + + Sbjct: 61 A-----TIDPTATPTQTTREAVAKKAFCGNIKQS---------GGLQNSFCNTTTLDTLG 106 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 T A +SNN + L + + + S + Sbjct: 107 ------TASATLSTATSNNIMTVTYSATAHVPTYLLGLVGIDTVDIDAVAKSGVSTSTAE 160 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS-I 242 + V+D +GSM KM LK++L L S + Sbjct: 161 ---------VAFVLDNTGSMSSN-------------------NKMTYLKSSLDAVLASML 192 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 D + L+ + T+V V V D + PT S A Sbjct: 193 DSTGKNYAKTKVALVPFDTQVS--------LSNVAGMVDYAGDFSTVTPTYSCSGYSSAQ 244 Query: 303 QILTSDKKRSF 313 + S+ + Sbjct: 245 CQVISENASAM 255 >gi|223936327|ref|ZP_03628239.1| von Willebrand factor type A [bacterium Ellin514] gi|223894845|gb|EEF61294.1| von Willebrand factor type A [bacterium Ellin514] Length = 338 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 45/242 (18%), Positives = 86/242 (35%), Gaps = 60/242 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I + +D+SGSM + ++ T+ ++ L F+D ++ Sbjct: 89 DIVVALDMSGSMLAEDEGF--------VLNGQQATRFIIARDVLKKFVDK-------RQS 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQILTSDKK 310 +GL+ + T+ + P+ E + + + R I T A+ + L K Sbjct: 134 DRIGLVVFGTQAYVAVPPTLDHEFLLKNLERLGIGSINGNQTAIGSALSTSMNRLRELKS 193 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI- 369 +S K II +TDG+NN K T + A+ IKI TI + Sbjct: 194 KS------------------KIIILMTDGQNNAGKVPPLTAA--EAARALGIKIYTIGVG 233 Query: 370 ------NA----SPNGQRLLKTCVSSPE-------------YHYNVVNADSLIHVFQNIS 406 A +GQ++ + + +Y + +L ++ +I Sbjct: 234 TKGVARMAVGTDPFSGQKIYQQVPVDIDEGTLTSISKMTNAKYYRADSTATLEKIYADID 293 Query: 407 QL 408 +L Sbjct: 294 RL 295 >gi|86145196|ref|ZP_01063527.1| hypothetical protein MED222_04745 [Vibrio sp. MED222] gi|85836773|gb|EAQ54893.1| hypothetical protein MED222_04745 [Vibrio sp. MED222] Length = 359 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 46/238 (19%), Positives = 86/238 (36%), Gaps = 29/238 (12%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 L ++ E+ + +VVDLSGSM Q K +++ A K L Sbjct: 88 TILGEPQVREQLGRDVMVVVDLSGSMA---------EQDFTSKQGDKISRLDATKEVLAD 138 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 F + ++ +GLI + + + + + + ++ + T A Sbjct: 139 FAKT-------RKGDRLGLILFGDAAFVQTPFTADQDVWLELLNQTDVAMAGQSTHLGDA 191 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A ++ +K+S + +K +I LTDG N+ S V I A Sbjct: 192 IGLATKVFEQSEKQSAAVQDSSIDANVK----EKVVIVLTDG--NDTGSFVEPIDAAKVA 245 Query: 358 KENFIKIVTISINASPN-GQRLL------KTCVSSPEYHYNVVNADSLIHVFQNISQL 408 K ++I I++ G+ L + S + +N D L + I +L Sbjct: 246 KAKGVRIHVIAMGDPQTVGEVALDMETIKRVAQESGGEAFEALNRDELTKAYAQIGEL 303 >gi|153806291|ref|ZP_01958959.1| hypothetical protein BACCAC_00547 [Bacteroides caccae ATCC 43185] gi|149130968|gb|EDM22174.1| hypothetical protein BACCAC_00547 [Bacteroides caccae ATCC 43185] Length = 327 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 46/309 (14%), Positives = 96/309 (31%), Gaps = 68/309 (22%) Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 I+ + T ++ ++ Y +Y + F+ ++ +++ Sbjct: 23 ILRQKKSEATLQISDARVYAHTPKSYK-----NYLLHVPFLLRVIALVLVILVLARPQTT 77 Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + + E I L +D+S SM D K ++ A K+ F++ Sbjct: 78 NKWQNSEIEGIDIMLAIDVSTSMLAE---------------DLKPNRLEAAKDVAAEFIN 122 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 +G+ + + + + LI T + Sbjct: 123 ----GRPNDN---IGITLFAGETFTQCPLTVDHAVLLDMIHNIKCGLIEDGTAVGMGIAN 175 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A L K +S K II LTDG NN K +++ + + AK Sbjct: 176 AVTRLKDSKAKS------------------KVIILLTDGTNN--KGDISPLTAAEIAKSF 215 Query: 361 FIKIVTISI----NAS-----------------PNGQRLLKTCVSSPEYHYNVVNADSLI 399 I++ TI + A + + L + ++ ++ + L Sbjct: 216 GIRVYTIGVGTNGMAPYPYPVGNTVQYVNMPVEIDEKTLTQIAGTTDGNYFRATSNSKLK 275 Query: 400 HVFQNISQL 408 V++ I +L Sbjct: 276 EVYEEIDKL 284 >gi|218781310|ref|YP_002432628.1| hypothetical protein Dalk_3472 [Desulfatibacillum alkenivorans AK-01] gi|218762694|gb|ACL05160.1| conserved hypothetical protein [Desulfatibacillum alkenivorans AK-01] Length = 308 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 34/222 (15%), Positives = 69/222 (31%), Gaps = 38/222 (17%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + P I L +D S SM + ++ ++ A+K + F+ Sbjct: 82 KTPGVDIILCLDASESMAQPDFAIDG----------QRVNRLTAVKKVVHDFVKR----- 126 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++ +GL+ + + + + + + T A+ A + + Sbjct: 127 --RDTDRIGLVVFGDYAFTQAPLTLDKGLLLNLIENLRIGMAGRKTAIGDALGVAGKRIK 184 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 S K +I L+DGEN IKI T Sbjct: 185 DIPAMS------------------KVVILLSDGENTAGDMTPQGAAE--ALAALGIKIYT 224 Query: 367 ISINASPNG-QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 I + G + L + +Y+ N + L +++ I + Sbjct: 225 IGMGTEQAGSKELAQIAAIGQGKYYHASNTEQLDSIYKEIDK 266 >gi|163734461|ref|ZP_02141901.1| hypothetical protein RLO149_09454 [Roseobacter litoralis Och 149] gi|161392469|gb|EDQ16798.1| hypothetical protein RLO149_09454 [Roseobacter litoralis Och 149] Length = 327 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 35/280 (12%), Positives = 92/280 (32%), Gaps = 44/280 (15%) Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 ++ S V+ + L+ I IE + + L Sbjct: 40 QVISEAQLTPAKGAVIMRRSWLQVIAASLIWILLVIAISQPERLGAPIETSKSAR-DLIL 98 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 +D+SGSM D ++ ++A +++ + F++ +E M Sbjct: 99 AIDISGSM---------DTRDFTDASNENVQRLAGVRDVVRAFVE-------GREGDRMA 142 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 LI + ++ + T + + + + + T A+ + + + + Sbjct: 143 LIVFGSKAYLQSPLTEDTGTIVELLDQTEVGMAGPHTAIGDAIGLSIRTFEASEIE---- 198 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN- 374 Q+ +I L+DG + + + + + A+ ++I TI++ Sbjct: 199 --------------QRLLILLSDGADTASRMSP--LNAAEIARGAGVEIFTIAVGDPDAT 242 Query: 375 -----GQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 L+ + + ++ + +L ++ I +L Sbjct: 243 GENRVDVAALQDIANRTSGSYFFAADQAALDEIYARIDEL 282 >gi|45655623|ref|YP_003432.1| BatA [Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130] gi|45602594|gb|AAS72069.1| BatA [Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130] Length = 320 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 40/252 (15%), Positives = 84/252 (33%), Gaps = 46/252 (18%) Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 I + +S + E+ I + +D+SGSM + + PE Sbjct: 63 PILRPIAISLMILSLSGPGKKITFLPDEKEGVDILIALDVSGSMSRSRDFLPE------- 115 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 T++ K L F++ + + +GL+ + + E + + Sbjct: 116 ------TRLGVSKKLLKRFIEK-------RNNDRLGLVVFAGAAYLQAPLTGDRESLSEI 162 Query: 280 VTRDMDSLILK-PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + + + + T A+ + L + K RS K I+ +TD Sbjct: 163 LETIEEETVTEQGTAIGDAIILSTYRLRNSKARS------------------KVIVLITD 204 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQR-----LLKTCVSSPEYHYNVV 393 G +N K + T D A++ KI ++ I L + ++ + Sbjct: 205 GVSNTGKIDPVTAT--DLAEQIGAKIYSVGIGKEDGSYEINFEILQELSANTGGRFFRAE 262 Query: 394 NADSLIHVFQNI 405 + + + V +I Sbjct: 263 DPEEMKAVLSSI 274 >gi|254420933|ref|ZP_05034657.1| hypothetical protein BBAL3_3243 [Brevundimonas sp. BAL3] gi|196187110|gb|EDX82086.1| hypothetical protein BBAL3_3243 [Brevundimonas sp. BAL3] Length = 646 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 28/216 (12%), Positives = 58/216 (26%), Gaps = 55/216 (25%) Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 G YT SW S I+ T+ + A +T+ + Sbjct: 432 RAGSQAYTD--AAPSASSWVGRSYASGGNACPASQIVPLTNVKKTLTDAVDGMTAVGSTA 489 Query: 313 FFTNFFRQGVKIP----------------SLPFQKFIIFLTDGENNNF------------ 344 + S K ++ +TDGE N Sbjct: 490 GHIGLAWGWYLVSPNFGLWSGLGAPAAYDSSKTLKAVVLMTDGEFNTPYFRGVIASDAGN 549 Query: 345 ----------------KSNVNTIKICDKAKENFIKIVTISINAS---------PNGQRLL 379 S ++C+ K + + T+ + + L+ Sbjct: 550 GSGGADTHINQPATNGSSFEQAYRLCENMKAADVIVYTVGFDIGAARNMTGPIDSAGELM 609 Query: 380 KTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 C ++P+ + ++ L F++I + + + S Sbjct: 610 ARCATNPDRAFQASSSTDLSDAFRDIGRDITRLRIS 645 >gi|296124353|ref|YP_003632131.1| von Willebrand factor type A [Planctomyces limnophilus DSM 3776] gi|296016693|gb|ADG69932.1| von Willebrand factor type A [Planctomyces limnophilus DSM 3776] Length = 390 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 55/396 (13%), Positives = 128/396 (32%), Gaps = 69/396 (17%) Query: 39 FLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFI 98 F + V + + +A +A+ AG + +I+ A + + Sbjct: 33 FTVDVAYMQLVRTELRAATDASAKAGMEALRRTQDTEAAIDAAIATAAANKVGGRSLTLT 92 Query: 99 KNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRL 158 + I+ L F N + N+ ++ T + N + + + L Sbjct: 93 ADQIEFGL-----AFRNVDNSVSFNAGQLPYTAVRVNSAMTES---------SAAGAVPL 138 Query: 159 QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMN----SDPEDV 214 F + + S + A +E I +D S SM + S P Sbjct: 139 -FFGSIFGTGQFEPTRSAVSASTEVE--------ICFAIDRSHSMCFDLTGVDWSYPPGT 189 Query: 215 NSAPICQDKKR----TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK----- 265 P ++ A+L A+ F+ + ++ + +++ + Sbjct: 190 PRNPDPVAFPPHPTLSRWASLSRAMQTFVSITASQEPKP---RVAMVTWASKITQSNYEG 246 Query: 266 ------------NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 ++ + + Q + + ++L T+ + +A +IL + K Sbjct: 247 KLTKTNSPEVFVDVPLTTNLADLNQAIKGRSEKVMLGATNMAAGIDEARKILNATKSTRP 306 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 + + + II +TDG + N + A I I ++S+ Sbjct: 307 YAH--------------RIIILMTDG---LWNQGRNPLLAAQDAANEGIVIHSVSLL-PR 348 Query: 374 NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 +G + ++ +Y N+ +L F +I++ + Sbjct: 349 SGDITPQVSSTTGGVNYPATNSAALEAAFADIARTL 384 >gi|261867447|ref|YP_003255369.1| TadG [Aggregatibacter actinomycetemcomitans D11S-1] gi|261412779|gb|ACX82150.1| TadG [Aggregatibacter actinomycetemcomitans D11S-1] Length = 545 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 42/298 (14%), Positives = 94/298 (31%), Gaps = 35/298 (11%) Query: 2 HLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA- 60 +L ++ K+ + +E ++II AL LL + F + K + A + A Sbjct: 6 NLTAKLFSTVKQFLQNEHGVYTIITALLAFPLLLFVAFTVDGTGILLDKARLAQATDQAA 65 Query: 61 --ILAGASKMVSNLSRLGDRFESISNHAKRALIDDA-------------KRFIKNHIKES 105 ++A ++ N + +++S D + ++ +K Sbjct: 66 LLLIAEDNQYRKNKDHSDVKRQNVSQQEIEREGRDFSSAKVQAQWKKRNQELVQGLVKLY 125 Query: 106 LSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLL 165 L + + ++ + + N T ++ + + +F L Sbjct: 126 LRSDDSKGQKNSSPVTIKEPFLAECLEEKTQPKNKNGTAK-SIACVVQGSVQRKFW--LP 182 Query: 166 NQRYNQKIVSFIPALLRIEMGERPI---------FLIELVVDLSGSMHCAMNSDPEDVNS 216 + + I G+ + +V DLS SM A+N+ + Sbjct: 183 WGQTLVSSSQLYDGRVGINSGKTYAVKEKQITIPIDLMMVTDLSRSMMWAINA----TGN 238 Query: 217 APICQDKKRTKMAALKNALLL---FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW 271 P + ++ AL+ A+ L V MG + + + E + Sbjct: 239 NPPEVNYPNRRIDALREAVEGIEKILLPAQNKGDVSPYNRMGFVSFAAGTRQRDELTN 296 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 54/163 (33%), Gaps = 26/163 (15%) Query: 261 TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 V K +W + R + + T T + ++ K +F Sbjct: 372 DNVNKKTTQAWFDKNNRAVASALKQIIPRGGTAVTSGIFIGTNLMMEKNK-----DFEAM 426 Query: 321 GVKIPSLPFQKFIIFLTDGENNNFKSNVNTI----KICDKAKEN--------------FI 362 KI ++ ++ L+DGE+N + +C + KE I Sbjct: 427 PNKIG-TNTRRILMILSDGEDNIPSKDTLVKLMEAGLCTRVKEKIDGLQDSNYPKVETRI 485 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 V N Q + K CV + +Y+V + +L F+ I Sbjct: 486 AFVAFGFNPPQKQQEVWKKCV--GDQYYSVSSKQALFDAFKQI 526 >gi|167946540|ref|ZP_02533614.1| BatB protein, putative [Endoriftia persephone 'Hot96_1+Hot96_2'] Length = 345 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 39/241 (16%), Positives = 80/241 (33%), Gaps = 58/241 (24%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + + L VD S SM ++ ++++ LK + F+D Sbjct: 94 AGYDLMLAVDTSRSM----------TAEDFTVHGREVSRLSVLKGIMGKFVD-------G 136 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + +GLI + ++ + Q + + +L T + + L Sbjct: 137 RVGDRIGLIIFGDTSYVLSPLTFDRNAIHQLLDGIVPTLAGGGTAIGDGIGLGIKKLRER 196 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + S + +I +TDG+N + +K AK+ I+I TI Sbjct: 197 PEGS------------------RVLILVTDGKNET--GTIPPLKAAQLAKQEGIRIYTIG 236 Query: 369 INASPNGQRLL---------------------KTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + ++ N RLL + ++ ++ + L V+Q I + Sbjct: 237 VGSTKNRVRLLSPDLRTYEIATGLAIDEETLQQIAETTGGAYFRANDTAGLEKVYQRIDE 296 Query: 408 L 408 L Sbjct: 297 L 297 >gi|152985991|ref|YP_001347440.1| hypothetical protein PSPA7_2067 [Pseudomonas aeruginosa PA7] gi|150961149|gb|ABR83174.1| hypothetical protein PSPA7_2067 [Pseudomonas aeruginosa PA7] Length = 337 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 43/250 (17%), Positives = 84/250 (33%), Gaps = 60/250 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VD+SGSM QD + +++ +K F++ + Sbjct: 91 DLLLAVDVSGSMD----------YRDMRWQDDEISRLELVKKLFGDFIE-------GRRG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + K T A+ A + L Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRHTVRVWLDEAQIGIAGKNTAIGDAIGLALKRLRQRPAE 193 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S + ++ +TDG N + + T A E +KI TI I A Sbjct: 194 S------------------RVLVLITDGANTGGQISPQTAAR--LAAEERVKIYTIGIGA 233 Query: 372 SPN------------GQRL----LKTCV-SSPEYHYNVVNADSL------IHVFQNISQL 408 P G L L+ ++ ++ ++ L + + ++Q Sbjct: 234 DPQQGGVIGLFGLNPGLDLDEPVLRGIAETTGGEYFRARSSAELESISATLDRLEPVAQQ 293 Query: 409 MVHRKYSVIL 418 + +V L Sbjct: 294 TTRARPAVAL 303 >gi|327190622|gb|EGE57710.1| hypothetical protein RHECNPAF_409007 [Rhizobium etli CNPAF512] Length = 427 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 60/446 (13%), Positives = 131/446 (29%), Gaps = 94/446 (21%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 I+ NF I+ AL ++ + G + + + +A +AA + + Sbjct: 9 ISDRSGNFGIMTALLMVPLVGTAGMAVDFAHALSLRTQLYAAADAAAVGS---IAEKSGA 65 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + N DA+ + + L+ + + Sbjct: 66 VAAAMAMNGNGTISLGKTDARDIFMSQVSGELA----------------EVHVDLGIDVT 109 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIE 194 + N+ + + V T+ + + I E Sbjct: 110 KTANKLNSQVSFTATVPTT-------FMRIFGRDSIT-----ISGTATAEYQTAAFMDFY 157 Query: 195 LVVDLSGSM----------------HCAMNSDPEDVNSAPICQDKK---RTKMAALKNAL 235 +++D + SM CA D ++ K ++ ++ A Sbjct: 158 ILLDNTPSMGVGATPSDVSKLEAKVGCAFACHQMDKSTNNYTIAKSLGVAMRIDVVRQAT 217 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 D+ + MG+ + T+ E + + +T+ + TD+ Sbjct: 218 QALTDTAKTERVSSDQFRMGVYTFGTKAEDAKLTT--ISGLTSDLTKVKNY-----TDAV 270 Query: 296 PAMKQAYQILTSDKKRSFFTNFFR-------QGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 M YQ SD+ +F + + G + +K + F++DG +++K + Sbjct: 271 DLMTIPYQNYNSDQITNFDSAMTQMNTIIDLAGDGTSNTSAEKILFFVSDGVGDSYKPST 330 Query: 349 NTIKI-------------CDKAKENFIKI---VTISINASPNG-------------QRLL 379 T K C K+ +KI T + N + Sbjct: 331 CTKKTTGGRCQEPIDTSFCKPLKDRGVKIAVLYTTYLPLPSNSWYNTWIKPFQSEIPTKM 390 Query: 380 KTCVSSPEYHYNVVNADSLIHVFQNI 405 + C SP +++ V D + + + Sbjct: 391 QACA-SPGFYFEVSPTDGITDAMKAL 415 >gi|119504633|ref|ZP_01626712.1| BatB protein, putative [marine gamma proteobacterium HTCC2080] gi|119459655|gb|EAW40751.1| BatB protein, putative [marine gamma proteobacterium HTCC2080] Length = 332 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 40/222 (18%), Positives = 76/222 (34%), Gaps = 47/222 (21%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L +DLSGSM + +++ A+K A+ S Sbjct: 91 DLLLAIDLSGSM----------QIEDMQIGNSLVSRITAVK-AIAADFASRRTGD----- 134 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + TR ++ + V+Q++ + T A+ A + L Sbjct: 135 -RVGLILFGTRAYVQAPLTFDVKTVKQFIEEAQLGFAGEDTAIGDALGLAVKRLRERPAD 193 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-- 369 S + +I LTDG++ + A E +KI TI I Sbjct: 194 S------------------RVLILLTDGQDTASTVDPMEAA--ALASEMNVKIYTIGISR 233 Query: 370 ----NASPNG---QRLLKTCV-SSPEYHYNVVNADSLIHVFQ 403 +++ +G + LL ++ ++ L ++Q Sbjct: 234 RLGTSSNSSGEVDEALLTAIAQATGGRYFRARTPKELQDIYQ 275 >gi|32472883|ref|NP_865877.1| signal peptide [Rhodopirellula baltica SH 1] gi|32444120|emb|CAD73562.1| hypothetical protein-signal peptide and transmembrane prediction [Rhodopirellula baltica SH 1] Length = 434 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 68/453 (15%), Positives = 131/453 (28%), Gaps = 82/453 (18%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 RF + S + +++ A + LL F I + K + A +AA AG Sbjct: 8 VRFALTQRDESRRGGITVLMAFVLPMLALLAAFCINLAQMQLVKTELAIATDAAARAGGR 67 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + + + A + + + + F + + N+ R Sbjct: 68 AFSEEQTVEAAKAAARLTAAMNEVAGEPYQLNTDDSANEFE-----FGVSAQTD-GNTGR 121 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG 186 T + + + ++ + D L + I F P M Sbjct: 122 FYFTKVPTSDVAANLVAVSSVRINGKRTDDSLLGPVPFIFPNTFS-IGDFSPVASATAMQ 180 Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDV------------------------NSAPICQ- 221 I LV+D SGSM P+D N P Sbjct: 181 VDR--DISLVLDRSGSMDWKTYDWPDDADPWGEDSLISAEDAGIVDLEWKYRNGQPQYIR 238 Query: 222 -------------------------DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL 256 T L A+ FL +D ++ + Sbjct: 239 RVSYNRGYDEYDLYDHAWEEVFGLGPAPNTPWEDLVLAVDAFLRVLDQTPQNEQV---SI 295 Query: 257 IGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTN 316 Y + + + VR V + + T M T + R + + Sbjct: 296 ASYNSHGTLDCWLLDDFDSVRAAVAQLAPN---GSTGIGNGMNSGKTAFTHENARPYAS- 351 Query: 317 FFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQ 376 K ++ +TDG N+N+ + NT+ + + I T++ + + Sbjct: 352 --------------KTMVVMTDG-NHNYGTQPNTVAQ-QLMSSSNLNIQTVTFGGGADQE 395 Query: 377 RLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + + V+ HY+ + D L+ F+ I+ + Sbjct: 396 TMQEVAVTGLGRHYHADSGDELVSAFEEIANNL 428 >gi|222528098|ref|YP_002571980.1| von Willebrand factor type A [Caldicellulosiruptor bescii DSM 6725] gi|222454945|gb|ACM59207.1| von Willebrand factor type A [Caldicellulosiruptor bescii DSM 6725] Length = 902 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 49/247 (19%), Positives = 86/247 (34%), Gaps = 46/247 (18%) Query: 165 LNQRYNQKIVSFIPALLRIEMGER-PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 L N + +P + I+ E+ + LV+D SGSM +D Sbjct: 379 LGNYSNSVLEKMLPVKMEIKNKEKEKNIDVVLVLDHSGSMA--------------DTEDA 424 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 K+ K+A ++ ++ V G+I + + G ++ V Sbjct: 425 GIPKLEIAKSASAKMIEHLESSDGV------GVIAFDHNYYWAYKF--GKISKKEDVIES 476 Query: 284 MDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + S+ + T P + +A + L K +S I+ LTDG Sbjct: 477 ISSIEVGGGTAIIPPLSEAVKTLKKSKAKSKL------------------IVLLTDGMGE 518 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + ++AK N IKI TI + N L + Y V N L+ VF Sbjct: 519 QGGYEIPA----NEAKRNNIKITTIGVGKYVNATVLSWIASFTSGRFYLVSNPSELVDVF 574 Query: 403 QNISQLM 409 ++++ Sbjct: 575 LKETKII 581 >gi|91792882|ref|YP_562533.1| von Willebrand factor, type A [Shewanella denitrificans OS217] gi|91714884|gb|ABE54810.1| von Willebrand factor, type A [Shewanella denitrificans OS217] Length = 330 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 38/235 (16%), Positives = 77/235 (32%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM + K + + ++N L F++ + Sbjct: 85 DLMLAVDLSGSM----------QIEDMVINGKTVDRFSLIQNVLGEFIER-------RNG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + ++ L+ K T A+ A + + Sbjct: 128 DRLGLILFADHAYLQAPLTQDRRSIATFLADAQIGLVGKQTAIGEAIALAVKRFDQVSES 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + ++ LTDG NN + + A + + I T+ + A Sbjct: 188 N------------------RVLVLLTDGSNNAGNIEPDVAA--EIAAKRNVTIYTVGVGA 227 Query: 372 S-----------------PNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 + L+ + + Y++ N++ L ++Q I QL Sbjct: 228 ELMERRTIFGKERVNPSMDLDEAQLQRLATMTNGYYFRAKNSEDLAQIYQKIDQL 282 >gi|26988754|ref|NP_744179.1| von Willebrand factor type A domain-containing protein [Pseudomonas putida KT2440] gi|24983548|gb|AAN67643.1|AE016394_4 von Willebrand factor type A domain protein [Pseudomonas putida KT2440] gi|313499848|gb|ADR61214.1| Von Willebrand factor type A domain-containing protein [Pseudomonas putida BIRD-1] Length = 358 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 35/183 (19%), Positives = 68/183 (37%), Gaps = 37/183 (20%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM Q++ +++ +K + FL +E Sbjct: 91 DLLVAVDVSGSMD----------FPDMQWQNEDISRLDLVKALMGDFLQ-------DREG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + K T A+ A + L + Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRRTVRTFLIEAQIGIAGKNTAIGDAIGLAVKRLRERPAQ 193 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S + ++ +TDG NN + + T A + ++I TI I A Sbjct: 194 S------------------RVLVLITDGANNGGQIHPLTAAR--LAAQEGVRIYTIGIGA 233 Query: 372 SPN 374 +P Sbjct: 234 NPE 236 >gi|88704964|ref|ZP_01102676.1| conserved hypothetical protein [Congregibacter litoralis KT71] gi|88700659|gb|EAQ97766.1| conserved hypothetical protein [Congregibacter litoralis KT71] Length = 344 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 36/234 (15%), Positives = 76/234 (32%), Gaps = 59/234 (25%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VD+SGSM ++ ++ A+K L S Sbjct: 85 DLMLAVDISGSM----------RVEDMQVGNRMARRIDAVKQ-LGSDFMSRRSGD----- 128 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + +R S+ + V++++ + T A+ A + L Sbjct: 129 -RLGLILFGSRAYLQSPLSFDIQTVQRFLLEAQIGFAGQETAIGDAIGLAVKRLQERPAS 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-- 369 S + ++ LTDG++ + + A + ++I TI I Sbjct: 188 S------------------RVLVLLTDGQDTASTVDPLEAA--NLAADLGVRIYTIGIGA 227 Query: 370 --------------------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 +A + L++ S+ ++ + + L V++ Sbjct: 228 DSLTLPGLLGSPLGARTVNPSADLDESTLIEIARSTGGQYFRARDPEELATVYR 281 >gi|77359908|ref|YP_339483.1| von Willebrand factor type A [Pseudoalteromonas haloplanktis TAC125] gi|76874819|emb|CAI86040.1| conserved protein of unknown function; putative Von Willebrand factor type A domain protein [Pseudoalteromonas haloplanktis TAC125] Length = 328 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 39/244 (15%), Positives = 80/244 (32%), Gaps = 42/244 (17%) Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 + +N +++ RL E ++ + I M I L Sbjct: 36 PSFAKHNLTSQSVEAHAR---RLTPFEWVIWLLLVIAAANPTWLDDPISMPNEGR-DIML 91 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 VDLSGSM + ++ +K L F++ ++ +G Sbjct: 92 AVDLSGSM----------TEQDMAYNGQYVDRLTMVKAVLTDFIEQ-------RQGDRLG 134 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 LI + + + V + ++ L+ + T A+ + + + K + Sbjct: 135 LILFGDTAFLQTPLTRDVKTVSKMLSEAQIGLVGRATAIGDALGLSVKRFANKDKSN--- 191 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-NASPN 374 + ++ LTDG+N + A++ IK+ TI + + +P Sbjct: 192 ---------------RIVVLLTDGQNTAGNLKPEEALL--LARDAGIKVYTIGVGSDNPR 234 Query: 375 GQRL 378 G L Sbjct: 235 GFSL 238 >gi|257052324|ref|YP_003130157.1| von Willebrand factor type A [Halorhabdus utahensis DSM 12940] gi|256691087|gb|ACV11424.1| von Willebrand factor type A [Halorhabdus utahensis DSM 12940] Length = 592 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 30/170 (17%), Positives = 63/170 (37%), Gaps = 14/170 (8%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV--NSAPICQD 222 L + + + + L ER + +V+D+SGSM + D N + + Sbjct: 163 LGESTGRYFTVGLNSTLDTSTFERKRLDVVIVLDISGSMGSQFDQYYYDRFGNRHTVEEG 222 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP----SWGTEKVRQ 278 R+KMA K+AL+ + + +G++ + + + +R Sbjct: 223 DSRSKMAVAKDALVALTEQLHPDD------RVGVVLFNNEPTVAKPLRDVETTDMDAIRG 276 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 ++ D+++ T+ M +A +L RQ V ++P Sbjct: 277 HIREDIEA--GGGTNIADGMAEAADMLGEYADSDPTEAETRQIVITDAMP 324 >gi|222147837|ref|YP_002548794.1| hypothetical protein Avi_1104 [Agrobacterium vitis S4] gi|221734825|gb|ACM35788.1| conserved hypothetical protein [Agrobacterium vitis S4] Length = 483 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 32/166 (19%), Positives = 58/166 (34%), Gaps = 33/166 (19%) Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 T + D T M + L+ + ++ S +K Sbjct: 315 TSSFSTLQSAIADMTSEGSTRLDAGMLAGWYTLSPKWRSAWGGGTAPADY---SEKVKKV 371 Query: 333 IIFLTDGENN----------------------------NFKSNVNTIKICDKAKENFIKI 364 I+F+TDGE N N + + CD K N I+I Sbjct: 372 IVFMTDGEMNVKFGSTDPAKSSTEKLDWICDKNRTKSCNDTATNALLTTCDSIKSNNIEI 431 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 IS ++ + Q L+TC S +Y+++ + + V+ IS+ ++ Sbjct: 432 YAISYSSEADVQN-LQTCSSGTKYYFSASTTN-IKDVYTAISKNII 475 >gi|91201135|emb|CAJ74194.1| conserved hypothetical protein [Candidatus Kuenenia stuttgartiensis] Length = 333 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 35/184 (19%), Positives = 61/184 (33%), Gaps = 45/184 (24%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E+ I + VD S SM D K ++ K + L + Sbjct: 85 EKKGIDIMIAVDTSRSMLAD---------------DVKPNRLEVAKREIEDLLKIL---- 125 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-MDSLILKPTDSTPAMKQAYQIL 305 E +GLI + R + R ++ ++ + + T A+ Y+ + Sbjct: 126 ---EGDRVGLIAFAGRAFTYCPLTSDYSAFRLFLNDLNVNIIPVGGTAIAEAI---YKGI 179 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 + + K +I +TDGEN + +K KAKE I I Sbjct: 180 DAFGENENNH---------------KAMIIITDGEN----HETDPLKAASKAKEKGIVIY 220 Query: 366 TISI 369 T+ + Sbjct: 221 TVGV 224 >gi|294054129|ref|YP_003547787.1| hypothetical protein Caka_0592 [Coraliomargarita akajimensis DSM 45221] gi|293613462|gb|ADE53617.1| conserved hypothetical protein [Coraliomargarita akajimensis DSM 45221] Length = 339 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 49/231 (21%), Positives = 80/231 (34%), Gaps = 29/231 (12%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I+++VD+S SM S K T+M K + F+ + Sbjct: 82 IAIQMLVDVSSSMD----------MSVKNFDGKSTTRMEVAKEMVERFIAGDGEDLQGRP 131 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD--MDSLILKPTDSTPAMKQAYQILTSD 308 +GLI + + ++G + + Q V + T A+ A L + Sbjct: 132 HDLIGLITFARYADTRSPLTFGHDALLQIVRHLTIQERPNEDGTAYGDALALAAARLKNP 191 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 ++ Q I S K II LTDGENN+ AK KI IS Sbjct: 192 QELRHGKRPDAQAEAIES----KVIILLTDGENNSGSHLPIEAA--GLAKAWDCKIYAIS 245 Query: 369 INA-----------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + SP + L + + + +SL+ V++ I +L Sbjct: 246 LGESLDAENPLDALSPAERVLEHISIETGGVFRQAHDFESLLSVYEEIDRL 296 >gi|84498072|ref|ZP_00996869.1| hypothetical protein JNB_18333 [Janibacter sp. HTCC2649] gi|84381572|gb|EAP97455.1| hypothetical protein JNB_18333 [Janibacter sp. HTCC2649] Length = 656 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 39/215 (18%), Positives = 77/215 (35%), Gaps = 32/215 (14%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 L++D SGSM TK+ A K AL + ++ + + Sbjct: 41 LMLDASGSMKAK--------------DPSGLTKIEAAKRALTGVVGAL------PDTAQV 80 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 GL Y +V+ +P+ Q V K T + A+K + Sbjct: 81 GLRVYGAKVDGKGKPTPAACADTQLVHPIATLDKPKLTSTIAAIKA--------LGETPI 132 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 + + +K ++ I+ ++DGE + I A ++I T+ + Sbjct: 133 AHSLTEALKDLGTSGKRNIVLVSDGEESCVPDPCPAITKLTAA-GVDLQIDTVGFGVNTK 191 Query: 375 GQRLLKTCVSSPEY--HYNVVNADSLIHVFQNISQ 407 + L+ C+++ +Y+ +A +L +SQ Sbjct: 192 ARAQLQ-CIAAAGKGTYYDAKDASALTTSLSKLSQ 225 >gi|255693880|ref|ZP_05417555.1| BatA protein [Bacteroides finegoldii DSM 17565] gi|260620309|gb|EEX43180.1| BatA protein [Bacteroides finegoldii DSM 17565] Length = 327 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 45/276 (16%), Positives = 81/276 (29%), Gaps = 63/276 (22%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 Y F+ L +++ + + E I L VD+S SM Sbjct: 51 YLLHAPFLLRLFALTLVILVLARPQTTNKWQNSEIEGIDIMLAVDVSTSMLAE------- 103 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 D K ++ A K+ F++ +G+ + + Sbjct: 104 --------DLKPNRLEAAKDVAAEFIN----GRPNDN---IGITLFAGESFTQCPLTVDH 148 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + + LI T + A L K +S K I Sbjct: 149 AVLLDMIHNIKCGLIEDGTAVGMGIANAVTRLKDSKAKS------------------KVI 190 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI----NAS----------------- 372 I LTDG NN + T + AK I++ TI + A Sbjct: 191 ILLTDGTNNKGDISPMTAA--EIAKSFGIRVYTIGVGTNGMAPYPYPVGNTVQYVSMPVE 248 Query: 373 PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + L + ++ ++ + L V++ I +L Sbjct: 249 IDEKTLTQIAGTTDGNYFRATSNSKLKEVYEEIDKL 284 >gi|110677910|ref|YP_680917.1| hypothetical protein RD1_0526 [Roseobacter denitrificans OCh 114] gi|109454026|gb|ABG30231.1| conserved hypothetical protein [Roseobacter denitrificans OCh 114] Length = 327 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 34/280 (12%), Positives = 93/280 (33%), Gaps = 44/280 (15%) Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 ++ + V+ ++ L+ I IE + + L Sbjct: 40 QVIAEAKLTPAKGAVIMRRNWLQIIAASLIWILLVIAISQPERLGAPIETSKSAR-DLIL 98 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 +D+SGSM D ++ ++A +++ + F++ +E M Sbjct: 99 AIDISGSM---------DTRDFTDASNENLQRLAGVRDVVRAFVE-------GREGDRMA 142 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 LI + ++ + T + + + + + T A+ + + + + Sbjct: 143 LIVFGSKAYLQSPLTEDTGTIVELLDQTEVGMAGPHTAIGDAIGLSIRTFEASEIE---- 198 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN- 374 Q+ +I L+DG + + + + + A+ ++I TI++ Sbjct: 199 --------------QRLLILLSDGADTASRMSP--LNAAEIARGAGVEIFTIAVGDPDGT 242 Query: 375 -----GQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 L+ + + ++ + +L ++ I +L Sbjct: 243 GENRVDVAALQDIANRTSGSYFFAADQAALDEIYARIDEL 282 >gi|253568262|ref|ZP_04845673.1| aerotolerance protein BatA [Bacteroides sp. 1_1_6] gi|298385671|ref|ZP_06995229.1| BatA protein [Bacteroides sp. 1_1_14] gi|251842335|gb|EES70415.1| aerotolerance protein BatA [Bacteroides sp. 1_1_6] gi|298261812|gb|EFI04678.1| BatA protein [Bacteroides sp. 1_1_14] Length = 327 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 44/276 (15%), Positives = 86/276 (31%), Gaps = 63/276 (22%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 Y FI ++ +++ + + + E I L +D+S SM Sbjct: 51 YLLHAPFILRVIALALIIVVLARPQSTNKWQNSEIEGIDIMLAIDVSTSMLAE------- 103 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 D K ++ A K+ F++ +G+ + + Sbjct: 104 --------DLKPNRLEAAKDVAAEFIN----GRPNDN---IGITLFAGETFTQCPLTVDH 148 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + + LI T + A L K +S K I Sbjct: 149 AVLLDMIHNIKCGLIEDGTAVGMGVANAVTRLKDSKAKS------------------KVI 190 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI----NAS----------------- 372 I LTDG NN K +++ + + AK I++ TI + A Sbjct: 191 ILLTDGTNN--KGDISPLTAAEIAKSFGIRVYTIGVGTNGMAPYPYPVGNTVQYINMPVE 248 Query: 373 PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + L + ++ ++ + L V++ I +L Sbjct: 249 IDEKTLTQIAGTTDGNYFRATSNSKLKEVYEEIDKL 284 >gi|253584083|ref|ZP_04861281.1| BatA protein [Fusobacterium varium ATCC 27725] gi|251834655|gb|EES63218.1| BatA protein [Fusobacterium varium ATCC 27725] Length = 319 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 44/238 (18%), Positives = 82/238 (34%), Gaps = 61/238 (25%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 ++ I + +DLS SM + +D ++ K L F+D Sbjct: 78 KKDGIDIAISLDLSQSM---------------LQEDFTPNRLEKAKEVLDEFIDK----- 117 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK--QAYQI 304 + + + LI + + ++ +++ + I T + M A Sbjct: 118 --RGNDRLSLIVFGGDAYTKVPLTFDHNVIKEMTRKLTVDDITSNTRTAIGMGIGVALNR 175 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L + +S K II LTDGENN+ + + + D AKE IKI Sbjct: 176 LKDSEAKS------------------KVIILLTDGENNSGEMSPSAAA--DIAKELGIKI 215 Query: 365 VTISINA----------------SPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNI 405 TI I A + + +LK+ ++ ++ ++ +F I Sbjct: 216 YTIGIGAKEIKVPSFFGYKTVKNTELDENMLKSIAETTGGEYFRASDSKEFKEIFNKI 273 >gi|29346317|ref|NP_809820.1| aerotolerance protein BatA [Bacteroides thetaiotaomicron VPI-5482] gi|29338212|gb|AAO76014.1| BatA [Bacteroides thetaiotaomicron VPI-5482] Length = 327 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 44/276 (15%), Positives = 86/276 (31%), Gaps = 63/276 (22%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 Y FI ++ +++ + + + E I L +D+S SM Sbjct: 51 YLLHAPFILRVIALALIIVVLARPQSTNKWQNSEIEGIDIMLAIDVSTSMLAE------- 103 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 D K ++ A K+ F++ +G+ + + Sbjct: 104 --------DLKPNRLEAAKDVAAEFIN----GRPNDN---IGITLFAGETFTQCPLTVDH 148 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + + LI T + A L K +S K I Sbjct: 149 AVLLDMIHNIKCGLIEDGTAVGMGVANAVTRLKDSKAKS------------------KVI 190 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI----NAS----------------- 372 I LTDG NN K +++ + + AK I++ TI + A Sbjct: 191 ILLTDGTNN--KGDISPLTAAEIAKSFGIRVYTIGVGTNGMAPYPYPVGNTVQYINMPVE 248 Query: 373 PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + L + ++ ++ + L V++ I +L Sbjct: 249 IDEKTLTQIAGTTDGNYFRATSNSKLKEVYEEIDKL 284 >gi|84385834|ref|ZP_00988864.1| hypothetical protein V12B01_12445 [Vibrio splendidus 12B01] gi|84379150|gb|EAP96003.1| hypothetical protein V12B01_12445 [Vibrio splendidus 12B01] Length = 359 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 45/238 (18%), Positives = 85/238 (35%), Gaps = 29/238 (12%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 L ++ E+ + +VVDLSGSM + K +++ A K L Sbjct: 88 TILGEPQVREQLGRDVMVVVDLSGSMA---------EQDFTSKRGDKISRLDATKEVLAD 138 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 F + ++ +GLI + + + + + + ++ + T A Sbjct: 139 FATT-------RKGDRLGLILFGDAAFVQTPFTADQDVWLELLNQTDVAMAGQSTHLGDA 191 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A ++ +K+S +K +I LTDG N+ S V I A Sbjct: 192 IGLAIKVFEQSEKQSAAVQDSSVDAN----EKEKVVIVLTDG--NDTGSFVEPIDAAKVA 245 Query: 358 KENFIKIVTISINASPN-GQRLL------KTCVSSPEYHYNVVNADSLIHVFQNISQL 408 K ++I I++ G+ L + S + +N D L + I +L Sbjct: 246 KAKGVRIHVIAMGDPQTVGEVALDMETIKRVAQESGGEAFEALNRDELTKAYAQIGEL 303 >gi|163761157|ref|ZP_02168234.1| hypothetical protein HPDFL43_13595 [Hoeflea phototrophica DFL-43] gi|162281708|gb|EDQ32002.1| hypothetical protein HPDFL43_13595 [Hoeflea phototrophica DFL-43] Length = 444 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 61/439 (13%), Positives = 125/439 (28%), Gaps = 61/439 (13%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 + S NF ++ L +++ + + G + + K + + +A +L ++ Sbjct: 10 LRSRDGNFGLLAGLVMVALVWVAGLAVDFSNALRVKTTAQDIVDATVLRATRDIIEEGKT 69 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH--- 131 L + S R D F + +S ++ I + S + S + Sbjct: 70 LAEAELS-----ARKYFDAELAFSSG-VGLEVSTFTLTQGVDGIVKLGVSGKTSTSLLKA 123 Query: 132 ----MANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE 187 +D++ + ++++ ++D I S + AL + G Sbjct: 124 VGREEIPVSVDAAAHVGGGSVEIAIAFDVTNSMGFGTTWGEATSVIASALNAL-KANSGS 182 Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI-----CQDKKRTK----------MAALK 232 + I ++ M A +P D + C D + TK M Sbjct: 183 MALTFIPFTDRVNVGMGRANLLNPGDQTAVKKGGWGGCVDVRATKKKNKGETEYFMPDSA 242 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 D + GY G VT + L T Sbjct: 243 PEKGDRFTKFDNGTPAAHK-----SGYKLACNPQSII--GPTSNVSDVTSQLGKLTKGGT 295 Query: 293 DSTP-AMKQAYQILTSDKK------RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF- 344 + L+ + K S +K + TDG N + Sbjct: 296 GRFDLGFAWLWYALSPNWKGFWSGGAPADNGVNLADYPTASTNTRKIAVLATDGLTNAYV 355 Query: 345 ----------------KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSS-PE 387 N + IC I++ + +N + + + C S+ Sbjct: 356 YEYGKTNLAGWNTGSKDHFENVVAICKSMAAQKIEVHVMHVNGNDKAEPYFRECASATGG 415 Query: 388 YHYNVVNADSLIHVFQNIS 406 +Y V + +L+ I+ Sbjct: 416 GYYKVASKQTLVDALTGIT 434 >gi|85374104|ref|YP_458166.1| hypothetical protein ELI_06385 [Erythrobacter litoralis HTCC2594] gi|84787187|gb|ABC63369.1| hypothetical protein ELI_06385 [Erythrobacter litoralis HTCC2594] Length = 623 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 36/249 (14%), Positives = 75/249 (30%), Gaps = 42/249 (16%) Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL--FLDSIDLLSHVKEDVYM 254 VDLS ++ +N+ P + + + + + KE + Sbjct: 386 VDLSANLDLDINTVPVASDQDTQWRPRYPDMIYVRSKEADDKGSFSPAPVYDTKKEFIQT 445 Query: 255 GLIGYTTRVEKNIEPSWGTEK-VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G ++ + T + Y+ T M ++L S Sbjct: 446 GNWWFSGCPAPAQKLKAMTSGELDSYLDSL---TPHGATYHDGGMIWGGRLL------SQ 496 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK--------------SNVNT--------- 350 + F + P + +IFLTDG+ + T Sbjct: 497 YGLFAAENSSKPGRTTSRHLIFLTDGQTEPYDLAYGSYGIDPIDERRWTQTSSLTLAQTV 556 Query: 351 ----IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + C++ K+ + ++ + N + +KTC S ++ NA L F I+ Sbjct: 557 EERFLFACNEVKKLGATVWVVAFGTAANDK--MKTCAGS-GRYFEAANASQLNDAFSTIA 613 Query: 407 QLMVHRKYS 415 + + + Sbjct: 614 KSTGDLRIA 622 Score = 53.0 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 39/300 (13%), Positives = 92/300 (30%), Gaps = 65/300 (21%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 ++ + N + A ++ L + G + + + ++ ++ A ++ +LA + + Sbjct: 2 RRLASDRSGNTLALIAAGLLPLLAMAGSGVDMSRAYLAESRLQQACDSGVLAARKALGTE 61 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 ++ L D I F+N+ Q+ Sbjct: 62 IATLTD------------------------IPTDAGTRGQEFFNSNFQD----------- 86 Query: 132 MANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF 191 + T +++ DY + + + F +++E R F Sbjct: 87 -------GNYGTQNRTFNMVLENDYSVSGTATVDVPTSVMTVFGFTKIPVKVECQARISF 139 Query: 192 L---IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + +V+D++GSM +K+ +LK + F D ++ Sbjct: 140 SDVDVMMVLDVTGSM-------------KHTNSGDTLSKIDSLKATVRNFYDQMEGAKSA 186 Query: 249 KEDVYMGLIGY--TTRVEKNIEPSWGTEKVRQYVTRDMDSLIL--KPTDSTPAMKQAYQI 304 + G + Y V ++ W Y +R + T + AY+ Sbjct: 187 GTRIRYGFVPYASNVNVGHLLKDEWVVNSWA-YQSRAISGTTTVEAGTKTRE--NWAYKS 243 >gi|148548919|ref|YP_001269021.1| von Willebrand factor, type A [Pseudomonas putida F1] gi|148512977|gb|ABQ79837.1| von Willebrand factor, type A [Pseudomonas putida F1] Length = 358 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 34/183 (18%), Positives = 68/183 (37%), Gaps = 37/183 (20%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM +++ +++ +K + FL +E Sbjct: 91 DLLVAVDVSGSMD----------FPDMQWKNEDISRLDLVKALMGDFLQ-------DREG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + K T A+ A + L + Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRRTVRTFLIEAQIGIAGKNTAIGDAIGLAVKRLRERPAQ 193 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S + ++ +TDG NN + + T A + ++I TI I A Sbjct: 194 S------------------RVLVLITDGANNGGQIHPLTAAR--LAAQEGVRIYTIGIGA 233 Query: 372 SPN 374 +P Sbjct: 234 NPE 236 >gi|167032571|ref|YP_001667802.1| von Willebrand factor type A [Pseudomonas putida GB-1] gi|166859059|gb|ABY97466.1| von Willebrand factor type A [Pseudomonas putida GB-1] Length = 358 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 34/183 (18%), Positives = 68/183 (37%), Gaps = 37/183 (20%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM +++ +++ +K + FL +E Sbjct: 91 DLLVAVDVSGSMD----------FPDMQWKNEDISRLDLVKALMGDFLQ-------DREG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + K T A+ A + L + Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRRTVRTFLDEAKIGIAGKNTAIGDAIGLAVKRLRQRPAQ 193 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S + ++ +TDG NN + + T A + ++I TI I A Sbjct: 194 S------------------RVLVLITDGANNGGQIHPLTAAR--LAAQEGVRIYTIGIGA 233 Query: 372 SPN 374 +P Sbjct: 234 NPE 236 >gi|260461186|ref|ZP_05809435.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259033220|gb|EEW34482.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 523 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 30/170 (17%), Positives = 60/170 (35%), Gaps = 40/170 (23%) Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + M T+ + + ++L+ + + K P+ K ++ LT Sbjct: 359 RTAAAQMQEWNGSGTNVSEGLSWGMRVLSPAPPYTDGAPW-----KTPNTS--KIVVLLT 411 Query: 338 DGENNNFKSNVN--------------------------------TIKICDKAKENFIKIV 365 DGEN + ++ T+ +CDK K ++I Sbjct: 412 DGENVVYGASAEPEKSDYTSYGYLSSGRFGTSNQTDAARSVDRWTLDVCDKLKAQQVQIY 471 Query: 366 TISINAS-PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 TI++ + + L C ++P +Y V + L +VFQ I+ + Sbjct: 472 TITLQSDTAANRTLYGKCATNPADYYAVNDPSKLPNVFQTIAGKFTTLQL 521 Score = 41.4 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 50/264 (18%), Positives = 86/264 (32%), Gaps = 69/264 (26%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 K SE NF+++FAL+ + L +GF V K +++++ +AA L+ Sbjct: 3 INKFWRSESGNFALLFALAAPAILAAVGFAADVSSVMRAKVNLQNSLDAATLSS------ 56 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 +H+ + + F N+ N ++ Sbjct: 57 -----------------------------SHLSDDEAARRLAFDGYFQANVANHPELT-- 85 Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPI 190 A L + + S D L F L NQ I + Sbjct: 86 -NAKLTLSVDKGFNYVKTKAIASADVNLYF-AFLFG--DNQHIEVDAGGVEATNN----- 136 Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LV+D +GSM A K+ AL++A + LD++D Sbjct: 137 LEVVLVLDNTGSMAGA--------------------KIKALRDATKVLLDNLDGAKSPDR 176 Query: 251 DVYMGLIGYTTRVE---KNIEPSW 271 V ++ + T V +PSW Sbjct: 177 KVTAAIVPFVTAVNINGDKFDPSW 200 >gi|254292617|ref|YP_003058640.1| hypothetical protein Hbal_0241 [Hirschia baltica ATCC 49814] gi|254041148|gb|ACT57943.1| hypothetical protein Hbal_0241 [Hirschia baltica ATCC 49814] Length = 514 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 73/510 (14%), Positives = 152/510 (29%), Gaps = 111/510 (21%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 + + K+ + + A + +FAL + L +IGF I K +++A ++A+LA Sbjct: 1 MRSIQKTLKQFLNATNAGVAPMFALFLTVILFIIGFTIDFRRMDSAKMHLQAATDSAVLA 60 Query: 64 GA------SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTE 117 A S V R D + S++ L+ + F N I+ + N Sbjct: 61 AARAYLTSSVQVKETKRQEDSQKIASDYLTANLLSSSNNFENNQIQLVFKEDGEIVGNAS 120 Query: 118 IQN-IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSF 176 + ++ + + L ++ ++++ D ++ ++F Sbjct: 121 TKIKLIFGGLFGKSDVVLPALAAATVGDSRKLEIVLVLDTSGSMSSQNRMKQLRTASINF 180 Query: 177 IPALLRIEMGERPI--------FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 + ++ + ER + + + +D G+ + P NS + T Sbjct: 181 VNSVFDNAVYERTVQVGVVPWNATVNINMDRPGT--WDASPGPAIHNSNYGNGTNQVTSF 238 Query: 229 AALKNAL----------------------LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN 266 L +L I ++ G + T V Sbjct: 239 QDFTENLYPPGFSDFGSYSDSDIDDDFGSSGWLGCITATKDERKISSSGNVTPLTDV-PP 297 Query: 267 IEPSWGTEKVRQY--------------------VTRDMDSLILKPTDSTPAMKQAYQ--I 304 + W KV + + + T + + Y+ Sbjct: 298 SKMKWPARKVAGWDPNSDCPSPMLAMSQSRPQIIKKLNQLNPSGNTHADIGLMWGYRMFS 357 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS------------------ 346 ++ F N + S +K +I LTDGEN S Sbjct: 358 QQANWNNFFGYNSDTKPDSFHSTKSRKIMIMLTDGENTATNSEGYSYYGWCTYTNHYNKW 417 Query: 347 --------------------------NVNTIKICDKAKENFIKIVTISINA----SPNGQ 376 N + C+ + +++ TI+++ Sbjct: 418 GRYTGSTKDCEVPKGINKDEISNNDLNSLMLDACEVIRSKDVELFTIALDLHSYYDSTAI 477 Query: 377 RLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 LL+ C S + YN+ + L FQ ++ Sbjct: 478 ALLRECAGSDSHAYNI-KGNELDETFQELA 506 >gi|54303502|ref|YP_133495.1| hypothetical protein PBPRB1845 [Photobacterium profundum SS9] gi|46916932|emb|CAG23695.1| conserved hypothetical protein [Photobacterium profundum SS9] Length = 321 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 50/268 (18%), Positives = 94/268 (35%), Gaps = 55/268 (20%) Query: 159 QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP 218 + + LL + + IE+ + LVVDLSGSM + Sbjct: 52 KVMMSLLWICLLGALARPVWYGDPIEVQPDHR-DMLLVVDLSGSMS---------IEDMI 101 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 I + ++AA+K+ L F++ ++ +GL+ + ++ V+Q Sbjct: 102 IKNGESIDRLAAVKDVLAEFIEQ-------RKGDRLGLVLFAQHAYLQTPLTFDRNTVKQ 154 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + R + LI + T + A + + S Q+ II L+D Sbjct: 155 QLERTVLGLIGQSTAIGEGLGIATKTFIN------------------SEAPQRVIILLSD 196 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------NGQRLLK 380 G N + ++ A E+ + I T+ + A + + L K Sbjct: 197 GANTAGV--IEPLEAAKLAAESNVTIYTVGVGAEEMIQKSFFGNRKVNPSQDLDERMLTK 254 Query: 381 TCVSSPEYHYNVVNADSLIHVFQNISQL 408 + ++ N L H++Q I QL Sbjct: 255 IADMTGGQYFRARNPQELEHIYQLIDQL 282 >gi|254512360|ref|ZP_05124427.1| conserved hypothetical protein [Rhodobacteraceae bacterium KLH11] gi|221536071|gb|EEE39059.1| conserved hypothetical protein [Rhodobacteraceae bacterium KLH11] Length = 668 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 18/72 (25%), Positives = 35/72 (48%), Gaps = 1/72 (1%) Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 K N + +C KA+E + I +I+ A + +++LK C P +Y + VF Sbjct: 597 SKKNEQVVSLCGKAEEKEVLIFSIAFEAPSSVKQMLKDCAVKPARYYEATGT-QIERVFD 655 Query: 404 NISQLMVHRKYS 415 +IS + + + + Sbjct: 656 SISTSIQNLRLT 667 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 35/271 (12%), Positives = 94/271 (34%), Gaps = 22/271 (8%) Query: 3 LLSRFRF------YFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESA 56 LL+R + ++ S + +I+ +M + GF + ++ + ++ ++ A Sbjct: 9 LLNRIYRENRSSRHLRRFAVSTDGSMTILTLFLIMIVFTVAGFAVDLMRYDRERVRLQYA 68 Query: 57 NNAAILAGAS--------KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSG 108 + A+LA A +V++ I + K +D + G Sbjct: 69 LDRAVLAAADLDQELCPRVVVNDYISKEGFDPGIIDEIK---VDPETCLNTDSSDSDGDG 125 Query: 109 --YSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLN 166 S + + SS + + T V +++ L Sbjct: 126 TDSSDASGSDSDPSDTASSGTESGSDGTSSGGDTAGTSTTTNAVELQGKRKVEASAQLNI 185 Query: 167 QRYNQK---IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 + + K + + + I LV+D+SGSM A ++ + + + Sbjct: 186 ETHFMKWSGVDTINSTAVSAAEESIGNVEISLVLDVSGSMEGAKLTNLQKAAKDFVKEML 245 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 +++ +L +++ + + + + ++ + + Sbjct: 246 EKSADDSLSISIIPYSEQVGVPDYMMDKINT 276 >gi|127513358|ref|YP_001094555.1| von Willebrand factor, type A [Shewanella loihica PV-4] gi|126638653|gb|ABO24296.1| von Willebrand factor, type A [Shewanella loihica PV-4] Length = 339 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 44/237 (18%), Positives = 77/237 (32%), Gaps = 59/237 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM + K + A ++ + F++ ++ Sbjct: 91 DLMLAVDLSGSM----------QIEDMVLNGKAVDRFAMVQQVMSEFIER-------RKG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V Q++T L+ K T A+ A + K+ Sbjct: 134 DKLGLILFADHAYLQAPLTQDRRSVAQFLTEAQIGLVGKQTAIGEAIALAVKRFDKAKQS 193 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + +I LTDG NN+ D A + + I TI + A Sbjct: 194 N------------------RVLILLTDGSNNSGSITPEQAA--DIAAKRGVTIYTIGVGA 233 Query: 372 --------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 LL + ++ N+D L ++Q I +L Sbjct: 234 EVMERRTLFGKERVNPSMDLDEAQLTLLAQ--KTKGRYFRARNSDELEQIYQEIDKL 288 >gi|149371021|ref|ZP_01890616.1| aerotolerance-related membrane protein [unidentified eubacterium SCB49] gi|149355807|gb|EDM44365.1| aerotolerance-related membrane protein [unidentified eubacterium SCB49] Length = 334 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 48/244 (19%), Positives = 81/244 (33%), Gaps = 66/244 (27%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 I + +D+S SM D K ++ ALK F+ + Sbjct: 90 KGIDIVMAIDVSASMLAR---------------DLKPDRLQALKQVAARFI-------NG 127 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR-DMDSLILKPTDSTPAMKQAYQILTS 307 + + +GL+ Y + V + + +S+I T + A L Sbjct: 128 RPNDRIGLVEYAGESYTKTPLTSDKTVVLSSLNSIEYNSIIEGGTAIGMGLATAVNRL-- 185 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 S K II LTDGENN+ + + A E IK+ TI Sbjct: 186 ----------------KESTAKSKVIILLTDGENNSGFIDPKIAS--ELAVEFGIKVYTI 227 Query: 368 SI----NAS------PNGQ------------RLLKTCV-SSPEYHYNVVNADSLIHVFQN 404 + AS PNG+ LLK ++ ++ + L +++ Sbjct: 228 GLGTNGMASSPIGILPNGRFQYGNQPVKIDETLLKEIAKTTGGQYFRATSNTKLNEIYEE 287 Query: 405 ISQL 408 I++L Sbjct: 288 INKL 291 >gi|77552603|gb|ABA95400.1| von Willebrand factor type A domain containing protein [Oryza sativa Japonica Group] Length = 574 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 40/240 (16%), Positives = 86/240 (35%), Gaps = 37/240 (15%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + +R + V+D+SGSM+ + ++ +++ LK A+ + + Sbjct: 96 VAPEKRAPIDLVAVLDVSGSMNKEEFVRGKHMS----------SRLDLLKIAMKYIIKLV 145 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQA 301 + ++ + V + + R+ + +D L TD PA+K+A Sbjct: 146 RDAD------RLAIVSFNHAVVSEYGLTRNSADSRKKLENLVDKLKASGNTDFRPALKKA 199 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQK----FIIFLTDGENNNFKSNVNTIKICDKA 357 + + ++ Q + +K FI+ L+DG + S +N K+ A Sbjct: 200 VEDMNIQNIKNSSAYNNFQILDGRGKEEKKKRVGFILLLSDGVDQFQYSRINWEKV---A 256 Query: 358 KENFI------------KIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQN 404 K + + T +AS + L + S + + N D++ F Sbjct: 257 KSTDVDHSEVGAMLRKYAVHTFGFSASHDPVPLRQISALSYGLYSFVCKNLDNITEAFAR 316 >gi|254459074|ref|ZP_05072497.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] gi|207084345|gb|EDZ61634.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] Length = 279 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 44/229 (19%), Positives = 85/229 (37%), Gaps = 34/229 (14%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 ++ R I L +D SGSM+ + ++V+ + TK+ + Sbjct: 45 KLSPNNRHGKDIVLAIDASGSMNSSGFDFEDEVSDGKRLSRFEITKI------IASEFIQ 98 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + +V G++ Y ++ E V Q + + + T A+ Sbjct: 99 KRISDNV------GVVLYGDFAFIASPITYEKEIVTQMLGYLTQGMAGQNTAIGEAIAMG 152 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + K ++ K I+ L+DGE+N+ + + AKE Sbjct: 153 VRSFKHSKAKT------------------KVIVLLSDGEHNSGSVSPKEAT--ELAKEQG 192 Query: 362 IKIVTISI-NASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQL 408 IKI TI++ N + LL+T S ++ +A L +++ I +L Sbjct: 193 IKIYTIAMGNKGEADEALLETIAKDSNGEFFSASSAKELKNIYDEIDKL 241 >gi|118094354|ref|XP_422360.2| PREDICTED: similar to calcium-activated chloride channel [Gallus gallus] Length = 928 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 41/256 (16%), Positives = 91/256 (35%), Gaps = 59/256 (23%) Query: 169 YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 N + F ++ +R + LV+D+SGSM ++ Sbjct: 287 VNSLVPPFETTFELLQTQDRA---VSLVLDVSGSM-------------------NTNNRI 324 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE-KVRQYVTRDMDSL 287 L+ A +FL I + +G++ + + + T RQ + +++ + Sbjct: 325 TNLRTAAEVFLIQIIEI-----GSRVGIVTFESSAYEKSPLLQITSVATRQRLVQNLPTT 379 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T +++ +I+T+ ++ + I+ LTDGE++ Sbjct: 380 AGGGTKICAGIEKGLEIITNAIGTTYGSE----------------IVLLTDGEDS----- 418 Query: 348 VNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN---ADSLIHVFQ 403 T+ +C +K KE+ I TI++ + L + + V+ L+ F Sbjct: 419 --TMSLCREKVKESGAIIHTIAL-GPSAAKELEEFSNITGGLQLYAVDVDVPSKLVEAFS 475 Query: 404 NISQ---LMVHRKYSV 416 I+ + + + Sbjct: 476 EITTGSGDISEQSIQL 491 >gi|257063307|ref|YP_003142979.1| hypothetical protein Shel_05710 [Slackia heliotrinireducens DSM 20476] gi|256790960|gb|ACV21630.1| hypothetical protein Shel_05710 [Slackia heliotrinireducens DSM 20476] Length = 1514 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 38/243 (15%), Positives = 88/243 (36%), Gaps = 29/243 (11%) Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED--VYM 254 +D SG++ C + + D+ TK L+N++ F ++ LS + Sbjct: 718 IDNSGTLKCRFQFVQNSCSPVYMKADEASTKSEVLQNSIARFAATLGYLSPGSQIAMTRF 777 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK-------------QA 301 + ++ + + T +V + ++ + + + + + Sbjct: 778 SVDTFSNAECALLNWTNDTGEVTAAMNQEYGNPLAEGGRANQTLDGLRVYNYGITGSTHT 837 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 Y+ + S + P +++I TDG++N+ ++ D K N Sbjct: 838 YRGIESYIENMTNGASGGYVPNAPQGNNSRYLIIFTDGKDNSGNL-QKSMDDTDALKNNG 896 Query: 362 IKIVTISINAS-------PNGQRLLKTCVSS----PEYHYNV--VNADSLIHVFQNISQL 408 I+T+ + ++ + LK SS +Y Y + + L+ VFQ+I+ Sbjct: 897 YTIITVLMQSAGMTSEDVEHSTTFLKRLASSNASGEKYFYTAMYNDPEGLVKVFQDIAHE 956 Query: 409 MVH 411 + Sbjct: 957 IAK 959 >gi|189465623|ref|ZP_03014408.1| hypothetical protein BACINT_01981 [Bacteroides intestinalis DSM 17393] gi|189437897|gb|EDV06882.1| hypothetical protein BACINT_01981 [Bacteroides intestinalis DSM 17393] Length = 327 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 41/276 (14%), Positives = 84/276 (30%), Gaps = 63/276 (22%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 Y + F+ ++ +++ + E I + +D+S SM Sbjct: 51 YLLHVPFMLRIIALALIIVVLARPQTTNSWQNSEIEGIDIMMAIDVSTSMLAE------- 103 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 D K ++ A K+ F++ +G+ + + Sbjct: 104 --------DLKPNRLEAAKDVAAEFIN----GRPNDN---IGITLFAGESFTQCPLTVDH 148 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + +I T + A L K +S K I Sbjct: 149 AVLLNLFQGIKCGIIEDGTAVGMGIANAVTRLKDSKAKS------------------KVI 190 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI----NAS----------------- 372 I LTDG NN K +++ + + AK I++ TI + A Sbjct: 191 ILLTDGTNN--KGDISPLTAAEIAKSFGIRVYTIGVGTNGMAPYPYPVGNTVQYVNMPVE 248 Query: 373 PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + L + ++ ++ + L V++ I +L Sbjct: 249 IDEKTLTQIAATTEGNYFRATSNSKLKEVYEEIDKL 284 >gi|83647467|ref|YP_435902.1| von Willebrand factor type A (vWA) domain-containing protein [Hahella chejuensis KCTC 2396] gi|83635510|gb|ABC31477.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Hahella chejuensis KCTC 2396] Length = 345 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 36/242 (14%), Positives = 79/242 (32%), Gaps = 55/242 (22%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + + L VD+S SM + + + T++ +K+ + F+ Sbjct: 88 PMDYEARDLLLAVDISPSM----------QETDLQLKGNQATRLDVVKSVVTDFIQ---- 133 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 V++ +GLI + + ++ V + + + T A+ + Sbjct: 134 ---VRQGDRLGLILFGAQPYIQAPLTYDLVTVGELLNEATLGIAGNATAIGDAIGLGIKR 190 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L S + ++ LTDG N + + A + IKI Sbjct: 191 LRERPADS------------------RVLVLLTDGANTGGEVSPEQAA--KLAADAGIKI 230 Query: 365 VTISINAS-----------------PNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNIS 406 T+ + A + LL++ + ++ N L ++++I+ Sbjct: 231 YTVGVGADEIIRRGIFGYRKENPSADLDETLLQSIADETDGQYFRARNTGELELIYESIN 290 Query: 407 QL 408 QL Sbjct: 291 QL 292 >gi|78357411|ref|YP_388860.1| von Willebrand factor, type A [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] gi|78219816|gb|ABB39165.1| von Willebrand factor, type A [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] Length = 402 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 43/290 (14%), Positives = 87/290 (30%), Gaps = 81/290 (27%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + V+D SGSM + ++ + + ++K L+ F + + + Sbjct: 117 LEVVFVIDNSGSMKGSPINETNAAATRLVDLIMPEGMATSVKIGLVPFRGKVR-IPADVD 175 Query: 251 DVYMGL----------------------IGYTTRVE------KNIEPSWGT----EKVRQ 278 + G Y R+ +I + G + Q Sbjct: 176 GLPSGCRNADGSLNEDGLLDEYKKPEYRYPYNDRLRVTPYSCSSIPLTQGLTADRATITQ 235 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + R T + +K A +LT + F + +K II LTD Sbjct: 236 AIGRQDARGDSSGTVISEGLKWARHVLT--PEAPFTEGSSAK-------DMRKVIILLTD 286 Query: 339 GENNNFKSNVNTIKI--------------------CDK--------------AKENFIKI 364 G+ + N C+ AK+ I+I Sbjct: 287 GDTEDGNCGGNYSVYYRPNNYWTNAYYGMMDMDSHCEDGGVLNNAMLSEAALAKDAGIEI 346 Query: 365 VTISINASPN-GQRLLKTCVSS----PEYHYNVVNADSLIHVFQNISQLM 409 I +S + L++ SS +++++ + + VF+ I + + Sbjct: 347 FAIRYGSSDAVDRNLMRAVASSKEGTDDHYFDAPSPYDIDDVFKLIGRQL 396 >gi|307353371|ref|YP_003894422.1| von Willebrand factor type A [Methanoplanus petrolearius DSM 11571] gi|307156604|gb|ADN35984.1| von Willebrand factor type A [Methanoplanus petrolearius DSM 11571] Length = 317 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 43/245 (17%), Positives = 87/245 (35%), Gaps = 60/245 (24%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 + I +D SGSM A D + ++ A K A+ ++ +DL Sbjct: 84 QTKEGVNIVFALDTSGSMEAA---------------DYQPDRITAAKEAIGTLINQLDLK 128 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 Y G+I + + S ++V + + M + T + A + Sbjct: 129 D------YAGIITFDSGASTAAYLSPDKQRVIEKL--GMIAASDDSTAIGDGLALAVDMS 180 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 S R + +I L+DGE+N + T + AKE+ +++ Sbjct: 181 KSIPNR------------------KSVVILLSDGESNAGYVSPETAA--EFAKESGVQVF 220 Query: 366 TISINASPN----------------GQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQL 408 T+++ +S + L+ S+ Y+ V+ +L +++ + Sbjct: 221 TVAMGSSEKVLVGYDWANNPQYATVDEETLEYIADSTGGGFYSSVDEKTLGNIYSQLDDA 280 Query: 409 MVHRK 413 +VH K Sbjct: 281 IVHEK 285 >gi|37676326|ref|NP_936722.1| hypothetical protein VVA0666 [Vibrio vulnificus YJ016] gi|37200868|dbj|BAC96692.1| conserved hypothetical protein [Vibrio vulnificus YJ016] Length = 362 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 43/224 (19%), Positives = 76/224 (33%), Gaps = 35/224 (15%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +VVDLSGSM +++ A K L F + ++ Sbjct: 107 DVMVVVDLSGSMA---------EQDFTSASGANISRLDATKEVLAEFAKT-------RQG 150 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + + + ++ + T A+ A + + Sbjct: 151 DRLGLILFGDAAFVQTPFTADQKVWLALLNQTDVAMAGQSTHLGDAIGLAI-KVFEQSES 209 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + S P QK I LTDG N+ S V I AK ++I I++ Sbjct: 210 NQAA---------SSKPRQKVAIVLTDG--NDTGSFVEPIDAAKVAKAKGVRIHVIAMGD 258 Query: 372 SPN-GQRLL------KTCVSSPEYHYNVVNADSLIHVFQNISQL 408 G+ L + S + +N D L + +I +L Sbjct: 259 PSTVGESALDLQTIERIASESGGKAFQALNRDELARAYDDIGKL 302 >gi|103487755|ref|YP_617316.1| hypothetical protein Sala_2274 [Sphingopyxis alaskensis RB2256] gi|98977832|gb|ABF53983.1| hypothetical protein Sala_2274 [Sphingopyxis alaskensis RB2256] Length = 666 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 22/117 (18%), Positives = 38/117 (32%), Gaps = 29/117 (24%) Query: 325 PSLPFQKFIIFLTDGE--------------------------NNNFKSNVNTIKICDKAK 358 P + I+F+TDG + + N ++C A+ Sbjct: 552 NDRPISRHIVFMTDGAMAPNMGNLTFQGYEFLMHRVGGTSDSDLRDRHNNRFTQLCRAAR 611 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + I I +S L C SS + NA L FQ I++ + + + Sbjct: 612 QRGITIWVVSFGV--GSNDSLNNCASS-GQAFEADNAAELNEQFQAIARQISKLRLA 665 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 37/273 (13%), Positives = 91/273 (33%), Gaps = 62/273 (22%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 +SR K I+ ++ N ++ A +++ + +G + + + + ++ A +A +LA Sbjct: 6 ISRLCAGTKSLISDQRGNAFMLTAAAIIPVIGFVGSAVDIGRAYMTQLRLQQACDAGVLA 65 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 G + + S + A +A + F K G + + +++ N + Sbjct: 66 G-RRAMGGASY---------DEAAQAEANKMFNFNFPEAKY---GATGILFSSRALNASD 112 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 + + + + + + L Sbjct: 113 VEGQASAVLPTE-------------------------LMFMFGKEEFRLSADCTAKL--- 144 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 E + LV+D++GSM ++ ALK+A + F D++ Sbjct: 145 ---EISNVDVMLVLDVTGSMA-------------QTNAGDSVNRITALKDATMDFFDTLT 188 Query: 244 LLSHVKEDVYMGLIGYTT-----RVEKNIEPSW 271 + G++ Y++ ++ P+W Sbjct: 189 NADVGDGRLRFGVVPYSSTANVGQILLAKNPAW 221 >gi|52548788|gb|AAU82637.1| cell surface protein [uncultured archaeon GZfos18H11] Length = 1359 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 45/249 (18%), Positives = 93/249 (37%), Gaps = 25/249 (10%) Query: 153 SYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPE 212 SY + +L V ++ + + + LV+D+SGSM + D Sbjct: 917 SYALSKDPVSEVLGY---YLSVGLNSGIIESDFQRKK-LNLALVLDISGSMGSSF--DEY 970 Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW- 271 + T+ A + + LL H+++D +GL+ + T E S Sbjct: 971 YYDRFGNHVAVNDTEDAEKSKIEIAAAAIVALLDHLEDDDRLGLVLFNTGAELAEPVSLV 1030 Query: 272 GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 G + +++ ++ T + M+ A T + + +++ ++ Sbjct: 1031 GAKNMQKLKGDVLEISATGGTRLSAGMQMA-------------TELYDEFLEVNQSEYEN 1077 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT--ISINASPNGQRLLKTCVS-SPEY 388 IIFLTD N+ +++ ++ +A N + T I I N L++ Sbjct: 1078 RIIFLTDAMPNSGQTSEESLLGMIEANANK-NVYTTFIGIGVDFN-TELVEYITKIRGAN 1135 Query: 389 HYNVVNADS 397 +Y+V +A Sbjct: 1136 YYSVHSATQ 1144 >gi|331694297|ref|YP_004330536.1| von Willebrand factor type A [Pseudonocardia dioxanivorans CB1190] gi|326948986|gb|AEA22683.1| von Willebrand factor type A [Pseudonocardia dioxanivorans CB1190] Length = 332 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 30/193 (15%), Positives = 58/193 (30%), Gaps = 30/193 (15%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 R + L D+SGSM D T++ A K A F+ Sbjct: 81 PQPRREGTVVLAFDVSGSMAA---------------TDIAPTRLEAAKAAARGFVQRQPA 125 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 V +G++ + +P+ V + R T ++ A Sbjct: 126 A------VRIGIVAFGATGLVTQQPTSDRASVVAAIDRL---SPQGGTALGGGLQTALGA 176 Query: 305 LTSDKKRSFFTNFFRQGVKIPSL---PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + ++ ++ LTDGEN + + +++ D A Sbjct: 177 IVGKPVVVPGSDPGGGPEPSGPDLGYHGSAAVVLLTDGENT---AQPDPLQVADIASTAG 233 Query: 362 IKIVTISINASPN 374 +K+ I + + Sbjct: 234 VKVYPIGLGSPAG 246 >gi|120554865|ref|YP_959216.1| von Willebrand factor, type A [Marinobacter aquaeolei VT8] gi|120324714|gb|ABM19029.1| von Willebrand factor, type A [Marinobacter aquaeolei VT8] Length = 339 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 41/235 (17%), Positives = 80/235 (34%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVD+S SM + Q + ++ A+K L F+D +E Sbjct: 90 DLMLVVDISPSMD----------EQDMVLQGRSINRLQAVKRVLDDFIDQ-------REG 132 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + T ++ E VR + + + T A+ + + L + Sbjct: 133 DRLGLILFGTEPYVQAPLTFDRETVRTLLFEAGLGMAGRATAIGDAIGLSVKRLRERPQE 192 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 Q+ +I LTDG N + + + + A+ +++ TI I A Sbjct: 193 ------------------QRVVILLTDGANTAGQVSPDKAT--EIAQAAGVRLYTIGIGA 232 Query: 372 ---------------SPNGQR---LLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 L + + ++ + L ++ +I+QL Sbjct: 233 DTMIQRGLLGSRRVNPSRDLDEELLTRMAEQTGGRYFRARSLPELEMIYDSINQL 287 >gi|150007595|ref|YP_001302338.1| hypothetical protein BDI_0948 [Parabacteroides distasonis ATCC 8503] gi|255013876|ref|ZP_05286002.1| hypothetical protein B2_08207 [Bacteroides sp. 2_1_7] gi|256839782|ref|ZP_05545291.1| conserved hypothetical protein [Parabacteroides sp. D13] gi|149936019|gb|ABR42716.1| conserved hypothetical protein BatA [Parabacteroides distasonis ATCC 8503] gi|256738712|gb|EEU52037.1| conserved hypothetical protein [Parabacteroides sp. D13] Length = 328 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 48/284 (16%), Positives = 85/284 (29%), Gaps = 64/284 (22%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 Y + F+ ++ I++ + I L +D+SGSM Sbjct: 52 YLRHVPFVLRMVAVALLIVILARPQSTNSWSNSSTEGIDIMLAMDISGSMLA-------- 103 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 QD K ++ A K+ F++ +GL+ ++ + Sbjct: 104 -------QDLKPNRLEAAKDVAASFIN----GRPNDN---IGLVVFSAESFTQCPLTTDH 149 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + +I T + A + +S K I Sbjct: 150 TVLLNLFKDIQSGMIQDGTAIGLGLANAVSRIKDSHAKS------------------KVI 191 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI----NAS----------------P 373 I LTDG NN + T + AK +++ TI + A Sbjct: 192 ILLTDGSNNAGEIAPVTAA--EIAKTFGVRVYTIGVGTKGMAPYPFQTAFGVQYQNIPVE 249 Query: 374 NGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 + LK S + ++ + SL ++ I Q M K SV Sbjct: 250 IDEATLKQIASTTGGQYFRATDNASLKEIYSEIDQ-MEKTKISV 292 >gi|119470787|ref|ZP_01613398.1| hypothetical protein ATW7_05591 [Alteromonadales bacterium TW-7] gi|119446014|gb|EAW27293.1| hypothetical protein ATW7_05591 [Alteromonadales bacterium TW-7] Length = 328 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 40/234 (17%), Positives = 78/234 (33%), Gaps = 54/234 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L VDLSGSM + ++ +K L F++ + Sbjct: 88 DIMLAVDLSGSM----------TEQDMAYNGQYVDRLTMVKAVLSDFIEQ-------RTG 130 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + V + + L+ + T A+ + + S Sbjct: 131 DRLGLILFGDTAFLQTPLTRDLKTVTKMLNEAQIGLVGRATAIGDALGLSVKRFASKDDS 190 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-N 370 + + ++ LTDG+N N + + A+E IK+ TI + + Sbjct: 191 N------------------RIVVLLTDGQNTAGNLNPDDALL--LAREEGIKVYTIGVGS 230 Query: 371 ASPNG---------------QRLLKTCVSS-PEYHYNVVNADSLIHVFQNISQL 408 +P G +RLLK ++ + L ++ + +L Sbjct: 231 DNPRGFSLFNMGGSGGSNLDERLLKNIADDTGGLYFRAKDVAGLKQIYAELDKL 284 >gi|224539999|ref|ZP_03680538.1| hypothetical protein BACCELL_04911 [Bacteroides cellulosilyticus DSM 14838] gi|224518389|gb|EEF87494.1| hypothetical protein BACCELL_04911 [Bacteroides cellulosilyticus DSM 14838] Length = 327 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 41/276 (14%), Positives = 84/276 (30%), Gaps = 63/276 (22%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 Y + F+ ++ +++ + E I + +D+S SM Sbjct: 51 YLLHVPFMLRIIALALIIVVLARPQTTNSWQNSEIEGIDIMMAIDVSTSMLAE------- 103 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 D K ++ A K+ F++ +G+ + + Sbjct: 104 --------DLKPNRLEAAKDVAAEFIN----GRPNDN---IGITLFAGESFTQCPLTVDH 148 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + +I T + A L K +S K I Sbjct: 149 AVLLNLFQGIKCGIIEDGTAVGMGIANAVTRLKDSKAKS------------------KVI 190 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI----NAS----------------- 372 I LTDG NN K +++ + + AK I++ TI + A Sbjct: 191 ILLTDGTNN--KGDISPLTAAEIAKSFGIRVYTIGVGTNGMAPYPYPVGNTVQYVNMPVE 248 Query: 373 PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + L + ++ ++ + L V++ I +L Sbjct: 249 IDEKTLTQIAATTEGNYFRATSNSKLKEVYEEIDKL 284 >gi|71278376|ref|YP_269691.1| von Willebrand factor type A domain-containing protein [Colwellia psychrerythraea 34H] gi|71144116|gb|AAZ24589.1| von Willebrand factor type A domain protein [Colwellia psychrerythraea 34H] Length = 364 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 46/241 (19%), Positives = 81/241 (33%), Gaps = 38/241 (15%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDP----EDVNSAPICQDKKRTKMAALKNALLLFLDS 241 E+ + + VDLSGSM + P E N A D TK + ++ Sbjct: 89 QEKSARDLMIAVDLSGSMAVEDFTLPIATNELTNRAKNDTDSSATKSSTNDTGKGEKVNR 148 Query: 242 IDLLSHV-------KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + + HV +E +GLI + + + + + + T Sbjct: 149 LVAVKHVLNAFVKSREHDRLGLILFGDAPYLQAPFTDDIATWQALLNESDIGMAGQSTAF 208 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+ A + S + +I LTDG N+ S V ++ Sbjct: 209 GDAIGLAISVFQQ------------------SDTQNRVLIVLTDG--NDTASKVPPVEAA 248 Query: 355 DKAKENFIKIVTISINASPN------GQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQ 407 A IKI TI+I +L+ + + +N++ L+ V+ I + Sbjct: 249 KVAAARDIKIYTIAIGDPSAVGEEKVDLEVLQAMAEITQGKSFQALNSEELLKVYAEIDR 308 Query: 408 L 408 L Sbjct: 309 L 309 >gi|153871328|ref|ZP_02000529.1| von Willebrand factor type A domain protein [Beggiatoa sp. PS] gi|152072210|gb|EDN69475.1| von Willebrand factor type A domain protein [Beggiatoa sp. PS] Length = 280 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 40/220 (18%), Positives = 81/220 (36%), Gaps = 53/220 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L++D+S SM + +A K A F+ DL Sbjct: 90 SVFLLIDVSYSMDG--------------------SALAEAKQAAQEFVRKSDLA-----H 124 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ + + + + + + R + T+ T + AY L + Sbjct: 125 TAIGLIEFGSKAKIISGLTQNAKHLYKAINRLKTN---GSTNMTEGLTTAYLKLKN---- 177 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + +FII LTDG N+ K NT +I + + I+++TI Sbjct: 178 ---------------VDDPRFIILLTDGLPNHPK---NTQQIAQEICADGIELITIG--T 217 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + L++ + + A +++ F I+Q++ Sbjct: 218 GDADKTYLQSLACYDQNSFFA-KAGTMVSTFSRIAQVLTE 256 >gi|262381906|ref|ZP_06075044.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B] gi|298375541|ref|ZP_06985498.1| BatA protein [Bacteroides sp. 3_1_19] gi|301310439|ref|ZP_07216378.1| BatA protein [Bacteroides sp. 20_3] gi|262297083|gb|EEY85013.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B] gi|298268041|gb|EFI09697.1| BatA protein [Bacteroides sp. 3_1_19] gi|300832013|gb|EFK62644.1| BatA protein [Bacteroides sp. 20_3] Length = 328 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 48/284 (16%), Positives = 85/284 (29%), Gaps = 64/284 (22%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 Y + F+ ++ I++ + I L +D+SGSM Sbjct: 52 YLRHVPFVLRMVAVALLIVILARPQSTNSWSNSSTEGIDIMLAMDISGSMLA-------- 103 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 QD K ++ A K+ F++ +GL+ ++ + Sbjct: 104 -------QDLKPNRLEAAKDVAASFIN----GRPNDN---IGLVVFSAESFTQCPLTTDH 149 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + +I T + A + +S K I Sbjct: 150 TVLLNLFKDIQSGMIQDGTAIGLGLANAVSRIKDSHAKS------------------KVI 191 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI----NAS----------------P 373 I LTDG NN + T + AK +++ TI + A Sbjct: 192 ILLTDGSNNAGEIAPVTAA--EIAKTFGVRVYTIGVGTKGMAPYPFQTAFGVQYQNIPVE 249 Query: 374 NGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 + LK S + ++ + SL ++ I Q M K SV Sbjct: 250 IDEATLKQIASTTGGQYFRATDNASLKEIYSEIDQ-MEKTKISV 292 >gi|329963581|ref|ZP_08301060.1| von Willebrand factor type A domain protein [Bacteroides fluxus YIT 12057] gi|328528570|gb|EGF55541.1| von Willebrand factor type A domain protein [Bacteroides fluxus YIT 12057] Length = 327 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 44/276 (15%), Positives = 84/276 (30%), Gaps = 63/276 (22%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 Y + F ++ +++ + E I L +D+S SM Sbjct: 51 YLLHVPFALRIIALVLIILVLARPQTTDSWQNSEIEGIDIMLAMDVSTSMLAE------- 103 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 D K ++ A K+ F++ +G+ + + Sbjct: 104 --------DLKPNRLEAAKDVAAEFIN----GRPNDN---IGITLFAGESFTQCPLTVDH 148 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + + LI T + A L K +S K I Sbjct: 149 AVLLNLLKDMKCGLIEDGTAIGMGIANAVTRLKDSKAKS------------------KVI 190 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI----NAS----------------- 372 I LTDG NN K +++ + + AK I+I TI + A Sbjct: 191 ILLTDGVNN--KGDISPLTAAEIAKSFGIRIYTIGVGTNGMAPYPYPVGGTVQYVNMPVE 248 Query: 373 PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + L + ++ ++ + L V++ I +L Sbjct: 249 IDEKTLTQIAGTTDGNYFRATSNSKLKEVYEEIDKL 284 >gi|218131126|ref|ZP_03459930.1| hypothetical protein BACEGG_02731 [Bacteroides eggerthii DSM 20697] gi|317476996|ref|ZP_07936238.1| von Willebrand factor type A domain-containing protein [Bacteroides eggerthii 1_2_48FAA] gi|217986646|gb|EEC52980.1| hypothetical protein BACEGG_02731 [Bacteroides eggerthii DSM 20697] gi|316906789|gb|EFV28501.1| von Willebrand factor type A domain-containing protein [Bacteroides eggerthii 1_2_48FAA] Length = 327 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 44/276 (15%), Positives = 84/276 (30%), Gaps = 63/276 (22%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 Y + F ++ I++ + E I L +D+S SM Sbjct: 51 YLLHVPFALRIIALALVILILARPQTTNSWQNSEIEGIDIMLAIDVSTSMLAE------- 103 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 D K ++ A K+ F++ +G+ + + Sbjct: 104 --------DLKPNRLEAAKDVAAEFIN----GRPNDN---IGITLFAGESFTQCPLTVDH 148 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + + LI T + A L K +S K I Sbjct: 149 AVLLNLIKDVKCGLIEDGTAVGMGIANAVTRLKDSKAKS------------------KVI 190 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI----NAS----------------- 372 I LTDG NN K +++ + + AK I++ TI + A Sbjct: 191 ILLTDGTNN--KGDISPLTAAEIAKSFGIRVYTIGVGTNGMAPYPYPVGGTVQYVNMPVE 248 Query: 373 PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + L + ++ ++ + L V++ I +L Sbjct: 249 IDEKTLTQIAGTTDGNYFRATSNSKLKEVYEEIDKL 284 >gi|119358220|ref|YP_912864.1| von Willebrand factor, type A [Chlorobium phaeobacteroides DSM 266] gi|119355569|gb|ABL66440.1| von Willebrand factor, type A [Chlorobium phaeobacteroides DSM 266] Length = 344 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 39/219 (17%), Positives = 74/219 (33%), Gaps = 44/219 (20%) Query: 171 QKIVSFIPALLRIEM-GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 P +++ + E + LV+D+S SMH ++++ Sbjct: 82 SVFAMAQPRIVQRQTVAETRGIDLLLVLDVSRSMHQQ--------------DFNGQSRLE 127 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 A+K + +GL+ ++ + + + + +I Sbjct: 128 AVKG-VGKQFVLSRSAD------RIGLVVFSGKGYTPCPLTLDHLTLGTVLDNISSEVIQ 180 Query: 290 K-PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 + T A+ A L + + R QK II LTDG+NN + Sbjct: 181 EEGTAIGTAILIAVNRLRASESR------------------QKAIILLTDGQNNAGDIDP 222 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE 387 T A ++ IKI TI+ A + +++ S P Sbjct: 223 LTAA--GFALQDGIKIYTIAATAQDA-RPFVRSAESLPA 258 >gi|126662671|ref|ZP_01733670.1| batA protein [Flavobacteria bacterium BAL38] gi|126626050|gb|EAZ96739.1| batA protein [Flavobacteria bacterium BAL38] Length = 334 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 52/312 (16%), Positives = 97/312 (31%), Gaps = 72/312 (23%) Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL 181 + +++S T ++ N F Y RL + ++ + V Sbjct: 27 IKRNQLSATLKMSSITPFKGNRTFLAKAKPFLYVLRLLALSAIIIAMARPRSVDVTS--- 83 Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + I + +D+S SM D K ++ ALK F+ Sbjct: 84 --KSKTTRGIDIVMAIDVSSSMLA---------------NDLKPNRLEALKKVAATFVQ- 125 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV--TRDMDSLILKPTDSTPAMK 299 D ++ +GL+ Y + + Q + DS+I T + Sbjct: 126 -DRIND-----RIGLVVYAGESYTRTPVTSDKTIILQSLKSVEFDDSIIADGTGIGVGLA 179 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 A + K +S + II LTDG NN+ + T AKE Sbjct: 180 TAINRIKDSKAKS------------------RIIILLTDGVNNSGTIDPRTAA--SIAKE 219 Query: 360 NFIKIVTISI----------------------NASPNGQRLLKTCV-SSPEYHYNVVNAD 396 IK+ TI I ++L++ ++ ++ + Sbjct: 220 YGIKVYTIGIGTNGKAMFPVAKDANGKLVFKMMPVEIDEKLMQEIAKNTDAKYFRATSNK 279 Query: 397 SLIHVFQNISQL 408 L ++ I++L Sbjct: 280 KLQAIYDEINKL 291 >gi|331090683|ref|ZP_08339532.1| hypothetical protein HMPREF9477_00175 [Lachnospiraceae bacterium 2_1_46FAA] gi|330400097|gb|EGG79748.1| hypothetical protein HMPREF9477_00175 [Lachnospiraceae bacterium 2_1_46FAA] Length = 3699 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 40/221 (18%), Positives = 80/221 (36%), Gaps = 49/221 (22%) Query: 221 QDKKRTKMAALKNALLLFLDSI----DLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEK 275 + ++KM ALK A+ F D ++ + M ++ +++ ++ + Sbjct: 217 KPNSQSKMGALKIAVNQFAQETAKRNDSITDAAKQHRMSIVTFSSESYIRQSLKAYNSNT 276 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 V ++ T + M++A + L + ++++ QK +IF Sbjct: 277 VSEFERTINGLNANGATYANLGMEKAKESLKNVREKA-----------------QKVVIF 319 Query: 336 LTDGE--NNNFKSNV--NTIKICDKAKENFIKIVTIS-------------INASPNGQRL 378 TDG + F + NTI+ K++ KI +I NA +G Sbjct: 320 FTDGTPGRSGFDDDTANNTIQAAKSLKDDLTKIYSIGVFDQANPDNTSSSFNAYMHGVSS 379 Query: 379 ----------LKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 L + Y+ +AD L +F+ I + M Sbjct: 380 NYPNATKWTELGERAENSNYYKAAQDADELNKIFEEIFEEM 420 >gi|218887819|ref|YP_002437140.1| von Willebrand factor A [Desulfovibrio vulgaris str. 'Miyazaki F'] gi|218758773|gb|ACL09672.1| von Willebrand factor type A [Desulfovibrio vulgaris str. 'Miyazaki F'] Length = 406 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 55/439 (12%), Positives = 128/439 (29%), Gaps = 90/439 (20%) Query: 22 FSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM-----VSNLSRLG 76 +++ A+ + L L G I + N ++ A +AA LAG+ ++ + G Sbjct: 1 MAMLMAVLLPVVLGLAGLGIDSGMLYLAHNRLQGAVDAAALAGSLELPYDPQLDKGLVKG 60 Query: 77 DRFESISNHAKRALIDDAKRFIKN---HIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + ++ + A++ + +K + + I + ++ + + Sbjct: 61 AVNQYMAANYPAAVLKGVTPGTEERSVTVKAEATVDTIFMGALGIGSSTVRAQATAGYNN 120 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVS----FIPALLRIEMGERP 189 + +NT + + + L+ + V + I G Sbjct: 121 LEVVFVIDNTGSMKGTAIQQANAAATQLAELIMPDGMETSVKVGLVPFRGKVHIPAGVDG 180 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + D + + + + P K + D + V+ Sbjct: 181 LADGCRNADGTLAPSWILEEYKQTKYRYPTGSSLNVPK------------GTCDSIPRVQ 228 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + + + + T + +K +LT + Sbjct: 229 ------------------ALTSNRTTIVSAIAKQDALGDASGTVISEGIKWGRHVLTPEA 270 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI------------------ 351 + + + +K +I LTDG+ + K N Sbjct: 271 PFTQGS---------SNKDMRKVMIVLTDGDTEDGKCGGNYALNYTPNAYWTNAYYGMFD 321 Query: 352 --KICDK--------------AKENFIKIVTISI-NASPNGQRLLKTCVSS----PEYHY 390 C+ AK+ I+I I ++ L+K SS +++Y Sbjct: 322 MNTHCENGGKLNAAMLSEAQIAKDKGIEIFAIRYGDSDSTDISLMKAIASSKAGTDDHYY 381 Query: 391 NVVNADSLIHVFQNISQLM 409 N +A L +F+ I + + Sbjct: 382 NAPSAYDLEEIFKKIGRQL 400 >gi|170727371|ref|YP_001761397.1| von Willebrand factor type A [Shewanella woodyi ATCC 51908] gi|169812718|gb|ACA87302.1| von Willebrand factor type A [Shewanella woodyi ATCC 51908] Length = 330 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 38/235 (16%), Positives = 79/235 (33%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VDLSGSM + K + +++ + F++ ++ Sbjct: 84 DLMVAVDLSGSM----------QIEDMVLDGKAVNRFIMVQSVVSDFIER-------RKG 126 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V Q++ L+ K T A+ + + + Sbjct: 127 DKLGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGEAIALSVKRFDLVDES 186 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + ++ LTDG NN+ + D A + IKI +I + A Sbjct: 187 N------------------RILVLLTDGSNNSGSISPEQAA--DIAAKRGIKIYSIGVGA 226 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + ++L ++ ++ NA L ++Q I +L Sbjct: 227 DVMERRTLFGKERVNPSMDLDEEQLTSLAQTTGGRYFRARNAQELEQIYQEIDKL 281 >gi|302870768|ref|YP_003839404.1| von Willebrand factor type A [Caldicellulosiruptor obsidiansis OB47] gi|302573627|gb|ADL41418.1| von Willebrand factor type A [Caldicellulosiruptor obsidiansis OB47] Length = 900 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 50/247 (20%), Positives = 86/247 (34%), Gaps = 46/247 (18%) Query: 165 LNQRYNQKIVSFIPALLRIEMGER-PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 L N + +P + I+ E+ + LV+D SGSM +D Sbjct: 379 LGNYSNSVLEKMLPVKMEIKNKEKEKNIDVVLVLDHSGSMA--------------DTEDA 424 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 +K+ K+A ++ ++ V G+I + E S K + + Sbjct: 425 GISKLEIAKSASAKMIEHLESSDGV------GVIAFDHNYYWAYEFSKLVRK-KDVIESI 477 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 + T P + +A + L K +S I+ LTDG Sbjct: 478 SSIEVGGGTAIIPPLSEAVKTLKKSKAKSKL------------------IVLLTDGMGEQ 519 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVF 402 + ++AK N IKI TI + N +L S + Y V N L+ VF Sbjct: 520 GGYEIPA----NEAKRNNIKITTIGVGKFVN-LPVLSWIASFTSGRFYLVSNPYELVDVF 574 Query: 403 QNISQLM 409 ++++ Sbjct: 575 LKETKII 581 >gi|162454786|ref|YP_001617153.1| hypothetical protein sce6504 [Sorangium cellulosum 'So ce 56'] gi|161165368|emb|CAN96673.1| hypothetical protein sce6504 [Sorangium cellulosum 'So ce 56'] Length = 381 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 38/193 (19%), Positives = 71/193 (36%), Gaps = 29/193 (15%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC-QDKKRTKMAALKNALLL 237 ++LR + + I + +DLSGSM +++ D+ P + K+ T++ K L Sbjct: 111 SVLREQRSDDKGIDIVVALDLSGSMRAILDARASDLPGQPKLPRGKRLTRLDTAKLVLQD 170 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTP 296 F+ + +G++ + P+ + Q V++ ++I T Sbjct: 171 FISR-------RRTDRLGVVVFGKAAYVLSPPTLDYHLLTQMVSQMTLNVIDGSATAIGD 223 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+ A L +S K +I LTDG++N + Sbjct: 224 ALGTAVARLRRSDAQS------------------KVVILLTDGDSNAGAISPEYAT--HL 263 Query: 357 AKENFIKIVTISI 369 A K+ TI I Sbjct: 264 ATSLGAKVYTIQI 276 >gi|119775307|ref|YP_928047.1| von Willebrand factor type A domain-containing protein [Shewanella amazonensis SB2B] gi|119767807|gb|ABM00378.1| von Willebrand factor type A domain protein [Shewanella amazonensis SB2B] Length = 327 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 36/235 (15%), Positives = 81/235 (34%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VDLSGSM + +K + +++ + F++ + Sbjct: 84 DLMVAVDLSGSM----------QIEDMVLDNKTVDRFTLVQHVVSDFIER-------RVG 126 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V QY+ L+ K T ++ A + + ++ Sbjct: 127 DRIGLILFGDHAYLQSPMTQDRRSVAQYLREAQIGLVGKQTAIGESIALAVKRFENLEES 186 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + ++ LTDG NN + + A E + I TI + A Sbjct: 187 N------------------RVLVLLTDGTNNAGSISPDKAA--AIAAERKVTIYTIGVGA 226 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + ++L + ++ ++ +++ L ++Q I +L Sbjct: 227 EMMERRSFFGRDRVNPSMDLDEEQLQRIANATQGKYFRARSSEDLAAIYQEIDKL 281 >gi|320158501|ref|YP_004190879.1| BatA [Vibrio vulnificus MO6-24/O] gi|319933813|gb|ADV88676.1| BatA [Vibrio vulnificus MO6-24/O] Length = 362 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 44/224 (19%), Positives = 76/224 (33%), Gaps = 35/224 (15%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +VVDLSGSM +++ A K L F + ++ Sbjct: 107 DVMVVVDLSGSMA---------EQDFTSASGANISRLDATKEVLAEFAKT-------RQG 150 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + + + ++ + T A+ A + + Sbjct: 151 DRLGLILFGDAAFVQTPFTADQKVWLALLNQTDVAMAGQSTHLGDAIGLAI-KVFEQSEP 209 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S S P QK I LTDG N+ S V I AK ++I I++ Sbjct: 210 SQAA---------SSKPRQKVAIVLTDG--NDTGSFVEPIDAAKVAKAKGVRIHVIAMGD 258 Query: 372 SPN-GQRLL------KTCVSSPEYHYNVVNADSLIHVFQNISQL 408 G+ L + S + +N D L + +I +L Sbjct: 259 PSTVGESALDLQTIERIASESGGKAFQALNRDELASAYDDIGKL 302 >gi|163745746|ref|ZP_02153106.1| hypothetical protein OIHEL45_09145 [Oceanibulbus indolifex HEL-45] gi|161382564|gb|EDQ06973.1| hypothetical protein OIHEL45_09145 [Oceanibulbus indolifex HEL-45] Length = 554 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 20/115 (17%), Positives = 46/115 (40%), Gaps = 9/115 (7%) Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 Y + R + ++++ F+ +I + D + + + +IC A Sbjct: 447 YYYGYNMYGRRYNSSWYWYQRAT---DFRDYIQWTVD----SVEKDRRLRQICGVANAAG 499 Query: 362 IKIVTISINAS-PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + I +I ++ N LLK C SS ++++V + F I+ + + + Sbjct: 500 VVIYSIGMDVDNTNSLNLLKDCASSESHYFDVEGL-EIQTAFDMIAASISMLRLT 553 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 44/269 (16%), Positives = 83/269 (30%), Gaps = 64/269 (23%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 M L+SR + K E ++ + + G + + + + + ++ + A Sbjct: 1 MMLISRIAHFRK----DEGGGMLVLMLIVFFGITIFGGLAVDLANHERTRTTFQTHLDNA 56 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 +LA AS LS+ D E + ++ A +E++ Sbjct: 57 VLAAAS-----LSQDLDAEEVVRSYLTSA----------------------GLDPSEVEI 89 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 +I + ++ F I S Sbjct: 90 ETREEKI------------GGILVGRTVEASLPAGLNTYFFRFFDIDTLGMTISSEATER 137 Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + I LV+D+SGSM I D+ KM LK A F++ Sbjct: 138 VED-------IEISLVLDVSGSMG-------------DITSDRSGIKMDLLKRAAGDFVE 177 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEP 269 +I L + V + ++ Y+T+V Sbjct: 178 TI-LSDAEEGRVSISIVPYSTKVNPGSAL 205 >gi|91216721|ref|ZP_01253686.1| batA protein [Psychroflexus torquis ATCC 700755] gi|91185190|gb|EAS71568.1| batA protein [Psychroflexus torquis ATCC 700755] Length = 334 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 54/297 (18%), Positives = 91/297 (30%), Gaps = 68/297 (22%) Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVS-FIPALLRIEMGERPIFL 192 N ++T + M + L FI +L ++ + ++ + Sbjct: 35 QNASIKMSSTQGFKMSTLAKLR-PLLFILKMLALVLLTIAMARPRTVDVTTKVKKTEGID 93 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I + VD+S SM D + ++ ALK + F++ + + Sbjct: 94 IIMAVDISASMLAR---------------DLEPNRLEALKKVAINFIE-------GRPND 131 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS-LILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ Y + + + S I T + + L K Sbjct: 132 RIGLVIYAGESYTKTPLTTDKSIIFNAINDLEYSQNIEGGTAIGMGLATSVNKLKDSKAE 191 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-- 369 S K II LTDGENN + T A E IK TI + Sbjct: 192 S------------------KVIILLTDGENNAGFIDPKTATQ--LATEYDIKTYTIGVGS 231 Query: 370 --NA------SPNGQ------------RLLKTCV-SSPEYHYNVVNADSLIHVFQNI 405 A NGQ LLKT S+ ++ + +++ I Sbjct: 232 NGMALSPVGIKANGQFEYRNIEVKIDEALLKTIAESNGGKYFRATDNQKFEAIYEEI 288 >gi|260061451|ref|YP_003194531.1| aerotolerance operon BatA [Robiginitalea biformata HTCC2501] gi|88785583|gb|EAR16752.1| BatA (Bacteroides aerotolerance operon) [Robiginitalea biformata HTCC2501] Length = 333 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 40/263 (15%), Positives = 73/263 (27%), Gaps = 65/263 (24%) Query: 169 YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 + + I + +D+S SM D + ++ Sbjct: 70 VITAMARPQTQDISTRTKTTKGIDIVMAIDVSSSMLAR---------------DLRPNRL 114 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 +ALK F+ + + +GL+ Y + V + + Sbjct: 115 SALKEVAAEFIRK-------RPNDRIGLVAYAGESYTKTPITSDKSIVLGALREITYGQL 167 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + + L S K II LTDG NN Sbjct: 168 NDGTAIGMGLATSVNRL------------------KESTAISKVIILLTDGVNNAGFIEP 209 Query: 349 NTIKICDKAKENFIKIVTISINASPN----------------------GQRLLKTCVSS- 385 T D A E IK TI + + N + LL ++ Sbjct: 210 QTAA--DLALEYGIKTYTIGLGTNGNALSPIGYNPDGSFRYGMRQVEIDEELLTDIATAT 267 Query: 386 PEYHYNVVNADSLIHVFQNISQL 408 ++ + + L +++ I++L Sbjct: 268 GGEYFRATDNEKLEAIYEEINKL 290 >gi|332307030|ref|YP_004434881.1| von Willebrand factor type A [Glaciecola agarilytica 4H-3-7+YE-5] gi|332174359|gb|AEE23613.1| von Willebrand factor type A [Glaciecola agarilytica 4H-3-7+YE-5] Length = 338 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 40/235 (17%), Positives = 80/235 (34%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VDLSGSM ++ ++ +K+ L F+ + Sbjct: 88 DLMIAVDLSGSM----------KIDDMQVNGRQVDRLQMIKSVLHDFIQR-------RVG 130 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ E V Q + + L+ + T A+ A + S K+ Sbjct: 131 DRLGLIFFADTAYLQAPLTYDRETVSQLLGESLIGLVGEQTAIGDAIGLAIKRFQSKKES 190 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + K +I LTDG+N + + A N + + TI + A Sbjct: 191 N------------------KVLILLTDGQNTAGNISPQQAN--ELAINNGVTLYTIGVGA 230 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + L + S+ ++ +A+SL ++ + +L Sbjct: 231 DQMMVQSIFGSRQVNPSQELDESMLTQLAESTGGRYFRARDAESLKAIYDKLDEL 285 >gi|325678004|ref|ZP_08157643.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] gi|324110284|gb|EGC04461.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] Length = 812 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 46/221 (20%), Positives = 83/221 (37%), Gaps = 52/221 (23%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + D SGSM +DP+ K A+ F+DS+ L Sbjct: 243 LAAAICYDCSGSMS---GNDPKGYR----------------KLAIDNFIDSMTLTD---- 279 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 LI + + E S E+++ V T+ +++ A + L + Sbjct: 280 --KTALISFEDEAKLVSEFSDNKEELKGLVNPYF----GGGTNVRASVEMAIEQLNTV-- 331 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGE---NNNFKSNVNTIKICDKAKENFIKIVTI 367 + + II L+DG+ N N +N I KA +N IKI TI Sbjct: 332 ---------------QHWYTRHIILLSDGDVNININLANNTVDDLI-KKAVDNNIKIHTI 375 Query: 368 SINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQ 407 + + + Q+ LK C + ++ A+ L +++++S+ Sbjct: 376 GLGSGADNQK-LKDCAEYTGGQYFTAETAEKLDAIYKDLSK 415 >gi|327403932|ref|YP_004344770.1| von Willebrand factor type A [Fluviicola taffensis DSM 16823] gi|327319440|gb|AEA43932.1| von Willebrand factor type A [Fluviicola taffensis DSM 16823] Length = 341 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 40/239 (16%), Positives = 75/239 (31%), Gaps = 64/239 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I L +D SGSM QD ++ K F+DS ++ Sbjct: 102 IDIILSIDASGSMLA---------------QDFDPNRLEVAKRVAKKFVDS-------RK 139 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +GL+ Y + + +++ ++ + T + A L SD Sbjct: 140 GDRVGLVVYEGEAYTACPATLDYKLLKEQISAIEPGHLEPGTAIGSGLGVAVTRLRSDS- 198 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI- 369 K II LTDG + + +++ + AK ++ TI + Sbjct: 199 -----------------LISKVIILLTDG---SSNTGPEPLEVAELAKAKKCRVYTIGVG 238 Query: 370 ---NASPN----------------GQRLLKTCVSSP-EYHYNVVNADSLIHVFQNISQL 408 A + +LK S+ ++ + SL ++ I +L Sbjct: 239 ADGMAPTPVNTPFGVVYQNLPVEIDEGVLKEIASATNGKYFRAQDEKSLEKIYAEIDKL 297 >gi|226326038|ref|ZP_03801556.1| hypothetical protein COPCOM_03856 [Coprococcus comes ATCC 27758] gi|225205580|gb|EEG87934.1| hypothetical protein COPCOM_03856 [Coprococcus comes ATCC 27758] Length = 275 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 35/193 (18%), Positives = 69/193 (35%), Gaps = 28/193 (14%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 + + LK A F + S E + L+ + + + K Y Sbjct: 69 TTNDANDRFYYLKQAATNFTTQLAQSSPNSE---IALVTFNKTATEQFDFK-NVGKDSAY 124 Query: 280 VTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 +T ++++ T + +AY+IL +D+ S +++++ LTD Sbjct: 125 ITETINAMETSGGTHQNEGLDRAYKILNNDQNTSN---------------LKRYVVLLTD 169 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASP------NGQRLLKTCVSSPEYHYNV 392 G N + T I +K K K++T+ + + L+ + YN Sbjct: 170 GCPNGVTYDQITTSI-NKIKSTNTKLITVGVGLDETNTGLKAAKDYLQA-NADDNMAYNA 227 Query: 393 VNADSLIHVFQNI 405 +A L +F I Sbjct: 228 NDASHLNTIFTQI 240 >gi|237716505|ref|ZP_04546986.1| aerotolerance protein BatA [Bacteroides sp. D1] gi|262408103|ref|ZP_06084651.1| conserved hypothetical protein [Bacteroides sp. 2_1_22] gi|294645097|ref|ZP_06722823.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CC 2a] gi|294809498|ref|ZP_06768201.1| von Willebrand factor type A domain protein [Bacteroides xylanisolvens SD CC 1b] gi|298484179|ref|ZP_07002345.1| BatA protein [Bacteroides sp. D22] gi|229444152|gb|EEO49943.1| aerotolerance protein BatA [Bacteroides sp. D1] gi|262354911|gb|EEZ04003.1| conserved hypothetical protein [Bacteroides sp. 2_1_22] gi|292639603|gb|EFF57895.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CC 2a] gi|294443316|gb|EFG12080.1| von Willebrand factor type A domain protein [Bacteroides xylanisolvens SD CC 1b] gi|295084189|emb|CBK65712.1| von Willebrand factor type A domain. [Bacteroides xylanisolvens XB1A] gi|298269683|gb|EFI11278.1| BatA protein [Bacteroides sp. D22] Length = 327 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 43/276 (15%), Positives = 82/276 (29%), Gaps = 63/276 (22%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 Y + F+ + +++ + + E I L +D+S SM Sbjct: 51 YLLHVPFLLRCIALVLVILVLARPQTTNKWQNSEIEGIDIMLAIDVSTSMLAE------- 103 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 D K ++ A K+ F++ +G+ + + Sbjct: 104 --------DLKPNRLEAAKDVAAEFIN----GRPNDN---IGITLFAGETFTQCPLTVDH 148 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + + LI T + A L K +S K I Sbjct: 149 AVLLDMIHNIKCGLITDGTAVGMGIANAVTRLKDSKAKS------------------KVI 190 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI----NAS----------------- 372 I LTDG NN + T + AK I++ TI + A Sbjct: 191 ILLTDGTNNKGDISPMTAA--EIAKSFGIRVYTIGVGTNGMAPYPYPVGNTVQYVSMPVE 248 Query: 373 PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + L + ++ ++ + L V++ I +L Sbjct: 249 IDEKTLTEIAGTTDGNYFRATSNSKLKEVYEEIDKL 284 >gi|325927915|ref|ZP_08189139.1| Mg-chelatase subunit ChlD [Xanthomonas perforans 91-118] gi|325541755|gb|EGD13273.1| Mg-chelatase subunit ChlD [Xanthomonas perforans 91-118] Length = 338 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 43/238 (18%), Positives = 78/238 (32%), Gaps = 54/238 (22%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + L VDLSGSM + K ++ A K L FLD Sbjct: 96 REARQMMLAVDLSGSMS----------EPDMVLGGKVVDRLTAAKAVLSDFLDR------ 139 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 ++ +GL+ + R + VR ++ + L + T A+ + + L Sbjct: 140 -RDGDRVGLLVFGQRAYALTPLTADLTSVRDQLSDSVVGLAGRETAIGDAIALSVKRLRE 198 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 K+ Q+ ++ LTDG N N +K + AK +++ TI Sbjct: 199 QKQG------------------QRVVVLLTDGVNTAGALNP--LKAAELAKAEGVRVHTI 238 Query: 368 SINA-----------SPNGQR------LLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + G L K + + + + L ++ + +L Sbjct: 239 AFGGSGGYSLFGVPIPAGGNDDIDEDGLRKIAQQTGGRFFRARDTEELAGIYAELDRL 296 >gi|154486447|ref|ZP_02027854.1| hypothetical protein BIFADO_00261 [Bifidobacterium adolescentis L2-32] gi|154084310|gb|EDN83355.1| hypothetical protein BIFADO_00261 [Bifidobacterium adolescentis L2-32] Length = 882 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 46/275 (16%), Positives = 88/275 (32%), Gaps = 49/275 (17%) Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 V A+ + LV+D+SGSM P+ + + ++ A Sbjct: 171 TVNVDVKGAVNSTTVTTTQPIDFTLVLDVSGSMD------------DPMSKTDRTRRLDA 218 Query: 231 LKNALLLFLDSIDLLSH--VKEDVYMGLIGY----TTRVEKNIEPSWGTEKVRQYVTRDM 284 LK A+ FLD + E V++GL+ + T ++ ++ S G + ++ Sbjct: 219 LKEAVKAFLDEAANTNTEAGSELVHVGLVKFAGDKTDKIGDDMYRSGGYTYNYSQIVSNL 278 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + + + +K A + F G +K +IF DG + Sbjct: 279 TADMNGLKNKVSKLKAA---GATRADNGFNRAVKVMGSASARTDAKKVVIFFADGSPTSS 335 Query: 345 KS-----NVNTIKICDKAKENFIKIVTISINASP-------NGQRLLKTCVS-------- 384 ++ + K+ + +I I AS N + + S Sbjct: 336 SGFEGKVANKAVEAAKELKDGGAAVYSIGIFASANPSSLSSNENQFMHAVSSNFPKATKY 395 Query: 385 --------SPEYHYNVVNADSLIHVFQNISQLMVH 411 Y+ + NA L +F I + Sbjct: 396 NQLGEGNIEAGYYKSATNASELNTIFDEIEKSETT 430 >gi|332519334|ref|ZP_08395801.1| von Willebrand factor type A [Lacinutrix algicola 5H-3-7-4] gi|332045182|gb|EGI81375.1| von Willebrand factor type A [Lacinutrix algicola 5H-3-7-4] Length = 334 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 50/264 (18%), Positives = 79/264 (29%), Gaps = 66/264 (25%) Query: 169 YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 I + + I + +D+S SM D K ++ Sbjct: 70 IITAIARPRTVDVSTKTKTTRGIDIVMAIDVSASMLAK---------------DLKPNRL 114 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-MDSL 287 ALKN F+ + + +GL+ Y + V + + +++ Sbjct: 115 EALKNVASEFIK-------GRPNDRIGLVEYAGESYTKTPITSDKSIVLRSLQEIRYNNI 167 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 I T + A L K +S K II LTDG NN+ N Sbjct: 168 IEGGTAIGMGLATAVNRLKDSKAKS------------------KVIILLTDGVNNSGSIN 209 Query: 348 VNTIKICDKAKENFIKIVTIS----------INASPNGQ------------RLLKTCVS- 384 + A E IK TI I NGQ LLK Sbjct: 210 PKIAS--ELAVEFGIKTYTIGLGTNGMALSPIAIKQNGQFQYGRVKVEIDETLLKEIAQV 267 Query: 385 SPEYHYNVVNADSLIHVFQNISQL 408 + ++ N L ++ I++L Sbjct: 268 TGGKYFRATNNKKLAEIYDEINKL 291 >gi|149026142|gb|EDL82385.1| rCG29121 [Rattus norvegicus] Length = 905 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 54/367 (14%), Positives = 126/367 (34%), Gaps = 67/367 (18%) Query: 46 WHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKES 105 + +KN +E+ + + G + +V +K L + FI N S Sbjct: 175 YLSRKNVVEATRCSTDITG-TNVVHECQGGSCVTRKCRRDSKTGLPEPKCTFIPNK---S 230 Query: 106 LSGYSAVFYNTEIQNIVN--SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEH 163 + +++ + + ++V + + T N + N +++ ++ + Sbjct: 231 QTARASIMFLQSLDSVVEFCTEKTHNTEAPNLQNKICNGRSTWDVIKESADFQHAPPMRG 290 Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 + P + + + + LV+D SGSM Sbjct: 291 ---------TEAPPPPTFSLLKSRQRV--VCLVLDKSGSMSS----------------GD 323 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT-RVEKNIEPSWGTEKVRQYVTR 282 T++ + A L+L I ++++ +GL+ + + + +N + ++ Sbjct: 324 PITRLTLMNQAAELYLIQI-----LEKESLVGLVTFDSIAIVQNNLIRMINDSSYLEISA 378 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + T +++ ++ +TS + + + I+ LTDGE++ Sbjct: 379 KLPQEAAGGTSICNGLRKGFETITSSDQSTCGSE----------------IVLLTDGEDD 422 Query: 343 NFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRL--LKTCVSSPEYHYNVVNADSLI 399 S C ++ K + I TI++ + L L ++ Y + LI Sbjct: 423 QISS-------CFEEVKHSGAVIHTIAL-GPDAARELETLSD-MTGGRRFYASEGINGLI 473 Query: 400 HVFQNIS 406 F IS Sbjct: 474 DAFSGIS 480 >gi|116487355|ref|NP_001070824.1| chloride channel calcium activated 4-like [Rattus norvegicus] gi|116013527|dbj|BAF34587.1| calcium-activated chloride channel [Rattus norvegicus] Length = 905 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 54/367 (14%), Positives = 126/367 (34%), Gaps = 67/367 (18%) Query: 46 WHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKES 105 + +KN +E+ + + G + +V +K L + FI N S Sbjct: 175 YLSRKNVVEATRCSTDITG-TNVVHECQGGSCVTRKCRRDSKTGLPEPKCTFIPNK---S 230 Query: 106 LSGYSAVFYNTEIQNIVN--SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEH 163 + +++ + + ++V + + T N + N +++ ++ + Sbjct: 231 QTARASIMFLQSLDSVVEFCTEKTHNTEAPNLQNKICNGRSTWDVIKESADFQHAPPMRG 290 Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 + P + + + + LV+D SGSM Sbjct: 291 ---------TEAPPPPTFSLLKSRQRV--VCLVLDKSGSMSS----------------GD 323 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT-RVEKNIEPSWGTEKVRQYVTR 282 T++ + A L+L I ++++ +GL+ + + + +N + ++ Sbjct: 324 PITRLTLMNQAAELYLIQI-----LEKESLVGLVTFDSIAIVQNNLIRMINDSSYLEISA 378 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + T +++ ++ +TS + + + I+ LTDGE++ Sbjct: 379 KLPQEAAGGTSICNGLRKGFETITSSDQSTCGSE----------------IVLLTDGEDD 422 Query: 343 NFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRL--LKTCVSSPEYHYNVVNADSLI 399 S C ++ K + I TI++ + L L ++ Y + LI Sbjct: 423 QISS-------CFEEVKHSGAVIHTIAL-GPDAARELETLSD-MTGGRRFYASEGINGLI 473 Query: 400 HVFQNIS 406 F IS Sbjct: 474 DAFSGIS 480 >gi|104780882|ref|YP_607380.1| hypothetical protein PSEEN1727 [Pseudomonas entomophila L48] gi|95109869|emb|CAK14574.1| conserved hypothetical protein; Willebrand factor type A domain protein [Pseudomonas entomophila L48] Length = 358 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 35/183 (19%), Positives = 67/183 (36%), Gaps = 37/183 (20%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM Q + +++ +K + FL ++ Sbjct: 91 DLLVAVDVSGSMD----------FPDMQWQGDEISRLDLVKALMGDFLQ-------DRQG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + K T A+ A + L + Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRRTVRTFLDEAQIGIAGKNTAIGDAIGLAVKRLRQRPAQ 193 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S + +I +TDG NN + + T A + ++I TI I A Sbjct: 194 S------------------RVLILITDGANNGGQIHPLTAAR--LAAQEGVRIYTIGIGA 233 Query: 372 SPN 374 +P Sbjct: 234 NPE 236 >gi|85705211|ref|ZP_01036310.1| hypothetical protein ROS217_17122 [Roseovarius sp. 217] gi|85670084|gb|EAQ24946.1| hypothetical protein ROS217_17122 [Roseovarius sp. 217] Length = 580 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 13/76 (17%), Positives = 28/76 (36%), Gaps = 3/76 (3%) Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNG--QRLLKTCVSSPEYHYNVVNADSLI 399 + N +C K + + +I NG + L C SSP +++ + + Sbjct: 505 SGSMKNGLMQNVCKATKTEGVVVYSIGFEVPVNGTAENQLSACASSPAHYFRA-SGTDIK 563 Query: 400 HVFQNISQLMVHRKYS 415 F I+ + + + Sbjct: 564 SAFSAIAANVKQLRLT 579 >gi|310657503|ref|YP_003935224.1| hypothetical protein CLOST_0189 [Clostridium sticklandii DSM 519] gi|308824281|emb|CBH20319.1| exported protein of unknown function [Clostridium sticklandii] Length = 466 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 33/172 (19%), Positives = 66/172 (38%), Gaps = 21/172 (12%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + + G++ + I LV+D SGSM M+ ++++ L+N +D Sbjct: 188 QTKAGKKVVAAITLVLDESGSMANNMSGGRV--------TSSNPSRISILRNRAKALIDQ 239 Query: 242 IDLLSHVKEDVYMGLIGYTTRVE----KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 L + +Y+G+I Y+ K+ + GT V + T++ A Sbjct: 240 FSGLGN----IYVGIIPYSDDAYISGTKSFVLANGTN-VNTIKNKIDSLTAQGMTNTGDA 294 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 M+ +Y + K+ N + ++I L+DG+ F + Sbjct: 295 MRVSY---YATKQFKDSPNSIDNTLPT-DTKVIPYMILLSDGDPTVFSATTR 342 >gi|269968855|ref|ZP_06182838.1| hypothetical protein VMC_42680 [Vibrio alginolyticus 40B] gi|269826535|gb|EEZ80886.1| hypothetical protein VMC_42680 [Vibrio alginolyticus 40B] Length = 356 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 42/224 (18%), Positives = 78/224 (34%), Gaps = 33/224 (14%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +VVDLSGSM +K +++ A K L F+ S ++ Sbjct: 107 DVMVVVDLSGSMA---------EPDFTSRTGEKISRLDAAKEVLSEFVQS-------RKG 150 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + + + + + + ++ + T A+ A + + Sbjct: 151 DRLGLVLFGDAAFVQTPFTADQKVWLELLNQTDVAMAGQSTHLGDAIGLAI-KVFEQSDK 209 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S Q +K I LTDG N+ S V I AK +++ I++ Sbjct: 210 SSGALEQDQNR-------EKVAIVLTDG--NDTGSFVEPIDAAKVAKAKGVRVHVIAMGD 260 Query: 372 SPN-GQRLL------KTCVSSPEYHYNVVNADSLIHVFQNISQL 408 G+ L + S + +N D L + I +L Sbjct: 261 PETIGETALDMETIHRIAKESGGEAFEALNRDELSAAYDEIGKL 304 >gi|289178041|gb|ADC85287.1| Fibronectin-binding protein [Bifidobacterium animalis subsp. lactis BB-12] Length = 2710 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 45/246 (18%), Positives = 84/246 (34%), Gaps = 43/246 (17%) Query: 140 SNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDL 199 ++ ++ + MD Y Y + + + V ++ R+ + D Sbjct: 151 TSGSMAWGMDGDDEYAYDPVYAADI--TTSKRYYVRVSGSMTRVYSSANGWYY-----DA 203 Query: 200 SGSMH------CAMNSDPEDVNSAPICQDKKR-TKMAALKNALLLFLDSI----DLLSHV 248 GS H A +SD + + T+M ALK A+ F+D +S Sbjct: 204 GGSRHYVTPKTSAADSDAAHTQFYSRRRLTTQDTRMYALKQAVNGFIDQTIAANAKVSDP 263 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + +GL+ Y + V + ++ V S T + M+ A +L + Sbjct: 264 NKKNRIGLVTYASDVNTRSGLTDSLSGLKSTVDDLKAS---GATRADLGMQTANTVLGNA 320 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE---NNNFKSNVNTIKI--CDKAKENFIK 363 K +IF TDG+ +N F+++V I K N Sbjct: 321 -----------------RADASKIVIFFTDGQPTKSNGFENDVANDAIGAAKTMKTNGAS 363 Query: 364 IVTISI 369 + ++ I Sbjct: 364 VYSVGI 369 >gi|159043014|ref|YP_001531808.1| von Willebrand factor type A [Dinoroseobacter shibae DFL 12] gi|157910774|gb|ABV92207.1| von Willebrand factor type A [Dinoroseobacter shibae DFL 12] Length = 320 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 31/178 (17%), Positives = 65/178 (36%), Gaps = 37/178 (20%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +V+DLSGSM V D+ T++ A+K A+ Sbjct: 92 DLAIVLDLSGSM----------VRDDFNLDDRAVTRLEAVK-AVGADFARRRAGD----- 135 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + L+ + + ++ TE V + + + + T + + A + L++ Sbjct: 136 -RLALVVFGSEAYFASPFTFDTESVARRIEEATIGISGRATSISDGLGLALKRLSTSTAT 194 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 S + +I L+DG NN +N + + A +++ TI++ Sbjct: 195 S------------------RVVILLSDGINNAGATNPRGVA--ELAARYGVRVHTIAL 232 >gi|32474857|ref|NP_867851.1| chloride channel [Rhodopirellula baltica SH 1] gi|32445397|emb|CAD75398.1| conserved hypothetical protein-putative chloride channel [Rhodopirellula baltica SH 1] Length = 900 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 40/236 (16%), Positives = 72/236 (30%), Gaps = 54/236 (22%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + E+P + LV+D SGSM K+ K+A ++ + Sbjct: 455 EKEREKPSLAMMLVIDKSGSMGGQ--------------------KIELAKDAAQAAVELL 494 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 +G+I + E T T+ PAM AY Sbjct: 495 GPKDA------IGVIAFDGDSYTVSELR-STSDRGAISDAISTIEASGGTNMYPAMADAY 547 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + L + K +I +TDG S + + + I Sbjct: 548 EALLGATAKL------------------KHVILMTDG----VSSPGDFQGVAGDMSASRI 585 Query: 363 KIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 + T+++ + + LL+ +Y + S+ VF ++ V S I Sbjct: 586 TLSTVAL-GQGSSEDLLEELAQIGGGRYYFCDDPQSVPQVF---AKETVEASKSAI 637 >gi|78049050|ref|YP_365225.1| hypothetical protein XCV3494 [Xanthomonas campestris pv. vesicatoria str. 85-10] gi|78037480|emb|CAJ25225.1| putative membrane protein [Xanthomonas campestris pv. vesicatoria str. 85-10] Length = 451 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 43/238 (18%), Positives = 78/238 (32%), Gaps = 54/238 (22%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + L VDLSGSM + K ++ A K L FLD Sbjct: 209 REARQMMLAVDLSGSMS----------EPDMVLGGKVVDRLTAAKAVLSDFLDR------ 252 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 ++ +GL+ + R + VR ++ + L + T A+ + + L Sbjct: 253 -RDGDRVGLLVFGQRAYALTPLTADLTSVRDQLSDSVVGLAGRETAIGDAIALSVKRLRE 311 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 K+ Q+ ++ LTDG N N +K + AK +++ TI Sbjct: 312 QKQG------------------QRVVVLLTDGVNTAGVLNP--LKAAELAKAEGVRVHTI 351 Query: 368 SINA-----------SPNGQR------LLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + G L K + + + + L ++ + +L Sbjct: 352 AFGGSGGYSLFGVPIPAGGNDDIDEDGLRKIAQQTGGRFFRARDTEELAGIYAELDRL 409 >gi|309790845|ref|ZP_07685389.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308227132|gb|EFO80816.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 885 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 32/242 (13%), Positives = 77/242 (31%), Gaps = 50/242 (20%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + +RP + V+D SGSM K+ K AL+ + + Sbjct: 403 DREQRPDLALVFVIDRSGSMAEPAG---------------NVQKLDIAKEALVQAIRMLY 447 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 +G++ + ++ + + G + + + T+ + + Sbjct: 448 GED------RVGIVTFDSQAYTTMPITQGVGE-EEVLQAIASVTADGGTNIGAGLSAGQR 500 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 +LT + + K +I LTDG + + + + + + I Sbjct: 501 MLTGVEAKI------------------KHMILLTDG----WGEGNDQLAVVEAMRAQGIT 538 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADS----LIH-VFQNISQLMVHRKYSVIL 418 + ++ S + L + +Y + L+ +L+V R++ + Sbjct: 539 LSVVA-AGSDTAEELKTLATAGGGRYYAAAIMQAVPQILVDETITVAGKLIVERRFVPVA 597 Query: 419 KG 420 G Sbjct: 598 TG 599 >gi|183601829|ref|ZP_02963198.1| hypothetical protein BIFLAC_06106 [Bifidobacterium animalis subsp. lactis HN019] gi|241190320|ref|YP_002967714.1| hypothetical protein Balac_0261 [Bifidobacterium animalis subsp. lactis Bl-04] gi|241195726|ref|YP_002969281.1| hypothetical protein Balat_0261 [Bifidobacterium animalis subsp. lactis DSM 10140] gi|183218714|gb|EDT89356.1| hypothetical protein BIFLAC_06106 [Bifidobacterium animalis subsp. lactis HN019] gi|240248712|gb|ACS45652.1| hypothetical fibronectin binding protein [Bifidobacterium animalis subsp. lactis Bl-04] gi|240250280|gb|ACS47219.1| hypothetical fibronectin binding protein [Bifidobacterium animalis subsp. lactis DSM 10140] gi|295793307|gb|ADG32842.1| hypothetical fibronectin binding protein [Bifidobacterium animalis subsp. lactis V9] Length = 2696 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 45/246 (18%), Positives = 84/246 (34%), Gaps = 43/246 (17%) Query: 140 SNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDL 199 ++ ++ + MD Y Y + + + V ++ R+ + D Sbjct: 137 TSGSMAWGMDGDDEYAYDPVYAADI--TTSKRYYVRVSGSMTRVYSSANGWYY-----DA 189 Query: 200 SGSMH------CAMNSDPEDVNSAPICQDKKR-TKMAALKNALLLFLDSI----DLLSHV 248 GS H A +SD + + T+M ALK A+ F+D +S Sbjct: 190 GGSRHYVTPKTSAADSDAAHTQFYSRRRLTTQDTRMYALKQAVNGFIDQTIAANAKVSDP 249 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + +GL+ Y + V + ++ V S T + M+ A +L + Sbjct: 250 NKKNRIGLVTYASDVNTRSGLTDSLSGLKSTVDDLKAS---GATRADLGMQTANTVLGNA 306 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE---NNNFKSNVNTIKI--CDKAKENFIK 363 K +IF TDG+ +N F+++V I K N Sbjct: 307 -----------------RADASKIVIFFTDGQPTKSNGFENDVANDAIGAAKTMKTNGAS 349 Query: 364 IVTISI 369 + ++ I Sbjct: 350 VYSVGI 355 >gi|107102622|ref|ZP_01366540.1| hypothetical protein PaerPA_01003686 [Pseudomonas aeruginosa PACS2] Length = 340 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 73/224 (32%), Gaps = 54/224 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VD+SGSM QD + +++ +K F++ + Sbjct: 91 DLLLAVDVSGSMD----------YRDMRWQDDEISRLELIKKLFGDFIE-------DRRG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + K T A+ A + L Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRHTVRVWLDEAQIGIAGKNTAIGDAIGLAVKRLRQRPAE 193 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S + ++ +TDG N + A E +KI TI I A Sbjct: 194 S------------------RVLVLITDGANTGGQIAPQIAAQ--LAAEQQVKIYTIGIGA 233 Query: 372 SPN------------GQRL----LKTCV-SSPEYHYNVVNADSL 398 P G L L+ S+ ++ ++ L Sbjct: 234 DPQQGGVPGLFGFNPGLDLDEPTLRGIAESTGGEYFRARSSAEL 277 >gi|91225506|ref|ZP_01260628.1| hypothetical protein V12G01_09265 [Vibrio alginolyticus 12G01] gi|91189869|gb|EAS76142.1| hypothetical protein V12G01_09265 [Vibrio alginolyticus 12G01] Length = 356 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 42/224 (18%), Positives = 79/224 (35%), Gaps = 33/224 (14%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +VVDLSGSM +K +++ A K L F+ S ++ Sbjct: 107 DVMVVVDLSGSMA---------EPDFTSRTGEKISRLDAAKEVLSEFVQS-------RKG 150 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + + + + + + ++ + T A+ A + Sbjct: 151 DRLGLVLFGDAAFVQTPFTADQKVWLELLNQTDVAMAGQSTHLGDAIGLAI--------K 202 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 F + +G +K I LTDG N+ S V I AK +++ I++ Sbjct: 203 VFEQSDQSRGALEQDQNREKVAIVLTDG--NDTGSFVEPIDAAKVAKAKGVRVHVIAMGD 260 Query: 372 SPN-GQRLL------KTCVSSPEYHYNVVNADSLIHVFQNISQL 408 G+ L + S + +N D L + I +L Sbjct: 261 PETIGETALDMDTIHRIAKESGGEAFEALNRDELSAAYDEIGKL 304 >gi|15598269|ref|NP_251763.1| hypothetical protein PA3073 [Pseudomonas aeruginosa PAO1] gi|9949180|gb|AAG06461.1|AE004731_9 hypothetical protein PA3073 [Pseudomonas aeruginosa PAO1] Length = 340 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 73/224 (32%), Gaps = 54/224 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VD+SGSM QD + +++ +K F++ + Sbjct: 91 DLLLAVDVSGSMD----------YRDMRWQDDEISRLELIKKLFGDFIE-------DRRG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + K T A+ A + L Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRHTVRVWLDEAQIGIAGKNTAIGDAIGLAVKRLRQRPAE 193 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S + ++ +TDG N + A E +KI TI I A Sbjct: 194 S------------------RVLVLITDGANTGGQIAPQIAAQ--LAAEQQVKIYTIGIGA 233 Query: 372 SPN------------GQRL----LKTCV-SSPEYHYNVVNADSL 398 P G L L+ S+ ++ ++ L Sbjct: 234 DPQQGGVPGLFGFNPGLDLDEPTLRGIAESTGGEYFRARSSAEL 277 >gi|254241773|ref|ZP_04935095.1| hypothetical protein PA2G_02484 [Pseudomonas aeruginosa 2192] gi|126195151|gb|EAZ59214.1| hypothetical protein PA2G_02484 [Pseudomonas aeruginosa 2192] Length = 340 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 73/224 (32%), Gaps = 54/224 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VD+SGSM QD + +++ +K F++ + Sbjct: 91 DLLLAVDVSGSMD----------YRDMRWQDDEISRLELIKKLFGDFIE-------DRRG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + K T A+ A + L Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRHTVRVWLDEAQIGIAGKNTAIGDAIGLAVKRLRQRPAE 193 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S + ++ +TDG N + A E +KI TI I A Sbjct: 194 S------------------RVLVLITDGANTGGQIAPQIAAQ--LAAEQQVKIYTIGIGA 233 Query: 372 SPN------------GQRL----LKTCV-SSPEYHYNVVNADSL 398 P G L L+ S+ ++ ++ L Sbjct: 234 DPQQGGVPGLFGFNPGLDLDEPTLRGIAESTGGEYFRARSSAEL 277 >gi|269926840|ref|YP_003323463.1| von Willebrand factor type A [Thermobaculum terrenum ATCC BAA-798] gi|269790500|gb|ACZ42641.1| von Willebrand factor type A [Thermobaculum terrenum ATCC BAA-798] Length = 918 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 50/298 (16%), Positives = 99/298 (33%), Gaps = 47/298 (15%) Query: 110 SAVFYNTEIQNIVNSSRISMTHMAN---NRLDSSNNTIFYNMDVMTSYDYRLQF-IEHLL 165 V N ++I ++ + N N L + T+ + + + L Sbjct: 319 DTVDSNDIPKDISTLAKYDAVVLVNVPANSLQDAGKTLQVYVHDLGKGLVAIGGDRAFAL 378 Query: 166 NQRYNQKIVSFIPALLRI-EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 +N + +P +I E P + + +D SGSM + + P Sbjct: 379 GGYFNTPLEQTLPVDSQIRNPDEEPQVAVVMAIDKSGSMAACHCEGSKLLEQYPGGIP-- 436 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR---VEKNIEPSWGTEKVRQYVT 281 K+ K + + S + L G++ + T V + + + + V Sbjct: 437 --KVDIAKESA---ILSSETLGPNDIF---GVVAFDTAPRWVVRPEPVT-DKSSIAEKVA 487 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 S T+ + +A L K ++ K +I LTDG + Sbjct: 488 GIQGS---GGTNIYGGLAEAIDSLIKVKAKN------------------KHVILLTDGWS 526 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSS-PEYHYNVVNADSL 398 N + + KA+ + I I T+S A+ +LL++ YN ++ + Sbjct: 527 NVGNYDE----LISKARRHGITISTVS--AAGGSAQLLRSIAEKGGGTFYNTRDSADI 578 >gi|219682744|ref|YP_002469127.1| Rhs family protein [Bifidobacterium animalis subsp. lactis AD011] gi|219620394|gb|ACL28551.1| Rhs family protein [Bifidobacterium animalis subsp. lactis AD011] Length = 2582 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 45/246 (18%), Positives = 84/246 (34%), Gaps = 43/246 (17%) Query: 140 SNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDL 199 ++ ++ + MD Y Y + + + V ++ R+ + D Sbjct: 151 TSGSMAWGMDGDDEYAYDPVYAADI--TTSKRYYVRVSGSMTRVYSSANGWYY-----DA 203 Query: 200 SGSMH------CAMNSDPEDVNSAPICQDKKR-TKMAALKNALLLFLDSI----DLLSHV 248 GS H A +SD + + T+M ALK A+ F+D +S Sbjct: 204 GGSRHYVTPKTSAADSDAAHTQFYSRRRLTTQDTRMYALKQAVNGFIDQTIAANAKVSDP 263 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + +GL+ Y + V + ++ V S T + M+ A +L + Sbjct: 264 NKKNRIGLVTYASDVNTRSGLTDSLSGLKSTVDDLKAS---GATRADLGMQTANTVLGNA 320 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE---NNNFKSNVNTIKI--CDKAKENFIK 363 K +IF TDG+ +N F+++V I K N Sbjct: 321 -----------------RADASKIVIFFTDGQPTKSNGFENDVANDAIGAAKTMKTNGAS 363 Query: 364 IVTISI 369 + ++ I Sbjct: 364 VYSVGI 369 >gi|163754426|ref|ZP_02161548.1| BatA (Bacteroides aerotolerance operon) [Kordia algicida OT-1] gi|161325367|gb|EDP96694.1| BatA (Bacteroides aerotolerance operon) [Kordia algicida OT-1] Length = 335 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 42/278 (15%), Positives = 80/278 (28%), Gaps = 68/278 (24%) Query: 157 RLQFIEHLLNQ-RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 + F+ +L + + + I + +D+S SM Sbjct: 57 PILFVLRMLAIMALITALARPQTKEVSTRIKTNKGIDIVMAIDVSASMLSK--------- 107 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 D + ++ ALK F++ + +GL+ Y + Sbjct: 108 ------DLRPNRLTALKKVAAEFIE-------GRPSDRIGLVVYAGESFTKTPITTDKSI 154 Query: 276 VRQYV--TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 ++ + + ++ T + A L K +S K I Sbjct: 155 IQNALKDIKYKHGELIGGTAIGMGLATAVNRLKDSKAKS------------------KVI 196 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI----------------------NA 371 I LTDG NN + A E IK TI I Sbjct: 197 ILLTDGVNNAGFIEPQIAS--ELAVEYGIKTYTIGIGTNGMASTPVALNPDGTILFRNMQ 254 Query: 372 SPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQL 408 ++LL+ ++ ++ N L ++ I++L Sbjct: 255 VEIDEKLLQQIAKTTGGKYFRATNTKKLAEIYDEINKL 292 >gi|146337717|ref|YP_001202765.1| hypothetical protein BRADO0586 [Bradyrhizobium sp. ORS278] gi|146190523|emb|CAL74522.1| hypothetical protein BRADO0586 [Bradyrhizobium sp. ORS278] Length = 418 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 59/394 (14%), Positives = 118/394 (29%), Gaps = 77/394 (19%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 + R Y +K ++ N +++FA++ + L +G I + + ++ A + A+LA Sbjct: 1 MDRLLRYLRKFGHDQRGNIAVLFAIACVPVLAFVGAGIDYSMANKLRTKLQMAIDEAVLA 60 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 G + + L + +I+ + +G +A T N Sbjct: 61 GVAAGKAALDSGATQAAAIAMAQAAS-------------SSYFTGNTAKIDATPTINFT- 106 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 + T+ + + + N S Sbjct: 107 ---------------TMGRTLSGTGSATSVMNTSFMRLVGFPTMTLNASSASSATM---- 147 Query: 184 EMGERPIFLIELVVDLSGSMH--------CAMNSDPEDVNSAPICQDKKRTKMAALKN-- 233 +P + L+VD+S SM M + + + + ALKN Sbjct: 148 ----QPYLNVYLLVDISSSMLLPATQAGITQMRNGTGCALACHETTNGTDSYSYALKNNV 203 Query: 234 ---------ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + L ++ + K V +GL + ++ + V + Sbjct: 204 LLRYQVVNQGVQNLLTYLNSSAVYKNYVKVGLWSFDNQLTQLSS----LTSSFSSVAANF 259 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + L D+ A + L S T P QK +I TDG N+ Sbjct: 260 PAPGLAYNDAAAA--TPFDSLIGSFVSSVGTAGDGSTSATP----QKLVIIATDGVNDPT 313 Query: 345 KSNVN-----------TIKICDKAKENFIKIVTI 367 ++ + C+ K N + + I Sbjct: 314 RAWTSQTSLRSQVRVFNTAFCNTFKSNGVTVAII 347 >gi|220922748|ref|YP_002498050.1| hypothetical protein Mnod_2796 [Methylobacterium nodulans ORS 2060] gi|219947355|gb|ACL57747.1| conserved hypothetical protein [Methylobacterium nodulans ORS 2060] Length = 135 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 17/74 (22%), Positives = 30/74 (40%), Gaps = 5/74 (6%) Query: 347 NVNTIKICDKAKENFIKIVTISINAS-----PNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + T + AK I + TI + G LL C SS + +A++LI Sbjct: 61 DALTTQAYTNAKAAGISVYTIGFSTPSDSIDDKGLSLLSNCASSSSQAFVANDANTLISA 120 Query: 402 FQNISQLMVHRKYS 415 F I++ + + + Sbjct: 121 FNQIAKSVGSLRLT 134 >gi|284029570|ref|YP_003379501.1| von Willebrand factor type A [Kribbella flavida DSM 17836] gi|283808863|gb|ADB30702.1| von Willebrand factor type A [Kribbella flavida DSM 17836] Length = 654 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 45/241 (18%), Positives = 85/241 (35%), Gaps = 42/241 (17%) Query: 174 VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 ++F+ A GE + +V+D SGSM T+M A K Sbjct: 18 LTFVNASTAAAAGELS--PVMVVLDSSGSMTARDAGGSG-------------TRMDAAKR 62 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 A+ + + +GL + S G EKV + P + Sbjct: 63 AVGSM------VDGLPAGAQVGLA-----IYGAGTGSSGAEKVAG-CKDVRVVQPVGPVN 110 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 PA+K+A + ++P Q+ I+ ++DGE+ Sbjct: 111 -KPALKRAVTATKASGYTPIGQALRTAAAQLPK-EGQRSIVLVSDGEDTCAPPQP----- 163 Query: 354 CDKAK---ENFIK--IVTISINASPNGQRLLKTCVS--SPEYHYNVVNADSLIHVFQNIS 406 C+ AK + + + TI + L C++ + +++ +ADSL+ V ++ Sbjct: 164 CEVAKELSKQGVDLHVHTIGFRVDAKARAQL-ACIAQNTGGTYHDASDADSLLGVLGRVT 222 Query: 407 Q 407 + Sbjct: 223 E 223 >gi|237737388|ref|ZP_04567869.1| BatA protein [Fusobacterium mortiferum ATCC 9817] gi|229421250|gb|EEO36297.1| BatA protein [Fusobacterium mortiferum ATCC 9817] Length = 319 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 43/238 (18%), Positives = 79/238 (33%), Gaps = 61/238 (25%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 ++ I + +DLS SM + +D K ++ K L F+D ++ Sbjct: 78 KKEGIDIVVALDLSQSM---------------LQRDFKPNRLETAKKLLEEFID--KRIN 120 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQI 304 + L+ + + ++ V+ ++ I T + + Sbjct: 121 D-----RISLVVFGGDAYTKVPLTFDHNVVKDITSKLTTDDITSNNRTAIGMGLGVSLNR 175 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L + +S K II +TDGENN+ + + + AKE IKI Sbjct: 176 LKDSEAKS------------------KVIILMTDGENNSGEMSPMGAS--EIAKELGIKI 215 Query: 365 VTISINA----------------SPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNI 405 TI I A + + LLK S + ++ + +F I Sbjct: 216 YTIGIGAREIQIRVPFGHTTVKNTELDENLLKNIASTTGGEYFRAGSEKEFQEIFNRI 273 >gi|307292639|ref|ZP_07572485.1| hypothetical protein SphchDRAFT_0111 [Sphingobium chlorophenolicum L-1] gi|306880705|gb|EFN11921.1| hypothetical protein SphchDRAFT_0111 [Sphingobium chlorophenolicum L-1] Length = 540 Score = 56.8 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 33/158 (20%), Positives = 52/158 (32%), Gaps = 35/158 (22%) Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A ++L+ D S + P +++IF+TDG + SN Sbjct: 387 PNGGTWLDVGLLWAARLLSRDGLWSTENDELYHTY-----PVSRYVIFMTDGYMSIGSSN 441 Query: 348 -------------------------VNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 + C K KI TIS A L C Sbjct: 442 YAAYAQEDYWRRVAAAGASKNDNHYARMLMTCTAIKNMDTKIYTISFGAGSTLDSNLINC 501 Query: 383 VSS-----PEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 SS PE+ Y ++ L VF++I + + + S Sbjct: 502 SSSTNTTNPEFAYKADSSSDLNRVFRDIGENIGSLRLS 539 Score = 45.2 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 42/313 (13%), Positives = 89/313 (28%), Gaps = 76/313 (24%) Query: 4 LSRFRFYFKKGIASEKAN-FSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 + R F + ++ N +I+ A + ++ G + + + K ++ A +A +L Sbjct: 5 IKRALFILMRLYRNQAGNTLAIVAAAMLPLAGMVGG-ALDISRGYLAKTRLQQACDAGVL 63 Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 AG M S+ G +S+ + ++ + + + ++ +L+ Sbjct: 64 AGRKVMGSS----GVLSDSVRDEVRKYV---SFNYPSGYLGSTLATTDI----------- 105 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 N SN+ I ++ + L + S Sbjct: 106 ------------NPTLGSNDQIALSLTTAIP-----TAVMRLFGRNNMSITASCT----- 143 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP----------------------IC 220 + I LV+D +GSM C + D ++ + Sbjct: 144 -ARNDYSNIDIVLVLDTTGSMACKPERNDSDCSTWAGSRYVTQWVAGLGRDATFVPEEMN 202 Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT-TRVEKNIEPSWGTEKVRQY 279 ++M L+ AL + + + T K W Q Sbjct: 203 SGVNVSRMQGLRTALANLQSQMATIE----------TQFNMTEESKRKRVRWAIVPFSQM 252 Query: 280 VTRDMDSLILKPT 292 V T Sbjct: 253 VNAGFSQGSAGTT 265 >gi|260901770|ref|ZP_05910165.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus AQ4037] gi|308108909|gb|EFO46449.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus AQ4037] gi|328470487|gb|EGF41398.1| protein BatA [Vibrio parahaemolyticus 10329] Length = 356 Score = 56.8 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 42/224 (18%), Positives = 79/224 (35%), Gaps = 33/224 (14%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +VVDLSGSM +K +++ A K L F+ S ++ Sbjct: 107 DVMVVVDLSGSMA---------EPDFTSRTGEKISRLDAAKEVLTEFVQS-------RKG 150 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + + + + + + ++ + T A+ A + Sbjct: 151 DRLGLVLFGDAAFVQTPFTADQKVWLELLNQTDVAMAGQSTHLGDAIGLAI--------K 202 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 F + +G +K I LTDG N+ S V I AK +++ I++ Sbjct: 203 VFEQSDKSRGALEQDQNREKVAIVLTDG--NDTGSFVEPIDAAKVAKAKGVRVHVIAMGD 260 Query: 372 SPN-GQRLL------KTCVSSPEYHYNVVNADSLIHVFQNISQL 408 G+ L + S + +N D L + I +L Sbjct: 261 PETIGETALDMDTIHRIAKESGGEAFEALNRDELSAAYDEIGKL 304 >gi|153836342|ref|ZP_01989009.1| von Willebrand factor, type A [Vibrio parahaemolyticus AQ3810] gi|149750244|gb|EDM60989.1| von Willebrand factor, type A [Vibrio parahaemolyticus AQ3810] Length = 356 Score = 56.8 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 42/224 (18%), Positives = 79/224 (35%), Gaps = 33/224 (14%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +VVDLSGSM +K +++ A K L F+ S ++ Sbjct: 107 DVMVVVDLSGSMA---------EPDFTSRTGEKISRLDAAKEVLTEFVQS-------RKG 150 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + + + + + + ++ + T A+ A + Sbjct: 151 DRLGLVLFGDAAFVQTPFTADQKVWLELLNQTDVAMAGQSTHLGDAIGLAI--------K 202 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 F + +G +K I LTDG N+ S V I AK +++ I++ Sbjct: 203 VFEQSDKSRGALEQDQNREKVAIVLTDG--NDTGSFVEPIDAAKVAKAKGVRVHVIAMGD 260 Query: 372 SPN-GQRLL------KTCVSSPEYHYNVVNADSLIHVFQNISQL 408 G+ L + S + +N D L + I +L Sbjct: 261 PETIGETALDMDTIHRIAKESGGEAFEALNRDELSAAYDEIGKL 304 >gi|28900543|ref|NP_800198.1| hypothetical protein VPA0688 [Vibrio parahaemolyticus RIMD 2210633] gi|260365425|ref|ZP_05777962.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus K5030] gi|260877490|ref|ZP_05889845.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus AN-5034] gi|260894838|ref|ZP_05903334.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus Peru-466] gi|28808923|dbj|BAC62031.1| conserved hypothetical protein [Vibrio parahaemolyticus RIMD 2210633] gi|308085296|gb|EFO34991.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus Peru-466] gi|308090935|gb|EFO40630.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus AN-5034] gi|308114289|gb|EFO51829.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus K5030] Length = 356 Score = 56.8 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 42/224 (18%), Positives = 79/224 (35%), Gaps = 33/224 (14%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +VVDLSGSM +K +++ A K L F+ S ++ Sbjct: 107 DVMVVVDLSGSMA---------EPDFTSRTGEKISRLDAAKEVLTEFVQS-------RKG 150 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + + + + + + ++ + T A+ A + Sbjct: 151 DRLGLVLFGDAAFVQTPFTADQKVWLELLNQTDVAMAGQSTHLGDAIGLAI--------K 202 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 F + +G +K I LTDG N+ S V I AK +++ I++ Sbjct: 203 VFEQSDKSRGALEQDQNREKVAIVLTDG--NDTGSFVEPIDAAKVAKAKGVRVHVIAMGD 260 Query: 372 SPN-GQRLL------KTCVSSPEYHYNVVNADSLIHVFQNISQL 408 G+ L + S + +N D L + I +L Sbjct: 261 PETIGETALDMDTIHRIAKESGGEAFEALNRDELSAAYDEIGKL 304 >gi|330959358|gb|EGH59618.1| von Willebrand factor, type A [Pseudomonas syringae pv. maculicola str. ES4326] Length = 353 Score = 56.8 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 41/250 (16%), Positives = 86/250 (34%), Gaps = 60/250 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM ++ + +++ ++ L FL+ ++ Sbjct: 91 DLLVAVDVSGSMD----------YPDMQWKNDEVSRLVLVQQLLGDFLE-------GRKG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + T+ ++ VR ++ + K T A+ A + L Sbjct: 134 DRVGLILFGTQAFLQAPLTYDRRTVRVWLDEAKIGIAGKNTAIGDAIGLALKRLRMRPAN 193 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S + ++ +TDG NN + + T A + +KI T+ I + Sbjct: 194 S------------------RVLVLVTDGANNGGQIDPITAAR--LAADEGVKIYTVGIGS 233 Query: 372 SPNGQRL----------------LKTCVS-SPEYHYNVVNADSL------IHVFQNISQL 408 P+ L LK S S ++ + L + + ++Q Sbjct: 234 DPDKNALQGVLGLNPSLDLDEPTLKDIASLSGGQYFRARDGAELDKIRTALDALEPVAQQ 293 Query: 409 MVHRKYSVIL 418 + + +L Sbjct: 294 PTQARPAQVL 303 >gi|299534564|ref|ZP_07047896.1| hypothetical protein BFZC1_01007 [Lysinibacillus fusiformis ZC1] gi|298729937|gb|EFI70480.1| hypothetical protein BFZC1_01007 [Lysinibacillus fusiformis ZC1] Length = 864 Score = 56.8 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 43/236 (18%), Positives = 82/236 (34%), Gaps = 60/236 (25%) Query: 165 LNQRYNQKIVSFIPALLRIEMGER-PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 L + I + +P + I+ E+ P + +V+D SGSM Sbjct: 380 LGGYFKTPIETLLPVEMEIKGKEQLPSLGLVIVLDRSGSM-------------------- 419 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT---TRVEKNIEPSWGTEKVRQYV 280 + +K+ K A ++ + +G I + + + S + V Sbjct: 420 QGSKLELAKEAAARSVEMLRDEDT------LGFIAFDDRPWEIIETGPLS----SKEEAV 469 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 + T+ ++ +AY+ L K + +K II LTDG+ Sbjct: 470 DTILSVTPGGGTEIYSSLAKAYENLADLKLQ------------------RKHIILLTDGQ 511 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISI--NASPNGQRLLKTCVSSPEYHYNVVN 394 N + + KE+ I + T++I +A N L S Y+V++ Sbjct: 512 ----SQAGNYEDLITEGKEDGITLSTVAIGQDADANLLEALSDMGS--GRFYDVID 561 >gi|254481548|ref|ZP_05094792.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] gi|214038176|gb|EEB78839.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] Length = 345 Score = 56.8 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 34/182 (18%), Positives = 64/182 (35%), Gaps = 37/182 (20%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VD+SGSM D+ +++ A+K F++ +E Sbjct: 92 DLMLAVDISGSM----------KIEDMEVSDELVSRIRAVKQVGSRFIEQ-------REG 134 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + S+ T V++++ + T A+ A + L Sbjct: 135 DRLGLILFGSNAYVQSPLSFDTATVKRFLLEAQIGFAGQDTAIGDAIGLAVKRLKERPAE 194 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + +I L+DG++ V + A + I+I TI I A Sbjct: 195 N------------------RVLILLSDGKDTASS--VQPLNAAKLAADLGIRIYTIGIGA 234 Query: 372 SP 373 Sbjct: 235 DS 236 >gi|58580793|ref|YP_199809.1| hypothetical protein XOO1170 [Xanthomonas oryzae pv. oryzae KACC10331] gi|58425387|gb|AAW74424.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC10331] Length = 335 Score = 56.8 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 43/238 (18%), Positives = 78/238 (32%), Gaps = 54/238 (22%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + L VDLSGSM+ + K ++ A K L FLD Sbjct: 96 REARQMMLAVDLSGSMN----------EPDMVLGGKVVDRLTAAKAVLSDFLDR------ 139 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 ++ +GL+ + R + VR + + L + T A+ + + L Sbjct: 140 -RDGDRVGLLVFGQRAYALTPLTADLTSVRDQLRDSVVGLAGRETAIGDAIALSVKRLRE 198 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 K+ Q+ ++ LTDG N + +K + AK ++I TI Sbjct: 199 QKQG------------------QRVVVLLTDGVNTAGVLDP--LKAAELAKAEGVRIHTI 238 Query: 368 SINA-----------SPNGQR------LLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + G L K + + + + L ++ + +L Sbjct: 239 AFGGGGGYSLFGVPIPAGGNDDIDEDGLRKIAQQTGGRFFRARDTEELAGIYAELDRL 296 >gi|254472518|ref|ZP_05085918.1| conserved hypothetical protein [Pseudovibrio sp. JE062] gi|211958801|gb|EEA94001.1| conserved hypothetical protein [Pseudovibrio sp. JE062] Length = 479 Score = 56.8 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 22/111 (19%), Positives = 43/111 (38%), Gaps = 6/111 (5%) Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV-----NTIKICDKAK 358 L +K + K+ + +K I E + S +C K Sbjct: 363 KLIKNKSDVQNSQDPMDSGKLDADDHKK-IASKVKWEYDYSSSLSGVPFKTASNLCKNMK 421 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + I I T+ + G+++++ C S+ E Y+ N +LI F +I+ + Sbjct: 422 KEDIVIYTVFFGSDYKGKKIMEECASNSETFYHATNQSALIQAFSSIANDI 472 >gi|166713250|ref|ZP_02244457.1| hypothetical protein Xoryp_17865 [Xanthomonas oryzae pv. oryzicola BLS256] Length = 335 Score = 56.8 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 43/238 (18%), Positives = 78/238 (32%), Gaps = 54/238 (22%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + L VDLSGSM+ + K ++ A K L FLD Sbjct: 96 REARQMMLAVDLSGSMN----------EPDMVLGGKVVDRLTAAKAVLSDFLDR------ 139 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 ++ +GL+ + R + VR + + L + T A+ + + L Sbjct: 140 -RDGDRVGLLVFGQRAYALTPLTADLTSVRDQLRDSVVGLAGRETAIGDAIALSVKRLRE 198 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 K+ Q+ ++ LTDG N + +K + AK ++I TI Sbjct: 199 QKQG------------------QRVVVLLTDGVNTAGVLDP--LKAAELAKAEGVRIHTI 238 Query: 368 SINA-----------SPNGQR------LLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + G L K + + + + L ++ + +L Sbjct: 239 AFGGGGGYSLFGVPIPAGGNDDIDEDGLRKIAQQTGGRFFRARDTEELAGIYAELDRL 296 >gi|254229828|ref|ZP_04923234.1| von Willebrand factor, type A [Vibrio sp. Ex25] gi|262395606|ref|YP_003287459.1| protein BatA [Vibrio sp. Ex25] gi|151937664|gb|EDN56516.1| von Willebrand factor, type A [Vibrio sp. Ex25] gi|262339200|gb|ACY52994.1| protein BatA [Vibrio sp. Ex25] Length = 356 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 42/224 (18%), Positives = 79/224 (35%), Gaps = 33/224 (14%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +VVDLSGSM +K +++ A K L F+ S ++ Sbjct: 107 DVMVVVDLSGSMA---------EPDFTSRTGEKISRLDAAKEVLTEFVQS-------RKG 150 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + + + + + + ++ + T A+ A + Sbjct: 151 DRLGLVLFGDAAFVQTPFTVDQKVWLELLNQTDVAMAGQSTHLGDAIGLAI--------K 202 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 F + +G +K I LTDG N+ S V I AK +++ I++ Sbjct: 203 VFEQSDKSRGALEQDQNREKVAIVLTDG--NDTGSFVEPIDAAKVAKAKGVRVHVIAMGD 260 Query: 372 SPN-GQRLL------KTCVSSPEYHYNVVNADSLIHVFQNISQL 408 G+ L + S + +N D L + I +L Sbjct: 261 PETIGETALDMDTIHRIAKESGGEAFEALNRDELSAAYDEIGKL 304 >gi|225621320|ref|YP_002722578.1| von Willebrand factor type A (vWA) domain-containing protein [Brachyspira hyodysenteriae WA1] gi|225216140|gb|ACN84874.1| von Willebrand factor type A (vWA) domain containing protein [Brachyspira hyodysenteriae WA1] Length = 289 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 45/240 (18%), Positives = 84/240 (35%), Gaps = 66/240 (27%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I LVVD+S SM D T++ A K ++ F+ + Sbjct: 49 ISLVVDVSPSMMAE---------------DMIPTRLEASKKTMIDFIKK-------RNFD 86 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 + L+ + R ++ + + + + ++ T + A +L S K+ + Sbjct: 87 KISLVSFALRASVLSPATFDYTSLEEEI-KKIEIDEEGSTSIGLGIATAVDMLRSVKEDN 145 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS---- 368 +K II LTDGENN+ + + + A IKI TI Sbjct: 146 -----------------EKIIILLTDGENNSGEIDPKLAS--EIASNFNIKIYTIGIGDA 186 Query: 369 --------INASPNGQRLLKT------------CVSSPEYHYNVVNADSLIHVFQNISQL 408 + G+R ++ ++ ++N NA +L +V+ I +L Sbjct: 187 NGSHAWVTYDDPNYGKRRIRADFTLNEESLIDIAATTGGKYFNAKNASALDNVYNTIDRL 246 >gi|90424817|ref|YP_533187.1| hypothetical protein RPC_3326 [Rhodopseudomonas palustris BisB18] gi|90106831|gb|ABD88868.1| conserved hypothetical protein [Rhodopseudomonas palustris BisB18] Length = 479 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 28/155 (18%), Positives = 51/155 (32%), Gaps = 20/155 (12%) Query: 273 TEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 T + ++ L T+ + A+ L + + + Sbjct: 332 TSTAESTLKGKINKLDAEGNTNQPIGLFWAWMSLQTGVPLNTPAKDTEYK-------YTD 384 Query: 332 FIIFLTDGENNNFKS-------NVNTIKICDKAK---ENFIKIVTISINAS-PNGQRLLK 380 II L+DG+N + + K+CD K I TI +N + +LK Sbjct: 385 AIILLSDGDNTQSGNSNSVSAIDARQKKLCDNIKDPLNGTTTIFTIQVNTDGDDESAVLK 444 Query: 381 TCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 C S + AD + FQ+I + + + Sbjct: 445 YCASD-GQFFQSTTADQIEIAFQSIGSSLTKLRLA 478 Score = 44.1 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 29/249 (11%), Positives = 74/249 (29%), Gaps = 59/249 (23%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 K A++ N +++F ++V+ + +G + +++M+ A ++A L + + Sbjct: 15 KAFHAADDGNIAVLFGIAVIPLISFVGVAVDYSRATAARSAMQGAADSATLMVSKDYAAG 74 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 + R D + + K + + + Sbjct: 75 VIRASDIQATAEKYFKALYTSPGINNVTVTATYTARSANGS------------------- 115 Query: 132 MANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF 191 + + TS F+ + S Sbjct: 116 --------------STVVMNTSGSMPTSFL-KVAGFTALPFTASSTSTWGAT------RL 154 Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + +D++GSM K+ A+K A + ++++ + D Sbjct: 155 RVAMALDVTGSMDWD-------------------DKLTAMKTAAIKLVNTLKATASTDAD 195 Query: 252 VYMGLIGYT 260 VY+ +I + Sbjct: 196 VYISIIPFN 204 >gi|21228580|ref|NP_634502.1| putative chloride channel [Methanosarcina mazei Go1] gi|20907073|gb|AAM32174.1| putative chloride channel [Methanosarcina mazei Go1] Length = 1004 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 35/215 (16%), Positives = 78/215 (36%), Gaps = 48/215 (22%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + LV+D SGSM + +++ KN+ LF+D ++ Sbjct: 315 VMLVIDRSGSMSG--------------------SPISSAKNSANLFIDYMEAEDM----- 349 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILTSDKKR 311 G++ +++ + + T +V+ + + ++S+ T M+ L + Sbjct: 350 -AGVVSFSSSARYDYHLATLTPEVKNSIKQKINSIYASGVTAIGSGMRYGLNDLLNYGDP 408 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + I+ L+DG N+ ++ N I K + I++ T+ + Sbjct: 409 NNPWA----------------IVLLSDGYQNSGENPNNVIP---SIKASNIQVYTVGL-G 448 Query: 372 SPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNI 405 Q+LL + +Y L ++ +I Sbjct: 449 PAVDQKLLGNIADQTGGKYYYSPTDSQLQEIYNDI 483 >gi|49087064|gb|AAT51411.1| PA3073 [synthetic construct] Length = 341 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 73/224 (32%), Gaps = 54/224 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VD+SGSM QD + +++ +K F++ + Sbjct: 91 DLLLAVDVSGSMD----------YRDMRWQDYEISRLELIKKLFGDFIE-------DRRG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + K T A+ A + L Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRHTVRVWLDEAQIGIAGKNTAIGDAIGLAVKRLRQRPAE 193 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S + ++ +TDG N + A E +KI TI I A Sbjct: 194 S------------------RVLVLITDGANTGGQIAPQIAAQ--LAAEQQVKIYTIGIGA 233 Query: 372 SPN------------GQRL----LKTCV-SSPEYHYNVVNADSL 398 P G L L+ S+ ++ ++ L Sbjct: 234 DPQQGGVPGLFGFNPGLDLDEPTLRGIAESTGGEYFRARSSAEL 277 >gi|212635209|ref|YP_002311734.1| Von Willebrand factor type A domain-containing protein [Shewanella piezotolerans WP3] gi|212556693|gb|ACJ29147.1| Von Willebrand factor type A domain protein [Shewanella piezotolerans WP3] Length = 360 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 51/272 (18%), Positives = 103/272 (37%), Gaps = 42/272 (15%) Query: 152 TSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCA----- 206 + + FI LL + V R+++G + +VVDLSGSM Sbjct: 59 SRWQQCALFIVWLLLVTAMTQPVILGEPQTRLQIGRD----LMVVVDLSGSMDTKDFTLH 114 Query: 207 ---MNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 +D +S D+ +++ A+K L F + ++ +GLI + Sbjct: 115 VKQQTADGIANSSGTEISDEYISRLDAVKRVLHEFAEQ-------RQGDRLGLILFGDAA 167 Query: 264 EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 + + + ++ + T A+ A ++++SD+ +S N Sbjct: 168 YLQAPFTADLASWLRLLDESRVAMAGQSTHVGDALGLAIKVMSSDEIKSSQKN------- 220 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN-GQRLLKT- 381 K ++ LTDG N+ S+V ++ A + I++ I+I G++ + Sbjct: 221 -------KVVLLLTDG--NDTDSSVPPLEAAKIAAKKGIRVHVIAIGDPQTVGEQAMDME 271 Query: 382 -----CVSSPEYHYNVVNADSLIHVFQNISQL 408 + + ++ L V+Q IS+L Sbjct: 272 VIEGVAALTGGKAFKAISTQELNKVYQTISKL 303 >gi|149376601|ref|ZP_01894361.1| hypothetical protein MDG893_00577 [Marinobacter algicola DG893] gi|149359119|gb|EDM47583.1| hypothetical protein MDG893_00577 [Marinobacter algicola DG893] Length = 340 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 40/235 (17%), Positives = 77/235 (32%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VD+S SM + Q + ++ A+K L F+ +E Sbjct: 90 DLLLAVDISPSMD----------EQDMVIQGRSINRLQAVKRVLDDFIQR-------REG 132 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + T ++ VR + + + T A+ A + L + Sbjct: 133 DRLGLLLFGTEPYIQAPLTFDLATVRTLLHEAGIGMAGRATAIGDALGLAVKRLRDRPQE 192 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 Q+ +I LTDG N + + + AK ++I TI I A Sbjct: 193 ------------------QRVVILLTDGANTAGEIAPDKAA--EIAKAAGVRIYTIGIGA 232 Query: 372 ---------------SPNGQR---LLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 L + + ++ + L ++++I+QL Sbjct: 233 ETMVQRGLLGSRRVNPSRDLDEGLLTRIAQQTGGEYFRARSLPELELIYESINQL 287 >gi|297560911|ref|YP_003679885.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] gi|296845359|gb|ADH67379.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] Length = 315 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 40/251 (15%), Positives = 87/251 (34%), Gaps = 57/251 (22%) Query: 174 VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 +PA+ + ER I + VD+S SM D ++ A K Sbjct: 71 SMALPAMPVQQPRERA--TIMVAVDVSLSMAA---------------NDIDPNRLEAAKK 113 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PT 292 + F++++ + +GL+ +++ P+ Q V +++L L T Sbjct: 114 SAQGFVETL------PDRFNVGLVAFSSTATVVSSPT----HDHQAVIGSIENLQLGPGT 163 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + + ++S + + P I+ L+DGEN + + + Sbjct: 164 AIGEGVFASLESISSFDEDADV------------DPPPSAIVLLSDGENTSGRDISQAVA 211 Query: 353 ICDKAKENFIKIVTISI------------NASPN-GQRLLKTCVSS-PEYHYNVVNADSL 398 + A E + + TI+ + + L+ S + Y + L Sbjct: 212 M---AAEQEVPVSTIAFGTGAAMIEIDGYQVPADIDKEALRGLASDTGGHFYEAESETEL 268 Query: 399 IHVFQNISQLM 409 V+++I + Sbjct: 269 DEVYEDIGSSL 279 >gi|289607418|emb|CBI60804.1| unnamed protein product [Sordaria macrospora] Length = 814 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 31/184 (16%), Positives = 57/184 (30%), Gaps = 41/184 (22%) Query: 271 WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT------SDKKRSFFTNFFRQGVKI 324 Y+ + T M + L+ S + N KI Sbjct: 466 NNRSGFVSYLNGLVA---RGGTYHDIGMIWGARFLSTTGLFKSATPETNDVNDPDNPAKI 522 Query: 325 PSLPFQKFIIFLTDGENNNFKSN-----------------------------VNTIKICD 355 +K++IF+TDG+ + S+ C+ Sbjct: 523 RGFSVKKYMIFMTDGDMSPTWSDYSAYGIEYLDGRVMGSPTTDNTALLARHLQRFRMACN 582 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH-RKY 414 AK I I I+ + + + C S PE + + +LI F+ I + ++ Sbjct: 583 AAKAKGIDIWVIAFSTTLTAD--MTNCASKPEQAAGLSSNAALIAKFKEIGSKIATVLEF 640 Query: 415 SVIL 418 ++IL Sbjct: 641 ALIL 644 Score = 41.4 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 47/317 (14%), Positives = 101/317 (31%), Gaps = 60/317 (18%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 + + ++ N + L+++ + L+G + + + +N A +A LAG + Sbjct: 9 RLWSDQRGNTLALMTLALIPLVALMGSGLDMTRAYVAQNRFRQACDAGSLAG-----RRM 63 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 ++ + A + + D F + +++ + + I + + + T Sbjct: 64 LAGLTLPQAARDEATKYFMFD---FPQGYLQSAPYTLTMSVPTAGTLQISSQTTVPTTL- 119 Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFL 192 L + L F+ Sbjct: 120 -----------------------MGLFGFDTLPISTTCSATQDFVNT------------D 144 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL----SHV 248 I V DLSGSM+CA V ++ ++M AL++A F D+++ + Sbjct: 145 IMFVFDLSGSMNCA-----PGVTGYCGDVEQSGSRMGALRSAATSFYDTLETAQSQLAAN 199 Query: 249 KEDVYMGLIGYTT-----RVEKNIEPSWGTEKVRQYVTRDMDSLILKPT-DSTPAMKQAY 302 + G + Y + R+ P W + Y +R D + ++ A AY Sbjct: 200 NLRLRYGFVNYNSTVNVGRILYEKNPDWMVQSW-SYQSRTPDWIDATAYFNNKSACNSAY 258 Query: 303 QILTSDKKRSFFTNFFR 319 + + R Sbjct: 259 TYDSQASQADANYTGSR 275 >gi|320333536|ref|YP_004170247.1| von Willebrand factor type A [Deinococcus maricopensis DSM 21211] gi|319754825|gb|ADV66582.1| von Willebrand factor type A [Deinococcus maricopensis DSM 21211] Length = 509 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 42/269 (15%), Positives = 87/269 (32%), Gaps = 53/269 (19%) Query: 147 NMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCA 206 N V + D+ + L R + + + L+ +P I V+D+SGSM Sbjct: 280 NTTVPLTRDFPDALLLELPFPRSASTLDAVVSTYLQDT--RQPANTI-FVLDVSGSM--- 333 Query: 207 MNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV----KEDVYMGLIGYTTR 262 + ++ ALK AL + L+ + LI +++ Sbjct: 334 -----------------RGARIDALKTALRGLSGADTTLTGRYATFANRERVTLIPFSSA 376 Query: 263 --VEKNIEPSWGTEKVR-QYVTRDMDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFF 318 + E + T + + +D+L T+ A++ AY+ + + + Sbjct: 377 PGAPRTTELTPATRGAALKQLRAQVDALTPDGGTNIYGALQAAYEQARAAPAGRYTS--- 433 Query: 319 RQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA--KENFIKIVTISINASPNGQ 376 I+ +TDGE S + +K T+ + Sbjct: 434 --------------IVLMTDGERTEGPSADQFRATYAALPERARQVKTFTVLF-GDSDAT 478 Query: 377 RLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + + + ++ N L F++I Sbjct: 479 EMNRIATLTGGRTFDGQN--DLRAAFKDI 505 >gi|16126967|ref|NP_421531.1| hypothetical protein CC_2734 [Caulobacter crescentus CB15] gi|221235756|ref|YP_002518193.1| hypothetical protein CCNA_02820 [Caulobacter crescentus NA1000] gi|13424325|gb|AAK24699.1| hypothetical protein CC_2734 [Caulobacter crescentus CB15] gi|220964929|gb|ACL96285.1| conserved hypothetical protein [Caulobacter crescentus NA1000] Length = 629 Score = 56.4 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 50/379 (13%), Positives = 103/379 (27%), Gaps = 48/379 (12%) Query: 59 AAILAGASKMVSNLSRLGDRFESISNH------AKRALIDDAKRFIKNHIKESL-SGYSA 111 + +++ SN+ L + ++ ++ L D + +G Sbjct: 276 NGAIYTVTRLDSNVVYLNGIDTRLKSNSPSGGTIRKCLTSDCNLVVTTSAAHGFDTGDQI 335 Query: 112 VFYNTEIQNIVNSSRISMTHMANNRLDSSN---NTIFYNMDVMTSYDYRLQFIEHLLNQR 168 F N +N S+T + +N D+ T + + + L Sbjct: 336 SFANLGGLTTLNGQTYSITDLTSNTFDTGVPGLGTAAFTSGGTATCEQSTTPGCRRLAYV 395 Query: 169 YN------QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 N + + + + + V S D S C Sbjct: 396 SNWGTNEVRALSTCVSERTGADAYTDAAPSTAFV-------GTNYPSTSADSYSPNPCPS 448 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR--QYV 280 K T +++ K AL ++ + +GL V N W + R Y Sbjct: 449 AKITPLSSDKTALKAQIN--NYSVGGSTAGQIGLAWGWYMVAPNFGYIWPSASQRPAAYK 506 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 ++D+ +++ TD A Y + + + F Sbjct: 507 SKDLMKVVIMMTD--GAFNTPYCNGVIAANAGIGSGSDEDHINCNATNGDPF-------- 556 Query: 341 NNNFKSNVNTIKICDKAKE--NFIKIVTISINASPN--GQRLLKTCVSSPEYHYNVVNAD 396 +C K N I + T+ + + L C S + Sbjct: 557 -------AQARALCTVIKNSANDITLYTVGFAVGSDYTAKTFLTDCASDSSKAFFPATGS 609 Query: 397 SLIHVFQNISQLMVHRKYS 415 L F I++ + + + Sbjct: 610 ELKASFTAIAREISSLRIA 628 >gi|88798929|ref|ZP_01114511.1| hypothetical protein MED297_12762 [Reinekea sp. MED297] gi|88778409|gb|EAR09602.1| hypothetical protein MED297_12762 [Reinekea sp. MED297] Length = 322 Score = 56.4 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 35/237 (14%), Positives = 82/237 (34%), Gaps = 52/237 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 ++ + L VDLS SM + I + ++ A++ + F++ Sbjct: 80 DQRGRSLYLAVDLSESM----------LEQDMIWNQRPVSRYEAMQAVISEFVE------ 123 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + ++GL+ + + + + ++ + + T + A + L Sbjct: 124 -DRRGDFIGLVVFGSFADVQAPLTPDLNAIQSLLADLRPGMADSRTAIGDGLALAVRQL- 181 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 S + ++ L+DGENN+ + + A+ I++ T Sbjct: 182 -----------------RESTTEDRVVVLLSDGENNSGEIRPDEATAVAAAEN--IRVYT 222 Query: 367 ISI---------------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 I ++S + Q L + + +Y ++ L VF++I +L Sbjct: 223 IGFGSAGRDSLLQSFGLRSSSLDEQTLREIAEQTQGRYYRATSSAELAEVFRDIERL 279 >gi|296228122|ref|XP_002759734.1| PREDICTED: collagen alpha-4(VI) chain-like [Callithrix jacchus] Length = 2348 Score = 56.4 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 32/178 (17%), Positives = 65/178 (36%), Gaps = 24/178 (13%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKPTD 293 + L + V +GL Y + + ++ V + +L T+ Sbjct: 256 NFLYSVVSGLDVSSDRVRVGLAQYNDDIYPAFQL--NQHPLKSMVLEQIQNLPYRTGGTN 313 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A++ + S+ T K + +I +TDGE+N+ Sbjct: 314 TGNALEF--------IRTSYLTEGSGSRAKD---RVPQIVILVTDGESNDEVQEAA---- 358 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 D+ KE+ I + + IN + L+ S P ++ +N+ N + L +I Q + Sbjct: 359 -DQLKEDGIVVYVVGINVQDVQE--LQKIASEPFEKFLFNIENFNILQDFSGSILQTL 413 Score = 38.7 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 72/200 (36%), Gaps = 24/200 (12%) Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 E+V D+ S C NS + V + +K + ++ ++ + + Sbjct: 1007 EVVRDICSSEACK-NSKADIVFLIDGSESISPEDFERMKRFVESMVNQSNI---GTDSIQ 1062 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 +GL+ +++ ++ + + KV Y + T + A+ Sbjct: 1063 IGLLQFSSIPKEEFRLNRYSSKVDIYNAIFAVQQMRDGTRTGKALN-------------- 1108 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 FT F + K Q+++I +TDG + NV ++ I I I + + Sbjct: 1109 FTLPFFESSKGGRPSVQQYLIVITDG---VAQDNVILPA--KALRDKNIIIFAIGVGEAK 1163 Query: 374 NGQRLLKTCVSSPEYHYNVV 393 Q LL+ +Y+V Sbjct: 1164 KSQ-LLEITNDEDRVYYDVD 1182 >gi|32394600|gb|AAM93998.1| proximal thread matrix protein 1 [Griffithsia japonica] Length = 218 Score = 56.4 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 32/157 (20%), Positives = 54/157 (34%), Gaps = 27/157 (17%) Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 +D+ +D Y +G+ + V+ P+ V T+ + Sbjct: 56 VDAAKEFDDRTKDSYFSAVGFASGVKLIQAPTQSLSTFNTAVNTVSPLN--GGTNIFRGL 113 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 + YQ L + + +I +TDG I C+ K Sbjct: 114 RGCYQQLKTKPMT------------------DRVLILVTDG------FGGQPINYCNFIK 149 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA 395 I +VT+ I S N Q LK C +S E++ NV + Sbjct: 150 SKGILLVTVGIGTSIN-QNFLKNCATSEEFYINVKDT 185 >gi|167624593|ref|YP_001674887.1| von Willebrand factor type A [Shewanella halifaxensis HAW-EB4] gi|167354615|gb|ABZ77228.1| von Willebrand factor type A [Shewanella halifaxensis HAW-EB4] Length = 345 Score = 56.4 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 41/232 (17%), Positives = 76/232 (32%), Gaps = 55/232 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM + K + + +++ + F++ ++ Sbjct: 85 DLMLSVDLSGSM----------QIEDMVLDGKVVDRFSLIQHVISDFIER-------RKG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V QY+ L+ K T A+ A + ++ Sbjct: 128 DRIGLILFADHAYLQSPLTQDRRTVAQYLKEAQIGLVGKQTAIGEAIALAVKR-FDKVEQ 186 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S + +I LTDG NN + A + I I TI + A Sbjct: 187 SN-----------------RVLILLTDGSNNAGAISPEQATQ--IAAKRGITIYTIGVGA 227 Query: 372 SPNGQRLL------------------KTCVSSPEYHYNVVNADSLIHVFQNI 405 +R L + ++ ++ N + L ++Q I Sbjct: 228 DVMERRTLFGKERVNPSMDLDESQLQEIAKTTGGQYFRARNTEELEQIYQVI 279 >gi|172087820|ref|YP_001816750.1| hypothetical protein VF_A1192 [Vibrio fischeri ES114] gi|171902402|gb|ACB55718.1| conserved hypothetical protein containing von Willebrand factor type A domain [Vibrio fischeri ES114] Length = 350 Score = 56.4 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 44/229 (19%), Positives = 80/229 (34%), Gaps = 35/229 (15%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E+ + +VVDLSGSM K +++ A+K L F+ + Sbjct: 97 EQFGRDVMVVVDLSGSMA---------EKDFTSTNGLKISRLDAVKKVLNDFVKT----- 142 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++ +GLI + + + + + + K T A+ + Sbjct: 143 --RKGDRLGLILFGDAAFVQTPFTADHKVWLDLLNQTHVEMAGKSTHLGDAIGLTIKRFE 200 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + + +K I LTDG N+ S V + AK ++I Sbjct: 201 DSNNSQ----------PLSTTSREKVAIILTDG--NDTDSYVPPMDAAKVAKVKGVRIHM 248 Query: 367 ISINASPN-GQRLLKT------CVSSPEYHYNVVNADSLIHVFQNISQL 408 I+I G++ L +S + +N D LI+ + IS+L Sbjct: 249 IAIGDPQTVGEQALDMDTINTIADASGGQAFQALNQDELINAYAEISKL 297 >gi|269961128|ref|ZP_06175496.1| conserved hypothetical protein [Vibrio harveyi 1DA3] gi|269834079|gb|EEZ88170.1| conserved hypothetical protein [Vibrio harveyi 1DA3] Length = 362 Score = 56.4 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 42/224 (18%), Positives = 78/224 (34%), Gaps = 35/224 (15%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +VVDLSGSM +K +++ A K L F + ++ Sbjct: 107 DVMVVVDLSGSMA---------EQDFTSKTGEKISRLDAAKEVLSDFAKT-------RKG 150 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + + + + ++ + T A+ A ++ +K Sbjct: 151 DRLGLILFGDAAFVQTPFTPDQKVWLELLNQTDVAMAGQSTHLGDAIGLAIKVFEQSEKS 210 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 ++ K I LTDG N+ S V I AK ++I I++ Sbjct: 211 RTDVEESKE----------KVAIVLTDG--NDTGSFVEPIDAAKVAKAKDVRIHVIAMGD 258 Query: 372 SPN-GQRLL------KTCVSSPEYHYNVVNADSLIHVFQNISQL 408 G+ L + S + +N D L + I +L Sbjct: 259 PQTVGETALDMNTIKRIAKESGGEAFEALNRDELAKAYDEIGKL 302 >gi|262193845|ref|YP_003265054.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262077192|gb|ACY13161.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 346 Score = 56.4 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 35/190 (18%), Positives = 58/190 (30%), Gaps = 33/190 (17%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + R I +VVD SGSM +D +T++ +K+ F+ Sbjct: 81 VGENTIRREGIAIMMVVDTSGSMRALDLADGGL----------DQTRLEVVKDVFRAFVA 130 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT--RDMDSLILKPTDSTPAM 298 D L D +GL+ + + + + + + T + Sbjct: 131 GEDGLDGRSNDT-IGLVSFAGFADTRCPLTLNHGSLLTILDDLEIVRERAEDGTAIGDGL 189 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 A + L + S + II LTDG NN + A Sbjct: 190 GLAVERLRESEASS------------------RVIILLTDGVNNAGIETPLEAA--ELAS 229 Query: 359 ENFIKIVTIS 368 IK+ TI Sbjct: 230 RLGIKVYTIG 239 >gi|218890727|ref|YP_002439591.1| putative von Willebrand factor type A domain [Pseudomonas aeruginosa LESB58] gi|254236045|ref|ZP_04929368.1| hypothetical protein PACG_02002 [Pseudomonas aeruginosa C3719] gi|126167976|gb|EAZ53487.1| hypothetical protein PACG_02002 [Pseudomonas aeruginosa C3719] gi|218770950|emb|CAW26715.1| putative von Willebrand factor type A domain [Pseudomonas aeruginosa LESB58] Length = 340 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 39/224 (17%), Positives = 73/224 (32%), Gaps = 54/224 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VD+SGSM Q+ + +++ +K F++ + Sbjct: 91 DLLLAVDVSGSMD----------YRDMRWQEDEISRLELIKKLFGDFIE-------DRRG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + K T A+ A + L Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRHTVRVWLDEAQIGIAGKNTAIGDAIGLAVKRLRQRPAE 193 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S + ++ +TDG N + A E +KI TI I A Sbjct: 194 S------------------RVLVLITDGANTGGQIAPQIAAQ--LAAEQQVKIYTIGIGA 233 Query: 372 SPN------------GQRL----LKTCV-SSPEYHYNVVNADSL 398 P G L L+ S+ ++ ++ L Sbjct: 234 DPQQGGVPGLFGFNPGLDLDEPTLRGIAESTGGEYFRARSSAEL 277 >gi|317154611|ref|YP_004122659.1| von Willebrand factor type A [Desulfovibrio aespoeensis Aspo-2] gi|316944862|gb|ADU63913.1| von Willebrand factor type A [Desulfovibrio aespoeensis Aspo-2] Length = 395 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 66/423 (15%), Positives = 133/423 (31%), Gaps = 79/423 (18%) Query: 32 SFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALI 91 L + G + + + + +++A +A LAG+ L D + I ++A+ Sbjct: 1 MLLAVAGLAVDMGNMYVTHTRLQAAVDAGALAGSL----ELPYDPDLSKGI---VQQAVS 53 Query: 92 DDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVM 151 D + + + ES+S + V + + ++ + +N + Sbjct: 54 DMIHTNMPDAVVESVSPGTEVRSVVVTAKAKVNLLVMGFLNLADQWVEAGAAAGFNKLEI 113 Query: 152 TSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELV-----VDLSGSMHCA 206 ++ K S L I G++P + LV V L G + Sbjct: 114 VFVIDNSGSMKG--TPINLVKEASIGLTDLLIPDGQQPDTKVGLVAFRGKVRLGGDVDGL 171 Query: 207 MNSDPEDVNSAPICQDKKRTKMA-ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK 265 S + M AL + +D +D S + E Sbjct: 172 EAGCRNADGSVNTGIHEDFMSMYWALSSYYRNQID-LDTCSSIPE--------------- 215 Query: 266 NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP 325 + S + + + T + +K A +LT + + + Sbjct: 216 SRPLSQDKGDIVEGINSQTALGSASGTVISEGIKWARHMLTPEAPYTQAGDKK------- 268 Query: 326 SLPFQKFIIFLTDGE----------------NNNFKS------------------NVNTI 351 F+K +I LTDG+ NN + + N + + Sbjct: 269 --DFRKIMIVLTDGDTEDGECGGSYRASFRPNNYWTNAYYGMGVDTAHCQDGGVLNQDML 326 Query: 352 KICDKAKENFIKIVTISINASPN-GQRLLKTCVSSP----EYHYNVVNADSLIHVFQNIS 406 AK+ I+I I S N L+K SS +++++ + + VF+ I Sbjct: 327 AEAQLAKDEGIEIFAIRFGVSDNTDISLMKQIASSKAGTNDHYFDAPSVYDIPDVFKKIG 386 Query: 407 QLM 409 + + Sbjct: 387 KQL 389 >gi|170720775|ref|YP_001748463.1| von Willebrand factor type A [Pseudomonas putida W619] gi|169758778|gb|ACA72094.1| von Willebrand factor type A [Pseudomonas putida W619] Length = 358 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 36/183 (19%), Positives = 69/183 (37%), Gaps = 37/183 (20%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM +D++ +++ +K L FL +E Sbjct: 91 DLLVAVDVSGSMD----------FPDMQWKDEEVSRLDLVKALLGDFLQ-------DREG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + K T A+ A + L + Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRRTVRTFLDEAQIGIAGKNTAIGDAIGLAVKRLRQRPAQ 193 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S + ++ +TDG NN + + T A + ++I TI I A Sbjct: 194 S------------------RVLVLITDGANNGGRIHPLTAAR--LAAQEDVRIYTIGIGA 233 Query: 372 SPN 374 +P Sbjct: 234 NPE 236 >gi|114563846|ref|YP_751360.1| von Willebrand factor, type A [Shewanella frigidimarina NCIMB 400] gi|114335139|gb|ABI72521.1| von Willebrand factor, type A [Shewanella frigidimarina NCIMB 400] Length = 334 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 38/235 (16%), Positives = 66/235 (28%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VDLSGSM + L +L Sbjct: 84 DLMIAVDLSGSMQIEDMVINGQTVNRFT-----------LIQHVLSDFIERRKGD----- 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V ++ L+ K T A+ A + + Sbjct: 128 -RLGLILFADHAYLQAPLTLDRRSVATFLDDAQIGLVGKQTAIGEAIALAVKRFDKVDES 186 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + +I LTDG NN A + I I TI + A Sbjct: 187 N------------------RVLILLTDGSNNAGNIEPEVAAQ--IAAKRNITIYTIGVGA 226 Query: 372 SPNGQ------------------RLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + +L K + ++ N++ L ++Q I +L Sbjct: 227 EILERRTIFGKERINPSMDLDEDQLKKLAAMTKGRYFRARNSEELASIYQEIDKL 281 >gi|260774144|ref|ZP_05883059.1| protein BatA [Vibrio metschnikovii CIP 69.14] gi|260611105|gb|EEX36309.1| protein BatA [Vibrio metschnikovii CIP 69.14] Length = 322 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 42/242 (17%), Positives = 76/242 (31%), Gaps = 55/242 (22%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 +P + LVVDLS SM+ +D ++ A+K L F+ Sbjct: 79 THSQPHRDMMLVVDLSYSMNQKDMQIGDDY----------IDRLTAVKQVLSDFIAQ--- 125 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 ++ +GLI + + V Q + + + LI T + A + Sbjct: 126 ----RQGDRLGLIFFADHAYLQTPLTLDRTTVAQQLNQAVLRLIGTQTAIGDGIGLATKT 181 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 I S Q+ +I L+DG NN + AK+ I Sbjct: 182 F------------------IESDAPQRVMILLSDGSNNAGVLDPIEAA--HIAKQYHTTI 221 Query: 365 VTISI-----------------NASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNIS 406 T+ + A + L+ + ++ + L ++Q I Sbjct: 222 YTVGVGAGEMMVRDFFMTRRINTAEDLDEETLQKIADLTGGQYFRARDKHDLQTIYQTID 281 Query: 407 QL 408 +L Sbjct: 282 KL 283 >gi|84498071|ref|ZP_00996868.1| hypothetical protein JNB_18328 [Janibacter sp. HTCC2649] gi|84381571|gb|EAP97454.1| hypothetical protein JNB_18328 [Janibacter sp. HTCC2649] Length = 651 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 38/215 (17%), Positives = 76/215 (35%), Gaps = 32/215 (14%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 L++D SGSM TK+ A K AL + ++ + + Sbjct: 43 LMLDASGSMKAK--------------DPSGLTKIEAAKKALTGVVGAL------PDTAQV 82 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 GL Y V+ +P+ Q + K T + A+K + Sbjct: 83 GLRVYGATVDGKGKPTPAACADTQLIHPIAALDKTKLTTTIAAIKA--------LGETPI 134 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 + + +K ++ I+ ++DGE + +K A ++I T+ + Sbjct: 135 AHSLTEALKDLGTSGKRNIVLVSDGEESCVPDPCPIVKKLTAA-GVDLQIDTVGFGVNAK 193 Query: 375 GQRLLKTCVSSPEY--HYNVVNADSLIHVFQNISQ 407 + L+ C++ +Y+ +A +L +SQ Sbjct: 194 ARTQLQ-CIADAGKGTYYDAKDAGALATSLNKLSQ 227 >gi|330829762|ref|YP_004392714.1| von Willebrand factor, type A [Aeromonas veronii B565] gi|328804898|gb|AEB50097.1| von Willebrand factor, type A [Aeromonas veronii B565] Length = 347 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 40/245 (16%), Positives = 80/245 (32%), Gaps = 35/245 (14%) Query: 173 IVSFIPALLRIEMGERPIF--LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 +V+ + R F + +V+DLSGSM K +++ A Sbjct: 78 VVALAKPTIYGPPQVRERFGRDVMIVLDLSGSMA---------ETDFSPDPGKSLSRLDA 128 Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 K + + + +E +GLI + + E + + ++ + Sbjct: 129 AK-------EVLKQFAATREGDRLGLILFGDAAFLQAPFTADLETWQTLLQETDVAMAGQ 181 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T A+ A + F + S +K I LTDG + + Sbjct: 182 STHLGDAIGLAI--------KVFNNSDRHGQQDQNSAKREKVAIILTDGNDTGSFVSPRD 233 Query: 351 IKICDKAKENFIKIVTISINASP--NGQRL----LKTCVS-SPEYHYNVVNADSLIHVFQ 403 A N +++ TI++ Q L L+ + + + ++ L +Q Sbjct: 234 AAR--VAAVNGVRLHTIAMGDPATVGEQALDLDTLQQLATLTGGQLFQALDEAQLTRAYQ 291 Query: 404 NISQL 408 I +L Sbjct: 292 VIGEL 296 >gi|153833319|ref|ZP_01985986.1| von Willebrand factor, type A [Vibrio harveyi HY01] gi|148870455|gb|EDL69376.1| von Willebrand factor, type A [Vibrio harveyi HY01] Length = 363 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 42/224 (18%), Positives = 78/224 (34%), Gaps = 35/224 (15%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +VVDLSGSM +K +++ A K L F + ++ Sbjct: 107 DVMVVVDLSGSMA---------EQDFTSKNGEKISRLDAAKEVLSDFAKT-------RKG 150 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + + + + ++ + T A+ A ++ +K Sbjct: 151 DRLGLILFGDAAFVQTPFTPDQKVWLELLNQTDVAMAGQSTHLGDAIGLAIKVFEQSEKS 210 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 ++ K I LTDG N+ S V I AK ++I I++ Sbjct: 211 RTDVEESKE----------KVAIVLTDG--NDTGSFVEPIDAAKVAKAKDVRIHVIAMGD 258 Query: 372 SPN-GQRLL------KTCVSSPEYHYNVVNADSLIHVFQNISQL 408 G+ L + S + +N D L + I +L Sbjct: 259 PQTVGETALDMNTIKRIAKESGGEAFEALNRDELAKAYDEIGKL 302 >gi|310814568|ref|YP_003962532.1| von Willebrand factor, type A [Ketogulonicigenium vulgare Y25] gi|308753303|gb|ADO41232.1| von Willebrand factor, type A [Ketogulonicigenium vulgare Y25] Length = 1160 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 58/185 (31%), Gaps = 43/185 (23%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 E P + V+D SGSM + T++ K A+ + +L Sbjct: 743 PHEGPGIAMVFVLDRSGSMSQTVGDV---------------TRLDVAKQAVSA---AANL 784 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 L + +G++ + + E + T+ P ++ A+Q Sbjct: 785 LDP--QTGSLGVVMFGSEAEVALPLG-PLPDAAGIAAALGHLQPGGGTNIYPGLQLAFQA 841 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L + + + I+ +TDG ++ I + I + Sbjct: 842 LRASDADA------------------RHIVVMTDGMSDEADFPGLLAAI----RAEGITV 879 Query: 365 VTISI 369 +++I Sbjct: 880 SSVAI 884 >gi|328541712|ref|YP_004301821.1| hypothetical protein SL003B_0088 [polymorphum gilvum SL003B-26A1] gi|326411464|gb|ADZ68527.1| hypothetical protein SL003B_0088 [Polymorphum gilvum SL003B-26A1] Length = 454 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 51/390 (13%), Positives = 121/390 (31%), Gaps = 15/390 (3%) Query: 35 LLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDA 94 L + + K++ + +A L+GA+ + +L+ + D + AL+ Sbjct: 70 LSVSIMTDSELDKAIKDAFAANMASAGLSGAT--LGDLTYVLDPDAGTISATATALVPTY 127 Query: 95 KRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMT-- 152 + E+++ ++ ++ + + +T N + S +D++ Sbjct: 128 FIHVGGLGPENVAIAASADATYSRFDVELAMVVDVTGSMRNSMASLRTAAQSVVDILIPD 187 Query: 153 -SYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDP 211 + + L+ + + P + + G + + + N Sbjct: 188 GTKKSASKVRIALVPYSQGVNLGEYAPKVSNGDAGTQNCVTERMGNEKYTDATYNYNGTS 247 Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN--IEP 269 + +M L + +I L I + Sbjct: 248 SEFFGGGSNSCASTPQMEPLTSKRNTLTSAISKLKDNGRTAGQTGIAWGWYALSPKWSNL 307 Query: 270 SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 Y D+ L TD + Y T+ + N+ K + Sbjct: 308 WPNDSVPGSYTDSDILKFALIMTD--GDFNEYYDKATAQSNCKWQFNWST--FKWEQVCD 363 Query: 330 QKFI-IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI--NASPNGQRLLKTCVSSP 386 ++ ++ + S+ +C K+ I++ +I NA+ G +++K C SS Sbjct: 364 SSYVWTAYSEAAGYSNVSSTRAKTLCAAIKQTGIQVYSIYFGSNANSAGAKVMKDCASST 423 Query: 387 -EYHYNVVNADSLIHVFQNISQLMVHRKYS 415 E + + LI F I+ + + S Sbjct: 424 KETFFMATSDSELIAAFAKIANKIQNIYLS 453 >gi|291514852|emb|CBK64062.1| Mg-chelatase subunit ChlD [Alistipes shahii WAL 8301] Length = 341 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 35/183 (19%), Positives = 68/183 (37%), Gaps = 43/183 (23%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + L VD+S SM D + ++ K A+ + + + Sbjct: 92 MMLTVDVSNSMLAE---------------DFEPNRLERTKYAIGKLFEGL-------QQD 129 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKR 311 +GL+ + + + + R + R SL+ + T A++QA + D ++ Sbjct: 130 RVGLVVFAGEPKVQLPITSDYRMARAFARRIDPSLVSVQGTAIGKALEQALLAFSGDTEQ 189 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S + II +TDGEN++ + I + ++A + +KI TI I Sbjct: 190 SHG----------------RVIILITDGENHDDDA----IAVAERAAQMGVKIFTIGIGT 229 Query: 372 SPN 374 Sbjct: 230 PEG 232 >gi|329928982|ref|ZP_08282792.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] gi|328937234|gb|EGG33661.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] Length = 899 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 37/223 (16%), Positives = 75/223 (33%), Gaps = 58/223 (26%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E P + LV+D SGSM K+ K + + ++ + Sbjct: 403 EIPSLGLILVIDRSGSMDGN--------------------KIELAKESAMRTVELMRAKD 442 Query: 247 HVKEDVYMGLIGYTTR---VEKNIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQAY 302 V G++ + + V + ++ V + S+ T+ PA+ A Sbjct: 443 TV------GVVAFDDQPWWVVPPQKL-----GDKEEVLSSIQSIPSAGGTNIYPAVSSAL 491 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + + + ++ II +TDG + S + D EN I Sbjct: 492 EEMLKIDAQ------------------RRHIILMTDG-QSAMNSGYQDLT--DTMVENKI 530 Query: 363 KIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADSLIHVFQN 404 + ++++ LL++ + +Y V + +L VF Sbjct: 531 TMSSVAVGM-DADTNLLQSLADAAKGRYYFVEDETTLPAVFSR 572 >gi|261408991|ref|YP_003245232.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] gi|261285454|gb|ACX67425.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] Length = 1007 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 37/223 (16%), Positives = 75/223 (33%), Gaps = 58/223 (26%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E P + LV+D SGSM K+ K + + ++ + Sbjct: 403 EIPSLGLILVIDRSGSMDGN--------------------KIELAKESAMRTVELMRAKD 442 Query: 247 HVKEDVYMGLIGYTTR---VEKNIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQAY 302 V G++ + + V + ++ V + S+ T+ PA+ A Sbjct: 443 TV------GVVAFDDQPWWVVPPQKL-----GDKEEVLSSIQSIPSAGGTNIYPAVSSAL 491 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + + + ++ II +TDG + S + D EN I Sbjct: 492 EEMLKIDAQ------------------RRHIILMTDG-QSAMNSGYQDLT--DTMVENKI 530 Query: 363 KIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADSLIHVFQN 404 + ++++ LL++ + +Y V + +L VF Sbjct: 531 TMSSVAVGM-DADTNLLQSLADAAKGRYYFVEDETTLPAVFSR 572 >gi|300853773|ref|YP_003778757.1| hypothetical protein CLJU_c05730 [Clostridium ljungdahlii DSM 13528] gi|300433888|gb|ADK13655.1| hypothetical protein CLJU_c05730 [Clostridium ljungdahlii DSM 13528] Length = 419 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 43/220 (19%), Positives = 83/220 (37%), Gaps = 44/220 (20%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + + I V+D SGSM N+DP++ + + ++L+ + Sbjct: 111 QKVNDIVFVIDTSGSMA---NTDPQNERFSSV----------------------LNLMDN 145 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD--SLILKPTDSTPAMKQAYQIL 305 + + + + ++ I + +E +++ ++ + T+ A+ AY + Sbjct: 146 MNTQNRVSIYKFDDTSKRIIPMTEVSESLKKNAEEELKQYEIPAGNTNMGEAIDSAYNEI 205 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 S K+ +I L+DGE+N F N + K++ I I Sbjct: 206 NSTKRPGRKAA----------------VILLSDGEDN-FGLNKKFDETLKPFKDSNISIY 248 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 TI ++ N L K + +YNV NA L F I Sbjct: 249 TIGMSNENNFTTLKKIAKDTHGEYYNVKNASDLKGTFSKI 288 >gi|254448210|ref|ZP_05061672.1| von Willebrand factor, type A [gamma proteobacterium HTCC5015] gi|198262077|gb|EDY86360.1| von Willebrand factor, type A [gamma proteobacterium HTCC5015] Length = 336 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 43/238 (18%), Positives = 79/238 (33%), Gaps = 61/238 (25%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 + +D+SGSM D ++R+++A K+ + F+ +E + Sbjct: 90 VALDISGSMEEQDMDD----------NGQRRSRIAVTKDVAMDFVKQ-------REGDRI 132 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDM----DSLILK--PTDSTPAMKQAYQILTSD 308 L+ + T ++ V Q++ D L T A+ A + L Sbjct: 133 ALVLFGTHPYLQTPLTFDHPTVMQHIYEAQLTMADDLQRGIHATAIGDAIGLAVKRLRDI 192 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 K +I LTDG +N + V +K A +KI TI Sbjct: 193 DA------------------PDKTLILLTDGSDNASQ--VAPLKAAQIAAREGLKIYTIG 232 Query: 369 INASPNGQRLLKT------------------CVSSPEYHYNVVNADSLIHVFQNISQL 408 + A LL ++ ++ N + L ++Q+I +L Sbjct: 233 LGAEQRQASLLGFDFGFGKNREIDEKTLKDIAKATDGRYFRARNPEELREIYQHIDRL 290 >gi|226314609|ref|YP_002774505.1| hypothetical protein BBR47_50240 [Brevibacillus brevis NBRC 100599] gi|226097559|dbj|BAH46001.1| conserved hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 947 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 38/220 (17%), Positives = 79/220 (35%), Gaps = 51/220 (23%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 P ++LV+D SGSM KMA + A + ++ Sbjct: 404 PSLGLQLVIDKSGSMSSDARG---------------ADKMALAREAAIRATTMMNAQD-- 446 Query: 249 KEDVYMGLIGYT---TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 Y+G+I + V + ++++Q ++R TD PA++ Y+ + Sbjct: 447 ----YIGVIAFDDTPWDVVAPQSVT-KLDEIQQQISRIQAD---GGTDIFPALQLGYERV 498 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 + + +K +I LTDG + + + A+ I + Sbjct: 499 KAMNTQ------------------RKHVILLTDG-QSALDDDYEGLLQQMTAEN--ITVS 537 Query: 366 TISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQN 404 T+++ + + LL+ +Y +A+S+ +F Sbjct: 538 TVAL-GDDSDRGLLEMIAELGKGRYYFANDAESIPKIFSK 576 >gi|90420284|ref|ZP_01228192.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] gi|90335618|gb|EAS49368.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] Length = 593 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 28/205 (13%), Positives = 65/205 (31%), Gaps = 73/205 (35%) Query: 277 RQYVTRDMDSLILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 + + ++++ T+ + +++L++ F QG + K ++ Sbjct: 395 QATINAAINAMDADGETNIPEGIAWGWRLLSA-------REPFTQGRANDAKDNLKVLVL 447 Query: 336 LTDGENNNFKS------------------------------------------------- 346 +TDG+NN Sbjct: 448 MTDGDNNYGSDENDYNESGYGTFGYASTYDAYGNHSWGRIFDDTSTTSKRANRSSFVSAM 507 Query: 347 NVNTIKICDKAK--------ENFIKIVTISINASPNG--QRLLKTCVS----SPEY--HY 390 N IC K E+ I I TI+ + + ++L++ C S P +Y Sbjct: 508 NEKVAAICQNIKDDGRKATGEDGIVIFTIAFDLNDGSSVKKLMEQCASYGITDPTKKLYY 567 Query: 391 NVVNADSLIHVFQNISQLMVHRKYS 415 + ++ L+ F +I++ + + + Sbjct: 568 DAKSSSDLMAAFDSITEQVSSLRIA 592 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 45/363 (12%), Positives = 108/363 (29%), Gaps = 72/363 (19%) Query: 10 YFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMV 69 + K+ ++ N +++F L++ +G + + + K +++ + + A LA Sbjct: 11 FRKRFWTAKSGNVAVVFGLTLPILACCMGAAVDISGIYASKRNLQHSVDIAALAA----- 65 Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 G + + + + + + F + + F I N S+ + + Sbjct: 66 ------GREYSNNQQDSHLSKVAEGYFFENAGAD---ARANTDFSYDGIFNEDGSTVLQV 116 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 + + + F + + L ++ Q + ++V Sbjct: 117 SAARRHPTIFGDLLSFVTAGELDWRAFPLAARSQIVVQNQSIELV--------------- 161 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 +V+D SGSM S K+ +K A + Sbjct: 162 -----MVLDNSGSMTGRPKSGGGKR------------KIDTIKEAAIGLTGQFLK----- 199 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 G T ++ +G V D+ K D+ + L Sbjct: 200 -----GAASSTLKL----PVQFGVVPFAAAVNIGPDNHDAKWMDTKGRSSIHNEYLDWAN 250 Query: 310 KRSFFTNFFRQGVKIPS--------LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 ++ + P +F ++ EN + K+ + C +++ N Sbjct: 251 WKTSRGVKLAERSPTGDYWQEISSKTPLTRFFVY----ENAHHKNELGPWLGCVESRPNG 306 Query: 362 IKI 364 + I Sbjct: 307 LAI 309 >gi|325922265|ref|ZP_08184046.1| Mg-chelatase subunit ChlD [Xanthomonas gardneri ATCC 19865] gi|325547218|gb|EGD18291.1| Mg-chelatase subunit ChlD [Xanthomonas gardneri ATCC 19865] Length = 335 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 46/238 (19%), Positives = 81/238 (34%), Gaps = 54/238 (22%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + L VDLSGSM S+P+ V + ++ A K L FLD Sbjct: 96 REARQMMLAVDLSGSM-----SEPDMVLGGSVVD-----RLTAAKAVLSDFLDR------ 139 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 +E +GL+ + R + VR + + L + T A+ + + L Sbjct: 140 -REGDRVGLLVFGQRAYALTPLTADLTSVRDQLADSVVGLAGRETAIGDAIALSVKRLRE 198 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 K+ Q+ ++ LTDG N N +K + AK +++ TI Sbjct: 199 QKQG------------------QRVVVLLTDGVNTAGVLNP--LKAAELAKAEGVRVHTI 238 Query: 368 SINA-----------SPNGQR------LLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + G L K + + + + L ++ + +L Sbjct: 239 AFGGSGGYSLFGVPIPAGGNDDIDEAGLRKIAEQTGGRFFRARDTEELAGIYAELDRL 296 >gi|312883763|ref|ZP_07743482.1| hypothetical protein VIBC2010_14219 [Vibrio caribbenthicus ATCC BAA-2122] gi|309368512|gb|EFP96045.1| hypothetical protein VIBC2010_14219 [Vibrio caribbenthicus ATCC BAA-2122] Length = 396 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 53/414 (12%), Positives = 123/414 (29%), Gaps = 51/414 (12%) Query: 10 YFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA------ 63 +F + I + +I + ++ ++ I + N A +AA +A Sbjct: 2 HFNQSIVKHHGSVAISYLAMLIPMVIAAASTIVIGYQVQLSNRAMQAADAASIACEFKGE 61 Query: 64 -GASKMVSNLSRLGDRFESISNHAKRALI-DDAKRFIKNHIKESLSGYSAVFYNTEIQNI 121 + S L + + + + + + + + I SL+ F + N Sbjct: 62 YDQALTQSYLDYYQPKIDKVRGQIRTNSGCNMSLGYSLSTIFTSLTLSDTSFVVSSTANE 121 Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYN-----QKIVSF 176 + + + + + + I L Q+ N VS Sbjct: 122 KAYVTEDVVSDPLELVIVLDISTSMYGAINDLKAILKRGIVSLKEQQNNAQSEDHIKVSI 181 Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 IP + + P + + K A + Sbjct: 182 IPFSTGVSVNNAPWL-------------------NDARTFCVDGTTESEDKFYAARTVAN 222 Query: 237 LFLD----SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 L + S+ L K +T + ++V V T Sbjct: 223 LDITHDQISVKLSQPNKWRESCSAASFTL------PLTADLDQVTNTVDSLRTE---GGT 273 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG-ENNNFKSNVNTI 351 S + + LT + ++++ R K+ ++ ++ +TDG + + ++ Sbjct: 274 ASYQGLIWGLRQLTPNWQKAWEVGPNRNVDKV-----ERKLVLMTDGNDYGRYFDDLINA 328 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 +CD+AK+ I + + + + C P+ ++ + L H F + Sbjct: 329 GLCDRAKDYGIALNFVGFGVNGSRLEQFTRCAVDPKGVFSASDTQDLDHYFSQL 382 >gi|317133828|ref|YP_004089739.1| von Willebrand factor type A [Ruminococcus albus 7] gi|315450290|gb|ADU23853.1| von Willebrand factor type A [Ruminococcus albus 7] Length = 1061 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 41/259 (15%), Positives = 90/259 (34%), Gaps = 50/259 (19%) Query: 148 MDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAM 207 + + T + + ++ +KI + A+ + VD SGSM Sbjct: 527 VSIETEHFSKYMVVDKTEWFNNWRKIYNSYAAIFSAIPS-----YTAICVDCSGSMSTND 581 Query: 208 NSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNI 267 S +D ++ A+ +++S+ + + +I + + + Sbjct: 582 KSFKDDNGVLTCYRN----------IAVQNYVESMFVFDNAS------IITFESSASEEC 625 Query: 268 EPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 E + ++ ++ T++ A+ A L + Sbjct: 626 EMTNN----KRTLSGKASFYNRGGTNANSAIDIAIDEL-------------------NHV 662 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE 387 +K II L+DG+ N N IK C K N I+I T+++ + N Q L + + Sbjct: 663 YGKKNIILLSDGDVNVSDDN---IKYC---KNNGIRIHTVALGSGANSQLLKQYANDTGG 716 Query: 388 YHYNVVNADSLIHVFQNIS 406 A+ L ++++ + Sbjct: 717 TPLTATTAEGLTKIYESYA 735 >gi|160882770|ref|ZP_02063773.1| hypothetical protein BACOVA_00731 [Bacteroides ovatus ATCC 8483] gi|237720676|ref|ZP_04551157.1| BatA [Bacteroides sp. 2_2_4] gi|260170239|ref|ZP_05756651.1| aerotolerance protein BatA [Bacteroides sp. D2] gi|293373990|ref|ZP_06620331.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CMC 3f] gi|299145608|ref|ZP_07038676.1| BatA protein [Bacteroides sp. 3_1_23] gi|315918602|ref|ZP_07914842.1| conserved hypothetical protein [Bacteroides sp. D2] gi|156111794|gb|EDO13539.1| hypothetical protein BACOVA_00731 [Bacteroides ovatus ATCC 8483] gi|229449511|gb|EEO55302.1| BatA [Bacteroides sp. 2_2_4] gi|292631066|gb|EFF49703.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CMC 3f] gi|298516099|gb|EFI39980.1| BatA protein [Bacteroides sp. 3_1_23] gi|313692477|gb|EFS29312.1| conserved hypothetical protein [Bacteroides sp. D2] Length = 327 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 44/276 (15%), Positives = 82/276 (29%), Gaps = 63/276 (22%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 Y + F+ L +++ + + E I L +D+S SM Sbjct: 51 YLLHVPFLLRCLALVLVILVLARPQTTNKWQNSEIEGIDIMLAIDVSTSMLAE------- 103 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 D K ++ A K+ F++ +G+ + + Sbjct: 104 --------DLKPNRLEAAKDVAAEFIN----GRPNDN---IGITLFAGETFTQCPLTVDH 148 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + + LI T + A L K +S K I Sbjct: 149 AVLLDMIHNIKCGLITDGTAVGMGIANAVTRLKDSKAKS------------------KVI 190 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI----NAS----------------- 372 I LTDG NN + T + AK I++ TI + A Sbjct: 191 ILLTDGTNNKGDISPMTAA--EIAKSFGIRVYTIGVGTNGMAPYPYPVGNTVQYVSMPVE 248 Query: 373 PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + L + ++ ++ + L V++ I +L Sbjct: 249 IDEKTLTEIAGTTDGNYFRATSNSKLKEVYEEIDKL 284 >gi|219850594|ref|YP_002465027.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219544853|gb|ACL26591.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 958 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 39/233 (16%), Positives = 75/233 (32%), Gaps = 40/233 (17%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 RP I V+D SGSM ++P+ K+ K+A+ + Sbjct: 407 RPDLAIVFVIDKSGSMDACHCANPD---RGGPITSSSERKIDIAKDAVAQATALLSPQDT 463 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 V G++ + + G V Q + PT+ + +A ++L Sbjct: 464 V------GVVTFDGAAFPTFVATRG-ATVEQVMDAVSGVEPRGPTNIRAGLLRAEEMLQQ 516 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 R K +I LTDG + S + + I + +E I + + Sbjct: 517 VDARI------------------KHMILLTDG----WGSGGDQLDIAARLREQGITLTVV 554 Query: 368 SINASPNGQRLLKTCVSS-PEYHYNVVNADSLIHVF-----QNISQLMVHRKY 414 + A L+ + +Y + + +F I +V + + Sbjct: 555 A--AGSGSATYLQQLAAEGGGRYYPAADMADVPQIFVQETITAIGNYIVEQPF 605 >gi|73990553|ref|XP_853265.1| PREDICTED: similar to alpha 3 type VI collagen isoform 1 precursor [Canis familiaris] Length = 1798 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 50/349 (14%), Positives = 118/349 (33%), Gaps = 56/349 (16%) Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNH----------IKESLSGYSAVFYNTEIQNIV 122 + D + +S + L A F+ + +K+ L + +++ Sbjct: 678 NETSDAIDRMSLINRGTLTGSALTFVGQYFTPTKGARTKVKKFLILITDGEAQDPVRDPA 737 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 + R + + + +N T + +S ++++ + L + K+V + AL Sbjct: 738 KALRDKGVVIFSVGVYGANRTQLEEISGDSSLVFQVENFDDL--KTVESKLVFRVCAL-- 793 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + + I V+D SGS+ ++ + M L + + Sbjct: 794 HDCKNIKVLDIVFVLDHSGSIG----------------TQEQESMM-------NLTIHLV 830 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 + V +G + Y+ E S V +++ R T + A++ A Sbjct: 831 KKADVDSDRVRVGALKYSDYPEVLFYLSGNKSAVIEHLRRRR--YTSGHTYTARALEHA- 887 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 F + ++ +I +TDG +++ + +T K + I Sbjct: 888 -----------NIMFTEEYGSRIQQNVKQMLIIITDGVSHDRDNLSDTAS---KLRNKGI 933 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 I + + Q L+T + ++V N +L ++ + + M Sbjct: 934 NIYAVGVG--QANQLELETMAGNKSNTFHVDNFSNLKDIYLPLQEKMCT 980 Score = 41.4 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 40/246 (16%), Positives = 82/246 (33%), Gaps = 50/246 (20%) Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + R+ F ++ + R + V + + + E I +VD SGS+ Sbjct: 582 EERVNFGQNFDSLRNIKNEV--VHRICTEKGCEDMKADIMFLVDSSGSIGHD-------- 631 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 KM K + L I + + +G++ ++ ++ + Sbjct: 632 ---------NFGKM---KTFMKNLLAKIQIGP---DSTQIGVVQFSDINQEEFQL--NKY 674 Query: 275 KVRQYVTRDMD--SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + + +D SLI + T + A+ Q T K +KF Sbjct: 675 FTQNETSDAIDRMSLINRGTLTGSALTFVGQYFTP--------------TKGARTKVKKF 720 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV 392 +I +TDGE + + ++ + I ++ + + Q L+ + V Sbjct: 721 LILITDGEAQDPVRDPA-----KALRDKGVVIFSVGVYGANRTQ--LEEISGDSSLVFQV 773 Query: 393 VNADSL 398 N D L Sbjct: 774 ENFDDL 779 >gi|109897980|ref|YP_661235.1| von Willebrand factor, type A [Pseudoalteromonas atlantica T6c] gi|109700261|gb|ABG40181.1| von Willebrand factor, type A [Pseudoalteromonas atlantica T6c] Length = 343 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 42/235 (17%), Positives = 79/235 (33%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VDLSGSM ++ ++ +K+ L F+ + Sbjct: 88 DLMIAVDLSGSM----------KIDDMQVNGRQVDRLQMIKSVLHDFIQR-RIGD----- 131 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ E V Q + + L+ + T A+ A + S ++ Sbjct: 132 -RLGLIFFADTAYLQAPLTYDRETVSQLLNESLIGLVGEQTAIGDAIGLAIKRFKSKEES 190 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + K +I LTDG+N + A N + + TI + A Sbjct: 191 N------------------KVLILLTDGQNTAGNITPEQAN--ELAINNGVTLYTIGVGA 230 Query: 372 SP-------------NGQRL----LKTCV-SSPEYHYNVVNADSLIHVFQNISQL 408 Q L L T S+ ++ +A SL ++ + +L Sbjct: 231 DQMLVQSIFGSRQVNPSQELDEGMLTTLAESTGGRYFRARDAQSLTEIYSKLDEL 285 >gi|239995770|ref|ZP_04716294.1| von Willebrand factor, type A [Alteromonas macleodii ATCC 27126] Length = 358 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 38/231 (16%), Positives = 74/231 (32%), Gaps = 55/231 (23%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + L VDLSGSM ++ ++ K+ + F+ + Sbjct: 90 MMLAVDLSGSM----------KIDDMQLNGRQVNRLTMTKSVVYDFIQR-------RVGD 132 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GLI + ++ + V ++ + L+ + T A+ A + ++ + Sbjct: 133 RIGLILFADTAYVQAPLTYDRDTVSTLLSEAVIGLVGEQTAIGDAIGLAVKRFDEREESN 192 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 +I LTDG+N K + A +K+ TI + A Sbjct: 193 N------------------VLILLTDGQNTAGNITPEQAK--ELAISKGVKVYTIGVGAD 232 Query: 373 P-------------NGQRL----LKTCVSS-PEYHYNVVNADSLIHVFQNI 405 Q L L +S ++ NA L ++Q + Sbjct: 233 KMLIQSFFGSRQINPSQELDEGMLTNIATSTGGQYFRARNAQELQAIYQQL 283 >gi|84385370|ref|ZP_00988402.1| hypothetical protein V12B01_16906 [Vibrio splendidus 12B01] gi|84379967|gb|EAP96818.1| hypothetical protein V12B01_16906 [Vibrio splendidus 12B01] Length = 319 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 41/235 (17%), Positives = 82/235 (34%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLSGSM ED+ D +++A+K+ L F++ ++ Sbjct: 83 DLMLVVDLSGSMS------QEDMQFNGEYID----RLSAVKHVLSDFIER-------RKG 125 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + + + + Q + + + LI T + A + Sbjct: 126 DRVGLVLFADHAYLQTPLTLDRDTLSQQLNQAVLKLIGTQTAIGDGIGLATKTFVD---- 181 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S Q+ +I L+DG N + D AK+ I T+ + A Sbjct: 182 --------------SDAPQRVMILLSDGSNTAGVLDPLEAA--DIAKKYNATIYTVGVGA 225 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + L++ + ++ ++ L ++ I+QL Sbjct: 226 GEMMVKEFFMTRKVNTAQDLDERTLMEIAKRTGGQYFRARDSKELATIYDTINQL 280 >gi|157374763|ref|YP_001473363.1| von Willebrand factor, type A [Shewanella sediminis HAW-EB3] gi|157317137|gb|ABV36235.1| von Willebrand factor, type A [Shewanella sediminis HAW-EB3] Length = 330 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 38/235 (16%), Positives = 76/235 (32%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VDLSGSM + K + ++ + F++ ++ Sbjct: 84 DLMMAVDLSGSM----------QIEDMVLDGKTVDRFTMIQAVVSDFIER-------RKG 126 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V Q++ L+ K T A+ A + + Sbjct: 127 DKLGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGEAIALAVKRFDRVDES 186 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + ++ LTDG NN+ + A + +KI +I + A Sbjct: 187 N------------------RILVLLTDGSNNSGSISPEQAA--AIAAKRGVKIYSIGVGA 226 Query: 372 SPNGQRLL------------------KTCVSSPEYHYNVVNADSLIHVFQNISQL 408 +R L ++ ++ NA L ++Q I +L Sbjct: 227 EVMERRTLFGKERVNPSMDLDETQLTALAQTTGGLYFRARNAQELESIYQEIDKL 281 >gi|323493494|ref|ZP_08098616.1| hypothetical protein VIBR0546_14275 [Vibrio brasiliensis LMG 20546] gi|323312317|gb|EGA65459.1| hypothetical protein VIBR0546_14275 [Vibrio brasiliensis LMG 20546] Length = 393 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 23/134 (17%), Positives = 47/134 (35%), Gaps = 6/134 (4%) Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 T + V + T S + + LT + + ++ R G Q+ Sbjct: 251 TSNLNDVVDAVNRLQTIGSTASYQGLLWGLRQLTPNWQSAW-----RVGPNRNQDNVQRK 305 Query: 333 IIFLTDG-ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYN 391 ++ +TDG ++N+ + +C +AK+ I++ I C S ++ Sbjct: 306 LVLMTDGMDDNSHLDELINAGLCTRAKDLGIELNFIGFGVQSWRLEQFTRCAGSAGAVFS 365 Query: 392 VVNADSLIHVFQNI 405 N L F + Sbjct: 366 ANNTQDLDDYFSQL 379 >gi|52549995|gb|AAU83844.1| cell surface protein [uncultured archaeon GZfos34G5] Length = 1357 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 45/249 (18%), Positives = 93/249 (37%), Gaps = 25/249 (10%) Query: 153 SYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPE 212 SY + +L V ++ + + + +V+D+SGSM + D Sbjct: 915 SYALSKDPVSEVLGY---YLSVGLNSGIIESDFQRKK-LNLVIVLDISGSMGSSF--DEY 968 Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW- 271 + D T+ A + + LL H+++D +GL+ + T E S Sbjct: 969 YYDRFGNRVDVNETEDAEKSKIEIAAAAIVALLDHLEDDDRLGLVLFNTGAELAEPVSLI 1028 Query: 272 GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 G + +++ ++ T + M+ A T + + +++ ++ Sbjct: 1029 GAKNMQKLKGDVLEISATDGTRLSAGMQMA-------------TELYDEFLEVNQSEYEN 1075 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT--ISINASPNGQRLLKTCVS-SPEY 388 IIFLTD N +++ ++ +A N + T I I N L++ Sbjct: 1076 RIIFLTDAMPNLGQTSEESLLGMTEANANK-NVYTTFIGIGVDFN-TELVEYITKIRGAN 1133 Query: 389 HYNVVNADS 397 +Y+V +A Sbjct: 1134 YYSVHSAKQ 1142 >gi|156976371|ref|YP_001447277.1| hypothetical protein VIBHAR_05144 [Vibrio harveyi ATCC BAA-1116] gi|156527965|gb|ABU73050.1| hypothetical protein VIBHAR_05144 [Vibrio harveyi ATCC BAA-1116] Length = 334 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 39/235 (16%), Positives = 78/235 (33%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLSGSM +D + ++ A+K L F++ ++ Sbjct: 98 DMMLVVDLSGSMQKEDMNDNGEY----------IDRLTAVKRVLSDFVEK-------RQG 140 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G++ + + + V Q + + + L+ + T + Sbjct: 141 DRLGVVLFGDHAYLQTPLTADRKTVMQQINQTVIGLVGQRTAIGDGI------------- 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + S Q+ +I L+DG N + + AK+ I T+ + A Sbjct: 188 -----GLGTKTFVDSDAPQRVMILLSDGSNTAGVLDPLEAA--EIAKKYNATIYTVGVGA 240 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + Q L K + ++ +A L ++ I+QL Sbjct: 241 GEMMVKDFFMTRKVNTASELDEQTLTKIAEMTGGKYFRARDAKELETIYDTINQL 295 >gi|114778216|ref|ZP_01453088.1| batB protein, putative [Mariprofundus ferrooxydans PV-1] gi|114551463|gb|EAU54018.1| batB protein, putative [Mariprofundus ferrooxydans PV-1] Length = 355 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 38/246 (15%), Positives = 78/246 (31%), Gaps = 49/246 (19%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VD+SGSM ++ +++ A K F+ + Sbjct: 108 DLLLAVDISGSM----------QIKDFEMNGQQVSRLTATKAVARQFISR-------RVG 150 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + ++ + V + L K T A+ A + L + Sbjct: 151 DRVGLILFGSNAYVQTPLTFDRKTVITLLDEAAVGLAGKATAIGDAIGLAVKRLEQSNRD 210 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ +I LTDG N + + + A E+ + I TI I A Sbjct: 211 KRIASK------------EQVLILLTDGVNTAGQLSAPQAA--ELAAEHGLTIYTIGIGA 256 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + + L + ++ + L ++ I +L + Sbjct: 257 DAMTVQSFFGTQRVNPSADLDEKMLTDIATKTGGRYFRAHDTQELQKIYAMIDKLEPVAR 316 Query: 414 YSVILK 419 V+ + Sbjct: 317 DRVVFR 322 >gi|289667993|ref|ZP_06489068.1| hypothetical protein XcampmN_05693 [Xanthomonas campestris pv. musacearum NCPPB4381] Length = 310 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 42/238 (17%), Positives = 78/238 (32%), Gaps = 54/238 (22%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + L VDLSGSM + K ++ A K L FLD Sbjct: 71 REARQMMLAVDLSGSMS----------EPDMVLGGKVVDRLTAAKAVLSDFLDR------ 114 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 ++ +GL+ + R + VR + + L + T A+ + + L Sbjct: 115 -RDGDRVGLLVFGQRAYALTPLTADLTSVRDQLRDSVVGLAGRETAIGDAIALSVKRLRE 173 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 K+ Q+ ++ LTDG N N +K + AK +++ TI Sbjct: 174 QKQG------------------QRVVVLLTDGVNTAGVLNP--LKAAELAKAEGVRVHTI 213 Query: 368 SINA-----------SPNG-----QRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQL 408 + G + L+ + + + + L ++ + +L Sbjct: 214 AFGGSGGYSLFGVPIPAGGNDDIDEEGLRKIAQQTGGRFFRARDTEELAGIYAELDRL 271 >gi|289662175|ref|ZP_06483756.1| hypothetical protein XcampvN_03493 [Xanthomonas campestris pv. vasculorum NCPPB702] Length = 335 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 42/238 (17%), Positives = 78/238 (32%), Gaps = 54/238 (22%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + L VDLSGSM + K ++ A K L FLD Sbjct: 96 REARQMMLAVDLSGSMS----------EPDMVLGGKVVDRLTAAKAVLSDFLDR------ 139 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 ++ +GL+ + R + VR + + L + T A+ + + L Sbjct: 140 -RDGDRVGLLVFGQRAYALTPLTADLTSVRDQLRDSVVGLAGRETAIGDAIALSVKRLRE 198 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 K+ Q+ ++ LTDG N N +K + AK +++ TI Sbjct: 199 QKQG------------------QRVVVLLTDGVNTAGVLNP--LKAAELAKAEGVRVHTI 238 Query: 368 SINA-----------SPNG-----QRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQL 408 + G + L+ + + + + L ++ + +L Sbjct: 239 AFGGSGGYSLFGVPIPAGGNDDIDEEGLRKIAQQTGGRFFRARDTEELAGIYAELDRL 296 >gi|86747937|ref|YP_484433.1| hypothetical protein RPB_0811 [Rhodopseudomonas palustris HaA2] gi|86570965|gb|ABD05522.1| conserved hypothetical protein [Rhodopseudomonas palustris HaA2] Length = 435 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 69/448 (15%), Positives = 139/448 (31%), Gaps = 79/448 (17%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA--SKMV 69 ++ N ++IFA++++ L IG I + + ++SA +AA+L S++ Sbjct: 15 RRFGRDRSGNIAVIFAIALLPILGFIGAAIDYATANRIRTKLQSAQDAAVLLAVSNSEIN 74 Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 ++ E N A A I+ + +A F +T N +N Sbjct: 75 RTTAQAKADAEQFFNATIGAYGLTATIKIEVTENDGKRSATADFTSTVTTNFLNLIGYPT 134 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 + N + + I Y +++ + + ++ Sbjct: 135 LAIGNRSTSTVSRPI---------YQDFYLLLDNSPSMGVAATTADIATMVG--NTSDKC 183 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 F C SD + + K ++ ++ A+ + L++ V Sbjct: 184 AF------------ACHDLSDSNNYYNLAKKLGVKM-RIDVVRQAVQQLTSTATLMTAVN 230 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 M + S G + + + M Q +D+ Sbjct: 231 NQFRMAVYT-----LGGSCASLGLTTIASLSSAMSSVQTAAG--AIDLMSIPKQNYNNDQ 283 Query: 310 KRSFFTNFFRQGVKIPS------LPFQKFIIFLTDG--ENNNFKSNVN------------ 349 F + IPS QK++ F++DG + NN Sbjct: 284 CTDFNSALAAMNTTIPSSGTGTAAQPQKWLFFVSDGVADFNNPSGCTQPTVSGGRCQEPL 343 Query: 350 TIKICDKAKENFIKI---VT--------------IS-INASPNGQRL-------LKTCVS 384 T+ C K+ I+I T I+ NA P G + +K+C Sbjct: 344 TVTQCKAMKDRGIQIAVLYTTYLALPTNQWYNDHIAPFNAGPYGPSVNSQIAAKMKSCA- 402 Query: 385 SPEYHYNVVNADSLIHVFQNISQLMVHR 412 SP++++ V + + + V + Sbjct: 403 SPDFYFEVSPTQGISEAMDALFKKAVAK 430 >gi|116051069|ref|YP_790101.1| von Willebrand factor type A domain-containing protein [Pseudomonas aeruginosa UCBPP-PA14] gi|296388430|ref|ZP_06877905.1| von Willebrand factor type A domain-containing protein [Pseudomonas aeruginosa PAb1] gi|313108364|ref|ZP_07794396.1| putative von Willebrand factor type A domain-containing protein [Pseudomonas aeruginosa 39016] gi|115586290|gb|ABJ12305.1| putative von Willebrand factor type A domain [Pseudomonas aeruginosa UCBPP-PA14] gi|310880898|gb|EFQ39492.1| putative von Willebrand factor type A domain-containing protein [Pseudomonas aeruginosa 39016] Length = 340 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 34/182 (18%), Positives = 62/182 (34%), Gaps = 37/182 (20%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VD+SGSM QD + +++ +K F++ + Sbjct: 91 DLLLAVDVSGSMD----------YRDMRWQDDEISRLELIKKLFGDFIE-------DRRG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + K T A+ A + L Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRHTVRVWLDEAQIGIAGKNTAIGDAIGLAVKRLRQRPAE 193 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S + ++ +TDG N + A E +KI TI + A Sbjct: 194 S------------------RVLVLITDGANTGGQIAPQIAAQ--LAAEQQVKIYTIGVGA 233 Query: 372 SP 373 P Sbjct: 234 DP 235 >gi|291523143|emb|CBK81436.1| fibro-slime domain [Coprococcus catus GD/7] Length = 1745 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 61/207 (29%), Gaps = 55/207 (26%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 R E + LV+D SGSM M AL NA Sbjct: 873 STSGREAGTEAKAASVVLVLDRSGSMGADG--------------------MTALVNAADT 912 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM---------DSLI 288 F+D++ S + ++ + +++ + + + + Sbjct: 913 FIDTLKTASPDSQV---AVVYFNGTQDEDDNTTTSKNFTKLNTDEKVKSIKDFLSNNGYS 969 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN- 347 T A+++A +L +D QK+++F TDG + + Sbjct: 970 YGGTPMGDALEKAKGLLDAD-----------------QTGNQKYVLFFTDGLPGHSSDDA 1012 Query: 348 -----VNTIKICDKAKENFIKIVTISI 369 N+ C + I T+ Sbjct: 1013 FNCMVANSAVNCATDIKANATIYTVGY 1039 >gi|326424188|ref|NP_762140.2| aerotolerance operon protein BatA [Vibrio vulnificus CMCP6] gi|319999572|gb|AAO07130.2| BatA (Bacteroides aerotolerance operon) [Vibrio vulnificus CMCP6] Length = 362 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 45/224 (20%), Positives = 77/224 (34%), Gaps = 35/224 (15%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +VVDLSGSM K +++ A K L F + ++ Sbjct: 107 DVMVVVDLSGSMA---------EQDFTSASGAKISRLDATKEVLADFAKT-------RQG 150 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + + + ++ + T A+ A + + Sbjct: 151 DRLGLILFGDAAFVQTPFTADQKVWLALLNQTDVAMAGQSTHLGDAIGLAI-KVFEQSEP 209 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S S P QK I LTDG N+ S V I AK ++I I++ Sbjct: 210 SQAAF---------SKPRQKVAIVLTDG--NDTGSFVEPIDAAKVAKAKGVRIHVIAMGD 258 Query: 372 SPN-GQRLL------KTCVSSPEYHYNVVNADSLIHVFQNISQL 408 G+ L + S + +N D L + +I +L Sbjct: 259 PSTVGESALDLQTIERIASESGGKAFQALNRDELASAYDDIGKL 302 >gi|307721534|ref|YP_003892674.1| von Willebrand factor A [Sulfurimonas autotrophica DSM 16294] gi|306979627|gb|ADN09662.1| von Willebrand factor type A [Sulfurimonas autotrophica DSM 16294] Length = 303 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 46/229 (20%), Positives = 79/229 (34%), Gaps = 39/229 (17%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + +R + +D SGSM + P + K ALK L F+ Sbjct: 72 QKTSSKRKGRDLVFALDTSGSMA--------ESGFNPENVQNR--KFDALKELLRSFITK 121 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 ++V G+ + T I S+ V + + T + A Sbjct: 122 -----RYNDNV--GVSIFGTYAYPAIPLSYDMGSVAFLLDFFDVGIAGDSTAIGEGLAMA 174 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 +IL + + +K II +TDG N+ +V KAK+ Sbjct: 175 LKILKKGEAK------------------EKVIILITDGYQNSGAVSVKEAVQ--KAKKQH 214 Query: 362 IKIVTISI-NASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQL 408 +KI TI I + S LL+ ++ + N L +++ I +L Sbjct: 215 VKIYTIGIGDRSAFDANLLQLIAKNTDAKMFEAKNVKMLQDIYKEIDKL 263 >gi|260914303|ref|ZP_05920772.1| Flp pilus assembly protein TadG [Pasteurella dagmatis ATCC 43325] gi|260631404|gb|EEX49586.1| Flp pilus assembly protein TadG [Pasteurella dagmatis ATCC 43325] Length = 584 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 44/279 (15%), Positives = 82/279 (29%), Gaps = 35/279 (12%) Query: 5 SRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG 64 +R K EK ++++ AL L+LI F + K + A + A L Sbjct: 18 TRCFQKLKDFYQEEKGVYAVMTALLSFPLLVLIAFTVDGTGIILDKVRLAQATDQAALLL 77 Query: 65 ASK--------MVSNLSRLGDRFESIS----NHAKRALIDDAKRFIKNHIKESLSGYS-A 111 ++ M ++++ E +S + + I+ K L + A Sbjct: 78 VAENNAYRKNPMHDDVTKQSVSKEELSKFSGDKLSAQKDKRNQELIQGLAKMYLRSENKA 137 Query: 112 VFYNTEIQNIVNSSRISM----THMANNRLDSSNNTIFY-NMDVMTSYDYRLQFIEHLLN 166 N I N+ Y V + L L Sbjct: 138 QKDNHLPVTIDQPFDYKCEELDLINPKNQYSRRKPVTCYVQGSVNREFWIPL---SADLV 194 Query: 167 QRYNQKIVSFIPALLRIEMGERPIF---LIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 + + + I + + + E+ I + LV D SGSM + ++ Sbjct: 195 KTHTKNGRLPINSGISYAVKEKAIVIPVDLMLVSDFSGSMLWDLKNNENAQY-------- 246 Query: 224 KRTKMAALKNALL---LFLDSIDLLSHVKEDVYMGLIGY 259 K+ L++ + L L MG + Sbjct: 247 PNRKIDILRSVVSDIQNILFPTKLSEDASPYNRMGFAAF 285 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 24/154 (15%), Positives = 48/154 (31%), Gaps = 25/154 (16%) Query: 270 SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 +W T+ + T ++ +L T Q K+ Sbjct: 421 AWFTKSKPKVAEALSKIKPTGSTAASSGFIIGANLLMDKN-----TVPEAQPAKLG-TNT 474 Query: 330 QKFIIFLTDGENNNFKSNVNT----IKICDKAKEN--------------FIKIVTISINA 371 Q+ ++ L+DGE+N + T +CD ++ I Sbjct: 475 QRILMVLSDGEDNRPTFDTLTTLLNAGLCDNIRKKADSLQDPKFNTLPTKIAFAAFGFQP 534 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 P + + CV +Y + ++L+ F+ I Sbjct: 535 PPEQKAAWQKCVG-ENNYYEPSSKEALLDAFKQI 567 >gi|189912860|ref|YP_001964749.1| BatA [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|189913185|ref|YP_001964414.1| Hypothetical BatA protein; putative von Willebrand factor, type A domain containing protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] gi|167777536|gb|ABZ95836.1| BatA [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|167781253|gb|ABZ99550.1| Hypothetical BatA protein; putative von Willebrand factor, type A domain containing protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] Length = 317 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 42/260 (16%), Positives = 77/260 (29%), Gaps = 44/260 (16%) Query: 156 YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 Y L+ + + + ++ I + +D+SGSM + + P Sbjct: 56 YSFTEFLVYLSMVFLVVAAAGPGSKYKLSPDSTKGVDIMIALDISGSMVNSYDFLP---- 111 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 R +++ K+ L F+ + +G++ + S Sbjct: 112 ---------RNRLSVSKDLLREFVKK-------RLYDRIGIVVFAGAAYLQSPLS----- 150 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 R ++ T S + + S K II Sbjct: 151 ----SDRFALDELIAGTSS--------EDIEEQGTAVGDALVLSSYRLKNSEAKSKVIIL 198 Query: 336 LTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL----LKTCVS-SPEYHY 390 LTDG +N K + +T K IK+ I I + L+ S + + Sbjct: 199 LTDGVSNTGKLDPDTAAY--TTKTMGIKVYCIGIGKEEGQYEINYESLQKISSNTNGKFF 256 Query: 391 NVVNADSLIHVFQNISQLMV 410 + + L V I QL V Sbjct: 257 RAESPEVLESVLNEIDQLEV 276 >gi|163748339|ref|ZP_02155613.1| hypothetical protein OIHEL45_20491 [Oceanibulbus indolifex HEL-45] gi|161378385|gb|EDQ02880.1| hypothetical protein OIHEL45_20491 [Oceanibulbus indolifex HEL-45] Length = 405 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 16/73 (21%), Positives = 32/73 (43%), Gaps = 2/73 (2%) Query: 344 FKSNVNTIKICDKAKENFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + IC A+ I I I ++ S N LLK C S+ ++++V + + F Sbjct: 333 DEKDRRLRNICAAAQRAGIVIYAIGMDVDSQNSLDLLKECASTEAHYFDV-DGLEIQTAF 391 Query: 403 QNISQLMVHRKYS 415 I+ + + + Sbjct: 392 DMIAASISMLRLT 404 >gi|312881786|ref|ZP_07741560.1| hypothetical protein VIBC2010_06474 [Vibrio caribbenthicus ATCC BAA-2122] gi|309370537|gb|EFP98015.1| hypothetical protein VIBC2010_06474 [Vibrio caribbenthicus ATCC BAA-2122] Length = 323 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 40/269 (14%), Positives = 81/269 (30%), Gaps = 55/269 (20%) Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 +Y + + LL + + + + + LV+DLS SM D Sbjct: 50 NYFTKTLSILLWCALITALARPVWYGEPVTTQPKHR-DLMLVLDLSYSMSQEDMQDSSGN 108 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 ++ A+KN + F ++ +GL+ + + Sbjct: 109 Y---------IDRLTAVKNVVSQFAQQ-------RKGDRLGLVLFADHAYLQTPLTLDRN 152 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 + + V + LI + T + A + S Q+ +I Sbjct: 153 TISEQVNSLVLQLIGQKTAIGEGIGLATKTFID------------------SDAPQRVMI 194 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------NGQ 376 L+DG N + + + AK+ I TI + A + + Sbjct: 195 LLSDGSNTSGVLDPIEAA--NIAKKYNATIYTIGVGAGEMMVKDFFMTRKVNTAQDLDEK 252 Query: 377 RLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 L+ + ++ NA L ++ I Sbjct: 253 TLMSIAKITGGQYFRARNAQELATIYDTI 281 >gi|187251529|ref|YP_001876011.1| von Willebrand factor type A [Elusimicrobium minutum Pei191] gi|186971689|gb|ACC98674.1| Von Willebrand factor type [Elusimicrobium minutum Pei191] Length = 335 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 37/192 (19%), Positives = 74/192 (38%), Gaps = 46/192 (23%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + +I + + + + VD+S SM +D K T++ K L + Sbjct: 80 GVEKINVTAQSSHSV-IAVDVSDSM---------------KARDLKPTRLENAKTMLKML 123 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS-LILKPTDSTPA 297 + + K + G++ +T++ + E ++ +V + L K T PA Sbjct: 124 ISA-------KGEQRTGIVAFTSKAYTQCPITNDVEALKYFVNQLRPEMLNAKGTALAPA 176 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 +++A + P +K +I LTDGE++ + IK A Sbjct: 177 VQRA-------------------AEMLSKYPGKKALILLTDGEDHEPEQIEEAIK---TA 214 Query: 358 KENFIKIVTISI 369 ++ IKI+ + I Sbjct: 215 QKEGIKIIAVGI 226 >gi|13476808|ref|NP_108377.1| hypothetical protein mll8241 [Mesorhizobium loti MAFF303099] gi|14027569|dbj|BAB53838.1| mll8241 [Mesorhizobium loti MAFF303099] Length = 678 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 38/259 (14%), Positives = 90/259 (34%), Gaps = 56/259 (21%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 ++ + N++++ ++++ + + + + + +K + +A +AA A A ++ Sbjct: 23 RQFRRDRRGNYALMTVVAMVPLMGGLAIAVDFTEMNREKQMVTNALDAANFATARRLT-- 80 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 E ++ +A D F+N + +I +S T Sbjct: 81 --------EGATDDQLKAYALD-------------------FFNANLNDIDPASA---TL 110 Query: 132 MANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRY---NQKIVSFIPALLRIEMGER 188 ++S + + Y L+ + NQKI + + +R+ + Sbjct: 111 NVTLPSNTSGGGLLTMTAQLAYKPYFYPAFAQLVGKSATDANQKINFSVTSQVRL----K 166 Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA----LLLFLDSIDL 244 + LV+D SGSM + + ++ LK A + + Sbjct: 167 NTLEVALVLDNSGSMT-------------TLGTGSGQKRIDLLKTASKQLVDTLAQQAVM 213 Query: 245 LSHVKEDVYMGLIGYTTRV 263 + V + V GL+ + V Sbjct: 214 IKQVDKPVQFGLVPFAASV 232 Score = 37.5 bits (85), Expect = 4.4, Method: Composition-based stats. Identities = 35/232 (15%), Positives = 72/232 (31%), Gaps = 29/232 (12%) Query: 211 PEDVNSAPICQDKKRTKMAALKNA--LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNI- 267 P C T + + A +IDL+ + + RV + Sbjct: 446 PAGNGPNYSCTTNPITPLTDVSVADGATSIKAAIDLMQPNGGTNVPEGMAWGWRVVSSGE 505 Query: 268 EPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 + G + + + + L Y ++ K ++ + + + Sbjct: 506 PFTQGRRETEKGNDKVVIVLTDGANTYYTPSSLGYSD-PANSKSTYASYGYLNPGYNGTS 564 Query: 328 PFQKFI-----IFLTDGENNNFKS--NVNTIKICDKAKENFIKIVTISI-----NASPN- 374 + F+ I D N N+ + N +C+ AK I ++T+++ AS Sbjct: 565 VGRMFMGTSSAIGQLDYSNGNYTNALNEQMATLCNNAKAANIMVMTVALDLSTTKASDKL 624 Query: 375 GQRLLKTCVSS------------PEYHYNVVNADSLIHVFQNISQLMVHRKY 414 LK+C S P + SL + F+ I + + + Sbjct: 625 AIDALKSCSSDSRFRKDPTDPSKPAKLFWNATGASLSNDFKEIGNELSNLRV 676 >gi|269926132|ref|YP_003322755.1| von Willebrand factor type A; type II secretion system protein [Thermobaculum terrenum ATCC BAA-798] gi|269789792|gb|ACZ41933.1| von Willebrand factor type A; type II secretion system protein [Thermobaculum terrenum ATCC BAA-798] Length = 643 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 40/226 (17%), Positives = 72/226 (31%), Gaps = 51/226 (22%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + L +D S SM+ A A I+ LS + Sbjct: 97 IDVVLALDTSASMND-----------------------DAFTAAQDAAYGLINGLSPEDK 133 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 GLI + + +V++ + + S+ T + A Q + + Sbjct: 134 V---GLITFDKTARVIEPLAQDHARVQESIQKLSRSV---GTALYQGLSLAAQEVAKGQN 187 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 K I+ +TDG N N + KA+E + T+ Sbjct: 188 T-------------------KAIVLMTDGFN--TSRNTTLEEAVAKAQEVGASVFTVGFG 226 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 + Q L K + +++ L VF +ISQ + H++Y + Sbjct: 227 KKVDTQGLQKIANETGGEYFSAPTNAQLRRVFADISQKL-HQEYRL 271 >gi|153831781|ref|ZP_01984448.1| von Willebrand factor, type A [Vibrio harveyi HY01] gi|148872291|gb|EDL71108.1| von Willebrand factor, type A [Vibrio harveyi HY01] Length = 334 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 39/235 (16%), Positives = 78/235 (33%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLSGSM +D + ++ A+K L F++ ++ Sbjct: 98 DMMLVVDLSGSMQKEDMNDNGEY----------IDRLTAVKRVLSDFVEK-------RQG 140 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G++ + + + V Q + + + L+ + T + Sbjct: 141 DRLGVVLFGDHAYLQTPLTADRKTVMQQINQTVIGLVGQRTAIGDGI------------- 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + S Q+ +I L+DG N + + AK+ I T+ + A Sbjct: 188 -----GLGTKTFVDSDAPQRVMILLSDGSNTAGVLDPLEAA--EIAKKYNATIYTVGVGA 240 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + Q L K + ++ +A L ++ I+QL Sbjct: 241 GEMMVKDFFMTRKVNTASDLDEQTLTKIAEMTGGKYFRARDAKELETIYDTINQL 295 >gi|323499301|ref|ZP_08104278.1| hypothetical protein VISI1226_03745 [Vibrio sinaloensis DSM 21326] gi|323315689|gb|EGA68723.1| hypothetical protein VISI1226_03745 [Vibrio sinaloensis DSM 21326] Length = 322 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 38/232 (16%), Positives = 71/232 (30%), Gaps = 55/232 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LV+DLS SM SD D ++ A+K + F +E Sbjct: 86 DLMLVLDLSYSMSQEDMSDGSDYVD----------RLTAVKKVVSDFAIK-------REG 128 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G++ + + V V + + LI T + A + Sbjct: 129 DRLGVVLFADHAYLQTPLTLDRTTVADQVNQLVLRLIGDKTAIGEGIGLATKTFID---- 184 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S Q+ +I L+DG N + + AK+ I TI + A Sbjct: 185 --------------SDAPQRVMILLSDGSNTSGVIDPIEAA--KIAKKYDATIYTIGVGA 228 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + + L++ + ++ +A L ++ I Sbjct: 229 GEMMVKEFFMTRKVNTAQDLDEKALMQIAQITGGQYFRARDAKELATIYDTI 280 >gi|91975399|ref|YP_568058.1| hypothetical protein RPD_0919 [Rhodopseudomonas palustris BisB5] gi|91681855|gb|ABE38157.1| conserved hypothetical protein [Rhodopseudomonas palustris BisB5] Length = 435 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 62/445 (13%), Positives = 135/445 (30%), Gaps = 75/445 (16%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 + + N ++IFA++++ L IG + + + +ESA +AA+L S Sbjct: 16 RFASDRSGNIAVIFAIALLPILGFIGAAVDYTNASRVRAKLESAQDAAVLLAVS------ 69 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 + + + +A DA +F + + +E ++ + + Sbjct: 70 ----NSAINKTVADAQA---DAVQFFNATLDGYGLSATIDLSVSENDGKRSAVSSFSSSV 122 Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFL 192 + LD + TS +++ L + P++ + Sbjct: 123 KTHFLDMIGYPTLAIGNRSTSTVSLPVYVDFYLLLDNS-------PSMGVAATTSDIATM 175 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + D +++ N A ++ ++ A+ + +S V Sbjct: 176 VANTSDQCAFACHDLSTSNNYYNLAKKLGVT--MRIDVVRQAVQRLTTTATAMSAVTNQF 233 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 MG+ + + + G V + T M YQ +D+ Sbjct: 234 RMGVYTF-----GSSCTAIGLTTVANLSSSMSSVQTSVGT--IDLMTIPYQGYNNDQCTD 286 Query: 313 FFTNFFRQGVKIP------SLPFQKFIIFLTDG--ENNNFKSNVN------------TIK 352 F + IP S QK++ F++DG + N + T+ Sbjct: 287 FDGSLTAINSAIPSPGSGISTQPQKWLFFVSDGVADANYPSTCTKPTVSGGRCQEPLTVA 346 Query: 353 ICDKAKENFIKI---VTISINASPNG----------------------QRLLKTCVSSPE 387 C K I+I T + N +++C SP Sbjct: 347 QCTAIKSRGIQIAVLYTTYLALPTNSWYNTYIAPFNPGPYGPSTNSQIAANMQSCA-SPG 405 Query: 388 YHYNVVNADSLIHVFQNISQLMVHR 412 +++ V + + + V + Sbjct: 406 FYFEVSPTQGIAEAMDALFKKAVAK 430 >gi|255526268|ref|ZP_05393185.1| von Willebrand factor type A [Clostridium carboxidivorans P7] gi|296186262|ref|ZP_06854666.1| von Willebrand factor type A domain protein [Clostridium carboxidivorans P7] gi|255510048|gb|EET86371.1| von Willebrand factor type A [Clostridium carboxidivorans P7] gi|296049063|gb|EFG88493.1| von Willebrand factor type A domain protein [Clostridium carboxidivorans P7] Length = 422 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 46/237 (19%), Positives = 90/237 (37%), Gaps = 51/237 (21%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + I +D SGSM N+DP + + +++L+ + Sbjct: 111 KKADDIVFAIDTSGSMK---NTDPNNERFSA----------------------ALNLIDN 145 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD--SLILKPTDSTPAMKQAYQIL 305 + ++ + + EK I S T++ R+ V+ + T+ A+++AY+ + Sbjct: 146 MDKNNRFSMYKFDDTAEKIIPMSQVTKQSREEVSGKLKDMQNPKGNTNMRDALEKAYEEI 205 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 S + + +I L+DG + + + + KE I I Sbjct: 206 KSSETKDKNA----------------MVIMLSDG-GDTYDLSKKFDETLKPFKEKNISIY 248 Query: 366 TISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNI-----SQLMVHRKYSV 416 TI ++ N +LK S +YNV L +VF I +L+V ++Y + Sbjct: 249 TIG-MSNGNNFSMLKEIAKESGGNYYNVKEIKDLKNVFNKIYRDRQQRLLVDKRYGI 304 >gi|90414549|ref|ZP_01222523.1| hypothetical protein P3TCK_02206 [Photobacterium profundum 3TCK] gi|90324356|gb|EAS40922.1| hypothetical protein P3TCK_02206 [Photobacterium profundum 3TCK] Length = 321 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 48/268 (17%), Positives = 92/268 (34%), Gaps = 55/268 (20%) Query: 159 QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP 218 + + LL + + IE+ + L VDLSGSM + Sbjct: 52 KVMMSLLWVCLVGALARPVWYGDPIEVQPDHR-DMLLAVDLSGSMS---------IEDMI 101 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 + ++AA+K+ L F++ ++ +GL+ + ++ V+Q Sbjct: 102 TQSGESIDRLAAVKDVLAEFIEQ-------RKGDRLGLVLFAQHAYLQTPLTFDRNTVKQ 154 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + R + LI + T + A + + S Q+ II L+D Sbjct: 155 QLERTVLGLIGQSTAIGEGLGIATKTFIN------------------SEAPQRVIILLSD 196 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------NGQRLLK 380 G N + ++ A E+ + I T+ + A + + L K Sbjct: 197 GANTAGV--IEPLEAAKLAAESNVTIYTVGVGAEEMIQKSFFGNRKVNPSQDLDERMLTK 254 Query: 381 TCVSSPEYHYNVVNADSLIHVFQNISQL 408 + ++ N L H++Q I QL Sbjct: 255 IADMTGGQYFRARNPQELEHIYQLIDQL 282 >gi|86143679|ref|ZP_01062055.1| batA protein [Leeuwenhoekiella blandensis MED217] gi|85829722|gb|EAQ48184.1| batA protein [Leeuwenhoekiella blandensis MED217] Length = 334 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 44/277 (15%), Positives = 82/277 (29%), Gaps = 67/277 (24%) Query: 157 RLQFIEHLLNQR-YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 + F+ LL + + I + +D+S SM Sbjct: 57 PVLFVLRLLALACLITALARPRNVDVSTRTNTTRGIDIVIAIDVSASMLAR--------- 107 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 D K ++ ALK F+ + +GL+ Y + Sbjct: 108 ------DLKPNRLEALKEVASQFI-----ADRPSD--RIGLVEYAGESYTRTPITSDKSI 154 Query: 276 VRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 V + +S+I T + + L + +S K II Sbjct: 155 VLSSLNDIQYNSIIEGGTAIGMGLATSVNRLKDSRAKS------------------KVII 196 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-----------NASPNG-------- 375 +TDG NN +T + A+E IK+ TI + Sbjct: 197 LMTDGVNNAGFIEPSTAS--ELAQEFGIKVYTIGLGTNGTALSPVALRPDGSFQYGSIPV 254 Query: 376 ---QRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 + LL+ + ++ + +SL ++ I++L Sbjct: 255 EIDEALLQEIADKTGGLYFRATDNESLEEIYAEINKL 291 >gi|296208411|ref|XP_002751094.1| PREDICTED: epithelial chloride channel protein-like [Callithrix jacchus] Length = 904 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 44/277 (15%), Positives = 92/277 (33%), Gaps = 64/277 (23%) Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 T N + N+ +N+ + + L + + + P ++ +R Sbjct: 257 TEAPNLQNKMCNHKSTWNIIMHSEDFQHLSPMTEINSP--------PHPTFSLLKSKQR- 307 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 ++ LV+D SGSM + A L + I+ S V Sbjct: 308 --VVCLVLDKSGSMAAEDRLFRMNQA------------------AELYLIQIIEKGSLV- 346 Query: 250 EDVYMGLIGYTTR-VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 G++ + + +N + Q +T ++ T +K +Q ++ Sbjct: 347 -----GMVTFDSYAQIQNNLIKITEDNTYQKITANLPQEASGGTSICNGLKAGFQAISQS 401 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTI 367 + + + II LTDGE+N +C ++ +++ I TI Sbjct: 402 NQSTLGSE----------------IILLTDGEDNQIS-------LCFEEVRQSGAIIHTI 438 Query: 368 SINASPNGQRL--LKTCVSSPEYHYNVVNADSLIHVF 402 ++ + L L ++ Y + + LI F Sbjct: 439 AL-GPSAEKELETLSN-MTRGHRFYAHNDINGLIDAF 473 >gi|222616426|gb|EEE52558.1| hypothetical protein OsJ_34813 [Oryza sativa Japonica Group] Length = 517 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 41/236 (17%), Positives = 85/236 (36%), Gaps = 46/236 (19%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + +R + V+D+SGSM+ + ++ +++ LK A+ + + Sbjct: 56 VAPEKRAPIDLVAVLDVSGSMNKEEFVRGKHMS----------SRLDLLKIAMKYIIKLV 105 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQA 301 + ++ + V + + R+ + +D L TD PA+K+A Sbjct: 106 RDAD------RLAIVSFNHAVVSEYGLTRNSADSRKKLENLVDKLKASGNTDFRPALKKA 159 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 +IL K + ++ FI+ L+DG + S +N K+ AK Sbjct: 160 VEILDGRGKEE-------KKKRVG------FILLLSDGVDQFQYSRINWEKV---AKSTD 203 Query: 362 I------------KIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQN 404 + + T +AS + L + S + + N D++ F Sbjct: 204 VDHSEVGAMLRKYAVHTFGFSASHDPVPLRQISALSYGLYSFVCKNLDNITEAFAR 259 >gi|78776855|ref|YP_393170.1| von Willebrand factor, type A [Sulfurimonas denitrificans DSM 1251] gi|78497395|gb|ABB43935.1| von Willebrand factor, type A [Sulfurimonas denitrificans DSM 1251] Length = 309 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 44/254 (17%), Positives = 86/254 (33%), Gaps = 41/254 (16%) Query: 157 RLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPI--FLIELVVDLSGSMHCAMNSDPEDV 214 + + L IV+ + + P + I L++D S SM Sbjct: 48 SSKRLLFLKWLGIFMMIVALMSPIKDEPYELEPKDGYEIALILDASESM----------K 97 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 Q++ ++ +K + F+ + MGL+ + ++ Sbjct: 98 AQGFDVQNQHLSRFDVVKEIVSDFISQRKNDN-------MGLVVFGAYSFIASPLTYDVN 150 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 + + +++ + K T ++ Q +L K ++ K I Sbjct: 151 ILNKILSQLQIGMAGKYTALNTSLAQGANLLKQSKSKT------------------KIAI 192 Query: 335 FLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINASP--NGQRLLKTCVSSPEYHYN 391 LTDG + + + T+ I D K+ IK+ I I N + LLK S + Sbjct: 193 LLTDG-YSTPQVDTITLDIALDMIKKEGIKVYPIGIGMPHEYNTEALLKIANESGGVAFG 251 Query: 392 VVNADSLIHVFQNI 405 +A L V++ I Sbjct: 252 ASSAAELQEVYKKI 265 >gi|87200512|ref|YP_497769.1| hypothetical protein Saro_2499 [Novosphingobium aromaticivorans DSM 12444] gi|87136193|gb|ABD26935.1| hypothetical protein Saro_2499 [Novosphingobium aromaticivorans DSM 12444] Length = 631 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 38/313 (12%), Positives = 81/313 (25%), Gaps = 36/313 (11%) Query: 111 AVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYN 170 + F +I V+S + T N + +N + + + + Sbjct: 344 SNFAYKQINVDVSSYKKFQTVTVQNGTNGANVSYTWKGCIEERDTEAASSFSY------- 396 Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR----- 225 + + I+ V D + A + Sbjct: 397 -------STVDGMSPSTALDLDIDRVPDSDPATKWAPMWPELGYYRTASSRSSTPVSTLE 449 Query: 226 -TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ---YVT 281 T + L A + +S Y + ++ WG + Sbjct: 450 TTSGSQLSAACPYKAQLLQTMSQSAFYAYADALSANGSTYHDLGMLWGLRLSSPDGPWQA 509 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 ++ S + + ++ K +I TD + Sbjct: 510 MVNETPENGGEVSRHIIFMTDGQMDTNYKVMSTYGIEWHDRRITDDGV-------TDQD- 561 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + + +CD AK ++ I+ + N L C S+ + NA L Sbjct: 562 --ARHTLRFRALCDAAKAKGFRVWVIAFASDLNDD--LSYCASASST-FPATNATELNTA 616 Query: 402 FQNISQLMVHRKY 414 FQ I++ + + Sbjct: 617 FQEIAKNVAELRV 629 Score = 37.2 bits (84), Expect = 4.9, Method: Composition-based stats. Identities = 46/339 (13%), Positives = 93/339 (27%), Gaps = 77/339 (22%) Query: 25 IFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISN 84 + A V +LLIG + + + +N ++SA +A LAG + S + ++ + Sbjct: 4 LAATCVPVLILLIGSGLDMGRLYKARNRLQSACDAGALAGRRSVSSAGYDDAAKAQAAAF 63 Query: 85 HAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTI 144 DD N S G S V Sbjct: 64 FNANFNEDDLGATETNFATSSADGGSLVEGIATTD------------------------- 98 Query: 145 FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMH 204 ++ I I M + + +V+D +GSM Sbjct: 99 -----------------VEMVLMNLFGVISVPINVECSATM-DIGNTDVTMVLDTTGSMS 140 Query: 205 CAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED-VYMGLIGYTTRV 263 ++ ++ AL+ A+ F D++ + V + Y++ V Sbjct: 141 QTLSGTTTK-------------RIDALRTAMKNFYDTVSAATTGSNARVRYSFVPYSSSV 187 Query: 264 EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 ++ + D TP + + + T + Sbjct: 188 --------NVGQLIYDLDPDYLVDTWAIQSRTPVFNTVTEQILTGYDTPVTTTASSYSNE 239 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 T + + + N++ C+ AK + Sbjct: 240 T------------TGNDQSYNSTRYNSLSACNTAKPADV 266 >gi|327400025|ref|YP_004340864.1| von Willebrand factor type A [Archaeoglobus veneficus SNP6] gi|327315533|gb|AEA46149.1| von Willebrand factor type A [Archaeoglobus veneficus SNP6] Length = 790 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 30/160 (18%), Positives = 57/160 (35%), Gaps = 17/160 (10%) Query: 258 GYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNF 317 GYT R + + + T D T +K A Q L ++ Sbjct: 534 GYTPRYDVSQTLTNDTLSAN---NSIDDLWAYGGTPMGGGIKVARQELVANTAPGNIPVM 590 Query: 318 FRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE-----NFIKIVTISINAS 372 P+L +DG + + I+ + K+ I I TI + Sbjct: 591 IVLSDGNPTLT--------SDGTASETLAIQEAIEEAETTKQTTIGGEQILIYTIGF-GN 641 Query: 373 PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + LLK +SP+Y+Y ++ L +++ I++ + + Sbjct: 642 DANETLLKQIATSPDYYYFAATSEELSSIYRQIAKELKEK 681 >gi|297527229|ref|YP_003669253.1| von Willebrand factor type A [Staphylothermus hellenicus DSM 12710] gi|297256145|gb|ADI32354.1| von Willebrand factor type A [Staphylothermus hellenicus DSM 12710] Length = 333 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 52/280 (18%), Positives = 97/280 (34%), Gaps = 56/280 (20%) Query: 147 NMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCA 206 N+ + + + F L + + + +I + +P + +V+D SGSM Sbjct: 61 NIALAIASIVLIMFSLALPYTIIPRYVKTTQTLEAKISLQRKP--PVVIVLDTSGSMK-- 116 Query: 207 MNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN 266 DK T + A+K + + V +GLI + V Sbjct: 117 --------------GDKIITAINAVK----------KFIDQTIDYVLIGLITFNDHVRIA 152 Query: 267 IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 I P T + + T + ++ AY L + + Sbjct: 153 IPP---TSDQELLYKKLGEIKAFGGTIYSKPLEIAYDWLVPFAEFNLSPT---------- 199 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG-------QRLL 379 IIF+TDG + + + + A+ N I I I I Q+ L Sbjct: 200 ------IIFVTDGLPYSQDAPLYREVVYKCARYN-ITIYPIFIETPGMSIYETMMAQQRL 252 Query: 380 KTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHRKYSVIL 418 + + + YNV +SLI++F+ +++ V + + IL Sbjct: 253 REIANITKGQFYNVKQTNSLINLFEKLAEKTVSKAGNYIL 292 >gi|284036687|ref|YP_003386617.1| von Willebrand factor A [Spirosoma linguale DSM 74] gi|283815980|gb|ADB37818.1| von Willebrand factor type A [Spirosoma linguale DSM 74] Length = 316 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 39/224 (17%), Positives = 79/224 (35%), Gaps = 41/224 (18%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 + + L++D +GS+ + P + +K+ L Sbjct: 85 SKPGGYSAMLLLDQTGSIS----------TTDPYNLRIEASKI---------------FL 119 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 +++ D Y GL +T+ ++ G + + + +D+L L + TP Q + Sbjct: 120 NNLGTDDYTGLTSFTSSYTSVVKLHSGFTNKTEQMKKSLDTLALNVSGGTPLYTSTIQSV 179 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 T + + I+F TDGENN + + KA + I + Sbjct: 180 T-------------YTAQKGPTANKAVIVF-TDGENNVTTNTLEDAT--AKAIQQKIPLF 223 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 T+ ++ N L + + + +A LI F + L+ Sbjct: 224 TVGLSTDVNVNVLAQMANETGGAFFYAKDAGQLISTFGTLGNLL 267 >gi|218676637|ref|YP_002395456.1| hypothetical protein VS_II0874 [Vibrio splendidus LGP32] gi|218324905|emb|CAV26683.1| Conserved hypothetical protein [Vibrio splendidus LGP32] Length = 355 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 46/238 (19%), Positives = 85/238 (35%), Gaps = 33/238 (13%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 L ++ E+ + +VVDLSGSM Q K +++ A K L Sbjct: 88 TILGEPQVREQLGRDVMVVVDLSGSMA---------EQDFTSKQGDKISRLDATKEVLAD 138 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 F + ++ +GLI + + + + + + ++ + T A Sbjct: 139 FAKT-------RKGDRLGLILFGDAAFVQTPFTADQDVWLELLNQTDVAMAGQSTHLGDA 191 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A + F + Q + + +K +I LTDG N+ S V I A Sbjct: 192 IGLAI--------KVFEQSAAVQDSSVDANVKEKVVIVLTDG--NDTGSFVEPIDAAKVA 241 Query: 358 KENFIKIVTISINASPN-GQRLL------KTCVSSPEYHYNVVNADSLIHVFQNISQL 408 K ++I I++ G+ L + S + +N D L + I +L Sbjct: 242 KAKGVRIHVIAMGDPQTVGEVALDMETIKRVAQESGGEAFEALNRDELTKAYAQIGEL 299 >gi|10334988|gb|AAD46685.2| TadG [Aggregatibacter actinomycetemcomitans] gi|26000721|gb|AAN75217.1| TadG [Aggregatibacter actinomycetemcomitans] Length = 538 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 39/286 (13%), Positives = 90/286 (31%), Gaps = 37/286 (12%) Query: 2 HLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 +L ++ K+ + +E ++II AL LL + F + K + A + A Sbjct: 6 NLTAKLFSTVKQFLQNEHGVYTIITALLAFPLLLFVAFTVDGTGILLDKARLAQATDQAA 65 Query: 62 L---------------AGASKMVSNLSRLGDRFESISNHAKRA-LIDDAKRFIKNHIKES 105 L + ++ + + + SN +A + ++ +K Sbjct: 66 LLLIAEDNQYRKNKDHSDVTRQRVSQQDIDRESKDFSNAKVQAQWKKRNQELVQGLVKLY 125 Query: 106 LSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLL 165 L + + ++ + + N T ++ + + +F L Sbjct: 126 LRSDDSNGQKNSSPVTIKEPFLAECLEEKTQPRNKNGTAK-SIACVVQGSVQRKFW--LP 182 Query: 166 NQRYNQKIVSFIPALLRIEMGERPI---------FLIELVVDLSGSMHCAMNSDPEDVNS 216 + + I GE + +V DLS SM+ A+ S + Sbjct: 183 WGQTLVSSSQLHDGRVGINSGETYAVKEKQITIPIDLMMVTDLSRSMNWAIVSHRDVEVP 242 Query: 217 APICQDKKRTKMAALKNALLLFLDSI---DLLSHVKEDVYMGLIGY 259 P ++ AL+ + D + + + +G + + Sbjct: 243 PPN------RRIDALREVVSNIQDILLPKAIRDDISPYNRIGFVSF 282 Score = 45.2 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 28/159 (17%), Positives = 51/159 (32%), Gaps = 26/159 (16%) Query: 265 KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 K +W ++ + L T T + ++T K Sbjct: 369 KATTQAWFSKSKPGVADALKEIEPLGGTAVTSGIFIGTNLMTDTNK------DPEAAPNK 422 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKI----CDKAKE--------------NFIKIVT 366 + ++ ++ L+DGE+N N + C+K KE I V Sbjct: 423 LNTNTRRVLLILSDGEDNRPSKNTLVTFMNSGMCEKIKEKINSLQDSNYPQVEARIAFVA 482 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + N + K CV + +Y V + L+ F+ I Sbjct: 483 LGFNPPQDQLIAWKKCV--GKQYYPVNSKQGLLDAFKQI 519 >gi|85712923|ref|ZP_01043963.1| Uncharacterized protein containing a von Willebrand factor type A(vWA) domain [Idiomarina baltica OS145] gi|85693229|gb|EAQ31187.1| Uncharacterized protein containing a von Willebrand factor type A(vWA) domain [Idiomarina baltica OS145] Length = 328 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 35/229 (15%), Positives = 78/229 (34%), Gaps = 55/229 (24%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I L VDLSGSM A + + ++ +K L F++ ++ Sbjct: 88 IMLAVDLSGSMEIADMTLDG----------RNVDRLEMVKAVLGDFIER-------RKGD 130 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GLI + ++ V+Q + + L+ + T A+ A + ++ + Sbjct: 131 RLGLILFADTAFLQTPITYDRNTVQQMLDESVLGLVGERTAIGDAIALAVKRFKGKQQTN 190 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA- 371 + ++ LTDG+N + + + AK ++I I++ A Sbjct: 191 ------------------RVLVLLTDGQNTAGNLSPE--QALELAKAYDVRIYPIAVGAE 230 Query: 372 --------------SPNGQR--LLKTCVS-SPEYHYNVVNADSLIHVFQ 403 L++ + ++ + + L ++Q Sbjct: 231 EVVVDSVFGRRKVNPSRDLDVPLMQNLADETGGEYFRARSTEELERIYQ 279 >gi|261409634|ref|YP_003245875.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] gi|261286097|gb|ACX68068.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] Length = 968 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 36/228 (15%), Positives = 73/228 (32%), Gaps = 48/228 (21%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L++D SGSM + + KM K A F +DL+ K Sbjct: 72 DVVLIIDKSGSMAPTYGPNNGED------------KMTNAKEAAKGF---VDLMDMTKHR 116 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 V + + + + + + Y+ T + A+ A +L Sbjct: 117 V---AVVDFSSSASSFPFTVDKDAAKSYINTI---NSGGGTATGNAIDAAVALLADH--- 167 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN----TIKICDKAKENFIKIVTI 367 Q I+ +TDG N + ++ AK+ + TI Sbjct: 168 --------------RTEAQPVIVLMTDGAATESPKNTDPFDYALQRAQAAKDAGVIFYTI 213 Query: 368 SINASPNG------QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 ++ L+K ++ +H+ V+ + L ++ I + + Sbjct: 214 ALLNPNEDPITSAPNVLMKNMATTATHHHFVLGSKGLNQIYAAIVKEI 261 >gi|117618125|ref|YP_856000.1| hypothetical protein AHA_1462 [Aeromonas hydrophila subsp. hydrophila ATCC 7966] gi|117559532|gb|ABK36480.1| conserved hypothetical protein [Aeromonas hydrophila subsp. hydrophila ATCC 7966] Length = 460 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 13/108 (12%), Positives = 40/108 (37%), Gaps = 6/108 (5%) Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS---NV 348 T++ + +++L+ + + +K ++ +DGE+ ++ + Sbjct: 337 TNTAEGVMWGWRLLSPQWQGRWQQGAAELPRPYGQADNRKILVLFSDGEHMGPEAALRDR 396 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 + +C + K I++ T++ + C S Y + + Sbjct: 397 KQLLLCREMKRKGIQVYTVAFEGDAR---FVAQCASERSLAYKATSGN 441 >gi|320007113|gb|ADW01963.1| von Willebrand factor type A [Streptomyces flavogriseus ATCC 33331] Length = 661 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 36/187 (19%), Positives = 62/187 (33%), Gaps = 28/187 (14%) Query: 174 VSFIPALLRIEMGERPIFLIELVVDLSGSMHCA---MNSDPEDVNSAPICQDKKRTKMAA 230 V AL GER I ++ D SGSM ++M A Sbjct: 165 VGPETALTLFAPGERSGVDIVVLADCSGSMSIEDIPAAPSDGGRWFNRGRTSAGISRMNA 224 Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ--YVTRDMDSLI 288 LK+AL +D+ V L+ + E WG E+V V R +++ Sbjct: 225 LKDALGAMIDARMRYDGV--GTRFALVRFDHDHEPMFPSRWGMEEVSDAHSVQRLREAVS 282 Query: 289 L-----KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 L TD A+ +A ++L ++ ++ ++DG + Sbjct: 283 LLNYRQSGTDIGKALHKAGELLHRYGVPGN----------------ERLVVLVSDGAHYA 326 Query: 344 FKSNVNT 350 + + Sbjct: 327 PIPDERS 333 >gi|164688691|ref|ZP_02212719.1| hypothetical protein CLOBAR_02337 [Clostridium bartlettii DSM 16795] gi|164602167|gb|EDQ95632.1| hypothetical protein CLOBAR_02337 [Clostridium bartlettii DSM 16795] Length = 1508 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 37/245 (15%), Positives = 84/245 (34%), Gaps = 51/245 (20%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI--DLLSHVK 249 + LV+D+SGSM ++ D + ++ K++ F++ + + K Sbjct: 402 DVVLVIDVSGSMDWDVDG--------KQTTDNTKKRITIAKDSAKQFVNQLFANNEDGSK 453 Query: 250 EDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTR--DMDSLILKPTDSTPAMKQAYQIL 305 + + ++ +++ N + + + + + TD AM A Q+L Sbjct: 454 SNNRVSVVIFSSSGYTNGILCSLKNVDNKQTVIDAIDGISNNPTGGTDYDNAMTMAEQVL 513 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN----- 360 + K + K ++F++DG N + I A + Sbjct: 514 ETVKDTTRN----------------KAVLFMSDGAPENGYNGKTGYDIYPDAFKAHEKSS 557 Query: 361 ------FIKIVTISI---------NASPNGQRLLK-TCVSSPEYHYNVVNADSLIHVFQN 404 I T+S +++L+ S+ + N + + L + F N Sbjct: 558 EIKNNYGATIYTVSFGLKGSQYKELTEDRCRQILRDYMASNENCYKNANSKEDLENAFTN 617 Query: 405 ISQLM 409 I+ + Sbjct: 618 IATAI 622 >gi|119889916|ref|XP_001252289.1| PREDICTED: Epithelial chloride channel protein-like [Bos taurus] gi|297473018|ref|XP_002686328.1| PREDICTED: Epithelial chloride channel protein-like [Bos taurus] gi|296489229|gb|DAA31342.1| Epithelial chloride channel protein-like [Bos taurus] Length = 903 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 42/241 (17%), Positives = 81/241 (33%), Gaps = 59/241 (24%) Query: 177 IPALLRIEMGERPIFL--------IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 + + +P F + LV+D SGSM + Sbjct: 285 TSPMTEMNPPTQPTFSLLKSKQRVVCLVLDKSGSMSSEDRLFRMNQA------------- 331 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT-EKVRQYVTRDMDSL 287 A L + I+ S V G++ + + E + T + V + +T ++ Sbjct: 332 -----AELFLIQIIEKGSLV------GMVTFDSVAEIRNNLTKITDDNVYENITANLPQE 380 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T +K +Q + ++ + + II LTDGE+N Sbjct: 381 ANGGTSICRGLKAGFQAIIQSQQSTSGSE----------------IILLTDGEDNE---- 420 Query: 348 VNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEY-HYNVVNADSLIHVFQNI 405 I C ++ K++ + I TI++ + L + + Y + + L + F I Sbjct: 421 ---IHSCIEEVKQSGVIIHTIAL-GPSAAKELETLSDMTGGHRFYANKDINGLTNAFSRI 476 Query: 406 S 406 S Sbjct: 477 S 477 >gi|327481077|gb|AEA84387.1| von Willebrand factor type A domain-containing protein [Pseudomonas stutzeri DSM 4166] Length = 339 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 41/242 (16%), Positives = 80/242 (33%), Gaps = 58/242 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VD+SGSM Q ++ T++ +K L F++ + Sbjct: 91 DLLLAVDVSGSMD----------YPDMQWQGEELTRLELVKVLLGDFIEQ-------RHG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + T A+ A + L Sbjct: 134 DRVGLILFGSKAYLQSPLTFDRRTVRVWLDEASVGIAGSNTAIGDAIGLALKRLRERPAN 193 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S + ++ +TDG NN + A E ++I TI I A Sbjct: 194 S------------------RVLVLVTDGANNGGEIEPLLAAT--LAAEENVRIHTIGIGA 233 Query: 372 -SPNGQRL---------------LKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 G L L+ + ++ +++ L Q I + + + Sbjct: 234 VPEEGGVLSRFGFNPGLDLDEPTLRAIAEQTGGEYFRAASSEQL----QAIGEALDRLEP 289 Query: 415 SV 416 +V Sbjct: 290 AV 291 >gi|119476361|ref|ZP_01616712.1| batB protein, putative [marine gamma proteobacterium HTCC2143] gi|119450225|gb|EAW31460.1| batB protein, putative [marine gamma proteobacterium HTCC2143] Length = 354 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 38/235 (16%), Positives = 74/235 (31%), Gaps = 60/235 (25%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM + ++ A+K L F+ ++ Sbjct: 91 DLLLAVDLSGSM----------KIEDMEVNGDRVPRIVAVKTVLNEFIQR-------RKG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQILTSDKK 310 +GLI + ++ ++ V++++ + T A+ + + L Sbjct: 134 DRLGLILFGSQAYVQAPLTFDQTTVQRFMREAQIGFAGEENTAIGDAIGLSVKRLRDRPG 193 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI- 369 +I LTDG+NN K N A N I I TI + Sbjct: 194 DRH------------------VMILLTDGQNNGGKINPIPAS--KIAANNGIIIYTIGVG 233 Query: 370 ---------------------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 +A + + L + ++ ++ N L +++ Sbjct: 234 ADEMVMPGVLGSSFGSRRVNPSADLDEKTLQQVATATGGQYFRARNPQELEKIYR 288 >gi|87306401|ref|ZP_01088548.1| hypothetical protein DSM3645_08717 [Blastopirellula marina DSM 3645] gi|87290580|gb|EAQ82467.1| hypothetical protein DSM3645_08717 [Blastopirellula marina DSM 3645] Length = 578 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 34/218 (15%), Positives = 69/218 (31%), Gaps = 25/218 (11%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS-NLSRLG 76 + ++ A+ ++ + + + V ++ ++ + ++A LAGA ++ G Sbjct: 19 RRGVIVVLAAVLMIVMMGFMALSVDVGYMFTMQSQLQRSVDSAALAGAGTLIEGEDVATG 78 Query: 77 DRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNR 136 E ++++ + KE G +A N + + +T N Sbjct: 79 TVHEYLTHNP-----------VGLQWKEFTEGNTA--DNVDKFLTKYGDGLQLTIGEWND 125 Query: 137 LDSS---NNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLI 193 + V +Y+ F HLL + I I Sbjct: 126 TSGQVVAAEKNPTTVSVRMTYENMPFFFGHLLGRDSFDITAESIATYQSR--------DI 177 Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 LV+DLSGSM+ + + A L Sbjct: 178 MLVLDLSGSMNDDSEFNSIGKLGFDHIYSNSQQMYADL 215 >gi|325917650|ref|ZP_08179844.1| Mg-chelatase subunit ChlD [Xanthomonas vesicatoria ATCC 35937] gi|325536114|gb|EGD07916.1| Mg-chelatase subunit ChlD [Xanthomonas vesicatoria ATCC 35937] Length = 335 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 42/238 (17%), Positives = 76/238 (31%), Gaps = 54/238 (22%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + L VDLSGSM + ++ A K L FLD Sbjct: 96 REARQMMLAVDLSGSMS----------EPDMVLGGNVVDRLTAAKAVLSDFLDR------ 139 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 ++ +GL+ + R + VR + + L + T A+ + + L Sbjct: 140 -RDGDRVGLLVFGQRAYALTPLTADLTSVRDQLADSVVGLAGRETAIGDAIALSVKRLRE 198 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 K+ Q+ ++ LTDG N N +K + AK +++ TI Sbjct: 199 QKQG------------------QRVVVLLTDGVNTAGVLNP--LKAAELAKAEGVRVHTI 238 Query: 368 SINA-----------SPNGQR------LLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + G L K + + + + L ++ + +L Sbjct: 239 AFGGSGGYSLFGVPIPAGGNDDIDEAGLRKIAEQTGGRFFRARDTEELAGIYAELDRL 296 >gi|269962784|ref|ZP_06177125.1| conserved hypothetical protein [Vibrio harveyi 1DA3] gi|269832474|gb|EEZ86592.1| conserved hypothetical protein [Vibrio harveyi 1DA3] Length = 353 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 38/235 (16%), Positives = 77/235 (32%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLSGSM +D + ++ +K L F++ ++ Sbjct: 117 DMMLVVDLSGSMQKEDMNDNGEY----------IDRLTTVKRVLSDFVEK-------RQG 159 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G++ + + + V Q + + + L+ + T + Sbjct: 160 DRLGVVLFGDHAYLQTPLTADRKTVMQQINQTVIGLVGQRTAIGDGI------------- 206 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + S Q+ +I L+DG N + + AK+ I T+ + A Sbjct: 207 -----GLGTKTFVDSDAPQRVMILLSDGSNTAGVLDPLEAA--EIAKKYNATIYTVGVGA 259 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + Q L K + ++ +A L ++ I+QL Sbjct: 260 GEMMVKDFFMTRKVNTASDLDEQTLTKIAEMTGGKYFRARDAKELETIYDTINQL 314 >gi|304407684|ref|ZP_07389335.1| von Willebrand factor type A [Paenibacillus curdlanolyticus YK9] gi|304343167|gb|EFM09010.1| von Willebrand factor type A [Paenibacillus curdlanolyticus YK9] Length = 966 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 40/227 (17%), Positives = 84/227 (37%), Gaps = 48/227 (21%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L++D SGSM + N+ + KM K A F+D +DL H Sbjct: 71 DVVLIIDKSGSMAPSYNNGED--------------KMLNAKEAAKGFVDLMDLTKH---- 112 Query: 252 VYMGLIGYTT-RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 + ++ +++ + N+ + + + Y+ T + A+ A +L + Sbjct: 113 -RVAIVDFSSSNMIGNLPFTTNPTEAKNYIDTI---NANGSTATGDAIDSAIALLANH-- 166 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK--AKENFIKIVTIS 368 Q I+ +TDG+ ++ AK+N I TI+ Sbjct: 167 ---------------RPEAQPVIVIMTDGDATQPSTDPYGYAKQKALLAKDNGIIFYTIA 211 Query: 369 IN------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + + LLK ++ ++H+ V+ + L ++ I + + Sbjct: 212 LLKSTDDPVTSGPNILLKEMATTSDHHHFVLGSTGLSQIYAAIVKEI 258 >gi|294664114|ref|ZP_06729507.1| conserved hypothetical protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 10535] gi|292606114|gb|EFF49372.1| conserved hypothetical protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 10535] Length = 451 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 43/239 (17%), Positives = 78/239 (32%), Gaps = 56/239 (23%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + L VDLSGSM + K ++ A K L FLD Sbjct: 209 REARQMMLAVDLSGSMS----------EPDMVLGGKVVDRLTAAKAVLSDFLDR------ 252 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 ++ +GL+ + R + VR + + L + T A+ + + L Sbjct: 253 -RDGDRVGLLVFGQRAYALTPLTADLTSVRDQLRDSVVGLAGRETAIGDAIALSVKRLRE 311 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 K+ Q+ ++ LTDG N N +K + AK +++ TI Sbjct: 312 QKQG------------------QRVVVLLTDGVNTAGVLNP--LKAAELAKAEGVRVHTI 351 Query: 368 SINASPNGQRL-----------------LKTCV-SSPEYHYNVVNADSLIHVFQNISQL 408 + G L L+ + + + + L ++ + +L Sbjct: 352 AFGG-SGGYSLFGVPIPAGGDDDIDEDGLRKIAQQTGGRFFRARDTEELAGIYAELDRL 409 >gi|294627092|ref|ZP_06705680.1| conserved hypothetical protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 11122] gi|292598525|gb|EFF42674.1| conserved hypothetical protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 11122] Length = 451 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 43/239 (17%), Positives = 78/239 (32%), Gaps = 56/239 (23%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + L VDLSGSM + K ++ A K L FLD Sbjct: 209 REARQMMLAVDLSGSMS----------EPDMVLGGKVVDRLTAAKAVLSDFLDR------ 252 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 ++ +GL+ + R + VR + + L + T A+ + + L Sbjct: 253 -RDGDRVGLLVFGQRAYALTPLTADLTSVRDQLRDSVVGLAGRETAIGDAIALSVKRLRE 311 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 K+ Q+ ++ LTDG N N +K + AK +++ TI Sbjct: 312 QKQG------------------QRVVVLLTDGVNTAGVLNP--LKAAELAKAEGVRVHTI 351 Query: 368 SINASPNGQRL-----------------LKTCV-SSPEYHYNVVNADSLIHVFQNISQL 408 + G L L+ + + + + L ++ + +L Sbjct: 352 AFGG-SGGYSLFGVPIPAGGDDDIDEDGLRKIAQQTGGRFFRARDTEELAGIYAELDRL 409 >gi|20089145|ref|NP_615220.1| hypothetical protein MA0247 [Methanosarcina acetivorans C2A] gi|19914014|gb|AAM03700.1| hypothetical protein (multi-domain) [Methanosarcina acetivorans C2A] Length = 589 Score = 54.9 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 37/248 (14%), Positives = 74/248 (29%), Gaps = 54/248 (21%) Query: 166 NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 + + P + +D SGSM S Sbjct: 57 GVNEETTVTIEVTGAGSTSTSAVP-MDVVFAIDSSGSMQSNDPSGLR------------- 102 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 K A F+D +D G++ + ++ ++ + V+ + Sbjct: 103 ------KTAAKSFVDKMDSSRDTA-----GVVSWDDSIDFSLPLTNDFPLVKTNIDSVDS 151 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 S T+ +++A IL ++ + IIFLTDG+ Sbjct: 152 S---GSTNLNVGLEEAIDILDANPRTENSVE---------------VIIFLTDGQGTYLH 193 Query: 346 SNVNTIKICDKAKENFIKIVTISINA--SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 S +A + I +I + Q + ++ +Y+ +A SL +F Sbjct: 194 STAQ------EAADKGYVIYSIGLGGVNPTPLQDMA---TTTGGAYYSSPDATSLQAIFD 244 Query: 404 NISQLMVH 411 +I + Sbjct: 245 DIFSEVTT 252 >gi|167034052|ref|YP_001669283.1| von Willebrand factor type A [Pseudomonas putida GB-1] gi|166860540|gb|ABY98947.1| von Willebrand factor type A [Pseudomonas putida GB-1] Length = 324 Score = 54.9 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 40/233 (17%), Positives = 80/233 (34%), Gaps = 43/233 (18%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E+PI + L +D+S SM +D S +++A+K+ + F+ Sbjct: 86 EQPIRDLMLAIDISQSMEATDYTDANGAKSD---------RLSAVKSVVRDFIAR----- 131 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++D +GLI + T + + + + T A+ + L Sbjct: 132 --RKDDRIGLIVFGTGAYPQAPLTLDHASLLLLLDEVGIGMAGPNTALGDAIGLTIKALE 189 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 ++ +K +I LTDG N+ S + A+ N I + T Sbjct: 190 KTPEQ------------------EKVLILLTDG--NDTSSAITPDHAAHLAQANGIVVHT 229 Query: 367 ISINASPN------GQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQLMVHR 412 I I L+ ++ + + +L V+ + +L H+ Sbjct: 230 IGIGDPQATGDAKVDLTTLQAIARTTGGQFFRADDRQALQQVYATLDRLTPHK 282 >gi|194367004|ref|YP_002029614.1| von Willebrand factor type A [Stenotrophomonas maltophilia R551-3] gi|194349808|gb|ACF52931.1| von Willebrand factor type A [Stenotrophomonas maltophilia R551-3] Length = 334 Score = 54.9 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 37/232 (15%), Positives = 75/232 (32%), Gaps = 53/232 (22%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + L +D+SGSM + + ++ A K L FLD + Sbjct: 101 MMLAMDVSGSMG----------EGDMVLGGQAVDRLTAAKAVLADFLDR-------RAGD 143 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GL+ + R + VR + + L + T A+ A + L S + Sbjct: 144 RIGLLVFGDRAYTLTPLTADLASVRDQLRDSVVGLAGRETAIGDAIGLAVKRLRSQPEG- 202 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI--- 369 Q+ +I LTDG +N + ++ + A+ ++I T++ Sbjct: 203 -----------------QRVLILLTDGVSNAGV--LEPLRAAEVARAEGVRIHTVAFGGD 243 Query: 370 -------------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + L K + + + L ++ + +L Sbjct: 244 GSMRLFGIPISADQDPVDEATLKKIATMTGGQFFRARDTAQLAGIYAELDRL 295 >gi|197334600|ref|YP_002156233.1| von Willebrand factor, type A [Vibrio fischeri MJ11] gi|197316090|gb|ACH65537.1| von Willebrand factor, type A [Vibrio fischeri MJ11] Length = 356 Score = 54.9 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 47/246 (19%), Positives = 86/246 (34%), Gaps = 28/246 (11%) Query: 172 KIVSFIPALLRIEMGERPIF--LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 I + + E R + +VVDLSGSM D S+ KK +++ Sbjct: 77 LIFALTKPTILGEPQTRESLGRDVMVVVDLSGSMA---EQDFVSKQSSDSGTVKKISRLE 133 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 A K L F+ + ++ +GLI + + + + + ++ Sbjct: 134 ATKEVLADFVKT-------RKGDRLGLILFGDAAFVQTPFTADQSVWLELLNQTDVAMAG 186 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 + T A+ A ++ + + P +K I LTDG N+ S V Sbjct: 187 QSTHLGDAIGLAIKVFEQSSEDKASAEENAK-------PREKVAIVLTDG--NDTGSYVE 237 Query: 350 TIKICDKAKENFIKIVTISINASPN-GQRLL------KTCVSSPEYHYNVVNADSLIHVF 402 I A ++I I++ G++ L + S + +N D L + Sbjct: 238 PIDAAKVAAAKGVRIHMIAMGDPRTVGEQALDMNIINRVAKESGGKAFQAINRDELEQAY 297 Query: 403 QNISQL 408 I +L Sbjct: 298 DEIGEL 303 >gi|75675889|ref|YP_318310.1| hypothetical protein Nwi_1697 [Nitrobacter winogradskyi Nb-255] gi|74420759|gb|ABA04958.1| hypothetical protein Nwi_1697 [Nitrobacter winogradskyi Nb-255] Length = 605 Score = 54.9 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 27/157 (17%), Positives = 59/157 (37%), Gaps = 21/157 (13%) Query: 273 TEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + +DS+ T+ + +Q L++ F K + +Q Sbjct: 455 VSSQSSTLKNQIDSMSPSGSTNQAIGLAWGWQTLSTTN------GPFPAPAKDKAYVYQD 508 Query: 332 FIIFLTDGENN-----------NFKSNVNTIKICDKAKENFIKIVTISINASPNG--QRL 378 +++ L+DG N + + +V +C K K++ I T+ +N ++ Sbjct: 509 YLVLLSDGLNTRNRWSGNGSDHSPEVDVRQALLCQKVKDSGTVIFTVQVNVGNRDPLSQV 568 Query: 379 LKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 L+ C S+ + +A+ FQNI + + + Sbjct: 569 LQDCASN-GNFQMITSANQTADAFQNILTQISQLRIA 604 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 43/255 (16%), Positives = 85/255 (33%), Gaps = 58/255 (22%) Query: 5 SRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG 64 R R + +AN + IFA++++ L +G + + ++SM++A ++A+L Sbjct: 8 KRIRNSISRFDRDIRANIAPIFAIALLPVLGFVGAAVDYTRANAARSSMQAAMDSAVL-- 65 Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 MVS + + + D +R+ SL + F + Sbjct: 66 ---MVSRDAAANP------AMTSQQITDAVQRYFN-----SLYNDKSAFNVS-------- 103 Query: 125 SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 + S ++ + F+ + S Sbjct: 104 --------VSAAYTPSTSSAAAKILASGQGAIETDFM-KIAGFPQLSFGTSSTSTWG--- 151 Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + LV+D +GSM + KMAAL+ A +DS+ Sbjct: 152 ---NSRMRVALVLDNTGSM-------------------RDNGKMAALQRAAKDMIDSLSA 189 Query: 245 LSHVKEDVYMGLIGY 259 + +DVY+ +I + Sbjct: 190 FAKTADDVYISIIPF 204 >gi|313885991|ref|ZP_07819729.1| von Willebrand factor type A domain protein [Porphyromonas asaccharolytica PR426713P-I] gi|312924521|gb|EFR35292.1| von Willebrand factor type A domain protein [Porphyromonas asaccharolytica PR426713P-I] Length = 326 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 35/236 (14%), Positives = 66/236 (27%), Gaps = 63/236 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + L +DLSGSM D K + A ++ + Sbjct: 87 IDLVLAMDLSGSMQAL---------------DLKPNRFEAARDVASEMI----AARPNDN 127 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +GL+ + + ++Q + + T + A L Sbjct: 128 ---IGLVVFAGESFTLCPLTVDHNVIQQMLETTEIGQLEDGTAIGLGLATAINTL----- 179 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS-- 368 S K II LTDG NN + + A++ I+I T++ Sbjct: 180 -------------RGSDNKSKVIILLTDGSNNAGDITPSMAA--ELAQQYGIRIYTVAAG 224 Query: 369 -----------------INAS-PNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNI 405 + A + L+ + +Y + L +++ I Sbjct: 225 TNGVAKFPVQTAFGTEYVEADVQIDEGTLRHIAEQTGGKYYRATDETKLHEIYKEI 280 >gi|256822867|ref|YP_003146830.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] gi|256796406|gb|ACV27062.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] Length = 986 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 46/273 (16%), Positives = 88/273 (32%), Gaps = 55/273 (20%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDV--NSAPICQDKK--RTKMAALKNALLLFLDS 241 G+ + LV+D+SGSM ++ + P C ++++ +K AL LFL Sbjct: 32 GQSQPVNLLLVLDVSGSMAWTTDACRLNRWGQPYPSCYPGNGEKSRLDIMKEALELFL-- 89 Query: 242 IDLLSHVKEDVYMGLIGY----TTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 + ++V +G++ Y + ++ +T T + A Sbjct: 90 ----DDLPDNVKVGILTYSAGNNIDLLHEVKQLSDNNHKATLLTTIDGLEANGGTLTAGA 145 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV--------- 348 + +A P I+FLTDG+ N+ N Sbjct: 146 LYEAGSYFRGQ-------YDNLPSPITPGCSNASNIVFLTDGQPNSMSYNGYSYRNSIIN 198 Query: 349 ---NTIKICDKAKE--------------------NFIKIVTISI-NASPNGQRLLKTCVS 384 ++ D KE + +K TI+ N + L+ Sbjct: 199 MTGSSCARSDDGKECSEKLAGFLSTVDQIEDLTPSKVKTHTIAFALEDNNARTFLENVAD 258 Query: 385 SP-EYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 + Y + D L+ F++ Q + + V Sbjct: 259 AGNGQSYTADSTDGLVDAFKSSIQTDIEQSMMV 291 >gi|297581617|ref|ZP_06943539.1| flp pilus assembly protein TadG [Vibrio cholerae RC385] gi|297534024|gb|EFH72863.1| flp pilus assembly protein TadG [Vibrio cholerae RC385] Length = 467 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 60/470 (12%), Positives = 155/470 (32%), Gaps = 74/470 (15%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 K + S+ AL ++ L F++ V+ + + S A+L + + Sbjct: 6 IKVTARVQHGVISVTAALMLLGMLTFFSFVLLVIVLSTTDSRL-SMLADAVLYSTTNSYN 64 Query: 71 NLSRLGDRFESISNH----AKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + E+ + +D ++ S T+ + Sbjct: 65 AKADAQQMSEANTPQPNLGLSSLQVDTGNNENAAQVQVSGRVDRGSLALTDTLGTSDVL- 123 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYD-----YRLQFIEHLLNQ------RYNQKIVS 175 +TH A +++ + I +DV S ++ + + R K+VS Sbjct: 124 --VTHQAQSKIHQTTLEIVVMLDVSNSMKGEPMTQSIKGLRDFADILYAEERRDFSKVVS 181 Query: 176 FIPALLRIEMGERPIFLIELV---------VDLSGSMHCAMNSDPEDVNSAPICQ--DKK 224 +PA + +G RP F + ++ + +C ++ Sbjct: 182 IVPATGLVNIGHRPEFFSASAFAIPRDWRSLAKERGWKDLLHPEVPGRWRKAMCTALPEE 241 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKED-VYMGLIG-------------YTTRVEKNIE-- 268 + ++ ++ ++ ++L + ++M + +T N + Sbjct: 242 QDELTSVSALTPNWIRRLELSPPDQNLRLHMEWMSKPAIEHYENDMPLFTYYYSGNPKEK 301 Query: 269 --------------PSWGTEKV-------RQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 P G ++ R ++ T++ + A+++L+ Sbjct: 302 YSPNKHEQRGLFDSPDCGVSQIQPLLSTRRAFIKALDTLYPEFNTNNAEGVMWAWRLLSP 361 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN--NNFKSNVNTIKICDKAKENFIKIV 365 + + +K ++ TDG + + K + + +C + K+ I+I+ Sbjct: 362 HWRGYWDKGKSELPRDYQHPNNRKVMLLFTDGNHLVDVAKRDRKQVALCREMKKQGIEII 421 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 +I N N +++K+C S +Y N ++ V + ++ + + + Sbjct: 422 SIDFN---NRSQVMKSCA-SAGQYYIADN-RTIRSVLKQVATTLSKIELT 466 >gi|70730104|ref|YP_259843.1| von Willebrand factor type A domain-containing protein [Pseudomonas fluorescens Pf-5] gi|68344403|gb|AAY92009.1| von Willebrand factor type A domain protein [Pseudomonas fluorescens Pf-5] Length = 358 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 40/188 (21%), Positives = 68/188 (36%), Gaps = 37/188 (19%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM QD+ +++ +K+ L FL+ +E Sbjct: 91 DLLVAVDVSGSMD----------FPDMQWQDEDVSRLNLVKHLLGDFLEH-------REG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + K T A+ A + L + Sbjct: 134 DRVGLILFGSKAYLQAPLTFDRHTVRVWLDEAKIGIAGKNTAIGDAIGLALKRLRQRPAQ 193 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S + +I +TDG NN + T A E +KI I I A Sbjct: 194 S------------------RVLILVTDGANNGGEIAPITAAR--LAAEEGVKIYPIGIGA 233 Query: 372 SPNGQRLL 379 P L Sbjct: 234 DPEQSATL 241 >gi|319902109|ref|YP_004161837.1| von Willebrand factor type A [Bacteroides helcogenes P 36-108] gi|319417140|gb|ADV44251.1| von Willebrand factor type A [Bacteroides helcogenes P 36-108] Length = 327 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 43/276 (15%), Positives = 83/276 (30%), Gaps = 63/276 (22%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 Y FI ++ +++ + E I L +D+S SM Sbjct: 51 YLLHAPFILRIVALILMIIVLARPQTTNSWQNSEIEGIDIMLAMDVSTSMLAE------- 103 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 D K ++ A K+ F++ +G+ + + Sbjct: 104 --------DLKPNRLEAAKDVATEFIN----GRPNDN---IGITLFAGESFTQCPLTVDH 148 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + + I T + A L K +S K I Sbjct: 149 AVLLNLLKDMKCGFIEDGTAIGMGLANAVTRLKDSKAKS------------------KVI 190 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI----NAS----------------- 372 I LTDG NN K +++ + + AK I++ TI + A Sbjct: 191 ILLTDGVNN--KGDISPLTAAEIAKSFGIRVYTIGVGTNGMAPYPYPVGGTVQYVNMPVE 248 Query: 373 PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + L + ++ ++ + L V++ I +L Sbjct: 249 IDEKTLTQIAGTTEGNYFRATSNSKLKEVYEEIDKL 284 >gi|166033217|ref|ZP_02236046.1| hypothetical protein DORFOR_02942 [Dorea formicigenerans ATCC 27755] gi|166027574|gb|EDR46331.1| hypothetical protein DORFOR_02942 [Dorea formicigenerans ATCC 27755] Length = 1465 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 39/216 (18%), Positives = 68/216 (31%), Gaps = 32/216 (14%) Query: 174 VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 +S + + +I I LV+D+SGSM M S ++ ALK Sbjct: 105 LSALSSTAKITGQTTVPLDIVLVLDVSGSMDDPMGSGD------------NTKRIDALKA 152 Query: 234 ALLLFLDSIDLLSHVKEDV----YMGLIGY---------TTRVEKN---IEPSWGTEKVR 277 A+ F+D ++ + DV + ++ + + +N + + Sbjct: 153 AVNSFIDGSAKVNDQRADVNKQNRIAVVKFAGNKTDKIGNDQYSQNRYWYNYTQVVSGYK 212 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 Y + + PA A K + ++ +IF T Sbjct: 213 AYTSGNKSEWETTVNALKPAGCTAADYAMDLTKTLVDQSKTDANNNADRKNVKRVVIFFT 272 Query: 338 DGENNN----FKSNVNTIKICDKAKENFIKIVTISI 369 DGE N+ N K + I TI I Sbjct: 273 DGEPNHQSGFDDDVANDAITSAKTIKTDADIYTIGI 308 >gi|42524204|ref|NP_969584.1| hypothetical protein Bd2794 [Bdellovibrio bacteriovorus HD100] gi|39576412|emb|CAE80577.1| conserved hypothetical protein [Bdellovibrio bacteriovorus HD100] Length = 336 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 49/244 (20%), Positives = 77/244 (31%), Gaps = 70/244 (28%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I + +D+S SM I K ++ A K + F+ S Sbjct: 88 IDIVICLDVSDSML--------------IEDMKPLNRLEAAKETIAKFI-SARTSD---- 128 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK---PTDSTPAMKQAYQILTS 307 +GL+ + + P+ + + Q V + K T AM A L Sbjct: 129 --RIGLVVFAGESFTMVPPTLDYQMILQRVNEISSASSAKIKDGTALGVAMANAAGRLKD 186 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + RS + +IF+TDGENN+ + T + AK IK+ +I Sbjct: 187 SQARS------------------RVMIFMTDGENNSGTIDPETGL--EIAKGYGIKVYSI 226 Query: 368 SI--NASPN---------GQ--------------RLLKTCVSS-PEYHYNVVNADSLIHV 401 I + GQ LL S +Y +L V Sbjct: 227 GIGKDGPTRIPVYSRDIFGQKVKTYQPFESTVNEDLLGRMASDTGGKYYRATTEGALQKV 286 Query: 402 FQNI 405 F +I Sbjct: 287 FSDI 290 >gi|197336748|ref|YP_002158568.1| von Willebrand factor, type A [Vibrio fischeri MJ11] gi|197314000|gb|ACH63449.1| von Willebrand factor, type A [Vibrio fischeri MJ11] Length = 350 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 54/254 (21%), Positives = 93/254 (36%), Gaps = 37/254 (14%) Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPIF--LIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 LL+ + +V+ L R F + +VVDLSGSM + + + Sbjct: 72 LLSLSWIMILVALTKPTLLGPPQTREQFGRDVMVVVDLSGSMAEKDFTSIDGI------- 124 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 K +++ A+K L F + ++ +GLI + + E + Sbjct: 125 --KISRLDAVKKVLNDFAKT-------RKGDRLGLILFGDAAFVQTPFTADHEVWLDLLN 175 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + K T A+ + F N Q + + +K I LTDG Sbjct: 176 QTRVEMAGKSTHLGDAIGLTI--------KRFEENDNSQ--PLSTTSRKKVAIILTDG-- 223 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPN-GQRLLKT------CVSSPEYHYNVVN 394 N+ S V + AK I+I I+I G++ L +S + +N Sbjct: 224 NDTDSYVPPMDAAKVAKVKGIRIHMIAIGDPQTVGEQALDMDTINTIADASGGQAFQALN 283 Query: 395 ADSLIHVFQNISQL 408 D LI+ + IS+L Sbjct: 284 QDELINAYAEISKL 297 >gi|47230696|emb|CAF99889.1| unnamed protein product [Tetraodon nigroviridis] Length = 1031 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 62/169 (36%), Gaps = 31/169 (18%) Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKK 310 +G++ Y +V + S + V + V R T++ + A Sbjct: 154 QVGVLQYGEKVVHEFKLS-DYKSVEEVVKRARSINQRGGEETNTALGINVA--------- 203 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 + F+ G + +K +I +TDGE+++ IK C+K + I I++ Sbjct: 204 ---CSQAFKHGGRRG---AKKVMIVITDGESHDSADLQQVIKDCEK---DGITRYAIAVL 254 Query: 371 A--------SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 +K S P+ + +NV + +L + + + + Sbjct: 255 GYYNRRGINPEAFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALGERI 303 >gi|53802771|ref|YP_115472.1| batB protein [Methylococcus capsulatus str. Bath] gi|53756532|gb|AAU90823.1| putative batB protein [Methylococcus capsulatus str. Bath] Length = 328 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 34/182 (18%), Positives = 64/182 (35%), Gaps = 37/182 (20%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM +V++ ++ A+K F++ Sbjct: 91 DLMLAVDLSGSMDIEDFVVDGEVSN----------RLEAVKRVASAFIER-RSGD----- 134 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + ++ + V + + L T A+ A + L + Sbjct: 135 -RIGLILFGEQAYLQVPLTFDRKTVEKLLDEAAIGLAGDKTAIGDAIGLAIKRLRDNPAD 193 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 Q+ +I L+DG N + V ++ + A +KI TI + A Sbjct: 194 ------------------QRVLILLSDGANTAGQ--VQPLQAAELAAREGLKIYTIGVGA 233 Query: 372 SP 373 Sbjct: 234 DE 235 >gi|21244101|ref|NP_643683.1| hypothetical protein XAC3376 [Xanthomonas axonopodis pv. citri str. 306] gi|21109728|gb|AAM38219.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri str. 306] Length = 323 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 44/239 (18%), Positives = 78/239 (32%), Gaps = 56/239 (23%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + L VDLSGSM + K ++ A K L FLD Sbjct: 84 REARQMMLAVDLSGSMS----------EPDMVLGGKVVDRLTAAKAVLSDFLDR------ 127 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 ++ +GL+ + R + VR + + L + T A+ + + L Sbjct: 128 -RDGDRVGLLVFGQRAYALTPLTADLTSVRDQLRDSVVGLAGRETAIGDAIALSVKRLRE 186 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 K+ Q+ ++ LTDG N N +K + AK +++ TI Sbjct: 187 QKQG------------------QRVVVLLTDGVNTAGVLNP--LKAAELAKAEGVRVHTI 226 Query: 368 SINASPNGQRL-----------------LKTCV-SSPEYHYNVVNADSLIHVFQNISQL 408 + G L L+ + + + D L ++ + +L Sbjct: 227 AFGG-SGGYSLFGVPIPAGGDDDIDEDGLRKIAQQTGGRFFRARDTDELAGIYAELDRL 284 >gi|319426861|gb|ADV54935.1| von Willebrand factor type A [Shewanella putrefaciens 200] Length = 339 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 37/235 (15%), Positives = 72/235 (30%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VDLSGSM + K + +++ + F++ ++ Sbjct: 85 DLMMAVDLSGSM----------QIEDMVVNGKTVDRFTLIQHVVSDFIER-------RKG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V Q++ L+ K T A+ A + + Sbjct: 128 DRIGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGEAIALAVKRFDKIDES 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + +I LTDG NN A + I T+ + A Sbjct: 188 N------------------RVLILLTDGSNNAGNIEPEQAAQ--IAANRKVTIYTVGVGA 227 Query: 372 S-----------------PNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 + LK + ++ N+ L ++Q I +L Sbjct: 228 DVMERRTLFGRERVNPSMDLDENQLKHIADVTHGRYFRARNSQELDQIYQEIDKL 282 >gi|120598362|ref|YP_962936.1| von Willebrand factor, type A [Shewanella sp. W3-18-1] gi|146293560|ref|YP_001183984.1| von Willebrand factor, type A [Shewanella putrefaciens CN-32] gi|120558455|gb|ABM24382.1| von Willebrand factor, type A [Shewanella sp. W3-18-1] gi|145565250|gb|ABP76185.1| von Willebrand factor, type A [Shewanella putrefaciens CN-32] Length = 339 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 37/235 (15%), Positives = 72/235 (30%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VDLSGSM + K + +++ + F++ ++ Sbjct: 85 DLMMAVDLSGSM----------QIEDMVVNGKTVDRFTLIQHVVSDFIER-------RKG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V Q++ L+ K T A+ A + + Sbjct: 128 DRIGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGEAIALAVKRFDKIDES 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + +I LTDG NN A + I T+ + A Sbjct: 188 N------------------RVLILLTDGSNNAGNIEPEQAAQ--IAANRKVTIYTVGVGA 227 Query: 372 S-----------------PNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 + LK + ++ N+ L ++Q I +L Sbjct: 228 DVMERRTLFGRERVNPSMDLDENQLKHIADVTHGRYFRARNSQELDQIYQEIDKL 282 >gi|90418447|ref|ZP_01226359.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] gi|90338119|gb|EAS51770.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] Length = 636 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 21/106 (19%), Positives = 43/106 (40%), Gaps = 26/106 (24%) Query: 336 LTDGENNNFKS----------NVNTIKICDKAKENF--------IKIVTISINASPNG-- 375 + DG N K + N K+C+ K + I I TI+ + Sbjct: 530 IYDGTTGNAKDYSVDSYVAAMDQNVAKVCENVKADGRKPGGTDGILIFTIAFDLRDGEPV 589 Query: 376 QRLLKTCVS------SPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 ++L++ C S S + +Y+ + + L FQ+I++ + + + Sbjct: 590 KKLMEDCASNGLIDASEKLYYDAQSQEELAAAFQSITEQISSLRIA 635 Score = 39.5 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 42/371 (11%), Positives = 108/371 (29%), Gaps = 66/371 (17%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 + + + N +++F L++ L + + + GA Sbjct: 18 QIFTRIMRFRREKAGNVAVVFGLTLPVLALCFATAVDLSGIY----------------GA 61 Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 ++ + + + + + + + LS S F+ + Sbjct: 62 NRSLQQAADVAALAAG-------------REYGRTQDADYLSSVSEAFFFHNAGDETR-- 106 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEM 185 T + + + + + V F + L+ + P + E+ Sbjct: 107 --GTTQFSYDGVFREDGLTI--LKVTARRQLPTFFGDALMWVTGGKLDWRQFPLYAKSEI 162 Query: 186 GERPI-FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + + LV+D SGSM S ++K+ +K+A Sbjct: 163 VVQNRSIELALVLDNSGSMQDRPRSGG------------SKSKIDIIKDAAEDLAKQFLS 210 Query: 245 LSHVKED---VYMGLIGYTTRVE---KNIEPSWGTEKVRQYVTRD-------MDSLILKP 291 + V ++ +++ V + W + R + + + Sbjct: 211 SDKGSTEEFPVQFAVVPFSSSVNVGPQYKNADWMDTQGRSPIHHENLDWGGWLSGATSGG 270 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 + Y +S + + + + P +F ++ +N +KS T Sbjct: 271 WEWIRDRGWVYTAPSSGAPMARYNGSYWTRITTGE-PLTRFYVY----DNARYKSQFGTW 325 Query: 352 KICDKAKENFI 362 + C +A+ N + Sbjct: 326 RGCVEARPNGL 336 >gi|313235286|emb|CBY10850.1| unnamed protein product [Oikopleura dioica] Length = 977 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 57/361 (15%), Positives = 117/361 (32%), Gaps = 67/361 (18%) Query: 49 KKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSG 108 K+ +++ + A A VS++ + LI D + + E+ Sbjct: 452 KRMRLQNGSTRTDKARALDFVSDVIFTKNFGARSQAPKVLVLISDGQDNHPERVLEAAKK 511 Query: 109 YSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQR 168 + + I N S++++ M + I Y V +N+ Sbjct: 512 LHGQDVSVFVIGIGNESQMNIKMMNQIATEPITKHIKYANTVEG------------INKF 559 Query: 169 YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 N + + + + +V D S S+ K Sbjct: 560 KNALTGQICEDVRSKQSCDSASMDMAIVFDGSDSV--------------------KADNF 599 Query: 229 AALKNALLLFLDSIDLLSHVKED-VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 LK F+D + V+E + L+ Y T + K E S +++++ + + Sbjct: 600 KKLKTWTGEFIDKL----GVQEYGAQVALVKYATSIIKVSELSSDVDELKEKLMKVP--F 653 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 I T++ A+++A Q+L K I+ +TDG+ Sbjct: 654 IQGKTNTGGALERAQQML-----------------AEGRPSVPKIILLITDGD---ATDK 693 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 +K K++ I I TI + + L + ++ Y + F +IS+ Sbjct: 694 ERLDAQIEKLKKSNILIYTIGV-GDLIDRNELNRIATDEDFVYETRD-------FDSISK 745 Query: 408 L 408 + Sbjct: 746 I 746 >gi|297620568|ref|YP_003708705.1| hypothetical protein wcw_0325 [Waddlia chondrophila WSU 86-1044] gi|297375869|gb|ADI37699.1| putative membrane protein [Waddlia chondrophila WSU 86-1044] Length = 374 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 42/235 (17%), Positives = 74/235 (31%), Gaps = 31/235 (13%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN-ALLLFLDSIDLLSHVK 249 I LV+D SGSM + K TKM LK L + + Sbjct: 99 IAIYLVLDQSGSMSEEVKVFR-----------KTITKMDLLKEVTKGFVLGNKQEGLTGR 147 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM--DSLILKPTDSTPAMKQAYQILTS 307 MGL+ + + + + + +++ L T A+ Y+ Sbjct: 148 PQDMMGLVTFARGAQVLAPLTLDHQAIIDQLSKLQYTTDLEQDGTAIGYAI---YKTANL 204 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG-------ENNNFKSNVNTIKICDKAKEN 360 + K +I +TDG + NV + AK+ Sbjct: 205 IAATRHYAEELEGAGKPAYTIKNSIMILVTDGLQAPNPLDQGKEFRNVELLDAAVYAKKL 264 Query: 361 FIKIVTISINASPNGQR------LLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 +K+ I++ + L+K + Y V N+ +L ++ I QL Sbjct: 265 GVKVYIINVEPRIASEEFSAHRLLMKKITELTGGRFYMVDNSLNLSSIYSEIDQL 319 >gi|146282738|ref|YP_001172891.1| von Willebrand factor type A domain-containing protein [Pseudomonas stutzeri A1501] gi|145570943|gb|ABP80049.1| von Willebrand factor type A domain protein [Pseudomonas stutzeri A1501] Length = 339 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 39/235 (16%), Positives = 76/235 (32%), Gaps = 58/235 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VD+SGSM Q ++ T++ +K L F++ + Sbjct: 91 DLLLAVDVSGSMD----------YPDMQWQGEELTRLELVKVLLGDFIEQ-------RHG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + T A+ A + L Sbjct: 134 DRVGLILFGSKAYLQAPLTFDRRTVRVWLDEARVGIAGSNTAIGDAIGLAVKRLRERPTN 193 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S + ++ +TDG NN + + A E ++I TI I A Sbjct: 194 S------------------RVLVLITDGANNGGELDPLLAAT--LAAEESVRIHTIGIGA 233 Query: 372 -SPNGQRL---------------LKTCVS-SPEYHYNVVNADSLIHVFQNISQLM 409 G L L+ + ++ ++ L + I + Sbjct: 234 VPEEGGVLSRFGFNPGLDLDEPTLRAIAEQTGGEYFRAASSAEL----KAIGAAL 284 >gi|308050346|ref|YP_003913912.1| von Willebrand factor type A [Ferrimonas balearica DSM 9799] gi|307632536|gb|ADN76838.1| von Willebrand factor type A [Ferrimonas balearica DSM 9799] Length = 322 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 35/237 (14%), Positives = 83/237 (35%), Gaps = 57/237 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM ++ + +++ L F++ ++ Sbjct: 85 DLMLAVDLSGSM----------QIEDMELGNRVVDRFTMVRHVLSDFIER-------RDG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + ++ V +++ + L+ + T A+ + ++ Sbjct: 128 DRLGLILFADQAYLQAPLTFDRFAVARFLDEAVLGLVGQQTAIGDAIALGVKRFNDLEQS 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S + ++ LTDGENN + + A+++ +K+ TI I + Sbjct: 188 S------------------RVLVLLTDGENNAGRFTP--AQAVSLARQSGVKLYTIGIGS 227 Query: 372 --------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + ++ S+ ++ + + L ++Q + QL Sbjct: 228 AEIRRRGLLGTRTVNPSSDLDQAEKSFIQLSESTGGRYFRARSTEELESIYQELDQL 284 >gi|84622723|ref|YP_450095.1| hypothetical protein XOO_1066 [Xanthomonas oryzae pv. oryzae MAFF 311018] gi|84366663|dbj|BAE67821.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF 311018] Length = 335 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 43/238 (18%), Positives = 79/238 (33%), Gaps = 54/238 (22%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + L VDLSGSM+ + K ++ A K L FLD Sbjct: 96 REARQMMLAVDLSGSMN----------EPDMVLGGKVVDRLTAAKAVLSDFLDR------ 139 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 ++ +GL+ + R + VR + + L + T A+ + + L Sbjct: 140 -RDGDRVGLLVFGQRAYALTPLTADLTSVRDQLRDSVVGLAGRETAIGDAIALSVKRLRE 198 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 K+ Q+ ++ LTDG N + +K + AK ++I TI Sbjct: 199 QKQG------------------QRVVVLLTDGVNTAGVLDP--LKAAELAKAEGVRIHTI 238 Query: 368 S-----------INASPNGQR------LLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + G L K + + + + L ++ + +L Sbjct: 239 AFGGGGGSSLFGVPIPAGGNDDIDEDGLRKIAQQTGGRFFRARDTEELAGIYAELDRL 296 >gi|285019106|ref|YP_003376817.1| von willebrand factor, type a protein [Xanthomonas albilineans GPE PC73] gi|283474324|emb|CBA16825.1| putative von willebrand factor, type a protein [Xanthomonas albilineans] Length = 343 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 42/240 (17%), Positives = 78/240 (32%), Gaps = 56/240 (23%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + + L VDLSGSM SDP+ + ++ A K L FLD Sbjct: 102 QQARQLMLAVDLSGSM-----SDPDM-----RLGGRVVDRLTAAKAVLADFLDR------ 145 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 ++ +GL+ + + + VR + + L + T A+ A + L Sbjct: 146 -RDGDRIGLLVFGQQAYALTPLTADLATVRDQLRDSVVGLAGRETALGDAIALAVKRL-- 202 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 ++ +I LTDG N + +K + AK +++ TI Sbjct: 203 ----------------REQPQGERVLILLTDGVNTAGVLDP--LKAAELAKAEHVRVYTI 244 Query: 368 SINASPNGQRL------------------LKTCVSS-PEYHYNVVNADSLIHVFQNISQL 408 ++ G L L+ + + L ++ + +L Sbjct: 245 ALGGDGGGMSLFGMPIPGSGGDDEVDEDTLRKIAQDTGGRFFRARDTAQLASIYAELDRL 304 >gi|332299342|ref|YP_004441263.1| von Willebrand factor type A [Porphyromonas asaccharolytica DSM 20707] gi|332176405|gb|AEE12095.1| von Willebrand factor type A [Porphyromonas asaccharolytica DSM 20707] Length = 326 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 35/236 (14%), Positives = 66/236 (27%), Gaps = 63/236 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + L +DLSGSM D K + A ++ + Sbjct: 87 IDLVLAMDLSGSMQAL---------------DLKPNRFEAARDVASEMI----AARPNDN 127 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +GL+ + + ++Q + + T + A L Sbjct: 128 ---IGLVVFAGESFTLCPLTVDHNVIQQMLETTEIGQLEDGTAIGLGLATAINTL----- 179 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS-- 368 S K II LTDG NN + + A++ I+I T++ Sbjct: 180 -------------RGSDNKSKVIILLTDGSNNAGDITPSMAA--ELAQQYGIRIYTVAAG 224 Query: 369 -----------------INAS-PNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNI 405 + A + L+ + +Y + L +++ I Sbjct: 225 TNGVAKFPVQTAFGTEYVEADVQIDEGTLRHIAEQTGGKYYRATDETKLHEIYKEI 280 >gi|313674519|ref|YP_004052515.1| von willebrand factor type a [Marivirga tractuosa DSM 4126] gi|312941217|gb|ADR20407.1| von Willebrand factor type A [Marivirga tractuosa DSM 4126] Length = 345 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 45/237 (18%), Positives = 69/237 (29%), Gaps = 62/237 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I LV+D+S SM QD ++ A K F+D + Sbjct: 105 IDIMLVLDISESM---------------KIQDFTPNRLEAAKQVANDFID-------GRF 142 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR-DMDSLILKPTDSTPAMKQAYQILTSDK 309 +GL ++ + + ++ +T D + T A+ Sbjct: 143 QDRIGLTIFSGEAYSLSPLTTDYKMLKNQITDIDFKMMEASGTAIGSALAVGTNR----- 197 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 S K +I L+DG+NN + T A IKI TI+I Sbjct: 198 -------------MRESDSKSKVLILLSDGDNNAGNIDPETSAKLANA--YGIKIYTIAI 242 Query: 370 NAS---PNGQRL---------------LKTCVS-SPEYHYNVVNADSLIHVFQNISQ 407 P G+ LK Y + +L VF I Q Sbjct: 243 GKEGKVPYGKDFFGRTRYIENSMDVTGLKNIAKIGEGQFYRATDNQALEEVFSIIDQ 299 >gi|139439379|ref|ZP_01772820.1| Hypothetical protein COLAER_01839 [Collinsella aerofaciens ATCC 25986] gi|133775158|gb|EBA38978.1| Hypothetical protein COLAER_01839 [Collinsella aerofaciens ATCC 25986] Length = 2432 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 42/302 (13%), Positives = 89/302 (29%), Gaps = 77/302 (25%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 I S + + I +V+D SGSM M + + Sbjct: 94 TTLSAISSTSDTTISGKP-----LDIVMVLDASGSMDDPMGTGD------------NTKR 136 Query: 228 MAALKNALLLFLDSI----DLLSHVKEDVYMGLIGYT---------TRVEKNIEPSWGTE 274 + ALK A F+D+I ++ + + ++ + ++ ++ Sbjct: 137 IDALKTAANTFIDAIAAQNQSITDASKQHRVAIVKFAGKKKTDKVGNDTYRDGRYTYNYS 196 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ-GVKIPSLPFQKFI 333 + + +T D+ + A + + + +K + Sbjct: 197 QTMKNLTSCKGKDADSLKDTVGNINPA-----GSTQADYGLELAENITINSGRADAKKIV 251 Query: 334 IFLTDGENNNFKSNVNTIKICDKA-----KENFIKIVTISI------NASPNGQ------ 376 +F TDG + ++ A K N I TI I +A P + Sbjct: 252 VFFTDGSPTSSSGFQASVADSAIASAKSLKANGADIYTIGIFSGANPSADPTAEGTSKVN 311 Query: 377 RLLKTCVS------------------------SPEYHYNVVNADSLIHVFQNISQLMVHR 412 + + S + +Y+ + +A L +F+ IS ++ Sbjct: 312 KFMHAVSSNYPGATSSISFWGEWVIDYGTRAENSDYYKSATSASELEKIFEEISGSIIQT 371 Query: 413 KY 414 Y Sbjct: 372 GY 373 >gi|308270599|emb|CBX27211.1| hypothetical protein N47_A12400 [uncultured Desulfobacterium sp.] Length = 330 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 37/237 (15%), Positives = 71/237 (29%), Gaps = 58/237 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +D SGSM K T++ +K + F+ +E Sbjct: 87 DIMLCLDTSGSMQAL----------DFELDGKPVTRLTVVKKVVADFIKE-------RET 129 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + + + V + + T A+ A + L K + Sbjct: 130 DRIGLVVFGQEAFTQSPLTMDKGLLLSLVDKMEIGMAGDSTAIGNAIAVAGKRLKDLKAK 189 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-- 369 S K +I LTDG +N A IKI TI + Sbjct: 190 S------------------KIMIILTDGRSNTGDITPEEAAG--AAAALGIKIYTIGVGG 229 Query: 370 -------------------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + + L + ++ ++ L ++++ I++ Sbjct: 230 TGPAPFKVNTFFGPRIVNQSVDLDEKTLKEIAAIGKGKYFRATDSKELANIYEIINK 286 >gi|152993581|ref|YP_001359302.1| von Willebrand factor A [Sulfurovum sp. NBC37-1] gi|151425442|dbj|BAF72945.1| von Willebrand factor type A domain protein [Sulfurovum sp. NBC37-1] Length = 305 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 49/242 (20%), Positives = 89/242 (36%), Gaps = 40/242 (16%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 + S I I + I LV+D S SM M DP+D K +K Sbjct: 66 LASPILTKNYINSKKEGR-DIVLVIDSSDSMR-QMGFDPKDPYK---------NKFDVVK 114 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 + F+ +++ +G++ + ++ + + + K T Sbjct: 115 EVVADFIKK-------RKNDRIGMVTFADVAFIASPLTFEKDFLTNITEMQKLGMAGKRT 167 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 A+ QAY +++ K +S K II LTDG +N K ++ +K Sbjct: 168 AINDALVQAYNLMSKSKAKS------------------KIIILLTDGRDNMSKIPLSDVK 209 Query: 353 ICDKAKENFIKIVTISINAS-PNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQLMV 410 ++ +K+ TI I + LKT + Y +A L ++ I++L V Sbjct: 210 --HMIEKRDVKLYTIGIGGPRDYDAQYLKTLAKAGKGQAYAARSAAMLSKIYDEINKLEV 267 Query: 411 HR 412 + Sbjct: 268 TK 269 >gi|149188658|ref|ZP_01866950.1| hypothetical protein VSAK1_16267 [Vibrio shilonii AK1] gi|148837568|gb|EDL54513.1| hypothetical protein VSAK1_16267 [Vibrio shilonii AK1] Length = 346 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 38/238 (15%), Positives = 79/238 (33%), Gaps = 40/238 (16%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 L + E+ + + VDLSGSM + +++ A+K+ L Sbjct: 85 TVLGEPQTREKLGRDVMVAVDLSGSMS---------EMDFSSSDGQAVSRLDAVKSVLHE 135 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 F+ + +E +GLI + + + + + ++ + T A Sbjct: 136 FVAT-------REGDRLGLILFGDAAYLQTPFTADHDVWLALLDQTEVAMAGQSTHLGDA 188 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A + + S K ++ LTDG + I A Sbjct: 189 IGLAI-KVFEQSESSKDKE--------------KVVVVLTDGNDTGSFVEPKDAAIVAAA 233 Query: 358 KENFIKIVTISINASP--NGQRL----LKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 K ++I I++ Q L + S S + ++ ++L ++ I +L Sbjct: 234 K--GVRIHVIAMGDPATIGEQALDMATIDNIASQSGGQAFQALDQEALQQAYRTIGEL 289 >gi|156742544|ref|YP_001432673.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233872|gb|ABU58655.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 562 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 36/214 (16%), Positives = 70/214 (32%), Gaps = 44/214 (20%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 + A+ I + + V+D+SGSM + ++ K AL Sbjct: 366 VLSAIRSIWEQNKKRVDVMAVLDVSGSM-------------------EDEGRLEQAKAAL 406 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 +F++ + GL ++ + S + + + R T Sbjct: 407 RIFVEQLQDDDG------FGLTIFSDQATVLTPISPIGSRRTEVLNRIAGLTPRGGTRLL 460 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 + +AYQ LT+ P + ++ LTDG +N + + + Sbjct: 461 DTVVEAYQELTA----------------TPPGQRIRAVVVLTDGLDNRSQRSAEDVLDLL 504 Query: 356 KAKENF--IKIVTISINASPNGQRLLKTCVSSPE 387 + IK+ TI+ + LLK S+ Sbjct: 505 RQDREGYSIKVFTIAFGGDADVH-LLKEIASATG 537 >gi|295132198|ref|YP_003582874.1| von Willebrand factor(vWA) type A domain-containing protein [Zunongwangia profunda SM-A87] gi|294980213|gb|ADF50678.1| von Willebrand factor(vWA) type A domain-containing protein [Zunongwangia profunda SM-A87] Length = 334 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 44/257 (17%), Positives = 78/257 (30%), Gaps = 66/257 (25%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 + I + +D+S SM D + ++ A K A+ Sbjct: 77 PRTVDVSTRTNSTQGIDIVMAIDVSASMLAR---------------DLQPNRLEATK-AV 120 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV-TRDMDSLILKPTDS 294 +GL+ Y+ + V + + + ++++ T Sbjct: 121 GEEFIKGRPSD------RIGLVLYSGESFTKTPITSDKSVVLRALEDVEFNNILESGTAI 174 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + + L K S K II LTDG NN+ + Sbjct: 175 GSGLATSVNRLKDSKAES------------------KVIILLTDGVNNSGFIDPKVAS-- 214 Query: 355 DKAKENFIKIVTISI----NA------SPNGQ------------RLLKTCVS-SPEYHYN 391 + AKE IK+ TI + A + NG+ LLK + ++ Sbjct: 215 ELAKEFGIKVYTIGVGTNGMALTPVGIAANGRFQFGNRQVEIDEDLLKQIADETGGKYFR 274 Query: 392 VVNADSLIHVFQNISQL 408 N + L ++ I QL Sbjct: 275 ATNNEKLEDIYDEIDQL 291 >gi|94499146|ref|ZP_01305684.1| hypothetical protein RED65_10169 [Oceanobacter sp. RED65] gi|94428778|gb|EAT13750.1| hypothetical protein RED65_10169 [Oceanobacter sp. RED65] Length = 340 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 30/194 (15%), Positives = 68/194 (35%), Gaps = 37/194 (19%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + + ++ + L VD+S SM + Q + ++ +K + F+ Sbjct: 77 VGEPKALQQTDRNMMLAVDISKSM----------LEEDMQYQGRLVNRLQTVKAVVTDFV 126 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 + ++ +GLI + + ++ V++ + + L T A+ Sbjct: 127 EE-------RKGDRLGLILFGEQAYIQTPLTFDLSTVKRLLDEAVVGLAGNKTAIGDAIG 179 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + L + + + +I LTDG+N + +K + A++ Sbjct: 180 LGVKRLQDLPESN------------------RVLILLTDGQNTA--GEIEPLKAAELAEK 219 Query: 360 NFIKIVTISINASP 373 +KI I I A Sbjct: 220 AGVKIYAIGIGADE 233 >gi|307548796|dbj|BAJ19118.1| TadG [Aggregatibacter actinomycetemcomitans] gi|307548811|dbj|BAJ19132.1| TadG [Aggregatibacter actinomycetemcomitans] Length = 538 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 38/286 (13%), Positives = 90/286 (31%), Gaps = 37/286 (12%) Query: 2 HLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 +L ++ K+ + +E ++II AL LL + F + K + A + A Sbjct: 6 NLTAKLFSTVKQFLQNEHGVYTIITALLAFPLLLFVAFTVDGTGILLDKARLAQATDQAA 65 Query: 62 L---------------AGASKMVSNLSRLGDRFESISNHAKRA-LIDDAKRFIKNHIKES 105 L + ++ + + + SN +A + ++ +K Sbjct: 66 LLLIAEDNQYRKNKDHSDVTRQRVSQQDIDRESKDFSNAKVQAQWKKRNQELVQGLVKLY 125 Query: 106 LSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLL 165 L + + ++ + + N T ++ + + +F L Sbjct: 126 LRSDDSNGQKNSSPVTIKEPFLAECLEEKTQPRNKNGTAK-SIACVVQGSVQRKFW--LP 182 Query: 166 NQRYNQKIVSFIPALLRIEMGERPI---------FLIELVVDLSGSMHCAMNSDPEDVNS 216 + + I G+ + +V DLS SM+ A+ S + Sbjct: 183 WGQTLVSSSQLHDGRVGINSGKTYAVKEKQITIPIDLMMVTDLSRSMNWAIVSHRDVEVP 242 Query: 217 APICQDKKRTKMAALKNALLLFLDSI---DLLSHVKEDVYMGLIGY 259 P ++ AL+ + D + + + +G + + Sbjct: 243 PPN------RRIDALREVVSNIQDILLPKAIRDDISPYNRIGFVSF 282 Score = 44.9 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 28/159 (17%), Positives = 51/159 (32%), Gaps = 26/159 (16%) Query: 265 KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 K +W ++ + L T T + ++T K Sbjct: 369 KATTQAWFSKSKPGVADALKEIEPLGGTAVTSGIFIGTNLMTDTNK------DPEAAPNK 422 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKI----CDKAKE--------------NFIKIVT 366 + ++ ++ L+DGE+N N + C+K KE I V Sbjct: 423 LNTNTRRVLLILSDGEDNRPSKNTLVTFMNSGMCEKIKEKINSLQDSNYPQVEARIAFVA 482 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + N + K CV + +Y V + L+ F+ I Sbjct: 483 LGFNPPQDQLIAWKKCV--GKQYYPVNSKQGLLDAFKQI 519 >gi|324991934|gb|EGC23857.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK405] gi|332363536|gb|EGJ41317.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK1059] Length = 462 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 24/163 (14%), Positives = 56/163 (34%), Gaps = 27/163 (16%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + V D SGSM+ + + ++M L+ ++ + + + +V Sbjct: 198 IAVSFVFDTSGSMNWDLQGRNVEKTG-------NESRMDILRKKSVIMIKDLAEIGNVSV 250 Query: 251 DVYMGLIGYTTRVEKNIE-PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ +G +++N GT + +T+ + T+ ++ L S Sbjct: 251 NL-VGFSTSAKYIQQNFSNLDNGTNTIIATITKRENLNPDGVTNPGDGLRYGMISLQSQP 309 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + K+I+ LTDG N + + + Sbjct: 310 AQL------------------KYIVLLTDGIPNAYLVDSRALY 334 >gi|84386025|ref|ZP_00989055.1| von Willebrand factor type A domain protein [Vibrio splendidus 12B01] gi|84379341|gb|EAP96194.1| von Willebrand factor type A domain protein [Vibrio splendidus 12B01] Length = 345 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 41/230 (17%), Positives = 75/230 (32%), Gaps = 43/230 (18%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 ++ I + +DLSGSM + K ++ K L F Sbjct: 89 QKKSAREIMVALDLSGSMS---------EEDFADKKGNKHDRLTIAKQVLREFAAQ---- 135 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 +E +GLI + + + ++ + L K A+ Sbjct: 136 ---REHDRLGLILFADSAYVQAPFTEDINVWQS----LLEDVELG----YAGFKTAFGDA 184 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 F RQ + +I LTDG++ + K +K + A + +KI Sbjct: 185 IGLSIAVFEQEQSRQ----------RVMILLTDGDDTSSKMPP--VKAAEIAAKYGVKIY 232 Query: 366 TISINASP-NGQ------RLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 TI+I G+ L K ++ ++ ++ L + I QL Sbjct: 233 TIAIGDPSTKGRYKMDLPTLEKVSAATGGQMFHAMDRKQLDQAYATIDQL 282 >gi|85374478|ref|YP_458540.1| hypothetical protein ELI_08255 [Erythrobacter litoralis HTCC2594] gi|84787561|gb|ABC63743.1| hypothetical protein ELI_08255 [Erythrobacter litoralis HTCC2594] Length = 626 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 26/178 (14%), Positives = 51/178 (28%), Gaps = 39/178 (21%) Query: 269 PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 G+ + +V+R T M A ++++ D + G P Sbjct: 454 LLNGSARNADFVSRINALSPKGGTMHDIGMIWAGRLISPDGIFAADNASAPNG-----DP 508 Query: 329 FQKFIIFLTDGENNNFKSNV--------------------------------NTIKICDK 356 + +IF+TDGE SN IC Sbjct: 509 ISRHVIFMTDGEMGASPSNTTAYGNYDMDGRMAGFAASGSWTENQLAAIHNLRLEAICKA 568 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 + + I +I+ + + C + N+ L F++I+ + + Sbjct: 569 IRNKNVTIWSIAFGLPHSAYT--QGCATGTSRALTAANSSELDSRFRDIAGSIAELRL 624 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 36/239 (15%), Positives = 72/239 (30%), Gaps = 54/239 (22%) Query: 22 FSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFES 81 +I A + + ++G + + K+ ++ A +AA LA ++ + G + Sbjct: 2 MAIGAASIIP-LVGVVGGGVDASRMYLAKSRLQQACDAATLAARKELAGSSISNGTIPAN 60 Query: 82 ISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSN 141 I + A F N+ + + +S Sbjct: 61 IQDKADNFFD---TNFPSG--------------MYGTTNVGYTLSAGTATQMDGAATASV 103 Query: 142 NTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSG 201 T + + D + L + P + LV+D+SG Sbjct: 104 PTTLMKVFNVPQIDIAVNCSAEL----------------------DLPNIDVVLVLDMSG 141 Query: 202 SMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT 260 SM+ + ++ ALKNA+ F D + V +G++ Y Sbjct: 142 SMNSNGTTGS--------------KRITALKNAVFSFYDVVMAAKPAGTRVRIGIVPYN 186 >gi|327463764|gb|EGF10080.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK1057] Length = 462 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 24/163 (14%), Positives = 56/163 (34%), Gaps = 27/163 (16%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + V D SGSM+ + + ++M L+ ++ + + + +V Sbjct: 198 IAVSFVFDTSGSMNWDLQGRNVEKTG-------NESRMDILRKKSVIMIKDLAEIGNVSV 250 Query: 251 DVYMGLIGYTTRVEKNIE-PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ +G +++N GT + +T+ + T+ ++ L S Sbjct: 251 NL-VGFSTSAKYIQQNFSNLDNGTNTIIATITKPENLNPDGVTNPGDGLRYGMISLQSQP 309 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + K+I+ LTDG N + + + Sbjct: 310 AQL------------------KYIVLLTDGIPNAYLVDSRALY 334 >gi|116625272|ref|YP_827428.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116228434|gb|ABJ87143.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 323 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 46/273 (16%), Positives = 79/273 (28%), Gaps = 64/273 (23%) Query: 157 RLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNS 216 L L + + P + E + +V D SGSM Sbjct: 62 PLNRFVTGLEKENFKIFEDKSPQEISQFSSEDAPLSVGVVFDCSGSMG------------ 109 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 K+ + A+ F L + E L+ + + E++ Sbjct: 110 ---------QKLDKSRQAVSQFF---KLANPEDEFF---LVQFNDSASLIQPFTRNLEEI 154 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + ++ T + Y L KK +K ++ + Sbjct: 155 QNHLAFTQSKGR------TALLDAVYLALHEMKKAKN---------------PRKALLLI 193 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQ----------RLLKTCVS-S 385 +DG +N+ + IK + KE ++I I I S G+ LL + Sbjct: 194 SDGGDNSSRYTEPEIK--NLVKEADVQIYAIGIYESAAGRGRTPEESSGPALLTEIAEQT 251 Query: 386 PEYHYNVVNADSLIHVFQNISQLMVHRKYSVIL 418 Y V N + L V I V + IL Sbjct: 252 GGRQYQVDNLNELPDVAAKIG---VELRNQYIL 281 >gi|109088926|ref|XP_001104056.1| PREDICTED: anthrax toxin receptor-like isoform 1 [Macaca mulatta] Length = 557 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 73/221 (33%), Gaps = 48/221 (21%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + F + ++D SGS++ L + + Sbjct: 70 QGSFDLYFILDKSGSVNNNWI---------------------DLYMWVEETVARFQSSD- 107 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + M I Y+T + + + +++ + + + T ++A Q + Sbjct: 108 ----IRMCFITYSTDGQTVLPLTSDKNRIKNGLDQLRKIVPDGHTFMQAGFRKAIQQI-- 161 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 F G K+PS+ II +TDGE + +T++ KA++ + T+ Sbjct: 162 --------ETFNSGNKVPSM-----IIAMTDGE-LVAHAFQDTLREAQKARKLGANVYTV 207 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVN----ADSLIHVFQN 404 + Q + SPE+ + V N + + Sbjct: 208 GVADYKLDQ--ITAIADSPEHVFAVENGFKAMRDTVDALTS 246 >gi|109088928|ref|XP_001104141.1| PREDICTED: anthrax toxin receptor-like isoform 2 [Macaca mulatta] Length = 564 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 73/221 (33%), Gaps = 48/221 (21%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + F + ++D SGS++ L + + Sbjct: 70 QGSFDLYFILDKSGSVNNNWI---------------------DLYMWVEETVARFQSSD- 107 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + M I Y+T + + + +++ + + + T ++A Q + Sbjct: 108 ----IRMCFITYSTDGQTVLPLTSDKNRIKNGLDQLRKIVPDGHTFMQAGFRKAIQQI-- 161 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 F G K+PS+ II +TDGE + +T++ KA++ + T+ Sbjct: 162 --------ETFNSGNKVPSM-----IIAMTDGE-LVAHAFQDTLREAQKARKLGANVYTV 207 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVN----ADSLIHVFQN 404 + Q + SPE+ + V N + + Sbjct: 208 GVADYKLDQ--ITAIADSPEHVFAVENGFKAMRDTVDALTS 246 >gi|305665951|ref|YP_003862238.1| BatA protein [Maribacter sp. HTCC2170] gi|88710726|gb|EAR02958.1| batA protein [Maribacter sp. HTCC2170] Length = 332 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 48/329 (14%), Positives = 95/329 (28%), Gaps = 66/329 (20%) Query: 104 ESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEH 163 E++S + F + + + ++ ++ + L F+ Sbjct: 3 ENISFANPEFLWLLLLLPLAILWYVLKRREQTASLKISSLKGFSKSSILPKIKPLLFVFR 62 Query: 164 LLN-QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 +L + + I + +D+S SM D Sbjct: 63 ILALASIIVAMARPQTEDISTRTKTTKGIDIVMAIDVSSSMLAR---------------D 107 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 K +++ALK F+ + + +GL+ Y + V + Sbjct: 108 LKPNRLSALKEVAADFIRQ-------RPNDRIGLVAYAGEAFTKTPITSDKSIVLNSLRE 160 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + T + + L S K II LTDG NN Sbjct: 161 ITYGQLNDGTAIGMGLATSVNRL------------------KESKAISKIIILLTDGVNN 202 Query: 343 NFKSNVNTIKICDKAKENFIKIVT------------ISINASPN----------GQRLLK 380 + T D A E IK T I+ NA + ++LL+ Sbjct: 203 SGFIEPQTAA--DLAVEYGIKSYTIGLGTNGNALSPIAYNADGSYRYGMRQVEIDEKLLE 260 Query: 381 TCV-SSPEYHYNVVNADSLIHVFQNISQL 408 ++ ++ + + L ++ I++L Sbjct: 261 GIAETTGGKYFRATDNEKLEAIYDEINKL 289 >gi|16124454|ref|NP_419018.1| hypothetical protein CC_0199 [Caulobacter crescentus CB15] gi|221233138|ref|YP_002515574.1| hypothetical protein CCNA_00199 [Caulobacter crescentus NA1000] gi|13421322|gb|AAK22186.1| hypothetical protein CC_0199 [Caulobacter crescentus CB15] gi|220962310|gb|ACL93666.1| hypothetical protein CCNA_00199 [Caulobacter crescentus NA1000] Length = 626 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 26/173 (15%), Positives = 49/173 (28%), Gaps = 35/173 (20%) Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL-PFQKFIIF 335 + + ++ L + + + A+ + Q K +I Sbjct: 454 KTALKAQINGLTVGG-ATAGQIGLAWGWYMVAPNFGYLWPNASQRPAAYKARDLMKVVIL 512 Query: 336 LTDG-----------------------------ENNNFKSNVNTIKICDKAKE--NFIKI 364 +TDG + N S ++CD K N I + Sbjct: 513 MTDGGFNMTYCNSVVARNIGSGTNIGDDERINCDATNGSSFDQAAELCDSIKASANDITL 572 Query: 365 VTISINA--SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 T+ + L C SS + Y L FQ I+Q + + + + Sbjct: 573 YTVGFTVGNDQTARNFLTNCASSTDKAYFPATGSELKASFQAIAQEISNLRIA 625 >gi|312200955|ref|YP_004021016.1| von Willebrand factor type A [Frankia sp. EuI1c] gi|311232291|gb|ADP85146.1| von Willebrand factor type A [Frankia sp. EuI1c] Length = 618 Score = 54.1 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 38/216 (17%), Positives = 76/216 (35%), Gaps = 37/216 (17%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V D SGSM P+ +T++ A + +D +G Sbjct: 422 VYDTSGSMD------------LPVANSGGKTRLQIAVGAADA------AIPLFAKDSRLG 463 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 L ++T ++ GT+ R+ V + + + A+ A L + + Sbjct: 464 LWQFSTNLD-------GTKPYRELVPVGLMNDEVGTGTREEALVAAVNGLKAKGGTGLYA 516 Query: 316 NFFR-----QGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK-----ENFIKIV 365 P P Q ++ LTDG+N++ S++ ++ K + + I+ Sbjct: 517 TALAAFESLSAQYQPDKPNQ--VVLLTDGQNDDPTSSMTLTQLIATLKAEYNPKAPVHII 574 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 TI A + L + ++ Y + +S+ V Sbjct: 575 TIGYGADADMDALRQISAATGSKTYPAQDPNSIFQV 610 >gi|329922584|ref|ZP_08278159.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] gi|328942128|gb|EGG38410.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] Length = 421 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 42/234 (17%), Positives = 88/234 (37%), Gaps = 52/234 (22%) Query: 182 RIEMGE-RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 ++ G +P+ I LV+D SGSM D + + A K Sbjct: 103 QLNPGSAKPVKDIVLVIDNSGSM---------------NETDPNQDRYTAAK-------- 139 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLIL--KPTDSTPA 297 +L++ + D + +I + + ++ + + ++D L TD + A Sbjct: 140 --NLINRMDRDNRVSVIMFDHATTLLQPFTRVNNQETKDEIIAEIDGLATTDGGTDISLA 197 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 ++ + + +I L+DG + + + Sbjct: 198 LE----------------DTMSHIQESRDAGRSAMVIMLSDG-FSETDHDRVLAEY---- 236 Query: 358 KENFIKIVTISIN-ASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLM 409 K+ I + TI ++ +P+G +LL+T + + +Y+V +A+ L VFQ I + Sbjct: 237 KQQQIAVNTIGLSLVNPDGAQLLQTIAAETGGQYYDVQHAEDLSFVFQKIYDDV 290 >gi|332365023|gb|EGJ42788.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK355] Length = 462 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 24/163 (14%), Positives = 56/163 (34%), Gaps = 27/163 (16%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + V D SGSM+ + + ++M L+ ++ + + + +V Sbjct: 198 IAVSFVFDTSGSMNWDLQGRNVEKTG-------NESRMDILRKKSVIMIKDLAEIGNVSV 250 Query: 251 DVYMGLIGYTTRVEKNIE-PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ +G +++N GT + +T+ + T+ ++ L S Sbjct: 251 NL-VGFSTSAKYIQQNFSNLDNGTNTIIATITKRENLNPDGVTNPGDGLRYGMISLQSQP 309 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + K+I+ LTDG N + + + Sbjct: 310 AQL------------------KYIVLLTDGIPNAYLVDSRALY 334 >gi|161086980|ref|NP_631887.2| chloride channel calcium activated 4 [Mus musculus] Length = 1044 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 37/212 (17%), Positives = 71/212 (33%), Gaps = 50/212 (23%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM T++ + A L+L I ++++ + Sbjct: 311 LVLDKSGSMR----------------LGSPITRLTLMNQAAELYLIQI-----IEKESLV 349 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQY--VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 GL+ + + Y ++ + T +K+ ++ +TS + + Sbjct: 350 GLVTFDSTATIQTNLI-RIINDSSYLAISTKLPQYPNGGTSICNGLKKGFEAITSSDQST 408 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINA 371 + I+ LTDGE+N S C + K + I TI++ Sbjct: 409 SGSE----------------IVLLTDGEDNRISS-------CFQEVKHSGAIIHTIAL-G 444 Query: 372 SPNGQRLLKTCVSSPE-YHYNVVNADSLIHVF 402 + L + Y + + LI F Sbjct: 445 PSAARELETLSDMTGGLRFYAKEDVNGLIDAF 476 >gi|217972770|ref|YP_002357521.1| von Willebrand factor type A [Shewanella baltica OS223] gi|217497905|gb|ACK46098.1| von Willebrand factor type A [Shewanella baltica OS223] Length = 340 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 37/235 (15%), Positives = 72/235 (30%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VDLSGSM + K + +++ + F++ ++ Sbjct: 85 DLMMAVDLSGSM----------QIEDMVVNGKTVDRFTLIQHVVSDFIER-------RKG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V Q++ L+ K T A+ A + + Sbjct: 128 DRIGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGEAIALAVKRFDKMDES 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + +I LTDG NN A + I T+ + A Sbjct: 188 N------------------RVLILLTDGSNNAGNIEPEQAAQ--IAANRKVTIYTVGVGA 227 Query: 372 S-----------------PNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 + LK + ++ N+ L ++Q I +L Sbjct: 228 DVMERRTLFGRERVNPSMDLDENQLKHIADVTHGRYFRARNSQELDQIYQEIDKL 282 >gi|172039857|ref|YP_001799571.1| hypothetical protein cur_0177 [Corynebacterium urealyticum DSM 7109] gi|171851161|emb|CAQ04137.1| hypothetical protein cu0177 [Corynebacterium urealyticum DSM 7109] Length = 675 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 44/232 (18%), Positives = 80/232 (34%), Gaps = 65/232 (28%) Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 L++D SGSM +T++ A K A F S+ E+ Sbjct: 76 MLILDASGSMMARDAGG--------------QTRLDAAKEASKNFSRSV------SEESE 115 Query: 254 MGLIGYTTRV------------EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 +G + Y T+V + G + T PA+KQA Sbjct: 116 LGFMVYGTKVGNSPEEREAGCKDVTTLLPVGKGNAGKISGEVDKVNASGHTPMGPALKQA 175 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK--- 358 + L ++ +RS I+ ++DGE+ +CD AK Sbjct: 176 AKELPNEGERS--------------------IVLVSDGEDTCAPP-----PVCDVAKDLH 210 Query: 359 ENFI--KIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + I I T+ P ++ L+ C++ + + +A+SL + ++ Sbjct: 211 KQGIDLTINTVGFLVDPAARKELQ-CIAEAGGGEYLDAQDAESLAESMKVLA 261 >gi|237800421|ref|ZP_04588882.1| von Willebrand factor, type A [Pseudomonas syringae pv. oryzae str. 1_6] gi|331023280|gb|EGI03337.1| von Willebrand factor, type A [Pseudomonas syringae pv. oryzae str. 1_6] Length = 352 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 37/196 (18%), Positives = 71/196 (36%), Gaps = 37/196 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM ++ + +++ ++ L FL+S ++ Sbjct: 91 DLLVAVDVSGSMD----------YPDMQWKNDEVSRLVLVQQLLGDFLES-------RKG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + T+ ++ + VR ++ + K T A+ A + L Sbjct: 134 DRVGLILFGTQAFLQAPLTYDRQTVRVWLDEAKIGIAGKNTAVGDAIGLALKRLRMRPAN 193 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S + ++ +TDG NN + + T A + +KI TI I + Sbjct: 194 S------------------RVLVLVTDGANNAGQIDPLTAAR--LAADEGVKIYTIGIGS 233 Query: 372 SPNGQRLLKTCVSSPE 387 P L S Sbjct: 234 DPEKNALQSALGLSAS 249 >gi|114762302|ref|ZP_01441760.1| von Willebrand factor type A domain protein [Pelagibaca bermudensis HTCC2601] gi|114544920|gb|EAU47924.1| von Willebrand factor type A domain protein [Roseovarius sp. HTCC2601] Length = 335 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 35/229 (15%), Positives = 79/229 (34%), Gaps = 43/229 (18%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E + + +D+SGSM D + ++ +++ ++ + F+ Sbjct: 90 ENAARDVVMAIDISGSM---------DARDFATPEGERIQRLSGVREVVRAFVS------ 134 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 +E M LI + T + E + + R + T A+ + + Sbjct: 135 -GREGDRMALIVFGTSAYLQAPLTDDLETIIALLDRTEVGMAGPHTALGDAIGLSIRTFE 193 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + + Q+ +I L+DG ++ S ++ + + A + ++I T Sbjct: 194 TSEID------------------QRLLILLSDG--SDTASRMSPVNAAEIAADRGVEIYT 233 Query: 367 ISINASPN------GQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 I + LK + ++ +A SL V+ I +L Sbjct: 234 IGVGDPDATGENRVDLTTLKEVAQRTGGQYFFAEDAASLEAVYDRIDEL 282 >gi|254505681|ref|ZP_05117827.1| von Willebrand factor, type A [Vibrio parahaemolyticus 16] gi|219551334|gb|EED28313.1| von Willebrand factor, type A [Vibrio parahaemolyticus 16] Length = 322 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 36/232 (15%), Positives = 72/232 (31%), Gaps = 55/232 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LV+DLS SM D D ++ A+K + F +E Sbjct: 86 DLMLVLDLSYSMSKEDMLDDGDYVD----------RLTAVKKVVSDFASK-------REG 128 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + + + + + V + + LI + T + A + Sbjct: 129 DRLGLVLFADHAYLQTPLTLDRKTIAEQVNQLVLRLIGEKTAIGEGIGLATKTFVD---- 184 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S Q+ ++ L+DG N + + AK+ I TI + A Sbjct: 185 --------------SDAPQRVMVLLSDGSNTSGVLDPLEAA--KIAKKYNATIYTIGVGA 228 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + + L+ + ++ +A L ++ I Sbjct: 229 GEMVVKEFFMTRKVNTAQDLDERTLMDIAQVTGGQYFRARDAKELATIYDTI 280 >gi|329954838|ref|ZP_08295855.1| von Willebrand factor type A domain protein [Bacteroides clarus YIT 12056] gi|328526942|gb|EGF53953.1| von Willebrand factor type A domain protein [Bacteroides clarus YIT 12056] Length = 327 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 44/276 (15%), Positives = 80/276 (28%), Gaps = 63/276 (22%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 Y F ++ ++S + E I L +D+S SM Sbjct: 51 YLLHAPFALRIIALALIIIVLSRPQTTDSWQNSEIEGIDIMLAIDVSTSMLAE------- 103 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 D K ++ A K+ F++ G+ + + Sbjct: 104 --------DLKPNRLEAAKDVAAEFIN----GRPNDNV---GITLFAGESFTQCPLTVDH 148 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + + LI T + A L K +S K I Sbjct: 149 AVLLNLIKDVKCGLIEDGTAVGMGIANAVTRLKDSKAKS------------------KVI 190 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI----NAS----------------- 372 I LTDG NN + + T + AK I++ TI + A Sbjct: 191 ILLTDGTNNRGEISPLTAA--EIAKSFGIRVYTIGVGTNGMAPYPYPVGGTVQYVNMPVE 248 Query: 373 PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + L + ++ ++ + L V++ I +L Sbjct: 249 IDEKTLTQIAGTTDGNYFRATSNSKLKEVYEEIDKL 284 >gi|327490425|gb|EGF22209.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK1058] Length = 462 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 22/163 (13%), Positives = 55/163 (33%), Gaps = 27/163 (16%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + V D SGSM+ + + ++M L+ ++ + + + ++ Sbjct: 198 IAVSFVFDTSGSMNWDLQGRETKKSG-------NESRMDILRKKSVIMIKDLAEIGNISV 250 Query: 251 DVYMGLIGYTTRVEKNIE-PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ +G +++N GT + + + + T+ ++ L S Sbjct: 251 NL-VGFSTSAKYIQQNFSNLDNGTNTIIATINKRENLNPDGVTNPGDGLRYGMISLQSQP 309 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + K+I+ LTDG N + + + Sbjct: 310 AQL------------------KYIVLLTDGIPNAYLVDSRALY 334 >gi|311695164|gb|ADP98037.1| von Willebrand factor type A domain protein [marine bacterium HP15] Length = 342 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 41/235 (17%), Positives = 80/235 (34%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVD+S SM + Q + ++ A+K L F+ ++ Sbjct: 90 DLMLVVDISPSMD----------EQDMVLQGRSINRLQAVKRVLDDFISR-------RQG 132 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + T ++ E VR + + + T A+ A + L + + Sbjct: 133 DRLGLILFGTEPYVQAPLTFDLETVRTLMREAGLGMAGRATAIGDAVGLATKRLRNRPQD 192 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 Q+ ++ LTDG N + + + A I++ TI I A Sbjct: 193 ------------------QRVVVLLTDGANTAGEITPDKAT--EIAAAASIRLYTIGIGA 232 Query: 372 SPNGQR------------------LLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 QR L + + ++ + L ++++I +L Sbjct: 233 ESMVQRGLLGSRRVNPSRDLDENLLTRMAQQTGGEYFRARSLPELELIYESIDRL 287 >gi|315649632|ref|ZP_07902717.1| von Willebrand factor type A [Paenibacillus vortex V453] gi|315275105|gb|EFU38480.1| von Willebrand factor type A [Paenibacillus vortex V453] Length = 421 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 41/234 (17%), Positives = 83/234 (35%), Gaps = 52/234 (22%) Query: 182 RIEMGE-RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 ++ G +P+ + LV+D SGSM D + + A K Sbjct: 103 QLNPGSAKPVKDVVLVIDNSGSM---------------KDTDPNQDRYTAAK-------- 139 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE-KVRQYVTRDMDSLIL--KPTDSTPA 297 +L++ + D + +I + + +++ + ++D L TD + A Sbjct: 140 --NLINRMDRDNRVSVIVFDHATTLLQPFTRVKNQEIKDEIMAEIDGLATTDGGTDISLA 197 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 ++ + + +I L+DG + + Sbjct: 198 LE----------------DTMSHIQESQDAGRSAMVIMLSDG-FSETDHDRVLADY---- 236 Query: 358 KENFIKIVTISIN-ASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLM 409 K+ I + TI ++ +G LL+T + + +Y+V NA L VFQ I + Sbjct: 237 KQQQIAVNTIGLSLVYKDGANLLQTIAAETGGQYYDVQNAADLSFVFQKIYDDV 290 >gi|190575666|ref|YP_001973511.1| putative von Willebrand factor-like protein [Stenotrophomonas maltophilia K279a] gi|190013588|emb|CAQ47223.1| putative von Willebrand factor-like protein [Stenotrophomonas maltophilia K279a] Length = 334 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 39/232 (16%), Positives = 79/232 (34%), Gaps = 53/232 (22%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + L +D+SGSM + + ++ A K L FLD + Sbjct: 101 MMLAMDVSGSMG----------EGDMVLGGQAVDRLTAAKAVLADFLDR-------RAGD 143 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GL+ + R + VR + + L + T A+ A + L S + Sbjct: 144 RIGLLIFGDRAYTLTPLTADLASVRDQLRDSVVGLAGRETAIGDAIGLAVKRLRSQPEG- 202 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI--- 369 Q+ +I LTDG +N + ++ + A+ ++I T++ Sbjct: 203 -----------------QRVLILLTDGVSNAGV--LEPLRAAEVAQAEGVRIHTVAFGGD 243 Query: 370 ----------NASPN--GQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 +A + + LK S + + + L ++ + +L Sbjct: 244 GSMRFLGIPISADQDPVDEATLKKIASLTGGQFFRARDTAQLAGIYAELDRL 295 >gi|59712029|ref|YP_204805.1| von Willebrand factor type A domain-containing protein [Vibrio fischeri ES114] gi|59480130|gb|AAW85917.1| von Willebrand factor type A domain protein [Vibrio fischeri ES114] Length = 356 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 47/246 (19%), Positives = 87/246 (35%), Gaps = 28/246 (11%) Query: 172 KIVSFIPALLRIEMGERPIF--LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 I + + E R + +VVDLSGSM D S+ + KK +++ Sbjct: 77 LIFALTKPTILGEPQTRESLGRDVMVVVDLSGSMAEQ---DFVSKQSSDLGAVKKISRLE 133 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 A K L F+ + ++ +GLI + + + + + ++ Sbjct: 134 ATKEVLADFVKT-------RKGDRLGLILFGDAAFVQTPFTADQSVWLELLNQTDVAMAG 186 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 + T A+ A ++ + + P +K I LTDG N+ S V Sbjct: 187 QSTHLGDAIGLAIKVFEQSSEDKASAEENAK-------PREKVAIVLTDG--NDTGSYVE 237 Query: 350 TIKICDKAKENFIKIVTISINASPN-GQRLL------KTCVSSPEYHYNVVNADSLIHVF 402 I A ++I I++ G++ L + S + +N D L + Sbjct: 238 PIDAAKVAAAKDVRIHMIAMGDPRTVGEQALDMNIINRVAKESGGKAFQAINRDELEQAY 297 Query: 403 QNISQL 408 I +L Sbjct: 298 DEIGEL 303 >gi|312197712|ref|YP_004017773.1| von Willebrand factor type A [Frankia sp. EuI1c] gi|311229048|gb|ADP81903.1| von Willebrand factor type A [Frankia sp. EuI1c] Length = 372 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 36/256 (14%), Positives = 69/256 (26%), Gaps = 65/256 (25%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +D+SGSM D ++ A + A F+ + S Sbjct: 87 TIMLALDVSGSM---------------CSTDVPPNRITAAEKAATAFIKAQPAGS----- 126 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ ++ + P T ++ + + + T + + + Sbjct: 127 -RIGLVTFSGIAGLLVPP---TTDSQKLLDALQNLTTSRGTAIGQGILTSIDAIADADPS 182 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN- 370 T G I+ LTDG N + T A+ +++ TI Sbjct: 183 VAPTGSAVSGNGTGPYAA-DVIVVLTDGANTQGV-DPQTAAKQAAARR--LRVYTIGFGT 238 Query: 371 ---AS---------------------------------PNGQRLLKTCVSSPEYHYNVVN 394 A + Q L ++ +Y N Sbjct: 239 TTPAPMVCGSSQVGGFGGFGGFGGFGGGGRLGDRSPLVIDEQALRDVAATTGGTYYRAQN 298 Query: 395 ADSLIHVFQNISQLMV 410 A L + + + Sbjct: 299 AGQLQDALGTLPRNIT 314 >gi|254525166|ref|ZP_05137221.1| von Willebrand factor, type A [Stenotrophomonas sp. SKA14] gi|219722757|gb|EED41282.1| von Willebrand factor, type A [Stenotrophomonas sp. SKA14] Length = 334 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 37/232 (15%), Positives = 75/232 (32%), Gaps = 53/232 (22%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + L +D+SGSM + + ++ A K L FLD + Sbjct: 101 MMLAMDVSGSMG----------EGDMVLGGQAVDRLTAAKAVLADFLDR-------RAGD 143 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GL+ + R + VR + + L + T A+ A + L S + Sbjct: 144 RIGLLIFGDRAYTLTPLTADLASVRDQLRDSVVGLAGRETAIGDAIGLAVKRLRSQPEG- 202 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI--- 369 Q+ +I LTDG +N + ++ + A+ ++I T++ Sbjct: 203 -----------------QRVLILLTDGVSNAGV--LEPLRAAEVARAEGVRIHTVAFGGD 243 Query: 370 -------------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + L K + + + L ++ + +L Sbjct: 244 GSMRVFGISISADQDPVDEATLKKIAGMTGGQFFRARDTAQLAGIYAELDRL 295 >gi|330973664|gb|EGH73730.1| von Willebrand factor, type A [Pseudomonas syringae pv. aceris str. M302273PT] Length = 352 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 44/248 (17%), Positives = 82/248 (33%), Gaps = 66/248 (26%) Query: 192 LIELVVDLSGSMHC---AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + + VD+SGSM SD +++ ++ L FL+ Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDEV-------------SRLVLVQQLLGDFLE-------G 130 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ +GLI + T+ ++ VR ++ + K T A+ A + L Sbjct: 131 RKGDRVGLILFGTQAFVQAPLTYDRRTVRVWLDEARIGIAGKNTALGDAIGLALKRLRMR 190 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 S + ++ +TDG NN + + T A E +KI I Sbjct: 191 PATS------------------RALVLVTDGANNAGQIDPVTAAR--LAAEEGVKIYAIG 230 Query: 369 INASPNGQRL----------------LKTCVS-SPEYHYNVVNADSL------IHVFQNI 405 I + P+ L LK S S ++ + D L + + + Sbjct: 231 IGSDPDKDALQSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKIRATLDALEPV 290 Query: 406 SQLMVHRK 413 +Q + Sbjct: 291 AQQPTQAR 298 >gi|66044963|ref|YP_234804.1| von Willebrand factor, type A [Pseudomonas syringae pv. syringae B728a] gi|63255670|gb|AAY36766.1| von Willebrand factor, type A [Pseudomonas syringae pv. syringae B728a] Length = 352 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 44/248 (17%), Positives = 82/248 (33%), Gaps = 66/248 (26%) Query: 192 LIELVVDLSGSMHC---AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + + VD+SGSM SD +++ ++ L FL+ Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDEV-------------SRLVLVQQLLGDFLE-------G 130 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ +GLI + T+ ++ VR ++ + K T A+ A + L Sbjct: 131 RKGDRVGLILFGTQAFVQAPLTYDRRTVRVWLDEARIGIAGKNTALGDAIGLALKRLRMR 190 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 S + ++ +TDG NN + + T A E +KI I Sbjct: 191 PATS------------------RALVLVTDGANNAGQIDPVTAAR--LAAEEGVKIYAIG 230 Query: 369 INASPNGQRL----------------LKTCVS-SPEYHYNVVNADSL------IHVFQNI 405 I + P+ L LK S S ++ + D L + + + Sbjct: 231 IGSDPDKDALQSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKIRATLDALEPV 290 Query: 406 SQLMVHRK 413 +Q + Sbjct: 291 AQQPTQAR 298 >gi|311274909|ref|XP_003134506.1| PREDICTED: matrilin-4-like [Sus scrofa] Length = 721 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 65/170 (38%), Gaps = 24/170 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKPTDST 295 + + L +G+I Y+++V+ G R+ + + +L+ + T + Sbjct: 197 LVGLLRSLDVGPNATRVGVIQYSSQVQSVFPL--GAFSRREDMEGAIRALVPLAQGTMTG 254 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A + S + G + P + + +TDG + ++ Sbjct: 255 LAIQYAMNVAFSVAE----------GARPPEARVPRVAVIVTDG---RPQD--RVAEVAA 299 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 +A+ I+I + + + G L+ S P E+ + V + LI F Sbjct: 300 QARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESF-DLIQEFG 346 Score = 44.1 bits (102), Expect = 0.047, Method: Composition-based stats. Identities = 31/177 (17%), Positives = 65/177 (36%), Gaps = 26/177 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR--DMDSLILKPTDSTPAMK 299 +D L E +GL+ +++RV G V + + + T + A++ Sbjct: 512 VDFLDVSPEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALR 569 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S+ QG + +L + + TDG + +V+ +AKE Sbjct: 570 HMVEHSFSE----------AQGARPRALNVPRVGLVFTDG---RSQDDVSV--WAARAKE 614 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I + + + + L+ S P + + D F ++ L+ + K S+ Sbjct: 615 EGIVMYAVGVG--KAVEEELREIASEPAELHVSYSPD-----FSTMTHLLENLKGSI 664 >gi|330963348|gb|EGH63608.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. actinidiae str. M302091] Length = 352 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 44/248 (17%), Positives = 82/248 (33%), Gaps = 66/248 (26%) Query: 192 LIELVVDLSGSMHC---AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + + VD+SGSM SD +++ ++ L FL+ Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDEV-------------SRLVLVQQLLGDFLE-------G 130 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ +GLI + T+ ++ VR ++ + K T A+ A + L Sbjct: 131 RKGDRVGLILFGTQAFVQAPLTYDRRTVRVWLDEAKIGIAGKNTAVGDAIGLALKRLRLR 190 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 S + ++ +TDG NN + + T A E +KI I Sbjct: 191 PANS------------------RVLVLVTDGANNAGQIDPITAAR--LAAEEGVKIYPIG 230 Query: 369 INASPNGQRL----------------LKTCVS-SPEYHYNVVNADSL------IHVFQNI 405 I + P+ L LK S S ++ + D L + + + Sbjct: 231 IGSDPDKDALQSVLGLNPSLDLDEPTLKEIASISGGQYFRARDGDQLEKIRATLDALEPV 290 Query: 406 SQLMVHRK 413 +Q + Sbjct: 291 AQQPTQAR 298 >gi|213968792|ref|ZP_03396933.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato T1] gi|213926395|gb|EEB59949.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato T1] Length = 328 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 44/248 (17%), Positives = 82/248 (33%), Gaps = 66/248 (26%) Query: 192 LIELVVDLSGSMHC---AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + + VD+SGSM SD +++ ++ L FL+ Sbjct: 67 DLLVAVDVSGSMDYPDMQWKSDEV-------------SRLVLVQQLLGDFLE-------G 106 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ +GLI + T+ ++ VR ++ + K T A+ A + L Sbjct: 107 RKGDRVGLILFGTQAFVQAPLTYDRRTVRVWLDEAKIGIAGKNTAVGDAIGLALKRLRLR 166 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 S + ++ +TDG NN + + T A E +KI I Sbjct: 167 PANS------------------RVLVLVTDGANNAGQIDPITAAR--LAAEEGVKIYPIG 206 Query: 369 INASPNGQRL----------------LKTCVS-SPEYHYNVVNADSL------IHVFQNI 405 I + P+ L LK S S ++ + D L + + + Sbjct: 207 IGSDPDKDALQSALGLSPSLDLDEPTLKEIASISGGQYFRARDGDQLEKIRATLDALEPV 266 Query: 406 SQLMVHRK 413 +Q + Sbjct: 267 AQQPTQAR 274 >gi|28870917|ref|NP_793536.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. tomato str. DC3000] gi|301385766|ref|ZP_07234184.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato Max13] gi|302061830|ref|ZP_07253371.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato K40] gi|302134226|ref|ZP_07260216.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato NCPPB 1108] gi|28854166|gb|AAO57231.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato str. DC3000] gi|331018299|gb|EGH98355.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. lachrymans str. M302278PT] Length = 352 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 44/248 (17%), Positives = 82/248 (33%), Gaps = 66/248 (26%) Query: 192 LIELVVDLSGSMHC---AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + + VD+SGSM SD +++ ++ L FL+ Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDEV-------------SRLVLVQQLLGDFLE-------G 130 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ +GLI + T+ ++ VR ++ + K T A+ A + L Sbjct: 131 RKGDRVGLILFGTQAFVQAPLTYDRRTVRVWLDEAKIGIAGKNTAVGDAIGLALKRLRLR 190 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 S + ++ +TDG NN + + T A E +KI I Sbjct: 191 PANS------------------RVLVLVTDGANNAGQIDPITAAR--LAAEEGVKIYPIG 230 Query: 369 INASPNGQRL----------------LKTCVS-SPEYHYNVVNADSL------IHVFQNI 405 I + P+ L LK S S ++ + D L + + + Sbjct: 231 IGSDPDKDALQSALGLSPSLDLDEPTLKEIASISGGQYFRARDGDQLEKIRATLDALEPV 290 Query: 406 SQLMVHRK 413 +Q + Sbjct: 291 AQQPTQAR 298 >gi|260777338|ref|ZP_05886232.1| protein BatA [Vibrio coralliilyticus ATCC BAA-450] gi|260607004|gb|EEX33278.1| protein BatA [Vibrio coralliilyticus ATCC BAA-450] Length = 271 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 36/265 (13%), Positives = 79/265 (29%), Gaps = 56/265 (21%) Query: 159 QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP 218 + + L+ + + + + + LVVDLS SM D Sbjct: 3 KVLAGLVWVCLITAVARPVWYGDPVTTQPKHR-DMMLVVDLSYSMSKEDMQFNGDY---- 57 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 +++A+K L F+ ++ +GL+ + + V + Sbjct: 58 ------IDRLSAVKQVLSDFISK-------RQGDRLGLVLFADHAYLQTPLTLDRHTVAE 104 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + + + LI T + A + S Q+ +I L+D Sbjct: 105 QLNQTVLRLIGTKTAIGEGIGLATKTFVD------------------SDAPQRVMILLSD 146 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------NGQRLLK 380 G N + AK+ I T+ + A + + L++ Sbjct: 147 GSNTAGVLDPIEAA--KIAKKYNATIYTVGVGAGEMMVKEFFMTRKVNTAQDLDEKSLME 204 Query: 381 TCVSSPEYHYNVVNADSLIHVFQNI 405 + ++ ++ L ++ I Sbjct: 205 IAKLTGGQYFRARDSKELATIYDTI 229 >gi|163751139|ref|ZP_02158369.1| von Willebrand factor type A domain protein [Shewanella benthica KT99] gi|161329095|gb|EDQ00167.1| von Willebrand factor type A domain protein [Shewanella benthica KT99] Length = 334 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 35/235 (14%), Positives = 76/235 (32%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM + + + +++ + F++ ++ Sbjct: 84 DLMLAVDLSGSM----------QIEDMVLNGQTVDRFTMIQDVVSDFIER-------RKG 126 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V Q++ L+ K T A+ + K Sbjct: 127 DKLGLILFADHAYLQAPLTQDRRSVAQFLQEAQIGLVGKQTAIGEAIALGVKRFDMVDKS 186 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + ++ LTDG NN+ + A + +KI I + A Sbjct: 187 N------------------RILVLLTDGSNNSGSISPEQAA--AIAAKRGVKIYAIGVGA 226 Query: 372 SPNGQ------------------RLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + +L+ ++ ++ ++ L ++Q I +L Sbjct: 227 DVMERRSIFGTERVNPSMDLDEAQLISLAKTTGGLYFRARSSQDLQLIYQEIDKL 281 >gi|153001301|ref|YP_001366982.1| von Willebrand factor type A [Shewanella baltica OS185] gi|151365919|gb|ABS08919.1| von Willebrand factor type A [Shewanella baltica OS185] Length = 340 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 36/235 (15%), Positives = 72/235 (30%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VDLSGSM + K + +++ + F++ ++ Sbjct: 85 DLMMAVDLSGSM----------QIEDMVVNGKTVDRFTLIQHVVSDFIER-------RKG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V Q++ L+ K T A+ A + + Sbjct: 128 DRIGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGEAIGLAVKRFDKMDES 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + +I LTDG NN A + I T+ + A Sbjct: 188 N------------------RVLILLTDGSNNAGNIEPEQAAQ--IAANRKVTIYTVGVGA 227 Query: 372 S-----------------PNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 + LK + ++ N+ L +++ I +L Sbjct: 228 DVMERRTLFGRERVNPSMDLDENQLKHIAEVTHGRYFRARNSQELDQIYKEIDKL 282 >gi|27365660|ref|NP_761188.1| hypothetical protein VV1_2340 [Vibrio vulnificus CMCP6] gi|27361808|gb|AAO10715.1| hypothetical protein VV1_2340 [Vibrio vulnificus CMCP6] Length = 465 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 26/141 (18%), Positives = 63/141 (44%), Gaps = 9/141 (6%) Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 V R + + T++ M A ++L+ + + + K++ Sbjct: 324 RHFESTVQRLVPGMN---TNNAEGMVWAMRLLSPYWQGIWDKTRPELPRRYSDETSNKYL 380 Query: 334 IFLTDGENNNFKS--NVNTIKICDKAKE--NFIKIVTISINASPNGQRLLKTCVSSPEYH 389 + +DG + + + IC + K+ +K++T++ +RL+++C S PE + Sbjct: 381 VMFSDGNHLIDPAFRDKKMKLICTQLKQPGRGVKVMTVNFGG-AASERLMQSCASGPE-Y 438 Query: 390 YNVVNADSLIHVFQNISQLMV 410 Y+V + S+ VF+ I++ ++ Sbjct: 439 YHVASLFSVEKVFEQIAEQVI 459 >gi|126174972|ref|YP_001051121.1| von Willebrand factor type A [Shewanella baltica OS155] gi|125998177|gb|ABN62252.1| von Willebrand factor, type A [Shewanella baltica OS155] Length = 339 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 38/235 (16%), Positives = 75/235 (31%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VDLSGSM + K + +++ + F++ ++ Sbjct: 85 DLMMAVDLSGSM----------QIEDMVVNGKTVDRFTLIQHVVSDFIER-------RKG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V Q++ L+ K T A+ A + + Sbjct: 128 DRIGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGEAIGLAVKRFDKMDES 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + +I LTDG NN N+ + A + I T+ + A Sbjct: 188 N------------------RVLILLTDGSNN--SGNIEPEQAAQIAANRKVTIYTVGVGA 227 Query: 372 S-----------------PNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 + LK + ++ N+ L ++Q I +L Sbjct: 228 DVMERRTLFGRERVNPSMDLDENQLKHIAEVTHGRYFRARNSQELDQIYQEIDKL 282 >gi|167763116|ref|ZP_02435243.1| hypothetical protein BACSTE_01485 [Bacteroides stercoris ATCC 43183] gi|167699456|gb|EDS16035.1| hypothetical protein BACSTE_01485 [Bacteroides stercoris ATCC 43183] Length = 327 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 40/239 (16%), Positives = 70/239 (29%), Gaps = 63/239 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I L +D+S SM D K ++ A K+ F++ Sbjct: 88 IDIMLAIDVSTSMLAE---------------DLKPNRLEAAKDVAAEFIN----GRPNDN 128 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 G+ + + + + LI T + A L K Sbjct: 129 V---GITLFAGESFTQCPLTVDHAVLLNLIKDVKCGLIEDGTAVGMGIANAVTRLKDSKA 185 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI- 369 +S K II LTDG NN + T + AK I++ TI + Sbjct: 186 KS------------------KVIILLTDGTNNRGDISPLTAA--EIAKSFGIRVYTIGVG 225 Query: 370 ---NAS-----------------PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 A + + L + ++ ++ + L V++ I +L Sbjct: 226 TNGMAPYPYPVGGTVQYVNMPVEIDEKTLTQIAGTTDGNYFRATSNSKLKEVYEEIDKL 284 >gi|332140758|ref|YP_004426496.1| von Willebrand factor, type A [Alteromonas macleodii str. 'Deep ecotype'] gi|327550780|gb|AEA97498.1| von Willebrand factor, type A [Alteromonas macleodii str. 'Deep ecotype'] Length = 349 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 38/231 (16%), Positives = 73/231 (31%), Gaps = 55/231 (23%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + L VDLSGSM ++ ++ K+ + F+ + Sbjct: 90 MMLAVDLSGSM----------KIDDMQLNGRQVNRLTMTKSVVYDFIQR-------RVGD 132 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GLI + ++ + V ++ + L+ + T A+ A + + + Sbjct: 133 RLGLILFADTAYVQAPLTYDRDTVSTLLSEAVIGLVGEQTAIGDAIGLAVKRFDERDESN 192 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 +I LTDG+N K + A +K+ TI + A Sbjct: 193 N------------------VLILLTDGQNTAGNITPEQAK--ELAINKGVKVYTIGVGAD 232 Query: 373 P-------------NGQRL----LKTCVSS-PEYHYNVVNADSLIHVFQNI 405 Q L L +S ++ NA L ++Q + Sbjct: 233 KMLIQSFFGSREINPSQELDEGMLTDIATSTGGQYFRARNAQELEAIYQQL 283 >gi|330810109|ref|YP_004354571.1| hypothetical protein PSEBR_a3255 [Pseudomonas brassicacearum subsp. brassicacearum NFM421] gi|327378217|gb|AEA69567.1| Conserved hypothetical protein [Pseudomonas brassicacearum subsp. brassicacearum NFM421] Length = 359 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 40/225 (17%), Positives = 80/225 (35%), Gaps = 54/225 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM +D++ +++A +++ L FL+ +E Sbjct: 91 DLLVAVDVSGSMD----------FPDMQWKDEEVSRLALVQHMLGDFLE-------GREG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + K T A+ A + L Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRRTVRHWLDEARIGIAGKNTAIGDAIGLALKRLRQRPAH 193 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S + +I +TDG NN + + T A + +KI I I A Sbjct: 194 S------------------RVLILVTDGANNGGEIDPLTAAR--LAADEGVKIYPIGIGA 233 Query: 372 SPN----------------GQRLLKTCVS-SPEYHYNVVNADSLI 399 +P + LK + ++ + + L+ Sbjct: 234 APEQSGTTGSLGVNPSLDLDEPTLKEIAEVTGGQYFRAQDGEQLL 278 >gi|320156062|ref|YP_004188441.1| hypothetical protein VVM_02402 [Vibrio vulnificus MO6-24/O] gi|319931374|gb|ADV86238.1| hypothetical protein VVMO6_01216 [Vibrio vulnificus MO6-24/O] Length = 465 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 26/141 (18%), Positives = 63/141 (44%), Gaps = 9/141 (6%) Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 V R + + T++ M A ++L+ + + + K++ Sbjct: 324 RHFESTVQRLVPGMN---TNNAEGMVWAMRLLSPYWQGIWDKTRPELPRRYSDETSNKYL 380 Query: 334 IFLTDGENNNFKS--NVNTIKICDKAKE--NFIKIVTISINASPNGQRLLKTCVSSPEYH 389 + +DG + + + IC + K+ +K++T++ +RL+++C S PE + Sbjct: 381 VMFSDGNHLIDPAFRDKKMKLICTQLKQPGRGVKVMTVNFGG-AASERLMQSCASGPE-Y 438 Query: 390 YNVVNADSLIHVFQNISQLMV 410 Y+V + S+ VF+ I++ ++ Sbjct: 439 YHVASLFSVEKVFEQIAEQVI 459 >gi|333030669|ref|ZP_08458730.1| von Willebrand factor type A [Bacteroides coprosuis DSM 18011] gi|332741266|gb|EGJ71748.1| von Willebrand factor type A [Bacteroides coprosuis DSM 18011] Length = 328 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 49/310 (15%), Positives = 95/310 (30%), Gaps = 69/310 (22%) Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 ++ +I + ++ + + + Y L FI + +++ + Sbjct: 23 VMRHYKIDASLQVSDTRVYEHAPKSFKV-----YLIHLPFILRIAALALLIIVLARPQST 77 Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + E I L +D+SGSM D + ++ A K+ F++ Sbjct: 78 DNWKSSEVEGIDIMLAIDVSGSMLAE---------------DLQPNRLEAAKDVAAKFIN 122 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 D + +G+ + + + +I T + Sbjct: 123 --DRPNDN-----IGITLFAGESFTQCPLTIDHTALLNLFGNIQTGVIEDGTAIGMGVSN 175 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A L + +S K II LTDG NN + T + AK Sbjct: 176 AVARLKDSQAKS------------------KVIILLTDGSNNAGDISPLTSA--EIAKTY 215 Query: 361 FIKIVTISI----NASPNGQRL-----------------LKTCV-SSPEYHYNVVNADSL 398 I++ T+ I A Q + LK ++ ++ + SL Sbjct: 216 GIRVYTVGIGTRGTAPYPIQTMTGAIQRIQVEVDIDEPTLKDIARTTGGVYFRATDNTSL 275 Query: 399 IHVFQNISQL 408 ++Q I QL Sbjct: 276 QDIYQEIDQL 285 >gi|330878848|gb|EGH12997.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. morsprunorum str. M302280PT] Length = 352 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 42/248 (16%), Positives = 82/248 (33%), Gaps = 66/248 (26%) Query: 192 LIELVVDLSGSMHC---AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + + +D+SGSM SD +++ ++ L FL+ Sbjct: 91 DLLVAIDVSGSMDYPDMQWKSDEV-------------SRLVLVQQLLGDFLE-------G 130 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ +GLI + T+ ++ VR ++ + K T A+ A + L Sbjct: 131 RKGDRVGLILFGTQAFVQAPLTYDRRTVRVWLDEAKIGIAGKNTAVGDAIGLALKRLRLR 190 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 S + ++ +TDG NN + + T A + +KI I Sbjct: 191 PANS------------------RVLVLVTDGANNAGQIDPITAAR--LAADEGVKIYPIG 230 Query: 369 INASPNGQRL----------------LKTCVS-SPEYHYNVVNADSL------IHVFQNI 405 I + P+ L LK S S ++ + D L + + + Sbjct: 231 IGSDPDKDALQSALGLNPSLDLDEPTLKEIASISGGQYFRARDGDQLEKIRVTLDALEPV 290 Query: 406 SQLMVHRK 413 +Q + Sbjct: 291 AQQPTQAR 298 >gi|194224466|ref|XP_001500626.2| PREDICTED: matrilin 4 [Equus caballus] Length = 542 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 30/170 (17%), Positives = 67/170 (39%), Gaps = 24/170 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKPTDST 295 + + L +G+I Y+++V+ S R+ + R + +L+ + T + Sbjct: 59 LVGLLRSLDVGPNATRVGVIQYSSQVQSVFPLS--AFSRREDMERAIRALVPLAQGTMTG 116 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A + S + G + P + + +TDG + ++ Sbjct: 117 LAIQYAMNVAFSVAE----------GARPPEARVPRVAVIVTDG---RPQD--RVAEVAA 161 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 +A+ I+I + + + G L+ S P E+ + V ++ LI F Sbjct: 162 QARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLV-DSFDLIQEFG 208 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 30/177 (16%), Positives = 65/177 (36%), Gaps = 26/177 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR--DMDSLILKPTDSTPAMK 299 +D L E +GL+ +++RV G V + + + T + A++ Sbjct: 333 VDFLDVSPEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALR 390 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S+ QG + +L + + TDG + +++ +AKE Sbjct: 391 HMVEHSFSE----------AQGARPRALNVPRVGLVFTDG---RSQDDISV--WAARAKE 435 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I + + + + L+ S P + + D F ++ L+ + K S+ Sbjct: 436 EGIVMYAVGVG--KAVEEELRKIASEPAELHVSYSPD-----FGTMTHLLENLKGSI 485 >gi|73992503|ref|XP_543015.2| PREDICTED: similar to Matrilin-4 precursor [Canis familiaris] Length = 624 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 30/170 (17%), Positives = 67/170 (39%), Gaps = 24/170 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKPTDST 295 + + L +G+I Y+++V+ G R+ + R + +L+ + T + Sbjct: 59 LVGLLRGLDVGPNATRVGVIQYSSQVQSVFPL--GAFSRREDMERAIRALVPLAQGTMTG 116 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A + S + G + P + + +TDG + ++ Sbjct: 117 LAIQYAMNVAFSVAE----------GARPPEARVPRIAVIVTDG---RPQD--RVAEVAA 161 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 +A+ I+I + + + G L+ S P E+ + V + + LI F Sbjct: 162 QARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESFN-LIQEFG 208 >gi|16331837|ref|NP_442565.1| hypothetical protein sll0103 [Synechocystis sp. PCC 6803] gi|2496792|sp|Q55874|Y103_SYNY3 RecName: Full=Uncharacterized protein sll0103 gi|1208467|dbj|BAA10635.1| sll0103 [Synechocystis sp. PCC 6803] Length = 420 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 39/225 (17%), Positives = 74/225 (32%), Gaps = 53/225 (23%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LV+D SGSM + +K+A L +D ++ Sbjct: 42 LNLCLVLDHSGSMDGQ--------------------PLETVKSAALGLIDRLEEDD---- 77 Query: 251 DVYMGLIGYTTR---VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + +I + R V +N + G + + + R T +K Q Sbjct: 78 --RLSVIAFDHRAKIVIENQQVRNG-AAIAKAIERLKAE---GGTAIDEGLKLGIQEAAK 131 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 K+ I LTDGEN + ++ +K+ A + + + T+ Sbjct: 132 GKEDRVSH-----------------IFLLTDGENEHGDND-RCLKLGTVASDYKLTVHTL 173 Query: 368 SINASPNGQRLLKTCVSSP-EYHYNVVNADSLIHVFQNISQLMVH 411 Q +L+ +S + N +H F+ + Q M + Sbjct: 174 GF-GDHWNQDVLEAIAASAQGSLSYIENPSEALHTFRQLFQRMSN 217 >gi|88707026|ref|ZP_01104723.1| von Willebrand factor type A domain protein [Congregibacter litoralis KT71] gi|88698754|gb|EAQ95876.1| von Willebrand factor type A domain protein [Congregibacter litoralis KT71] Length = 330 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 37/229 (16%), Positives = 71/229 (31%), Gaps = 43/229 (18%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E+ + L +DLSGSM D E ++ A K+ L F Sbjct: 90 EKSARDLMLALDLSGSMDARDFRDAEGHEQ---------NRLTAAKDVLEGFAAQ----- 135 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 +E +GLI + + E + + ++ + T A+ A I Sbjct: 136 --REGDRLGLIVFGNAAYLQAPFTDDRETWQTLLEESEVAMAGQSTALGDAIGLAISIFQ 193 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + S + +I LTDG + + A N + I T Sbjct: 194 A------------------SDTTNRVLIVLTDGNDTGSRVPPRDAATIAAA--NDVTIYT 233 Query: 367 ISINASPN------GQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQL 408 +++ L ++ + ++ +L + I++L Sbjct: 234 VAVGDPATIGEEALDLETLNAVAETTGGASFQALDTQALEKAYDEINRL 282 >gi|26352386|dbj|BAC39823.1| unnamed protein product [Mus musculus] Length = 902 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 64/370 (17%), Positives = 130/370 (35%), Gaps = 66/370 (17%) Query: 41 IYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKN 100 + + +KN++E+ + + G + +V N R + +K L + FI + Sbjct: 170 VDQPFYMSRKNTIEATRCSTRITG-TNVVHNCERGNCVTRACRRDSKTRLYEPKCTFIPD 228 Query: 101 HIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN-RLDSSNNTIFYNMDVMTSYDYRLQ 159 I+ + G S +F + + + A N + N +++ ++ Sbjct: 229 KIQTA--GASIMFMQNLNSVVEFCTEKNHNAEAPNLQNKMCNRRSTWDV---------IK 277 Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 N + + P + R + + LV+D SGSM Sbjct: 278 TSADFQNAPPMRGTEAPPPPTFSLLKSRRRV--VCLVLDKSGSMD--------------- 320 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV-EKNIEPSWGTEKVRQ 278 K ++ + A L+L I V+++ +GL+ + + +N + Q Sbjct: 321 ----KEDRLIRMNQAAELYLTQI-----VEKESMVGLVTFDSAAHIQNYLIKITSSSDYQ 371 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 +T ++ T ++ +Q +TS + + + I+ LTD Sbjct: 372 KITANLPQQASGGTSICHGLQAGFQAITSSDQSTSGSE----------------IVLLTD 415 Query: 339 GENNNFKSNVNTIKICDKA-KENFIKIVTISINASPNGQRLLKTCVSSPE-YHYNVVNAD 396 GE+N +S C +A + I TI++ + L + Y + + Sbjct: 416 GEDNGIRS-------CFEAVSRSGAIIHTIAL-GPSAARELETLSDMTGGLRFYANKDLN 467 Query: 397 SLIHVFQNIS 406 SLI F IS Sbjct: 468 SLIDAFSRIS 477 >gi|328953621|ref|YP_004370955.1| hypothetical protein Desac_1940 [Desulfobacca acetoxidans DSM 11109] gi|328453945|gb|AEB09774.1| hypothetical protein Desac_1940 [Desulfobacca acetoxidans DSM 11109] Length = 376 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 48/308 (15%), Positives = 99/308 (32%), Gaps = 25/308 (8%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 E+ ++I AL + + G I + + + K M+SA +AA+ G K+ + + Sbjct: 7 RHEEGAIAVITALLLPVLIGFTGLAIDIGNLYVIKTRMQSAVDAAVCGGGLKLPNQGLAM 66 Query: 76 GDRFESISNH------AKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 I+++ A D + + S + V T + IS+ Sbjct: 67 TTANSFITSNGFDPNDATITYTQDTVNNPAGSPEINCSMTNQV--PTFFLGLFGYPNISI 124 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 T A L + +N + + + + + + + ++ I +I Sbjct: 125 TVSAKGILQTGGAGGPFNYAIFSDQNLPISGNQKITGSVHTNHQLT-ISGNTKISGAAEG 183 Query: 190 IFLIELVVDLSGSMH-CAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + L SGS ++ +D A + A+ F I+ + Sbjct: 184 ATGVRL----SGSNTIGSVQADTTANIHVSGSNTIGSQSGGATQIAMPDFTSQIEAAAQA 239 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + VY Y+ ++ GT + + D + T S A +T Sbjct: 240 QGTVY-----YSAKILS------GTNDLDGNIWVQGDITMSGMTTSGTGAILANGNITIS 288 Query: 309 KKRSFFTN 316 + N Sbjct: 289 GNTTIGGN 296 >gi|304411849|ref|ZP_07393460.1| von Willebrand factor type A [Shewanella baltica OS183] gi|307303383|ref|ZP_07583138.1| von Willebrand factor type A [Shewanella baltica BA175] gi|304349709|gb|EFM14116.1| von Willebrand factor type A [Shewanella baltica OS183] gi|306913743|gb|EFN44165.1| von Willebrand factor type A [Shewanella baltica BA175] Length = 339 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 37/235 (15%), Positives = 72/235 (30%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VDLSGSM + K + +++ + F++ ++ Sbjct: 85 DLMMAVDLSGSM----------QIEDMVVNGKTVDRFTLIQHVVSDFIER-------RKG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V Q++ L+ K T A+ A + + Sbjct: 128 DRIGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGEAIALAVKRFDKMDES 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + +I LTDG NN A + I T+ + A Sbjct: 188 N------------------RVLILLTDGSNNAGNIEPEQAAQ--IAANRKVTIYTVGVGA 227 Query: 372 S-----------------PNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 + LK + ++ N+ L ++Q I +L Sbjct: 228 DVMERRTLFGRERVNPSMDLDENQLKHIAEVTHGRYFRARNSQELDQIYQEIDKL 282 >gi|301165481|emb|CBW25052.1| putative membrane protein (von Willebrand factor type A) [Bacteriovorax marinus SJ] Length = 329 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 38/179 (21%), Positives = 65/179 (36%), Gaps = 43/179 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V+D+S SM D + ++ K+ + F+ ++ Sbjct: 86 DIFFVIDVSRSMLAD---------------DFRPNRLEVAKDKISDFVA-------LRPT 123 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQILTSDKK 310 +GLI ++ R + S + ++Q V +L T+ A+ A Sbjct: 124 DRIGLIMFSERAFTLLPLSTDLKLIKQMVGEINVGGMLGSGTNIGDALGLAV-------- 175 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 SL K II LTDG +N + I+ ++AK+ IK+ TI I Sbjct: 176 ----------ARGAQSLAKNKVIILLTDGVSN--VGFLTPIQAAEEAKKQGIKVYTIGI 222 >gi|37680183|ref|NP_934792.1| hypothetical protein VV1999 [Vibrio vulnificus YJ016] gi|37198930|dbj|BAC94763.1| conserved hypothetical protein [Vibrio vulnificus YJ016] Length = 481 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 26/141 (18%), Positives = 63/141 (44%), Gaps = 9/141 (6%) Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 V R + + T++ M A ++L+ + + + K++ Sbjct: 340 RHFESTVQRLVPGMN---TNNAEGMVWAMRLLSPYWQGIWDKTRPELPRRYSDETSNKYL 396 Query: 334 IFLTDGENNNFKS--NVNTIKICDKAKE--NFIKIVTISINASPNGQRLLKTCVSSPEYH 389 + +DG + + + IC + K+ +K++T++ +RL+++C S PE + Sbjct: 397 VMFSDGNHLIDPAFRDKKMKLICTQLKQPGRGVKVMTVNFGG-AASERLMQSCASGPE-Y 454 Query: 390 YNVVNADSLIHVFQNISQLMV 410 Y+V + S+ VF+ I++ ++ Sbjct: 455 YHVASLFSVEKVFEQIAEQVI 475 >gi|299143633|ref|ZP_07036713.1| von Willebrand factor type A domain protein [Peptoniphilus sp. oral taxon 386 str. F0131] gi|298518118|gb|EFI41857.1| von Willebrand factor type A domain protein [Peptoniphilus sp. oral taxon 386 str. F0131] Length = 1217 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 53/364 (14%), Positives = 102/364 (28%), Gaps = 80/364 (21%) Query: 98 IKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYR 157 I + +F + +R S L + T+ ++ S Sbjct: 113 PDGTINYGPFEETKIFKVSGNMIRNAENRESSAPQWVQNLINPGGTLDPDLVNEGSVKAY 172 Query: 158 LQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSA 217 +E N + + A ++ + I LV+D SGSM Sbjct: 173 KPVVEEDPNIANKWTVKMLVAARDSVKTSK-----IVLVIDTSGSM-------------- 213 Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 K +M KNA F +D + + +G++ + + V + + K+ Sbjct: 214 -----KDFGRMKGAKNAANAF---VDNVLDGSQSTQIGIVRFASNVSIVSDFTSNKAKLH 265 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + T + +KQA +L + GV + +L Sbjct: 266 SAINALSAE---GGTFTQAGVKQARTMLAGSGADKKYMVVLSDGVPTFNYKISHPNNYLI 322 Query: 338 DG-----------------------ENNNFKSNVNTIKI--------------------- 353 DG E+ N + Sbjct: 323 DGGPGSHTHEKQTGKQLPENEFLYNESATGSGNSMWKQYERVKIGGGIFPTYEYHYYNSG 382 Query: 354 -CD-----KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 C AK +++ T+ + G +L+ S+P V + L VF+ I+ Sbjct: 383 NCAIAESGFAKAANMRVFTVGLQTDATGSGVLRDIASAPGDFTEVTDVSQLTPVFEQIAA 442 Query: 408 LMVH 411 ++ Sbjct: 443 QILK 446 >gi|152993979|ref|YP_001359700.1| von Willebrand factor type A domain-containing protein [Sulfurovum sp. NBC37-1] gi|151425840|dbj|BAF73343.1| von Willebrand factor type A domain protein [Sulfurovum sp. NBC37-1] Length = 307 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 53/225 (23%), Positives = 84/225 (37%), Gaps = 39/225 (17%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 + + L +D SGSM S +D+ +TK S D + Sbjct: 75 QHKKGRDLVLAIDASGSMA----------QSGFDEKDRFKTKYET------TLDLSADFI 118 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 H +D MG++ + T ++ E + + + + T A+ QA + L Sbjct: 119 KHRFDD-NMGVVIFGTFAYTASPLTYDLEAMESMLKMTTVGIAGESTAIGDALMQAMRTL 177 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 + + +S K II LTDG +N +S+ KAKE IKI Sbjct: 178 SYGEAQS------------------KAIILLTDGYHNAGRSSPKAAVA--KAKEKGIKIY 217 Query: 366 TISI-NASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQL 408 TI + +S LL T S Y +A L V++ I +L Sbjct: 218 TIGVGKSSDYDAALLDTIAKESGGKSYAAASAAQLKEVYKEIDKL 262 >gi|228471029|ref|ZP_04055873.1| BatA protein [Porphyromonas uenonis 60-3] gi|228307249|gb|EEK16272.1| BatA protein [Porphyromonas uenonis 60-3] Length = 326 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 42/274 (15%), Positives = 75/274 (27%), Gaps = 63/274 (22%) Query: 153 SYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPE 212 SY L F LL ++ E + L +DLSGSM Sbjct: 49 SYLRALPFALELLALASMILALARPQDSSHWEERSIQGIDLVLAMDLSGSMQAL------ 102 Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D K + A ++ + +GL+ + + Sbjct: 103 ---------DLKPNRFEAARDVASEMI----AARPNDN---IGLVVFAGESFTLCPLTVD 146 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + + Q + + T + A L S K Sbjct: 147 HDVILQMLDATEIGQLEDGTAIGLGLATAINTL------------------RGSDNKSKV 188 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS-------------------INAS- 372 II LTDG NN + + A++ I+I T++ + A Sbjct: 189 IILLTDGSNNAGDITPSMAA--ELAQQYGIRIYTVAAGTNGVAKFPVQTASGIEYVEADV 246 Query: 373 PNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNI 405 + L+ + +Y + L +++ I Sbjct: 247 QIDEGTLRHIAQQTGGKYYRATDETKLHEIYKEI 280 >gi|218190303|gb|EEC72730.1| hypothetical protein OsI_06342 [Oryza sativa Indica Group] Length = 585 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 34/216 (15%), Positives = 73/216 (33%), Gaps = 34/216 (15%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + V+D+SGSM + S+ ++K +++ LK A+ + +D Sbjct: 46 IDVVAVLDVSGSMG------DPAMASSDFEKNKPPSRLDVLKEAMKFIIRKLDDGD---- 95 Query: 251 DVYMGLIGYTTR-VEKNIEPSWGTEKVRQYVTRDMDS--LILKPTDSTPAMKQAYQILTS 307 + ++ + R V++ + + T PA+++A ++L Sbjct: 96 --RLSIVAFNDRPVKEYSTGLLNISGNGRRIAEKKVDWLEARGGTALMPALEEAIRVLDC 153 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 S + FI+ LTDG++ + + K + T Sbjct: 154 RPGDSRNSVG--------------FILLLTDGDDTSGFRWSRDVINGAVGK---YPVHTF 196 Query: 368 SINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHV 401 + A+ + + LL S + N D + Sbjct: 197 GLGAAHSSEALLHIAQESRGTYSFVDDENMDKIAGA 232 >gi|258647263|ref|ZP_05734732.1| BatA protein [Prevotella tannerae ATCC 51259] gi|260852912|gb|EEX72781.1| BatA protein [Prevotella tannerae ATCC 51259] Length = 334 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 47/281 (16%), Positives = 84/281 (29%), Gaps = 70/281 (24%) Query: 156 YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 L F+ +L ++ + E I + +D+S SM Sbjct: 53 IHLPFLLRILTFTCIIFALARPQTHNALSNKETEGINIMMAIDVSTSM------------ 100 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 + D +++ K F+ + + D +GL + + Sbjct: 101 ---LTPDLPPSRIETAKQVAYEFI-------NNRPDDNIGLTVFGGEAYTQCPLTTDHSA 150 Query: 276 V-----RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 + + + +I T + A L K +S Sbjct: 151 LLNMFKQVNCDLQKEGVISPGTAIGMGLSSAVSHLEQSKSKS------------------ 192 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI----------------------- 367 K II LTDGENN + + T + AK I+I TI Sbjct: 193 KVIILLTDGENNAGEISPLTAA--EMAKRLGIRIYTISVGTDAAVNQTVATLPNGETYEA 250 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 +I + + + L S+ Y + L ++QNI +L Sbjct: 251 AIKQNTDPKTLEAIANSTGGKFYQARSKAKLRDIYQNIDRL 291 >gi|90410254|ref|ZP_01218271.1| hypothetical protein P3TCK_05786 [Photobacterium profundum 3TCK] gi|90329607|gb|EAS45864.1| hypothetical protein P3TCK_05786 [Photobacterium profundum 3TCK] Length = 370 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 42/224 (18%), Positives = 78/224 (34%), Gaps = 25/224 (11%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +VVDLSGSM +K T++ A+K L F S ++ Sbjct: 95 DVMVVVDLSGSMA---------EKDFLTPYGEKHTRLDAVKQVLAQF-------SQHRDG 138 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + E + + + T A+ A ++ Sbjct: 139 DRLGLILFGDAAYLQAPFTADHETWLALLDETQVGMAGQSTHLGDAIGLAIKVFNDQADA 198 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + + +K +I LTDG N+ S V I A I+I I++ Sbjct: 199 AKQAATQQGSAVVTRPVKEKVVIVLTDG--NDTDSLVPPIDAAKVAASRDIRIHMIAMGD 256 Query: 372 SPN-GQRLL------KTCVSSPEYHYNVVNADSLIHVFQNISQL 408 G++ L + + + ++ L V++ I +L Sbjct: 257 PRTVGEQALDMEVIEQVASLTGGQSFQALSPAELTRVYKKIGEL 300 >gi|312131680|ref|YP_003999020.1| von willebrand factor type a [Leadbetterella byssophila DSM 17132] gi|311908226|gb|ADQ18667.1| von Willebrand factor type A [Leadbetterella byssophila DSM 17132] Length = 328 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 35/226 (15%), Positives = 71/226 (31%), Gaps = 48/226 (21%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 I + +D+S SM D K ++ A K L F++ Sbjct: 100 AGVDIAIAIDISDSMLAE---------------DLKPNRLEAAKGMALRFIE-------G 137 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDSTPAMKQAYQILTS 307 + + L+ + + +++Y+ ++I T A+ L Sbjct: 138 RTTDRIALVAFAGETATLSPLTTDYTALKEYLASINTNIIRTSGTALGMALSSCVNKLRD 197 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 +S + I ++DG+N T + AK +++ TI Sbjct: 198 VAGKS------------------RIAIIISDGDNTAGTIPPETAV--ELAKSFGVRVYTI 237 Query: 368 SINASPN----GQRLLKTCVSSP-EYHYNVVNADSLIHVFQNISQL 408 +I + ++ L+ P + + SL +F I L Sbjct: 238 AIGKPGSEEGVDEKTLRMLAGGPNGRFFKAADNSSLSKIFDEIDHL 283 >gi|188994393|ref|YP_001928645.1| aerotolerance-related membrane protein BatA [Porphyromonas gingivalis ATCC 33277] gi|188594073|dbj|BAG33048.1| aerotolerance-related membrane protein BatA [Porphyromonas gingivalis ATCC 33277] Length = 327 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 37/239 (15%), Positives = 73/239 (30%), Gaps = 63/239 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I L +D+SGSM D K ++ A K+ + F+ + + Sbjct: 88 IDIMLAMDVSGSMQAM---------------DFKPNRLEAAKDVAISFI---NNRPNDN- 128 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +G++ + + + V ++ T + A L K Sbjct: 129 ---IGMVTFAGESFTQCPLTTDHTVLLNMVQDLQMGVLDDGTAIGMGLATAVNRLKDSKA 185 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI- 369 +S + +I LTDG NN D A+ I++ T+ + Sbjct: 186 KS------------------RVVILLTDGSNNMGDITPRMAA--DIARTFGIRVYTVGVG 225 Query: 370 ---NAS----------------PNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 A + L S ++ V+ ++L +++ I +L Sbjct: 226 TRGEAPFPIQTEFGVRIQNVPVDIDEPTLDGIAEVSGGKYFRAVDNETLNEIYKEIDKL 284 >gi|154244802|ref|YP_001415760.1| von Willebrand factor type A [Xanthobacter autotrophicus Py2] gi|154158887|gb|ABS66103.1| von Willebrand factor type A [Xanthobacter autotrophicus Py2] Length = 345 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 50/269 (18%), Positives = 87/269 (32%), Gaps = 59/269 (21%) Query: 157 RLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNS 216 L FI LL + + P + +E E + L VDLS SM P+ V S Sbjct: 62 TLAFIWTLLVIAAARPVYVGTPVAIPVEGRE-----MMLAVDLSASMSS-----PDLVQS 111 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 ++ +K F+ + +GLI ++TR + V Sbjct: 112 -----GVPANRLQVVKRVADDFIAR-------RTGDRIGLILFSTRAYVQAPLTLDRNVV 159 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 RQ + + + T A+ A + L + + +I L Sbjct: 160 RQLLAEASIGMTGRNTSIGDAIGLAVKTLRDRPAK------------------DRVLILL 201 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN----------------GQRLLK 380 TDG N + + A + ++I TI + A N + LK Sbjct: 202 TDGANTSGVLDPMEAA--AIAAKENVRIHTIGVGADSNFTDIQPGMLMNPSGDLDEEALK 259 Query: 381 TCVS-SPEYHYNVVNADSLIHVFQNISQL 408 + ++ N L ++ +I +L Sbjct: 260 KIAGLTGGQYFRARNDKGLAAIYADIDRL 288 >gi|262196446|ref|YP_003267655.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262079793|gb|ACY15762.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 903 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 35/224 (15%), Positives = 70/224 (31%), Gaps = 56/224 (25%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + E+P I LVVD SGSM K+ A K + + Sbjct: 449 SEKQREQPHVAIALVVDRSGSMSGL--------------------KIEAAKESARATAEV 488 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEP--SWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 + + ++ + + + + ++ + R T+ PA++ Sbjct: 489 LSPSDLIT------VVAFDNQPTTIVRLQRASNRMRIATDIARLQAG---GGTNIYPALR 539 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 +AY+IL + K +I L+DG+ +C + + Sbjct: 540 EAYEILQGANAKV------------------KHVIVLSDGQAPYDG----IADLCQEMRS 577 Query: 360 NFIKIVTISINASPNGQRLLKTCVSS-PEYHYNVVNADSLIHVF 402 I + + I + LL + Y + +L +F Sbjct: 578 ARITVSAVGIG--DADRNLLNLITDNGDGRLYMTDDLAALPRIF 619 >gi|77459433|ref|YP_348940.1| von Willebrand factor, type A [Pseudomonas fluorescens Pf0-1] gi|77383436|gb|ABA74949.1| putative exported protein [Pseudomonas fluorescens Pf0-1] Length = 359 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 36/183 (19%), Positives = 68/183 (37%), Gaps = 37/183 (20%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM D+ ++++ +++ L FL+S ++ Sbjct: 91 DLLVAVDVSGSMD----------FPDMQWNDEDVSRLSLVQHLLGDFLES-------RDG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + K T A+ A + L + Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRRTVRVWLDEARIGIAGKNTAIGDAIGLALKRLRMRPAQ 193 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S + +I +TDG NN + + T A +KI I I A Sbjct: 194 S------------------RVLILVTDGANNGGEIDPLTAA--KLAASEGVKIYPIGIGA 233 Query: 372 SPN 374 P Sbjct: 234 DPE 236 >gi|148680072|gb|EDL12019.1| mCG141954, isoform CRA_b [Mus musculus] Length = 972 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 37/212 (17%), Positives = 71/212 (33%), Gaps = 50/212 (23%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM T++ + A L+L I ++++ + Sbjct: 311 LVLDKSGSMR----------------LGSPITRLTLMNQAAELYLIQI-----IEKESLV 349 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQY--VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 GL+ + + Y ++ + T +K+ ++ +TS + + Sbjct: 350 GLVTFDSTATIQTNLI-RIINDSSYLAISTKLPQYPNGGTSICNGLKKGFEAITSSDQST 408 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINA 371 + I+ LTDGE+N S C + K + I TI++ Sbjct: 409 SGSE----------------IVLLTDGEDNRISS-------CFQEVKHSGAIIHTIAL-G 444 Query: 372 SPNGQRLLKTCVSSPE-YHYNVVNADSLIHVF 402 + L + Y + + LI F Sbjct: 445 PSAARELETLSDMTGGLRFYAKEDVNGLIDAF 476 >gi|291409921|ref|XP_002721255.1| PREDICTED: matrilin 4 [Oryctolagus cuniculus] Length = 346 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 30/173 (17%), Positives = 66/173 (38%), Gaps = 24/173 (13%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKPTD 293 + + L +G+I Y+++V+ G R+ + R + +L+ + T Sbjct: 57 RFLVGLVRSLDVGPNATRVGVIQYSSQVQSVFPL--GAFSRREDMERALRTLVPLAQGTM 114 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A++ A + S + G + P + + +TDG + ++ Sbjct: 115 TGLAIQYAMNVAFSVAE----------GARPPEERVPRVAVIVTDG---RPQD--RVAEV 159 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQN 404 +A+ I+I + + + G L+ S P E+ + V + LI F Sbjct: 160 AAQARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESF-DLIQEFGR 209 >gi|113969745|ref|YP_733538.1| von Willebrand factor, type A [Shewanella sp. MR-4] gi|113884429|gb|ABI38481.1| von Willebrand factor, type A [Shewanella sp. MR-4] Length = 338 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 37/235 (15%), Positives = 66/235 (28%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM V L ++ Sbjct: 85 DLMLAVDLSGSMQIEDMVINGKVVDRFT-----------LIQHVVSEFIERRKGD----- 128 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V Q++ L+ K T ++ A + + Sbjct: 129 -RIGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGESIALAVKRFDKMDES 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + +I LTDG NN + + A + I T+ + A Sbjct: 188 N------------------RVLILLTDGSNNAGNIDPDQAAQ--IAANRKVTIYTVGVGA 227 Query: 372 S-----------------PNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 + LK + ++ N+ L ++Q I +L Sbjct: 228 DVMERRTLFGRERVNPSMDLDENQLKHIAEVTHGRYFRARNSQELEQIYQEIDKL 282 >gi|327193254|gb|EGE60160.1| hypothetical protein RHECNPAF_1700073 [Rhizobium etli CNPAF512] Length = 457 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 58/401 (14%), Positives = 114/401 (28%), Gaps = 92/401 (22%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 ++ R F+ N +I+ ALS++ L+ +G + + + M+S +AA Sbjct: 19 LNSFGRLVRAFRSLGRDRTGNVAIVVALSLVPMLVAVGASFDYIRSYNVRQRMQSDLDAA 78 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 ++A ++ + E Y+ Sbjct: 79 LIAAVKQINN--------------------------------SEDTDALKQKVYDWFHAQ 106 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 + NS + + +T +N+ S F+ + + Sbjct: 107 VENSYALG---------EIEIDTTNHNITATASGTVPTTFM------KIANIDTVPVSVG 151 Query: 181 LRIEMGERPIFLIELVVDLSGSM----------------HCAM-----NSDPEDVNSAPI 219 ++ + +V+D S SM C ++ + Sbjct: 152 SAVKGPATSYLNVYIVIDRSPSMLLAATTSGQSTMYSGIGCQFACHTGDAHTVGKKTYAN 211 Query: 220 CQDKKRTK-----MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 D K +A+ LD ID E + +GL ++ + P+ T Sbjct: 212 NYDYSTEKNIKLRADVAGDAVREVLDMIDESDSNHERIKVGLYSLGDTTKEVLAPTLDTS 271 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 R+ ++ D L S +M Y + G S K ++ Sbjct: 272 NARKRLSDDSYGLT-----SATSMNYTY---FDVALAALQKIVGTGGDGTSSANPLKLVL 323 Query: 335 FLTDGENNN----FKSNVNTIK-------ICDKAKENFIKI 364 LTDG + K++ N K C K + Sbjct: 324 LLTDGVQSQRGWVVKNSSNLKKVAPLNPDWCGYVKNKSATM 364 >gi|307718398|ref|YP_003873930.1| hypothetical protein STHERM_c06990 [Spirochaeta thermophila DSM 6192] gi|306532123|gb|ADN01657.1| hypothetical protein STHERM_c06990 [Spirochaeta thermophila DSM 6192] Length = 458 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 46/241 (19%), Positives = 81/241 (33%), Gaps = 39/241 (16%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 +R + LV+D SGSM A++ P + D R ++ K A+ FL Sbjct: 78 SVRRGVNREEGISFLLVLDASGSMWDALDGTPTE--------DPDRMRITHAKRAIREFL 129 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 + +GL + V + + P+ + Sbjct: 130 PLLSGRD------RVGLAVFNRTYRVIQPIVGDPSLVLEKLDAIE----------RPSRE 173 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN--NNFKSNVNTIKIC-DK 356 QAY L + + ++ ++ L+DGEN + + +T D Sbjct: 174 QAYTELYRSMEEALT--------DFGEEGRRRVLVVLSDGENFPVDPSESPSTPGTAIDL 225 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 A I I + RL+ S + ++ NA L V+ I Q V ++Y+ Sbjct: 226 AHRYGITCYVIHFGTEKD--RLIGDLASETGGRVFDARNALELASVYTAI-QEQVLQEYA 282 Query: 416 V 416 V Sbjct: 283 V 283 >gi|295092462|emb|CBK78569.1| von Willebrand factor type A domain. [Clostridium cf. saccharolyticum K10] Length = 2061 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 40/239 (16%), Positives = 81/239 (33%), Gaps = 53/239 (22%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D SGSM + S D + +AL LF + + S ++ Sbjct: 1106 VMFVIDKSGSMDQSFGSGNSDARR------------EVVNSALELFFNQL---SDGDYNI 1150 Query: 253 YMGLIGYTTRVEKNIEPSWG---------TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 G ++ E+ WG T ++ + T + ++ A Sbjct: 1151 QFGGYKFSDSGERVNFNDWGWQDKYWETDTSNALSHL-KLTSWETDGSTYPSQTLRSAIS 1209 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE--NNNFKSNVNTIKICDKAKEN- 360 L + + ++++IFLTDGE N++ + + C A +N Sbjct: 1210 ALENVELGENG---------------KRYLIFLTDGEPGQNSYSFSKEEAENCYSAIKNL 1254 Query: 361 --FIKIVTISINASPNGQRLLKTCVSSPE-------YHYNVVNADSLIHVFQNISQLMV 410 I + A+ + +++ VS+ + +AD L F ++ + Sbjct: 1255 DSGTTFYAIQV-ANSDSHGFMESMVSNANFVDGVTAQKFVGNSADELNAAFSQMAAEIS 1312 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 38/226 (16%), Positives = 74/226 (32%), Gaps = 32/226 (14%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + V+D S SM ++ D D+ ++ + +AL + + Sbjct: 1525 DLVFVIDKSLSMDYDIDGDEIKWWE-----DETESRKDIVNDALDEIIP-----DLCSQQ 1574 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + + GY W E+ +Q + S T+ + A+ A +L Sbjct: 1575 YDIQIAGYQFSGSSTRVLDWSREE-QQVLNNLKISNTSYNTEPSQALADALDML------ 1627 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 +K++IF+TDGE +S + K I TI ++ Sbjct: 1628 -----KTGSQAHQNQSNVKKYLIFMTDGE--PTESEELSYYAISKNPVPGASIYTIGVS- 1679 Query: 372 SPNGQRLLKTCVSSPE-------YHYNVVNADSLIHVFQNISQLMV 410 S L++ S+ E + +A + F I ++ Sbjct: 1680 SDASTDLMEGIRSTAEGNGMTAPATFKGTSAQLIKDAFTQIKDEII 1725 Score = 37.2 bits (84), Expect = 4.6, Method: Composition-based stats. Identities = 33/172 (19%), Positives = 65/172 (37%), Gaps = 19/172 (11%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 I G++ I LV+DLS SM ++ D D +++ LK+ L + +S Sbjct: 633 ITGGDQQGLDIVLVIDLSNSMDDGISEDSSD------------SRLKVLKDTLGYYKESY 680 Query: 243 DLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST-PAMKQ 300 + D G I E++ + Y + ++D S +K+ Sbjct: 681 NSRPGKPTIDEKSGFI--DDLFEQSPNSRFSIVTYSTYASTELDWTEYGMNGSGQQTIKE 738 Query: 301 AYQILTSDKKRSFFT---NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 A L ++ ++ + + +IFL+DG+ + S+V+ Sbjct: 739 AIGELQANGGTNYEAGLYQAMEVLKERGNSSNIPVVIFLSDGKPTYYYSDVD 790 >gi|3560547|gb|AAC35003.1| chloride channel CaCC [Mus musculus] Length = 901 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 64/370 (17%), Positives = 130/370 (35%), Gaps = 66/370 (17%) Query: 41 IYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKN 100 + + +KN++E+ + + G + +V N R + +K L + FI + Sbjct: 170 VDQPFYMSRKNTIEATRCSTRITG-TNVVHNCERGNCVTRACRRDSKTRLYEPKCTFIPD 228 Query: 101 HIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN-RLDSSNNTIFYNMDVMTSYDYRLQ 159 I+ + G S +F + + + A N + N +++ ++ Sbjct: 229 KIQTA--GASIMFMQNLNSVVEFCTEKNHNAEAPNLQNKMCNRRSTWDV---------IK 277 Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 N + + P + R + + LV+D SGSM Sbjct: 278 TSADFQNAPPMRGTEAPPPPTFSLLKSRRRV--VCLVLDKSGSMD--------------- 320 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV-EKNIEPSWGTEKVRQ 278 K ++ + A L+L I V+++ +GL+ + + +N + Q Sbjct: 321 ----KEDRLIRMNQAAELYLTQI-----VEKESMVGLVTFDSAAHIQNYLIKITSSSDYQ 371 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 +T ++ T ++ +Q +TS + + + I+ LTD Sbjct: 372 KITANLPQQASGGTSICHGLQAGFQAITSSDQSTSGSE----------------IVLLTD 415 Query: 339 GENNNFKSNVNTIKICDKA-KENFIKIVTISINASPNGQRLLKTCVSSPE-YHYNVVNAD 396 GE+N +S C +A + I TI++ + L + Y + + Sbjct: 416 GEDNGIRS-------CFEAVSRSGAIIHTIAL-GPSAARELETLSDMTGGLRFYANKDLN 467 Query: 397 SLIHVFQNIS 406 SLI F IS Sbjct: 468 SLIDAFSRIS 477 >gi|313203640|ref|YP_004042297.1| von willebrand factor type a [Paludibacter propionicigenes WB4] gi|312442956|gb|ADQ79312.1| von Willebrand factor type A [Paludibacter propionicigenes WB4] Length = 327 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 43/273 (15%), Positives = 82/273 (30%), Gaps = 63/273 (23%) Query: 157 RLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNS 216 + F+ +L ++ A I + +D+S +M Sbjct: 54 HIPFVLRILVIACLIIALARPQASNSWRTQNTEGIDIMMALDISSTMLAG---------- 103 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 D K T++ A K+ F+ L +GL+ + + + Sbjct: 104 -----DIKPTRLEAAKSVATEFI----LSRPNDN---IGLVIFARESFTQCPLTTDHAVL 151 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + +I T + A + K +S K II L Sbjct: 152 VNLFNGVNNGMIEDGTAIGLGLANAVNRIKDGKSKS------------------KVIILL 193 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTIS----------INASPNGQ---------- 376 TDG NN+ T + AK I+I TI ++ Q Sbjct: 194 TDGSNNSGDIAPITAA--EIAKTFGIRIYTIGVGTHGVINIPVSTPMGIQYQRVQSEFDA 251 Query: 377 RLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 + L+ + + ++ + L +++Q I +L Sbjct: 252 KSLENIANLTGGKYFGATDNSKLRNIYQEIDKL 284 >gi|260592520|ref|ZP_05857978.1| BatA protein [Prevotella veroralis F0319] gi|260535566|gb|EEX18183.1| BatA protein [Prevotella veroralis F0319] Length = 318 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 45/267 (16%), Positives = 76/267 (28%), Gaps = 58/267 (21%) Query: 156 YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 L L +++ + + I L +D+S SM Sbjct: 53 IHLPMFLRCLVYTLVVIVLARPQTYNSWDNKDAEGIDIMLTMDISASM------------ 100 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS----- 270 + +D ++ K F+ +GL + + Sbjct: 101 ---LTEDVFPNRIEVAKEVASDFI----SGRPNDN---IGLTIFAGEAFTQCPMTVDHAA 150 Query: 271 --WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 VR + LI T + + L K +S Sbjct: 151 LLNLLHNVRT--DLVVKGLIQDGTAIGMGLANSVSRLKDSKAKS---------------- 192 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG------QRLLKTC 382 K II LTDG NN + T AK+ I+I TI + G + L+ Sbjct: 193 --KVIILLTDGSNNVGSISPMTAA--SIAKKYGIRIYTIGLGKESEGDLGAIDYKTLQNI 248 Query: 383 V-SSPEYHYNVVNADSLIHVFQNISQL 408 S+ Y + L ++Q+I +L Sbjct: 249 AVSTNGEFYRAQSQAELSKIYQDIDKL 275 >gi|148680077|gb|EDL12024.1| mCG3350, isoform CRA_d [Mus musculus] Length = 902 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 64/370 (17%), Positives = 130/370 (35%), Gaps = 66/370 (17%) Query: 41 IYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKN 100 + + +KN++E+ + + G + +V N R + +K L + FI + Sbjct: 170 VDQPFYMSRKNTIEATRCSTRITG-TNVVHNCERGNCVTRACRRDSKTRLYEPKCTFIPD 228 Query: 101 HIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN-RLDSSNNTIFYNMDVMTSYDYRLQ 159 I+ + G S +F + + + A N + N +++ ++ Sbjct: 229 KIQTA--GASIMFMQNLNSVVEFCTEKNHNAEAPNLQNKMCNRRSTWDV---------IK 277 Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 N + + P + R + + LV+D SGSM Sbjct: 278 TSADFQNAPPMRGTEAPPPPTFSLLKSRRRV--VCLVLDKSGSMD--------------- 320 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV-EKNIEPSWGTEKVRQ 278 K ++ + A L+L I V+++ +GL+ + + +N + Q Sbjct: 321 ----KEDRLIRMNQAAELYLTQI-----VEKESMVGLVTFDSAAHIQNYLIKITSSSDYQ 371 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 +T ++ T ++ +Q +TS + + + I+ LTD Sbjct: 372 KITANLPQQASGGTSICHGLQAGFQAITSSDQSTSGSE----------------IVLLTD 415 Query: 339 GENNNFKSNVNTIKICDKA-KENFIKIVTISINASPNGQRLLKTCVSSPE-YHYNVVNAD 396 GE+N +S C +A + I TI++ + L + Y + + Sbjct: 416 GEDNGIRS-------CFEAVSRSGAIIHTIAL-GPSAARELETLSDMTGGLRFYANKDLN 467 Query: 397 SLIHVFQNIS 406 SLI F IS Sbjct: 468 SLIDAFSRIS 477 >gi|327270780|ref|XP_003220166.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 906 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 43/282 (15%), Positives = 94/282 (33%), Gaps = 64/282 (22%) Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFL--------IELVVDLSGSMHCA 206 + + + ++ F+ + R P + LV+D+SGSM Sbjct: 258 NMQNKMCDYRSTWEVIMDSADFVNSSPRSAPPPHPTLSLLQTRDRVVCLVLDVSGSM--- 314 Query: 207 MNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN 266 + ++ LK A LFL I ++ + G++ + ++ Sbjct: 315 ----------------RDFNRIHRLKQAAELFLLQI-----IETGSWAGIVVFNSKASTK 353 Query: 267 IEPSWGTE-KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP 325 T VRQ ++ + ++ T +++ +Q+ + Sbjct: 354 AFLQQITSDSVRQTLSDHLPTVADGGTSICSGIREGFQVFLQKYSSTEGCE--------- 404 Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVS 384 I+ LTDGE+++ S C + + + I TI++ + L Sbjct: 405 -------IVLLTDGEDSSVSS-------CFAEVERSGSTIHTIAL-GPSAAKELEMLANM 449 Query: 385 SPEYHYNVV---NADSLIHVFQNISQ---LMVHRKYSVILKG 420 + ++ +++ LI F IS + + + KG Sbjct: 450 TGGLTFSATDSLDSNGLIDAFSGISSGSGDISQQSIQLESKG 491 >gi|86741605|ref|YP_482005.1| von Willebrand factor, type A [Frankia sp. CcI3] gi|86568467|gb|ABD12276.1| von Willebrand factor, type A [Frankia sp. CcI3] Length = 534 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 50/290 (17%), Positives = 95/290 (32%), Gaps = 64/290 (22%) Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL-RIEMGERPIFL 192 +RL ++ + V + + + L Q + A L R IFL Sbjct: 287 QHRLQTATSRRPALPGVALDARFPTRTLTELPFPASRQVADQLLSAYLDRFRRPSHAIFL 346 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH----- 247 +D+SGSM +++AAL+ AL + D LS Sbjct: 347 ----LDVSGSMAG--------------------SRIAALQAALRGLTGADDTLSGRFARF 382 Query: 248 -VKEDVYMGLIGYTTRVEKNIEPSWG--------TEKVRQYVTRDMDSLILKPTDSTPAM 298 +E + M I + R ++ + V +V + T A+ Sbjct: 383 RGREKITM--ITFAGRANDPVDFAVNDPRPGSADLAGVNTFVDGLR---LQDGTAIYSAL 437 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN--FKSNVNTIKICDK 356 + Y+ + + + I+ +TDGENN+ ++ + Sbjct: 438 EAGYRAAGAAVEADPG--------------YLTSIVLMTDGENNSGISAADFRSSYQRLP 483 Query: 357 AKENFIKIVTISI-NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 A ++ TI+ A P R + + ++ + SL F++I Sbjct: 484 AAARAVRTFTIAFGEADPAALRDIS--ADTGGAVFDARTS-SLADAFKDI 530 >gi|74209191|dbj|BAE24978.1| unnamed protein product [Mus musculus] Length = 902 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 64/370 (17%), Positives = 130/370 (35%), Gaps = 66/370 (17%) Query: 41 IYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKN 100 + + +KN++E+ + + G + +V N R + +K L + FI + Sbjct: 170 VDQPFYMSRKNTIEATRCSTRITG-TNVVHNCERGNCVTRACRRDSKTRLYEPKCTFIPD 228 Query: 101 HIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN-RLDSSNNTIFYNMDVMTSYDYRLQ 159 I+ + G S +F + + + A N + N +++ ++ Sbjct: 229 KIQTA--GASIMFMQNLNSVVEFCTEKNHNAEAPNLQNKMCNRRSTWDV---------IK 277 Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 N + + P + R + + LV+D SGSM Sbjct: 278 TSADFQNAPPMRGTEAPPPPTFSLLESRRRV--VCLVLDKSGSMD--------------- 320 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV-EKNIEPSWGTEKVRQ 278 K ++ + A L+L I V+++ +GL+ + + +N + Q Sbjct: 321 ----KEDRLIRMNQAAELYLTQI-----VEKESMVGLVTFDSAAHIQNYLIKITSSSDYQ 371 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 +T ++ T ++ +Q +TS + + + I+ LTD Sbjct: 372 KITANLPQQASGGTSICHGLQAGFQAITSSDQSTSGSE----------------IVLLTD 415 Query: 339 GENNNFKSNVNTIKICDKA-KENFIKIVTISINASPNGQRLLKTCVSSPE-YHYNVVNAD 396 GE+N +S C +A + I TI++ + L + Y + + Sbjct: 416 GEDNGIRS-------CFEAVSRSGAIIHTIAL-GPSAARELETLSDMTGGLRFYANKDLN 467 Query: 397 SLIHVFQNIS 406 SLI F IS Sbjct: 468 SLIDAFSRIS 477 >gi|255037594|ref|YP_003088215.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] gi|254950350|gb|ACT95050.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] Length = 339 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 39/234 (16%), Positives = 70/234 (29%), Gaps = 58/234 (24%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I L++D+S SM I +D ++ A K F+ + Sbjct: 103 IDIMLLLDISDSM---------------IEKDLSPNRLEAAKRMARQFIK-------GRL 140 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQILTSDK 309 +GLI + + E + ++ SL T A+ A + Sbjct: 141 QDRIGLIVFAGEAVSLCPLTTDYELLYGFLDEVTPSLIPTPGTAIGSALAVAVNRMRDTA 200 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 S K I ++DG+N + T A +K+ TIS+ Sbjct: 201 GES------------------KVAILISDGDNTSGNLGPTTSAQLANA--FGVKVYTISV 240 Query: 370 NASPNGQRL--------------LKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 + + L+ ++ + +L VF+ I QL Sbjct: 241 GKPKSASKADTTASAGALMDEGELQNIAGIGNGKYFRATDNTALESVFKQIDQL 294 >gi|32964827|ref|NP_034029.2| chloride channel calcium activated 1 [Mus musculus] gi|3925281|gb|AAC79982.1| calcium-sensitive chloride conductance protein-1 [Mus musculus] gi|74183411|dbj|BAE36582.1| unnamed protein product [Mus musculus] gi|124376304|gb|AAI32343.1| Chloride channel calcium activated 1 [Mus musculus] Length = 902 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 64/370 (17%), Positives = 130/370 (35%), Gaps = 66/370 (17%) Query: 41 IYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKN 100 + + +KN++E+ + + G + +V N R + +K L + FI + Sbjct: 170 VDQPFYMSRKNTIEATRCSTRITG-TNVVHNCERGNCVTRACRRDSKTRLYEPKCTFIPD 228 Query: 101 HIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN-RLDSSNNTIFYNMDVMTSYDYRLQ 159 I+ + G S +F + + + A N + N +++ ++ Sbjct: 229 KIQTA--GASIMFMQNLNSVVEFCTEKNHNAEAPNLQNKMCNRRSTWDV---------IK 277 Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 N + + P + R + + LV+D SGSM Sbjct: 278 TSADFQNAPPMRGTEAPPPPTFSLLKSRRRV--VCLVLDKSGSMD--------------- 320 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV-EKNIEPSWGTEKVRQ 278 K ++ + A L+L I V+++ +GL+ + + +N + Q Sbjct: 321 ----KEDRLIRMNQAAELYLTQI-----VEKESMVGLVTFDSAAHIQNYLIKITSSSDYQ 371 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 +T ++ T ++ +Q +TS + + + I+ LTD Sbjct: 372 KITANLPQQASGGTSICHGLQAGFQAITSSDQSTSGSE----------------IVLLTD 415 Query: 339 GENNNFKSNVNTIKICDKA-KENFIKIVTISINASPNGQRLLKTCVSSPE-YHYNVVNAD 396 GE+N +S C +A + I TI++ + L + Y + + Sbjct: 416 GEDNGIRS-------CFEAVSRSGAIIHTIAL-GPSAARELETLSDMTGGLRFYANKDLN 467 Query: 397 SLIHVFQNIS 406 SLI F IS Sbjct: 468 SLIDAFSRIS 477 >gi|302188504|ref|ZP_07265177.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. syringae 642] Length = 352 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 44/248 (17%), Positives = 82/248 (33%), Gaps = 66/248 (26%) Query: 192 LIELVVDLSGSMHC---AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + + VD+SGSM SD +++ ++ L FL+ Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDEV-------------SRLVLVQQLLGDFLE-------G 130 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ +GLI + T+ ++ VR ++ + K T A+ A + L Sbjct: 131 RKGDRVGLILFGTQAFVQAPLTYDRRTVRIWLDEARIGIAGKNTALGDAIGLALKRLRLR 190 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 S + ++ +TDG NN + + T A E +KI I Sbjct: 191 PATS------------------RVLVLVTDGANNAGQIDPITAAR--LAAEEGVKIYPIG 230 Query: 369 INASPNGQRL----------------LKTCVS-SPEYHYNVVNADSL------IHVFQNI 405 I + P+ L LK S S ++ + D L + + + Sbjct: 231 IGSDPDKDALQSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKIRATLDALEPV 290 Query: 406 SQLMVHRK 413 +Q + Sbjct: 291 AQQPTQAR 298 >gi|256419476|ref|YP_003120129.1| hypothetical protein Cpin_0430 [Chitinophaga pinensis DSM 2588] gi|256034384|gb|ACU57928.1| conserved hypothetical protein [Chitinophaga pinensis DSM 2588] Length = 336 Score = 53.0 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 47/310 (15%), Positives = 96/310 (30%), Gaps = 69/310 (22%) Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 I R + ++ + + + + L + S Sbjct: 31 IARQQRRQVAMEMSSLQGLKGLPVSWKVRLRPLLLVLRLVAFAALVVALARPQTSNTSES 90 Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + + I L +D+SGSM QD + ++ A K + F+D Sbjct: 91 I-----DSEGIDIVLAMDISGSMLA---------------QDLQPDRLEAAKRVAMNFVD 130 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 S +S +GL+ ++ + ++ + + ++ T + Sbjct: 131 S--RISD-----RIGLVIFSGESFTQCPITTDHGVLKNQIAQVKSGMLQDGTAIGMGLAT 183 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 + + L + K +S K II LTDG NN + T + AK Sbjct: 184 SVERLRTSKAKS------------------KVIILLTDGVNNTGLIDPLTAL--EIAKAF 223 Query: 361 FIKIVTISI----NAS-----------------PNGQRLLKT-CVSSPEYHYNVVNADSL 398 I++ TI + A + L+K V + ++ + L Sbjct: 224 KIRVYTIGVGTIGKAPFPMTMPDGSIQMQMQDVQLDEPLMKKISVETGGKYFRATSNKEL 283 Query: 399 IHVFQNISQL 408 +++ I +L Sbjct: 284 ENIYGEIDKL 293 >gi|167758708|ref|ZP_02430835.1| hypothetical protein CLOSCI_01050 [Clostridium scindens ATCC 35704] gi|167663904|gb|EDS08034.1| hypothetical protein CLOSCI_01050 [Clostridium scindens ATCC 35704] Length = 1865 Score = 53.0 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 31/163 (19%), Positives = 57/163 (34%), Gaps = 30/163 (18%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + + G++ + LV+D SGSM N + M LK + +D Sbjct: 535 VTGKRGKKAGVDVLLVIDKSGSMGLNDNGRTDSNYFNL---------MPTLKKTVPTLVD 585 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEK--NIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTPA 297 +I L + I +++ +I W + R ++ L T+ A Sbjct: 586 TI--LPDSDSVNRVAAISFSSDDYTGNDISTDWVDYNGKSGFNRKIEGLGTKGGTNWQLA 643 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 M+ A + L + +K ++FL+DGE Sbjct: 644 MRNADKKLKPRAESQN----------------KKVVVFLSDGE 670 Score = 37.9 bits (86), Expect = 3.1, Method: Composition-based stats. Identities = 50/276 (18%), Positives = 85/276 (30%), Gaps = 75/276 (27%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 I R E I LV+D S SM + K+ ++ Sbjct: 1049 IELSASTKGRDEGIAAQAASIVLVLDASASM------------------QENGKKLKDIQ 1090 Query: 233 NALLLFLDSIDLLSHVKEDV---YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLI 288 +A F+++ S + E Y G G ++ + + + T + R + + Sbjct: 1091 DAAKAFVNTTKEKSPISEIAVIWYQGSEGSSSTITDSGFYTLDTSDNVDAINRFISNKNA 1150 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T A+++A IL+ S K+ + TDG SN Sbjct: 1151 SGGTPMGDALEEANSILSGRPNSS------------------KYALLFTDGMPGYNSSNN 1192 Query: 349 N-----TIKICDKAK--ENFIKIVTISI-----------------------NASPNGQRL 378 + ++AK + + K+ TI Sbjct: 1193 SFNCMVANHANNEAKEIKEYAKLYTIGYKLSGSFKWEEGHSQDSTNNHGSHKTETKAADF 1252 Query: 379 LK-TCVSSPE----YHYNVVNADSLIHVFQNISQLM 409 LK SSPE Y Y N D L +F++I+ + Sbjct: 1253 LKNYLASSPEGDRTYAYTTDNTDGLTKIFEDIAGQI 1288 >gi|34541234|ref|NP_905713.1| batA protein [Porphyromonas gingivalis W83] gi|34397550|gb|AAQ66612.1| batA protein [Porphyromonas gingivalis W83] Length = 327 Score = 53.0 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 37/239 (15%), Positives = 73/239 (30%), Gaps = 63/239 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I L +D+SGSM D K ++ A K+ + F+ + + Sbjct: 88 IDIMLAMDVSGSMQAM---------------DFKPNRLEAAKDVAISFI---NNRPNDN- 128 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +G++ + + + V ++ T + A L K Sbjct: 129 ---IGMVTFAGESFTQCPLTTDHTVLLNMVQDLQMGVLDDGTAIGMGLATAVNRLKDSKA 185 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI- 369 +S + +I LTDG NN D A+ I++ T+ + Sbjct: 186 KS------------------RVVILLTDGSNNMGDITPRMAA--DIARTFGIRVYTVGVG 225 Query: 370 ---NAS----------------PNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 A + L S ++ V+ ++L +++ I +L Sbjct: 226 TRGEAPFPIQTEFGVRIQNVPVDIDEPTLDGIAEVSGGKYFRAVDNETLNEIYKEIDKL 284 >gi|325268973|ref|ZP_08135594.1| aerotolerance protein BatA [Prevotella multiformis DSM 16608] gi|324988594|gb|EGC20556.1| aerotolerance protein BatA [Prevotella multiformis DSM 16608] Length = 318 Score = 53.0 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 43/234 (18%), Positives = 68/234 (29%), Gaps = 54/234 (23%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E I L +D+S SM + D +MA K F+ S Sbjct: 84 ETEGIDIMLTMDVSASM---------------LTDDVYPNRMAVAKEVASEFISS----R 124 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-----TDSTPAMKQA 301 +GL + + + + L+ T + A Sbjct: 125 PNDN---IGLTIFAGEAFTQCPMTLDHAALLNLLHNVRPDLVTSGLMKDGTAIGMGLANA 181 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 L K +S K +I LTDG NN + T AK+ Sbjct: 182 VSRLQDSKAKS------------------KIVILLTDGSNNVGSISPMTAA--AIAKKFG 221 Query: 362 IKIVTISINASPN------GQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQL 408 I++ TI R L+ S+ Y + L ++Q+I +L Sbjct: 222 IRVYTIGFGRETGEEIGAIDYRALQNIAVSTNGEFYRAQSQAELSRIYQDIDKL 275 >gi|150375951|ref|YP_001312547.1| von Willebrand factor type A [Sinorhizobium medicae WSM419] gi|150030498|gb|ABR62614.1| von Willebrand factor type A [Sinorhizobium medicae WSM419] Length = 334 Score = 53.0 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 47/258 (18%), Positives = 88/258 (34%), Gaps = 55/258 (21%) Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPIF------------LIELVVDLSGSMHCAMNSDP 211 + QKI++ I LL + RP F + L +DLS SM SDP Sbjct: 55 VPRANLLQKIIAPICWLLVLTALARPQFVEPPIEKTEPQRDLMLALDLSQSMDTRDFSDP 114 Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW 271 Q + ++ A+K + F+D + +GL+ + + + Sbjct: 115 ---------QGNLQARVDAVKTVVADFVDR-------RPYDRLGLVAFGDAPYPLVPFTM 158 Query: 272 GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 VR +T + + T A+ + Q + P K Sbjct: 159 DHATVRSMLTGALPGMAGPKTALGDALGL--------------SIKLFQQSQAPD----K 200 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG-------QRLLKTCVS 384 ++ LTDG + K + + A +N I+I T+ I + L K + Sbjct: 201 VLVVLTDGNDTASKMPPDKAA--EIASQNHIRIHTVGIGNPDAQGEEKLDTETLQKIATA 258 Query: 385 SPEYHYNVVNADSLIHVF 402 + ++ + +L ++ Sbjct: 259 TGGRYFFGQDQQALAEIY 276 >gi|297473020|ref|XP_002686329.1| PREDICTED: chloride channel accessory 1 [Bos taurus] gi|296489230|gb|DAA31343.1| chloride channel accessory 1 [Bos taurus] Length = 911 Score = 53.0 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 48/240 (20%), Positives = 87/240 (36%), Gaps = 69/240 (28%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM ++ L A LFL V++ ++ Sbjct: 311 LVLDKSGSMTIG-------------------NRLKRLNQAGKLFLLQ-----TVEQGSWV 346 Query: 255 GLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + E + R +T+ + + T ++ A+ ++ K+ + Sbjct: 347 GMVTFDSAAYVQSELVQINSATERDTLTKSLPTTASGGTSICSGLRSAFTVI----KKKY 402 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 T+ II LTDGE+N TI C D+ K++ I T+++ Sbjct: 403 PTDGAE-------------IILLTDGEDN-------TISACFDEVKQSGAIIHTVAL-GP 441 Query: 373 PNGQRL---------LKTCVSSPEYHYNVVNADSLIHVFQNIS---QLMVHRKYSVILKG 420 Q L L+T S V N + L+ F +S + + R + +G Sbjct: 442 SAAQELEQMSKMTGGLQTYASD-----QVQN-NGLVDAFAALSSGNKAVSQRSIQLESRG 495 >gi|284046352|ref|YP_003396692.1| hypothetical protein Cwoe_4905 [Conexibacter woesei DSM 14684] gi|283950573|gb|ADB53317.1| conserved hypothetical protein [Conexibacter woesei DSM 14684] Length = 317 Score = 53.0 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 34/241 (14%), Positives = 75/241 (31%), Gaps = 63/241 (26%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LV D S SM D + ++ A K A FLD + Sbjct: 87 SVMLVTDHSRSMLAE---------------DVEPDRITAAKRAASRFLDQL---PPG--- 125 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + +G+ ++ + P++ + +R+ + + T + A++ A L Sbjct: 126 IRVGVTTFSDVPDGTQTPTYDHDLIRRTIEAQIAD---GGTATGDALQVALDTL------ 176 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-- 369 Q + ++ L+DG + V + +A+ I I T+++ Sbjct: 177 ----ERLEQNGERTPAA----MVLLSDGATTTGRDPVMVARAAGEAR---IPIYTVALGT 225 Query: 370 -----------------NASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQLMVH 411 A L+ +S + + L +++ + + Sbjct: 226 RDATVPNPGPTGPPLLPVAPDP--ETLQAIADASGGRAFQAQDDQELSSIYETLGSRLGT 283 Query: 412 R 412 R Sbjct: 284 R 284 >gi|160875970|ref|YP_001555286.1| von Willebrand factor type A [Shewanella baltica OS195] gi|160861492|gb|ABX50026.1| von Willebrand factor type A [Shewanella baltica OS195] gi|315268165|gb|ADT95018.1| von Willebrand factor type A [Shewanella baltica OS678] Length = 339 Score = 53.0 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 37/235 (15%), Positives = 72/235 (30%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VDLSGSM + K + +++ + F++ ++ Sbjct: 85 DLMMAVDLSGSM----------QIEDMVINGKTVDRFTLIQHVVSDFIER-------RKG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V Q++ L+ K T A+ A + + Sbjct: 128 DRIGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGEAIGLAVKRFDKMDES 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + +I LTDG NN A + I T+ + A Sbjct: 188 N------------------RVLILLTDGSNNAGNIEPEQAAQ--IAANRKVTIYTVGVGA 227 Query: 372 S-----------------PNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 + LK + ++ N+ L ++Q I +L Sbjct: 228 DVMERRTLFGRERVNPSMDLDENQLKHIAEVTHGRYFRARNSQELDQIYQEIDKL 282 >gi|329664002|ref|NP_001193105.1| calcium-activated chloride channel regulator 1 [Bos taurus] Length = 909 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 48/240 (20%), Positives = 87/240 (36%), Gaps = 69/240 (28%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM ++ L A LFL V++ ++ Sbjct: 309 LVLDKSGSMTIG-------------------NRLKRLNQAGKLFLLQ-----TVEQGSWV 344 Query: 255 GLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + E + R +T+ + + T ++ A+ ++ K+ + Sbjct: 345 GMVTFDSAAYVQSELVQINSATERDTLTKSLPTTASGGTSICSGLRSAFTVI----KKKY 400 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 T+ II LTDGE+N TI C D+ K++ I T+++ Sbjct: 401 PTDGAE-------------IILLTDGEDN-------TISACFDEVKQSGAIIHTVAL-GP 439 Query: 373 PNGQRL---------LKTCVSSPEYHYNVVNADSLIHVFQNIS---QLMVHRKYSVILKG 420 Q L L+T S V N + L+ F +S + + R + +G Sbjct: 440 SAAQELEQMSKMTGGLQTYASD-----QVQN-NGLVDAFAALSSGNKAVSQRSIQLESRG 493 >gi|189347765|ref|YP_001944294.1| von Willebrand factor type A [Chlorobium limicola DSM 245] gi|189341912|gb|ACD91315.1| von Willebrand factor type A [Chlorobium limicola DSM 245] Length = 325 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 33/236 (13%), Positives = 77/236 (32%), Gaps = 48/236 (20%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 +++ + + L +D+S SM +++ A+K+ F+ Sbjct: 88 VVKQTVAHSRGIDVILALDISESMQLKDAGGR--------------SRLDAVKSVAREFV 133 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV-TRDMDSLILKPTDSTPAM 298 H + +G++ + + + + D + + T A+ Sbjct: 134 TR-----HSND--RIGVVVFKGKGYTLSPLTLDHRVTGMLIDNVSPDVIRDEGTAVGTAV 186 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 A L + + QK II +DG +N + + T A Sbjct: 187 LIAVNRLRASQSD------------------QKVIILFSDGVSNAGEIDPVTAA--SFAA 226 Query: 359 ENFIKIVTISINASP------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 I+I T ++ + L + +++ ++ + SL F++I +L Sbjct: 227 AQGIRIYTAGAGSASSASSALDEGELRRVALTAGGRYFRAGTSASLAEAFESIDRL 282 >gi|148680076|gb|EDL12023.1| mCG3350, isoform CRA_c [Mus musculus] Length = 907 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 64/370 (17%), Positives = 130/370 (35%), Gaps = 66/370 (17%) Query: 41 IYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKN 100 + + +KN++E+ + + G + +V N R + +K L + FI + Sbjct: 175 VDQPFYMSRKNTIEATRCSTRITG-TNVVHNCERGNCVTRACRRDSKTRLYEPKCTFIPD 233 Query: 101 HIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN-RLDSSNNTIFYNMDVMTSYDYRLQ 159 I+ + G S +F + + + A N + N +++ ++ Sbjct: 234 KIQTA--GASIMFMQNLNSVVEFCTEKNHNAEAPNLQNKMCNRRSTWDV---------IK 282 Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 N + + P + R + + LV+D SGSM Sbjct: 283 TSADFQNAPPMRGTEAPPPPTFSLLKSRRRV--VCLVLDKSGSMD--------------- 325 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV-EKNIEPSWGTEKVRQ 278 K ++ + A L+L I V+++ +GL+ + + +N + Q Sbjct: 326 ----KEDRLIRMNQAAELYLTQI-----VEKESMVGLVTFDSAAHIQNYLIKITSSSDYQ 376 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 +T ++ T ++ +Q +TS + + + I+ LTD Sbjct: 377 KITANLPQQASGGTSICHGLQAGFQAITSSDQSTSGSE----------------IVLLTD 420 Query: 339 GENNNFKSNVNTIKICDKA-KENFIKIVTISINASPNGQRLLKTCVSSPE-YHYNVVNAD 396 GE+N +S C +A + I TI++ + L + Y + + Sbjct: 421 GEDNGIRS-------CFEAVSRSGAIIHTIAL-GPSAARELETLSDMTGGLRFYANKDLN 472 Query: 397 SLIHVFQNIS 406 SLI F IS Sbjct: 473 SLIDAFSRIS 482 >gi|219518504|gb|AAI45058.1| Chloride channel calcium activated 1 [Mus musculus] Length = 902 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 64/370 (17%), Positives = 130/370 (35%), Gaps = 66/370 (17%) Query: 41 IYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKN 100 + + +KN++E+ + + G + +V N R + +K L + FI + Sbjct: 170 VDQPFYMSRKNTIEATRCSTRITG-TNVVHNCERGNCVTRACRRDSKTRLYEPKCTFIPD 228 Query: 101 HIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN-RLDSSNNTIFYNMDVMTSYDYRLQ 159 I+ + G S +F + + + A N + N +++ ++ Sbjct: 229 KIQTA--GASIMFMQNLNSVVEFCTEKNHNAEAPNLQNKMCNRRSTWDV---------IK 277 Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 N + + P + R + + LV+D SGSM Sbjct: 278 TSADFQNAPPMRGTEAPPPPTFSLLKSRRRV--VCLVLDKSGSMD--------------- 320 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV-EKNIEPSWGTEKVRQ 278 K ++ + A L+L I V+++ +GL+ + + +N + Q Sbjct: 321 ----KEDRLIRMNQAAELYLTQI-----VEKESMVGLVTFDSAAHIQNYLIKITSSSDYQ 371 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 +T ++ T ++ +Q +TS + + + I+ LTD Sbjct: 372 KITANLPQQASGGTSICHGLQAGFQAITSSDQSTSGSE----------------IVLLTD 415 Query: 339 GENNNFKSNVNTIKICDKA-KENFIKIVTISINASPNGQRLLKTCVSSPE-YHYNVVNAD 396 GE+N +S C +A + I TI++ + L + Y + + Sbjct: 416 GEDNGIRS-------CFEAVSRSGAIIHTIAL-GPSAARELETLSDMTGGLRFYANKDLN 467 Query: 397 SLIHVFQNIS 406 SLI F IS Sbjct: 468 SLIDAFSRIS 477 >gi|213963729|ref|ZP_03391979.1| BatA protein [Capnocytophaga sputigena Capno] gi|213953609|gb|EEB64941.1| BatA protein [Capnocytophaga sputigena Capno] Length = 333 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 50/301 (16%), Positives = 83/301 (27%), Gaps = 68/301 (22%) Query: 137 LDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGER--PIFLIE 194 + S+ F M Y Y F L I P + I Sbjct: 37 VTLSSTIAFKKMSSWKDYLYHSLFAFRLFAVTLI-IIALARPQTHSENAQTKITDGIDIV 95 Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 + +D+S SM QD K + ALK F+ + + + Sbjct: 96 MAIDVSSSMLS---------------QDLKPNRFEALKKVASQFVK-------DRPNDRI 133 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 GL+ Y + + + + T + A L K +S Sbjct: 134 GLVVYAGESYTKTPVTTDKGIILSSLAELTYGQVEDGTAIGMGLATAVNRLKESKAKS-- 191 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI----N 370 + II LTDG NN + + A E IK+ T+ I Sbjct: 192 ----------------RVIILLTDGVNNTGVIDPLIAA--ELAAEYGIKVYTVGIGTNGM 233 Query: 371 A-------SPN-----------GQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVH 411 A + L+K + ++ N L ++ I++L Sbjct: 234 ALSPYALNPDGSIMYRMLQVEIDESLMKKIAQVTHGRYFRATNNQKLQQIYDEINKLETS 293 Query: 412 R 412 + Sbjct: 294 K 294 >gi|26328325|dbj|BAC27903.1| unnamed protein product [Mus musculus] Length = 902 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 64/370 (17%), Positives = 130/370 (35%), Gaps = 66/370 (17%) Query: 41 IYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKN 100 + + +KN++E+ + + G + +V N R + +K L + FI + Sbjct: 170 VDQPFYMSRKNTIEATRCSTRITG-TNVVHNCERGNCVTRACRRDSKTRLYEPKCTFIPD 228 Query: 101 HIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN-RLDSSNNTIFYNMDVMTSYDYRLQ 159 I+ + G S +F + + + A N + N +++ ++ Sbjct: 229 KIQTA--GASIMFMQNLNSVVEFCTEKNHNAEAPNLQNKMCNRRSTWDV---------IK 277 Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 N + + P + R + + LV+D SGSM Sbjct: 278 TSADFQNAPPMRGTEAPPPPTFSLLKSRRRV--VCLVLDKSGSMD--------------- 320 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV-EKNIEPSWGTEKVRQ 278 K ++ + A L+L I V+++ +GL+ + + +N + Q Sbjct: 321 ----KEDRLIRMNQAAELYLTQI-----VEKESMVGLVTFDSAAHIQNYLIKITSSSDYQ 371 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 +T ++ T ++ +Q +TS + + + I+ LTD Sbjct: 372 KITANLPQQASGGTSICHGLQAGFQAITSSDQSTSGSE----------------IVLLTD 415 Query: 339 GENNNFKSNVNTIKICDKA-KENFIKIVTISINASPNGQRLLKTCVSSPE-YHYNVVNAD 396 GE+N +S C +A + I TI++ + L + Y + + Sbjct: 416 GEDNGIRS-------CFEAVSRSGAIIHTIAL-GPSAARELETLSDMTGGLRFYANKDLN 467 Query: 397 SLIHVFQNIS 406 SLI F IS Sbjct: 468 SLIDAFSRIS 477 >gi|257056239|ref|YP_003134071.1| hypothetical protein Svir_22360 [Saccharomonospora viridis DSM 43017] gi|256586111|gb|ACU97244.1| Mg-chelatase subunit ChlD [Saccharomonospora viridis DSM 43017] Length = 326 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 42/239 (17%), Positives = 77/239 (32%), Gaps = 51/239 (21%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + LV+D+S SM D K T++ A ++A F Sbjct: 86 RNRATVMLVIDVSLSMEA---------------TDVKPTRLRAAQDAARSF------AEG 124 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + V +GLI + + T + V + + + T + + A Q + S Sbjct: 125 LTPGVNLGLISF---AGTATVLAAPTTEREGVVHAIENLKLAQSTATGEGIFAALQAIES 181 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN--VNTIKICDK-AKENFIKI 364 P + I+ +TDG+ + AK+ I I Sbjct: 182 FSAVIGGAE--------GPPPAR--IVLMTDGKQTVPQDEYAPRGAFTAAGVAKQKGIPI 231 Query: 365 VTISI-----NASPNGQRL--------LKTCVS-SPEYHYNVVNADSLIHVFQNISQLM 409 TIS + +G R+ ++ S Y A+ L V+ ++ + + Sbjct: 232 TTISFGTSYGSVEIDGTRVPVEVDDASMREIARLSGGDFYKAATAEELKQVYDSLGEQI 290 >gi|24374613|ref|NP_718656.1| von Willebrand factor type A domain-containing protein [Shewanella oneidensis MR-1] gi|24349233|gb|AAN56100.1|AE015746_4 von Willebrand factor type A domain protein [Shewanella oneidensis MR-1] Length = 338 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 37/235 (15%), Positives = 70/235 (29%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM V L ++ Sbjct: 85 DLMLAVDLSGSMQIEDMVIDGKVVDRFT-----------LIQHVVSEFIERRKGD----- 128 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V Q++ L+ K T ++ A + + Sbjct: 129 -RIGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGESIALAVKRFDKMDES 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + ++ LTDG NN N+ + A + I T+ + A Sbjct: 188 N------------------RVLVLLTDGSNNA--GNIEPQQAAQIAANRKVTIYTVGVGA 227 Query: 372 SPNGQRLL------------------KTCVSSPEYHYNVVNADSLIHVFQNISQL 408 +R L + ++ ++ N++ L ++Q I +L Sbjct: 228 DVMERRTLFGRERVNPSMDLDENQLQQIADATHGRYFRARNSEELEQIYQEIDKL 282 >gi|32475925|ref|NP_868919.1| hypothetical protein RB9502 [Rhodopirellula baltica SH 1] gi|32446468|emb|CAD76304.1| conserved hypothetical protein [Rhodopirellula baltica SH 1] Length = 368 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 45/240 (18%), Positives = 85/240 (35%), Gaps = 44/240 (18%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 E P + L+VDLSGSM KK +++ A+K L FL Sbjct: 109 TKEIPTRDLLLLVDLSGSMA---------QEDFKNDAGKKVSRLDAVKEVLDGFLAK--- 156 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 ++ +GL+ + + + ++ + + T A+ + Sbjct: 157 ----RKGDRVGLVVFGDAAYLQAPFTTDLQLSQELLGECEVGMAGPRTAFGDAIGLGVNL 212 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 D +R+ K II LTDG N+ KS V ++ A + IKI Sbjct: 213 FDEDTERA------------------KTIIALTDG--NDTKSKVPPVEAARVATQRDIKI 252 Query: 365 VTISINASPN------GQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 T++I ++ LK S + ++ + + L ++ + + + + I Sbjct: 253 YTVAIGDPTTVGEDKLDEQSLKDVASETGGKYFFAADREHLAGIYDELDK-IETQTIQTI 311 >gi|153812017|ref|ZP_01964685.1| hypothetical protein RUMOBE_02410 [Ruminococcus obeum ATCC 29174] gi|149831916|gb|EDM87002.1| hypothetical protein RUMOBE_02410 [Ruminococcus obeum ATCC 29174] Length = 2099 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 49/306 (16%), Positives = 103/306 (33%), Gaps = 84/306 (27%) Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMH-------------------------------- 204 + A + + + I LV+D SGSM Sbjct: 1265 LTASSKTTQSMKTPYDIVLVLDQSGSMSQKFVEYNKINGSSMFWRKTYYIKTQNGIYQQL 1324 Query: 205 -CAMNSDPEDVNSAP--------------ICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + ++ +S + Q +TK+ ALK+A F ++ +++ Sbjct: 1325 SWSWDNTWSYTDSYSGKTVTVDPNTTDVYVAQKSNQTKIDALKSAATTF---VNNVANKN 1381 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQY------VTRDMDSLILKPTDSTPAMKQAYQ 303 D +G++ ++ + + + + + +D L D+ PA Sbjct: 1382 SDCRVGIVTFSNDGY-IKPITNNSYTLAKVGTSKGDIINTIDGLKTGG-DTYPAKGLDKA 1439 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG-----ENNNFKSNVNTIKICDKAK 358 + S Q + +K ++FLTDG NNF N+ + AK Sbjct: 1440 NEIFSENSSNSWETVEQ-----TDGRKKMVVFLTDGVPAPANTNNFDENLAGAGT-NSAK 1493 Query: 359 ---ENFIKIVTISI--NASPNGQ----------RLLKTCVSSPEYHYNVVNADSLIHVFQ 403 + + + I A+ +G + +++ SS E + + D+L +F+ Sbjct: 1494 ILHDQGVATYALGIFGAANSDGTMDNASVQRIDKYMQSIASSHEKYMTADSVDNLSSLFE 1553 Query: 404 NISQLM 409 +I+ + Sbjct: 1554 SITNNI 1559 >gi|261409463|ref|YP_003245704.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] gi|261285926|gb|ACX67897.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] Length = 421 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 41/234 (17%), Positives = 87/234 (37%), Gaps = 52/234 (22%) Query: 182 RIEMGE-RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 ++ G +P+ I LV+D SGSM D + + A K Sbjct: 103 QLNPGSAKPVKDIVLVIDNSGSM---------------NETDPNQDRYTAAK-------- 139 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE-KVRQYVTRDMDSLIL--KPTDSTPA 297 +L++ + D + ++ + + + + + ++D L TD + A Sbjct: 140 --NLINRMDRDNRVSVMVFDHATTLLQPFTRVKNQETKDEIIAEIDGLATNDGGTDISLA 197 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 ++ + + +I L+DG + + + Sbjct: 198 LE----------------DTMSHIQESRDAGRSAMVIMLSDG-FSETDHDRVLAEY---- 236 Query: 358 KENFIKIVTISIN-ASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLM 409 K+ I + TI ++ +P+G +LL+T + + +Y+V +A+ L VFQ I + Sbjct: 237 KQQQIAVNTIGLSLVNPDGAQLLQTIAAETGGQYYDVQHAEDLSFVFQKIYDDV 290 >gi|153835956|ref|ZP_01988623.1| von Willebrand factor, type A [Vibrio parahaemolyticus AQ3810] gi|260880154|ref|ZP_05892509.1| von Willebrand factor type A [Vibrio parahaemolyticus AN-5034] gi|260895271|ref|ZP_05903767.1| von Willebrand factor type A [Vibrio parahaemolyticus Peru-466] gi|260900622|ref|ZP_05909017.1| von Willebrand factor type A [Vibrio parahaemolyticus AQ4037] gi|149750710|gb|EDM61455.1| von Willebrand factor, type A [Vibrio parahaemolyticus AQ3810] gi|308085798|gb|EFO35493.1| von Willebrand factor type A [Vibrio parahaemolyticus Peru-466] gi|308091801|gb|EFO41496.1| von Willebrand factor type A [Vibrio parahaemolyticus AN-5034] gi|308107055|gb|EFO44595.1| von Willebrand factor type A [Vibrio parahaemolyticus AQ4037] Length = 334 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 41/235 (17%), Positives = 81/235 (34%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLSGSM ED+N D +++A+K L F+ ++ Sbjct: 98 DLMLVVDLSGSMQ------KEDMNLDGEYID----RLSAVKKVLSDFVAK-------RKG 140 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G++ + + + V Q + + + L+ + T + Sbjct: 141 DRLGVVLFGDHAYLQTPLTADRQTVIQQIKQTVIGLVGQRTAIGDGI------------- 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-- 369 + S Q+ +I L+DG N + + AK+ I T+ + Sbjct: 188 -----GLGTKTFVDSDAPQRVMILLSDGSNTAGVLDPIEAA--EIAKKYNATIYTVGVGA 240 Query: 370 ----------------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 A + Q L K + ++ +A+ L ++ I++L Sbjct: 241 GEMMVKDFFMTRKVDTAADLDEQTLTKIAEMTGGQYFRARDAEQLEKIYDTINKL 295 >gi|28901309|ref|NP_800964.1| hypothetical protein VPA1454 [Vibrio parahaemolyticus RIMD 2210633] gi|308125557|ref|ZP_05775735.2| von Willebrand factor type A [Vibrio parahaemolyticus K5030] gi|28809856|dbj|BAC62797.1| conserved hypothetical protein [Vibrio parahaemolyticus RIMD 2210633] gi|308112309|gb|EFO49849.1| von Willebrand factor type A [Vibrio parahaemolyticus K5030] Length = 328 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 41/235 (17%), Positives = 81/235 (34%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLSGSM ED+N D +++A+K L F+ ++ Sbjct: 92 DLMLVVDLSGSMQ------KEDMNLDGEYID----RLSAVKKVLSDFVAK-------RKG 134 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G++ + + + V Q + + + L+ + T + Sbjct: 135 DRLGVVLFGDHAYLQTPLTADRQTVIQQIKQTVIGLVGQRTAIGDGI------------- 181 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-- 369 + S Q+ +I L+DG N + + AK+ I T+ + Sbjct: 182 -----GLGTKTFVDSDAPQRVMILLSDGSNTAGVLDPIEAA--EIAKKYNATIYTVGVGA 234 Query: 370 ----------------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 A + Q L K + ++ +A+ L ++ I++L Sbjct: 235 GEMMVKDFFMTRKVDTAADLDEQTLTKIAEMTGGQYFRARDAEQLEKIYDTINKL 289 >gi|149918750|ref|ZP_01907237.1| aerotolerance-related membrane protein [Plesiocystis pacifica SIR-1] gi|149820351|gb|EDM79767.1| aerotolerance-related membrane protein [Plesiocystis pacifica SIR-1] Length = 350 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 70/227 (30%), Gaps = 46/227 (20%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E I + +DLS SM M+ ++ K + F+ Sbjct: 117 EHEGIDIVIALDLSDSMSNPMDGRRGLGLD----------RLTVAKQVIDEFIRR----- 161 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQIL 305 + + L+G+ + +R + + ++ T + + L Sbjct: 162 --RPHDRIALVGFGAHASTIAPLTLDHAVLRNLIVQVRLGVVDGQETAIGAGLGVSLNRL 219 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 S K I+ LTDG +N + +T+ A E + I Sbjct: 220 ------------------KESQAATKIIVLLTDGVHNADGMDPDTVAQ--TAAERGVVIY 259 Query: 366 TISINASPNGQ-------RLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 T+ + G +L + ++ Y Y + +L FQ++ Sbjct: 260 TV-LMGQQTGDRSSVDAGQLERLAGATDGYAYLAEDTQTLETSFQDL 305 >gi|149414665|ref|XP_001516049.1| PREDICTED: similar to integrin alpha 11 subunit [Ornithorhynchus anatinus] Length = 1194 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 24/182 (13%), Positives = 60/182 (32%), Gaps = 27/182 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ + + +G++ Y V V+ V T++ Sbjct: 191 LINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVKDVVEAASHIEQRGGTET--- 246 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + AY I + + K +K +I +TDGE + + ++ D + Sbjct: 247 -RTAYGIEFARSEA---------FQKGGRKGAKKVMIVITDGE---SHDSPDLEQVIDAS 293 Query: 358 KENFIKIVTISINA--------SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQ 407 +++ I +++ +K S P+ + +NV + +L + + Sbjct: 294 EKDNITRYAVAVLGYYNRRGINPEAFLSEIKYIASDPDDKHFFNVTDEAALKDIVDALGD 353 Query: 408 LM 409 + Sbjct: 354 RI 355 >gi|331012285|gb|EGH92341.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. tabaci ATCC 11528] Length = 352 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 43/227 (18%), Positives = 75/227 (33%), Gaps = 60/227 (26%) Query: 192 LIELVVDLSGSMHC---AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + + VD+SGSM SD +++ ++ L FL+ Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDEV-------------SRLVLVQQLLGDFLE-------G 130 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ +GLI + T+ ++ VR ++ + K T A+ + L Sbjct: 131 RKGDRVGLILFGTQAFVQAPLTYDRRTVRVWLDEAKIGIAGKNTALGDAIGLGLKRLRLR 190 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 S + ++ +TDG NN + + T A E +KI I Sbjct: 191 PATS------------------RVLVLVTDGANNAGQIDPITAAR--LAAEEGVKIYPIG 230 Query: 369 INASPNGQRL----------------LKTCVS-SPEYHYNVVNADSL 398 I A P+ L LK S S ++ + D L Sbjct: 231 IGADPDKDALQSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQL 277 >gi|330989218|gb|EGH87321.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. lachrymans str. M301315] Length = 352 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 43/227 (18%), Positives = 75/227 (33%), Gaps = 60/227 (26%) Query: 192 LIELVVDLSGSMHC---AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + + VD+SGSM SD +++ ++ L FL+ Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDEV-------------SRLVLVQQLLGDFLE-------G 130 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ +GLI + T+ ++ VR ++ + K T A+ + L Sbjct: 131 RKGDRVGLILFGTQAFVQAPLTYDRRTVRVWLDEAKIGIAGKNTALGDAIGLGLKRLRLR 190 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 S + ++ +TDG NN + + T A E +KI I Sbjct: 191 PATS------------------RVLVLVTDGANNAGQIDPITAAR--LAAEEGVKIYPIG 230 Query: 369 INASPNGQRL----------------LKTCVS-SPEYHYNVVNADSL 398 I A P+ L LK S S ++ + D L Sbjct: 231 IGADPDKDALQSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQL 277 >gi|257482758|ref|ZP_05636799.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. tabaci ATCC 11528] Length = 265 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 43/227 (18%), Positives = 75/227 (33%), Gaps = 60/227 (26%) Query: 192 LIELVVDLSGSMHC---AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + + VD+SGSM SD +++ ++ L FL+ Sbjct: 4 DLLVAVDVSGSMDYPDMQWKSDEV-------------SRLVLVQQLLGDFLE-------G 43 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ +GLI + T+ ++ VR ++ + K T A+ + L Sbjct: 44 RKGDRVGLILFGTQAFVQAPLTYDRRTVRVWLDEAKIGIAGKNTALGDAIGLGLKRLRLR 103 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 S + ++ +TDG NN + + T A E +KI I Sbjct: 104 PATS------------------RVLVLVTDGANNAGQIDPITAAR--LAAEEGVKIYPIG 143 Query: 369 INASPNGQRL----------------LKTCVS-SPEYHYNVVNADSL 398 I A P+ L LK S S ++ + D L Sbjct: 144 IGADPDKDALQSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQL 190 >gi|71737462|ref|YP_275714.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. phaseolicola 1448A] gi|71558015|gb|AAZ37226.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. phaseolicola 1448A] gi|320329710|gb|EFW85699.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. glycinea str. race 4] gi|330882170|gb|EGH16319.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. glycinea str. race 4] Length = 352 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 43/227 (18%), Positives = 75/227 (33%), Gaps = 60/227 (26%) Query: 192 LIELVVDLSGSMHC---AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + + VD+SGSM SD +++ ++ L FL+ Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDEV-------------SRLVLVQQLLGDFLE-------G 130 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ +GLI + T+ ++ VR ++ + K T A+ + L Sbjct: 131 RKGDRVGLILFGTQAFVQAPLTYDRRTVRVWLDEAKIGIAGKNTALGDAIGLGLKRLRLR 190 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 S + ++ +TDG NN + + T A E +KI I Sbjct: 191 PATS------------------RVLVLVTDGANNAGQIDPITAAR--LAAEEGVKIYPIG 230 Query: 369 INASPNGQRL----------------LKTCVS-SPEYHYNVVNADSL 398 I A P+ L LK S S ++ + D L Sbjct: 231 IGADPDKDALQSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQL 277 >gi|56797869|emb|CAG27568.1| matrilin-4 [Danio rerio] Length = 685 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 72/195 (36%), Gaps = 29/195 (14%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + +K + +D +D+ + +GL+ Y++RV S K Sbjct: 465 RPQNFELVKQFVNQVVDQLDV---SAKGTRVGLVQYSSRVRTEFPLSMYHSKDEIKKAEM 521 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 + K T + A+K + S+ + + IP + +F TDG Sbjct: 522 NVEYMEKGTMTGLALKHMVENSFSEAEGARPAEK-----NIPRVGL----VF-TDG---R 568 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 + ++ + KAKE I + + + + + L+ S P ++ + + Sbjct: 569 SQDDIQ--EWAKKAKEAGITMYAVGVGKAVEDE--LREIASDPVEKHFFYSAD------- 617 Query: 402 FQNISQLMVHRKYSV 416 F ISQ+ + K +V Sbjct: 618 FTAISQIAENLKLNV 632 Score = 41.4 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 62/169 (36%), Gaps = 25/169 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTP 296 +D I L +G++ Y+++V+ K Q V + + L T + Sbjct: 45 MIDIIHELDIGLAATRIGVVQYSSQVQNVFSLK-AFSKTEQMVKAINEIIPLAQGTMTGL 103 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A + S ++ P++P I+ TDG + + V Sbjct: 104 AIRYAMNVAFSAEE-----------GARPNVPHVAVIV--TDGRPQDRVAEVAAAAR--- 147 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 E+ I+I + + L+ S P ++ + V + LIH F Sbjct: 148 --ESGIEIYAVGVA--RADMTSLRAMASPPFEDHVFLVESF-DLIHQFG 191 >gi|332884779|gb|EGK05035.1| hypothetical protein HMPREF9456_03188 [Dysgonomonas mossii DSM 22836] Length = 327 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 37/276 (13%), Positives = 85/276 (30%), Gaps = 63/276 (22%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 Y F+ ++ +++ ++ ++ + I + +D+SG+M Sbjct: 51 YMRHFPFLLRVIAIALVIIVLARPQSVNSSDISKSEGIDIIMALDISGTMMA-------- 102 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 D T++ A K F+ + ++ +GL+ + + Sbjct: 103 -------NDFSPTRLEAAKKVASEFI-------NDRQSDRIGLVIFAGESFTQCPLTTDH 148 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + ++ +I T + + L + +S + + Sbjct: 149 RVLLNLLSEVKFGMIEDGTAIGLGLANSVNRLKDSQSKS------------------RVV 190 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS----------INASPNGQRL----- 378 I LTDG NN + T + A I++ TI + Q + Sbjct: 191 ILLTDGSNNAGQIAPLTAA--ELAASYGIRVYTIGIGSRGTSVARVMTPYGMQSMNVSGD 248 Query: 379 -----LKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 L S + ++ + SL ++ I Q+ Sbjct: 249 FDERTLTEIASKTGGSYFRATDNTSLSGIYDEIDQM 284 >gi|254458660|ref|ZP_05072084.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] gi|207084426|gb|EDZ61714.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] Length = 308 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 42/256 (16%), Positives = 90/256 (35%), Gaps = 40/256 (15%) Query: 157 RLQFIEHLLNQRYNQKIVSFIPALLRI--EMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + + L + I++ + + E+ + + I L++D S SM Sbjct: 48 SSKLLLFLKWLSISMLIIALMSPVKDEPYEIEPKKGYEIALILDASESM----------K 97 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 +++ T+ +K + F+ S + MG++ + ++ + Sbjct: 98 AKGFDEKNRDLTRFDVVKEIVSNFISSRKNDN-------MGIVVFGAYSFIASPLTYDSN 150 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 ++ V+ + K T ++ Q +L + K ++ K I Sbjct: 151 ILKGVVSNLYIGMAGKFTALFESLAQGVNLLKTSKSKT------------------KIAI 192 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-NASPNGQRLLKTCVS-SPEYHYNV 392 LTDG N I D A + +K+ I I + Q++L+ + + Sbjct: 193 LLTDGYNTPDSEFPFDAAI-DFANKQGVKVYPIGIGKSDEYNQKMLEKIAEQTGGVAFGA 251 Query: 393 VNADSLIHVFQNISQL 408 NA L V+ I++L Sbjct: 252 SNASELAIVYAKINEL 267 >gi|171913221|ref|ZP_02928691.1| hypothetical protein VspiD_18615 [Verrucomicrobium spinosum DSM 4136] Length = 868 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 37/221 (16%), Positives = 70/221 (31%), Gaps = 53/221 (23%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + E+ + LV+D SGSM K+ K+A + + + Sbjct: 406 DEEEKQSSALALVIDRSGSMSGE--------------------KLEMAKSAAIATAEVLT 445 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQAY 302 +G+ + + + + T V + L T+ PA +A Sbjct: 446 RNDS------IGVYAFDSEAHVVVPMTRLTS--SSAVAGQIAGLTSGGGTNLHPAFTEAR 497 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 L K + K +I LTDG + + C + + Sbjct: 498 NALQRTKAKI------------------KHMIILTDG-QTSGQGYEALASQC---RAEGV 535 Query: 363 KIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVF 402 I T++I LL+ S Y ++A +++ +F Sbjct: 536 TISTVAI-GDGAHVGLLQAIASLGGGKSYTTLDAANIVRIF 575 >gi|21232653|ref|NP_638570.1| hypothetical protein XCC3224 [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|66767265|ref|YP_242027.1| hypothetical protein XC_0933 [Xanthomonas campestris pv. campestris str. 8004] gi|21114459|gb|AAM42494.1| conserved hypothetical protein [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|66572597|gb|AAY48007.1| conserved hypothetical protein [Xanthomonas campestris pv. campestris str. 8004] Length = 335 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 41/238 (17%), Positives = 75/238 (31%), Gaps = 54/238 (22%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + L VDLSGSM + ++ A K L FLD Sbjct: 96 REARQMMLAVDLSGSMS----------EPDMVLGGNVVDRLTAAKAVLSDFLDR------ 139 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 +E +GL+ + R + VR + + L + T A+ + + L Sbjct: 140 -REGDRVGLLVFGQRAYALTPLTADLTSVRDQLADSVVGLAGRETAIGDAIALSVKRL-- 196 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 Q+ ++ LTDG N N +K + AK +++ TI Sbjct: 197 ----------------REQRQGQRVVVLLTDGVNTAGVLNP--LKAAELAKAEGVRVHTI 238 Query: 368 SINASPNGQRL----------------LKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 + S + L+ + + + + L ++ + +L Sbjct: 239 AFGGSGSYSLFGVPIPAGGGDDIDEDGLRKIAEQTGGRFFRARDTEELAGIYAELDRL 296 >gi|163848731|ref|YP_001636775.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|163670020|gb|ABY36386.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] Length = 845 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 43/240 (17%), Positives = 78/240 (32%), Gaps = 50/240 (20%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 +R I ++D S SM +K K A +L L ++ Sbjct: 391 QRAPVSILFIIDRSASMSATFG----------------ISKFDMAKEAAILSLTTLQPGD 434 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL--KPTDSTPAMKQAYQI 304 +G++ + T + E V +D + + T+ A+ Sbjct: 435 ------RVGVLAFDTETIWTVPFRTVGEGVSLVELQDQIATMSLGGGTNIERALSVGLPA 488 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L ++ + + LTDG + + ++ + A+ I + Sbjct: 489 LANEPYSTRHA------------------VLLTDG-RSYSNNYPRYQQLVETARAAQITL 529 Query: 365 VTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIH-VFQN----ISQLMVHRKYSVIL 418 TI+I S + LL S +Y V +A L FQ ++L V + VIL Sbjct: 530 STIAI-GSDSDTELLNQLASWGNGRYYFVADATDLPRITFQESEIATAELTVEQPVPVIL 588 >gi|291398583|ref|XP_002715574.1| PREDICTED: chloride channel accessory 1-like [Oryctolagus cuniculus] Length = 911 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 44/225 (19%), Positives = 83/225 (36%), Gaps = 66/225 (29%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM ++ L A LFL + + ++ Sbjct: 309 LVLDKSGSMSSD-------------------NRLNRLNQAGKLFL--LQTIEQGS---WV 344 Query: 255 GLIGYTTRVEKNIEPSWGTEKV-RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + + E R+ +T+ + ++ T ++ A+ ++ K+ + Sbjct: 345 GMVTFDSAAQVRSELRQIKSGTDREALTKSLPTVPSGGTSICSGLRVAFSVI----KKKY 400 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 T+ I+ LTDGE+N TIK+C D+ +++ I T+++ Sbjct: 401 PTDGSE-------------IVLLTDGEDN-------TIKVCFDEVRQSGAIIHTVAL-GP 439 Query: 373 PNGQRL---------LKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 L L+T S V N + LI F +S Sbjct: 440 SAALELEELSKMTGGLQTYASD-----QVQN-NGLIDAFGALSSE 478 >gi|323488845|ref|ZP_08094085.1| hypothetical protein GPDM_05856 [Planococcus donghaensis MPA1U2] gi|323397543|gb|EGA90349.1| hypothetical protein GPDM_05856 [Planococcus donghaensis MPA1U2] Length = 857 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 39/213 (18%), Positives = 78/213 (36%), Gaps = 50/213 (23%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E P + +V+D SGSM KM K A ++ + Sbjct: 402 ELPSLGLMIVMDRSGSMMGL--------------------KMELAKEAAARSVELLRSDD 441 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 +G+I + + + I P+ + ++ + + T+ +++QAY L Sbjct: 442 T------LGVIAFDDQPWE-ILPTGKVDDPKKAADKILSITPGGGTEIYRSLEQAYTELE 494 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + + +K II LTDG+++ I + K++ I + T Sbjct: 495 DLELQ------------------RKHIILLTDGQSSTSNDYDALI---ENGKDHNITLST 533 Query: 367 ISINASPNGQRLLKTCV-SSPEYHYNVVNADSL 398 +SI + LL+ + Y+V +A ++ Sbjct: 534 VSI-GQDADRNLLEQLAGTGSGRFYDVTDATTI 565 >gi|315923825|ref|ZP_07920054.1| conserved hypothetical protein [Pseudoramibacter alactolyticus ATCC 23263] gi|315622858|gb|EFV02810.1| conserved hypothetical protein [Pseudoramibacter alactolyticus ATCC 23263] Length = 969 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 34/222 (15%), Positives = 73/222 (32%), Gaps = 21/222 (9%) Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK-EDVYMGLIGYTTRVEKNI 267 D + + ++T++ K+A +D + + V + L+ + T Sbjct: 145 GDNYQYHEYTGKRYSQKTRLDVAKSATNTMIDQLLANNATNPGSVRISLVSFDTFASDAT 204 Query: 268 EPSWGTEKVRQYVT------RDMDSLILKPTDSTPAMKQAYQILTSDKKRSF-----FTN 316 S +E + V T+ A+++A + N Sbjct: 205 AWSTSSENLHSIVNGYKTPQSSHLGGHRGGTNWEDALQKADGTQPRADAQKHVIFVSDGN 264 Query: 317 FFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI-------VTISI 369 + I P ++ G+++ + S+ N D AK++ KI T+ Sbjct: 265 PTFRISSINGNPDDQYNDVHGHGDDDYYHSHPNYNY--DAAKDDAKKIVDGGAAFYTVGT 322 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + L T + + +Y + +L F+NI + H Sbjct: 323 FGDAARMQNLATEAGASDNYYKADDEAALKAAFKNIVASITH 364 >gi|126303381|ref|XP_001379571.1| PREDICTED: similar to matrilin-4 [Monodelphis domestica] Length = 623 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 66/172 (38%), Gaps = 24/172 (13%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKPTD 293 ++ I L +G+I Y+++V+ G R+ + R + +++ + T Sbjct: 57 RFLVNIIRGLDIGPNATRVGVIQYSSQVQSVFPL--GAFSRREDMERAIHAIVPLAQGTM 114 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A++ A + S + G + + + +TDG + ++ Sbjct: 115 TGLAIQYAMNVAFSVAE----------GARPSQARVPRVAVIVTDG---RPQD--RVTEV 159 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 +A+ I+I + + + G L+ S P E+ + V + LI F Sbjct: 160 AAQARNRGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESF-DLIQQFG 208 Score = 43.7 bits (101), Expect = 0.062, Method: Composition-based stats. Identities = 32/177 (18%), Positives = 69/177 (38%), Gaps = 26/177 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPS-WGTE-KVRQYVTRDMDSLILKPTDSTPAMK 299 +D L E +GL+ Y++RV +GT +V+Q V + K T + A++ Sbjct: 415 VDFLDVSPEGTRVGLVQYSSRVRTEFPLGRYGTADEVKQAV--LAVEYMEKGTMTGLALR 472 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S+ QG + + + + TDG + +++ +AKE Sbjct: 473 HLVEHSFSE----------AQGARPRAQNVPRVGLVFTDG---RSQDDISV--WAARAKE 517 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I + + + + L+ S P + + D F ++ ++ + K ++ Sbjct: 518 EGIIMYAVGVG--KAVEEELREIASDPPEQHVSYSPD-----FSTMTHMLENLKVNI 567 >gi|160889563|ref|ZP_02070566.1| hypothetical protein BACUNI_01987 [Bacteroides uniformis ATCC 8492] gi|317480055|ref|ZP_07939167.1| von Willebrand factor type A domain-containing protein [Bacteroides sp. 4_1_36] gi|156861080|gb|EDO54511.1| hypothetical protein BACUNI_01987 [Bacteroides uniformis ATCC 8492] gi|316903797|gb|EFV25639.1| von Willebrand factor type A domain-containing protein [Bacteroides sp. 4_1_36] Length = 327 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 43/276 (15%), Positives = 85/276 (30%), Gaps = 63/276 (22%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 Y + F+ ++ +++ + E I L +D+S SM Sbjct: 51 YLLHVPFVLRIIALILIILVLARPQTTDSWQNSEIEGIDIMLAMDVSTSMLAE------- 103 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 D K ++ A K+ F++ +G+ + + Sbjct: 104 --------DLKPNRLEAAKDVAAEFIN----GRPNDN---IGITLFAGESFTQCPLTVDH 148 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + + LI T + A L K +S K I Sbjct: 149 AVLLNLLKDMKCGLIEDGTAIGMGIANAVTRLKDSKAKS------------------KVI 190 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI----NAS----------------- 372 I LTDG NN K +++ + + AK I++ TI + A Sbjct: 191 ILLTDGVNN--KGDISPLTAAEIAKSFGIRVYTIGVGTNGMAPYPYPVGGTVQYVNMPVE 248 Query: 373 PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + L + ++ ++ + L V++ I +L Sbjct: 249 IDEKTLTQIAGTTEGNYFRATSNSKLKEVYEEIDKL 284 >gi|289547502|ref|NP_001166098.1| integrin, alpha 11a [Danio rerio] Length = 1190 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 21/186 (11%), Positives = 62/186 (33%), Gaps = 31/186 (16%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--TD 293 ++ + + +G++ Y +V + V V T+ Sbjct: 187 DFLINILRKFYVGPGQIQVGVLQYGEKVVSEFQL-NDFRSVEDVVKAARKIGQRGGEETN 245 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + + A + +K +I +TDGE ++ +++ ++ Sbjct: 246 TALGINVARSEAF---------------KQGGRRGAKKVMIVITDGE-SHDSADLQ--QV 287 Query: 354 CDKAKENFIKIVTISINA--------SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQ 403 ++++++ I I++ +K S P+ + +NV + +L + Sbjct: 288 IEESEKDGITRYAIAVLGYYNRRGINPEAFLNEIKYIASDPDDKHFFNVTDEAALKDIVD 347 Query: 404 NISQLM 409 + + + Sbjct: 348 ALGERI 353 >gi|55793317|gb|AAV65699.1| integrin alpha 11 subunit [Danio rerio] Length = 1168 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 21/186 (11%), Positives = 62/186 (33%), Gaps = 31/186 (16%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--TD 293 ++ + + +G++ Y +V + V V T+ Sbjct: 165 DFLINILRKFYVGPGQIQVGVLQYGEKVVSEFQL-NDFRSVEDVVKAARKIGQRGGEETN 223 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + + A + +K +I +TDGE ++ +++ ++ Sbjct: 224 TALGINVARSEAF---------------KQGGRRGAKKVMIVITDGE-SHDSADLQ--QV 265 Query: 354 CDKAKENFIKIVTISINA--------SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQ 403 ++++++ I I++ +K S P+ + +NV + +L + Sbjct: 266 IEESEKDGITRYAIAVLGYYNRRGINPEAFLNEIKYIASDPDDKHFFNVTDEAALKDIVD 325 Query: 404 NISQLM 409 + + + Sbjct: 326 ALGERI 331 >gi|149202124|ref|ZP_01879097.1| hypothetical protein RTM1035_12393 [Roseovarius sp. TM1035] gi|149144222|gb|EDM32253.1| hypothetical protein RTM1035_12393 [Roseovarius sp. TM1035] Length = 584 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 19/57 (33%), Positives = 26/57 (45%), Gaps = 2/57 (3%) Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISIN--ASPNGQRLLKTCVSSPEYHYNVVNAD 396 N K + IC +AK I I TI+ + P G +K C SS +HYN + Sbjct: 509 NRSKKDERLDDICREAKSEGIVIYTIAFEMGSQPTGADKIKKCASSVNHHYNATTVN 565 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 64/308 (20%), Positives = 120/308 (38%), Gaps = 72/308 (23%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 M +R Y ++ E+ +++ + FL++ G I ++ + S+++ + A Sbjct: 1 MRETTRAAEYAERFAKDEEGTVTVLAFAIFVMFLVMGGIGIDMMRQEMARASLQATLDRA 60 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 +LAGA+ + +N RA+I+D F K+ + L+ A +I Sbjct: 61 VLAGATAV--------------NNATARAVIEDY--FAKSGQSDYLAAQEAG----DIDI 100 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 +NSS+ +T A LD+ Y RL ++ L + + V+ Sbjct: 101 RLNSSK--VTARATQTLDT--------------YLMRLAGVDTLTSAGNSTAEVTI---- 140 Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 P I + +D+SGSM A ++ ALK A + F+D Sbjct: 141 --------PKLEIAMALDVSGSMIGA--------------------RIDALKPAAIEFVD 172 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEP--SWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 SI L S D + ++ ++ V + E + + +Y + ++ TD+T Sbjct: 173 SI-LDSTEPNDAVISVVPFSWGVTPSKEIYEALTVNETHKY-SSCLELNDSHFTDTTIDP 230 Query: 299 KQAYQILT 306 AY L Sbjct: 231 NTAYNQLI 238 >gi|260463263|ref|ZP_05811464.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259030853|gb|EEW32128.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 661 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 37/259 (14%), Positives = 89/259 (34%), Gaps = 56/259 (21%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 ++ + N++++ A++++ + + I + + +K + +A +AA A A ++ Sbjct: 6 RQFRRDRRGNYALMTAVAMIPLMGGLALAIDFTEMNREKQMVTNALDAANFATARRLT-- 63 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 E ++ RA D F+N + ++ + + T Sbjct: 64 --------EGATDDQLRAYALD-------------------FFNANLNDL---NPANATL 93 Query: 132 MANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRY---NQKIVSFIPALLRIEMGER 188 +++ + + Y L+ + NQ I + + +R+ + Sbjct: 94 NLTLPSNTAGGGLLKMTARLNYKPYFYPAFAQLVGKSATDANQSISFDVTSQVRL----K 149 Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA----LLLFLDSIDL 244 + LV+D SGSM + + ++ LK A + + Sbjct: 150 NTLEVALVLDNSGSMT-------------TLGTGSGQKRIDLLKTASKQLVDTLAQQAVM 196 Query: 245 LSHVKEDVYMGLIGYTTRV 263 + V V GL+ + V Sbjct: 197 IKQVDRPVQFGLVPFAASV 215 Score = 36.8 bits (83), Expect = 6.1, Method: Composition-based stats. Identities = 35/235 (14%), Positives = 76/235 (32%), Gaps = 28/235 (11%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK 265 A+ + S T + ++ + +IDL+ + + RV Sbjct: 427 ALPTGNGPNYSCTTNAITPLTDV-SVTDGATAIKAAIDLMQPNGGTNVPEGMAWGWRVVS 485 Query: 266 NI-EPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 + + G + + + + L +Y +D K ++ + + Sbjct: 486 SGEPFTQGRLETEKGNDKVVIVLTDGANTYYTPSSLSYSD-PADSKSTYASYGYLNPGYN 544 Query: 325 PSLPFQKFIIFLT-----DGENNNFKS--NVNTIKICDKAKENFIKIVTISI-----NAS 372 + + F+ T D N N+ + N +C+ AK I ++T+++ AS Sbjct: 545 GTSVGRMFMGTSTAIGQFDYSNGNYTNALNEQMATLCNNAKAANIMVMTVALDLSTTKAS 604 Query: 373 PN-GQRLLKTCVSS------------PEYHYNVVNADSLIHVFQNISQLMVHRKY 414 LK+C S P + SL + F+ I + + + Sbjct: 605 DKLAIDALKSCSSESRFRKDPTDPSKPAKLFWNATGASLSNDFKEIGNELSNLRV 659 >gi|256823198|ref|YP_003147161.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] gi|256796737|gb|ACV27393.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] Length = 348 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 36/237 (15%), Positives = 79/237 (33%), Gaps = 57/237 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + +D+SGSM + +DK+ ++ A+K L F+ ++ Sbjct: 87 DLMISIDISGSME----------MPDMVIEDKEVDRLVAVKALLTDFIAR-------RKG 129 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILTSDKK 310 +G+I + + ++ + V+ + L T + A + L Sbjct: 130 DRVGMILFGEQAYLQTPLTFDLKTVQTMLDETTIGLAGSSRTAIGDGIGLAVKRL----- 184 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 + +I LTDG+NN N ++ + A+ I I TI + Sbjct: 185 -------------RERDANNRVLILLTDGQNNTGALNP--LQAAELAEHAGITIYTIGVG 229 Query: 371 ASP-------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 A + + L+ + ++ + + ++Q I +L Sbjct: 230 ADEMIVKNRFFGNRRINPSLELDEESLIAVAEKTGGRYFRARDTKEMEEIYQIIDEL 286 >gi|330830423|ref|YP_004393375.1| FlpL [Aeromonas veronii B565] gi|328805559|gb|AEB50758.1| FlpL [Aeromonas veronii B565] Length = 460 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 15/126 (11%), Positives = 47/126 (37%), Gaps = 8/126 (6%) Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE--NNNFKSNVN 349 T++ + +++L+ + + + +K ++ +DGE + Sbjct: 338 TNTAEGVMWGWRLLSPEWQGRWRQGAAALPRPYELQDNRKIMVLFSDGEHMTEAALRDRK 397 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD---SLIHVFQNIS 406 + +C + K I+I T++ + C S + ++ L + +I+ Sbjct: 398 QLLLCREMKRKGIQIYTVAFEGDTR---FVAQCASDRSLAFKATKSNIRTVLTRLASSIN 454 Query: 407 QLMVHR 412 +++ + Sbjct: 455 DVVLTK 460 >gi|119720657|ref|YP_921152.1| von Willebrand factor, type A [Thermofilum pendens Hrk 5] gi|119525777|gb|ABL79149.1| von Willebrand factor, type A [Thermofilum pendens Hrk 5] Length = 327 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 41/216 (18%), Positives = 78/216 (36%), Gaps = 47/216 (21%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LVVD+SGSM +S P V K+ + A L ++ + V + Sbjct: 104 LVVDVSGSME---DSIPGGV------------KIEVARRAATLLVERM------PGGVDV 142 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 GL+ ++ R+ ++ P T R+ + T T ++ A L K + Sbjct: 143 GLLAFSDRIVLSLPP---TGDRRRVLDAIESLKPGGGTMYTYPLQAALSWLKPYKLFNAS 199 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS-- 372 T ++F++DG + + + I + T+ Sbjct: 200 T----------------LVVFVSDG---LPADAATYRTLLSEFRSLGIPVYTV-YIGPGG 239 Query: 373 PNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQ 407 G+R LK S+ Y +A+ L+ F+ +++ Sbjct: 240 DEGERELKLIAGSTGGEEYTAGSAEELLKAFKTLAE 275 >gi|89889805|ref|ZP_01201316.1| BatA, aerotolerance operon [Flavobacteria bacterium BBFL7] gi|89518078|gb|EAS20734.1| BatA, aerotolerance operon [Flavobacteria bacterium BBFL7] Length = 337 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 48/287 (16%), Positives = 85/287 (29%), Gaps = 70/287 (24%) Query: 150 VMTSYDYRLQFIEHLLNQRYNQKIVS----FIPALLRIEMGERPIFLIELVVDLSGSMHC 205 V TS+ L+ + +L I++ + + + I L VD+S SM Sbjct: 50 VETSWLANLRPLLFILRLIVLALIITALARPQTTDVTTKTKKTEGIDIVLAVDVSASMLA 109 Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK 265 D K ++ A K F+ + + +G++ Y Sbjct: 110 E---------------DLKPNRLEATKKVAADFIK-------GRPNDRIGVVVYAGESYT 147 Query: 266 NIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 + + + D ++ T + A L Sbjct: 148 KTPITTDEMISLRAINEIAFDGVLENGTAIGMGLATAVNRL------------------K 189 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI----NASPN------ 374 S K II +TDG NN+ + + A E IK+ TI I NA Sbjct: 190 DSEALSKVIILMTDGVNNSGFIDPKIAS--ELALEYDIKVYTIGIGTNGNAPSPVAQIGR 247 Query: 375 ------------GQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQL 408 + L+K + ++ N L ++ I +L Sbjct: 248 NKFRMAMMPVEIDEELMKQIAVDTGGKYFRATNNKKLEEIYGEIDKL 294 >gi|189461337|ref|ZP_03010122.1| hypothetical protein BACCOP_01987 [Bacteroides coprocola DSM 17136] gi|189431866|gb|EDV00851.1| hypothetical protein BACCOP_01987 [Bacteroides coprocola DSM 17136] Length = 332 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 47/312 (15%), Positives = 87/312 (27%), Gaps = 73/312 (23%) Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 + T ++ N + + Y F+ ++ +++ Sbjct: 25 RRKKTEPTMQVSSTRMYMNAPKSWKI-----YLLHAPFVLRIVAFVMIVLVLARPQTTDN 79 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + E I L VD+S SM D K ++ A K F++ Sbjct: 80 WQNTEIEGIDIMLAVDVSTSMLAE---------------DLKPNRLEAAKQVAAEFIN-- 122 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-----MDSLILKPTDSTPA 297 +GL + + + +I T Sbjct: 123 --GRPNDN---IGLTVFAGEAFTQCPLTVDHGVLLNLFNSIKGDIAQRGMIEDGTAIGMG 177 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A L K +S K II LTDG NN + T + A Sbjct: 178 LANAISRLKDSKAKS------------------KVIILLTDGSNNRGDISPLTAA--EIA 217 Query: 358 KENFIKIVTIS-------------------INAS--PNGQRLLKTCVSSPEYHYNVVNAD 396 K+ I++ TI +N + Q L + ++ ++ + Sbjct: 218 KQFGIRVYTIGVGTNGTAPYPMQTYAGVQYVNVPVEIDEQTLTQIAGTTNGNYFRATSNS 277 Query: 397 SLIHVFQNISQL 408 L V++ I +L Sbjct: 278 KLEEVYREIDKL 289 >gi|270296687|ref|ZP_06202886.1| conserved hypothetical protein [Bacteroides sp. D20] gi|270272674|gb|EFA18537.1| conserved hypothetical protein [Bacteroides sp. D20] Length = 327 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 43/276 (15%), Positives = 85/276 (30%), Gaps = 63/276 (22%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 Y + F+ ++ +++ + E I L +D+S SM Sbjct: 51 YLLHVPFVLRIIALILIILVLARPQTTDSWQNSEIEGIDIMLAMDVSTSMLAE------- 103 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 D K ++ A K+ F++ +G+ + + Sbjct: 104 --------DLKPNRLEAAKDVAAEFIN----GRPNDN---IGITLFAGESFTQCPLTVDH 148 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + + LI T + A L K +S K I Sbjct: 149 AVLLNLLKDMKCGLIEDGTAIGMGIANAVTRLKDSKAKS------------------KVI 190 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI----NAS----------------- 372 I LTDG NN K +++ + + AK I++ TI + A Sbjct: 191 ILLTDGVNN--KGDISPLTAAEIAKSFGIRVYTIGVGTNGMAPYPYPVGGTVQYVNMPVE 248 Query: 373 PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + L + ++ ++ + L V++ I +L Sbjct: 249 IDEKTLTQIAGTTEGNYFRATSNSKLKEVYEEIDKL 284 >gi|6465945|gb|AAF12731.1|AF108501_1 Ca(2+)-sensitive chloride channel 2 [Mus musculus] Length = 902 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 65/370 (17%), Positives = 128/370 (34%), Gaps = 66/370 (17%) Query: 41 IYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKN 100 + + +KN++E+ +A + G K+V R + +K L + FI + Sbjct: 170 VDRPFYISRKNTIEATRCSASITG-KKVVHECQRGSCVTRACRRDSKTRLYEPKCTFIPD 228 Query: 101 HIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN-RLDSSNNTIFYNMDVMTSYDYRLQ 159 I+ + G S +F + + + A N + N +++ ++ Sbjct: 229 KIQTA--GASIMFMQNLNSVVEFCTENNHNAEAPNLQNKMCNRRSTWDV---------IK 277 Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 N + + P + R + + LV+D SGSM Sbjct: 278 ASADFQNSPPMRGTEAPPPPTFSLLKSRRRV--VCLVLDKSGSMD--------------- 320 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV-EKNIEPSWGTEKVRQ 278 K ++ + A L+L I V+++ +GL+ + + +N + Q Sbjct: 321 ----KEDRLIRMNQAAELYLTQI-----VEKESMVGLVTFDSAAHIQNYLIKITSSSDYQ 371 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 +T ++ T ++ +Q +TS + + + I+ LTD Sbjct: 372 KITANLPQQATGGTSICHGLQAGFQAITSSDQSTSGSE----------------IVLLTD 415 Query: 339 GENNNFKSNVNTIKICDKA-KENFIKIVTISINASPNGQRLLKTCVSSPE-YHYNVVNAD 396 GE+N S C +A + I TI++ + L + Y + Sbjct: 416 GEDNGISS-------CFEAVSRSGAIIHTIAL-GPSAARELETLSDMTGGLRFYANKHVS 467 Query: 397 SLIHVFQNIS 406 SLI F IS Sbjct: 468 SLIDAFSRIS 477 >gi|225012026|ref|ZP_03702463.1| von Willebrand factor type A [Flavobacteria bacterium MS024-2A] gi|225003581|gb|EEG41554.1| von Willebrand factor type A [Flavobacteria bacterium MS024-2A] Length = 334 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 47/260 (18%), Positives = 80/260 (30%), Gaps = 66/260 (25%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 I + I + +D+S SM QD K +++ALK Sbjct: 74 IARPQTVDISTRTKTNKGIDIVMAIDVSSSMLA---------------QDLKPDRLSALK 118 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKP 291 F+D D LS +GL+ Y + V+ + LI Sbjct: 119 RVASAFVD--DRLSD-----RIGLVVYAGESYTLTPITSDKGIVKGSLREISYQGLIEDG 171 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + + L + +S K II LTDG NN+ + Sbjct: 172 TAIGMGLATSVNRLKDSRAKS------------------KVIILLTDGVNNSGFIDPKIA 213 Query: 352 KICDKAKENFIKIVTISI------NASPN----------------GQRLLKTCVSSPEY- 388 + A E IK TI + A + LL+ ++ Sbjct: 214 T--ELAVEFGIKTYTIGLGSNGTARAPVGILPNGSFQYAMTKVEIDEALLQEIATATGGI 271 Query: 389 HYNVVNADSLIHVFQNISQL 408 ++ + L +++ I++L Sbjct: 272 YFRATDNKKLEEIYEEINKL 291 >gi|297579701|ref|ZP_06941628.1| conserved hypothetical protein [Vibrio cholerae RC385] gi|297535347|gb|EFH74181.1| conserved hypothetical protein [Vibrio cholerae RC385] Length = 318 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 43/235 (18%), Positives = 78/235 (33%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLS SM ED+ S D ++ A+K L F+ +E Sbjct: 85 DLMLVVDLSYSMS------QEDMQSGQQMVD----RLTAVKQVLSEFIAK-------REG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + V + + + LI T + A + Sbjct: 128 DRIGLILFADHAYLQTPLTLDRQTVASQLNQAVLKLIGTQTAIGEGIGLATKTFID---- 183 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S Q+ +I L+DG N + + AK+ I T+ + A Sbjct: 184 --------------SDAPQRVMILLSDGSNTAGVLDPLEAA--NIAKQYHTTIYTVGVGA 227 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + L ++ +++ N L +++ I+QL Sbjct: 228 GEMVVKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTINQL 282 >gi|328469247|gb|EGF40193.1| hypothetical protein VP10329_10201 [Vibrio parahaemolyticus 10329] Length = 334 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 41/235 (17%), Positives = 81/235 (34%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLSGSM ED+N D +++A+K L F+ ++ Sbjct: 98 DLMLVVDLSGSMQ------KEDMNLDGEYID----RLSAVKKVLSDFVAK-------RKG 140 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G++ + + + V Q + + + L+ + T + Sbjct: 141 DRLGVVLFGDHAYLQTPLTADRQTVIQQIKQTVIGLVGQRTAIGDGI------------- 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-- 369 + S Q+ +I L+DG N + + AK+ I T+ + Sbjct: 188 -----GLGTKTFVDSDAPQRVMILLSDGSNTAGVLDPIEAA--EIAKKYNATIYTVGVGA 240 Query: 370 ----------------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 A + Q L K + ++ +A+ L ++ I++L Sbjct: 241 GEMMVKDFFMTRKVDTAADLDEQTLTKIAEMTGGQYFRARDAEQLEKIYNTINKL 295 >gi|13447394|ref|NP_085104.1| chloride channel calcium activated 2 [Mus musculus] gi|12043705|gb|AAG47626.1|AF115852_1 endothelial chloride channel [Mus musculus] gi|14198178|gb|AAH08147.1| Chloride channel calcium activated 2 [Mus musculus] gi|74208910|dbj|BAE21205.1| unnamed protein product [Mus musculus] gi|148680073|gb|EDL12020.1| mCG120735 [Mus musculus] Length = 902 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 65/370 (17%), Positives = 128/370 (34%), Gaps = 66/370 (17%) Query: 41 IYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKN 100 + + +KN++E+ +A + G K+V R + +K L + FI + Sbjct: 170 VDRPFYISRKNTIEATRCSASITG-KKVVHECQRGSCVTRACRRDSKTRLYEPKCTFIPD 228 Query: 101 HIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN-RLDSSNNTIFYNMDVMTSYDYRLQ 159 I+ + G S +F + + + A N + N +++ ++ Sbjct: 229 KIQTA--GASIMFMQNLNSVVEFCTENNHNAEAPNLQNKMCNRRSTWDV---------IK 277 Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 N + + P + R + + LV+D SGSM Sbjct: 278 ASADFQNSPPMRGTEAPPPPTFSLLKSRRRV--VCLVLDKSGSMD--------------- 320 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV-EKNIEPSWGTEKVRQ 278 K ++ + A L+L I V+++ +GL+ + + +N + Q Sbjct: 321 ----KEDRLIRMNQAAELYLTQI-----VEKESMVGLVTFDSAAHIQNYLIKITSSSDYQ 371 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 +T ++ T ++ +Q +TS + + + I+ LTD Sbjct: 372 KITANLPQQATGGTSICHGLQAGFQAITSSDQSTSGSE----------------IVLLTD 415 Query: 339 GENNNFKSNVNTIKICDKA-KENFIKIVTISINASPNGQRLLKTCVSSPE-YHYNVVNAD 396 GE+N S C +A + I TI++ + L + Y + Sbjct: 416 GEDNGISS-------CFEAVSRSGAIIHTIAL-GPSAARELETLSDMTGGLRFYANKHVS 467 Query: 397 SLIHVFQNIS 406 SLI F IS Sbjct: 468 SLIDAFSRIS 477 >gi|188990358|ref|YP_001902368.1| hypothetical protein xccb100_0962 [Xanthomonas campestris pv. campestris str. B100] gi|167732118|emb|CAP50310.1| putative membrane protein [Xanthomonas campestris pv. campestris] Length = 335 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 41/238 (17%), Positives = 75/238 (31%), Gaps = 54/238 (22%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + L VDLSGSM + ++ A K L FLD Sbjct: 96 REARQMMLAVDLSGSMS----------EPDMVLGGNVVDRLTAAKAVLSDFLDR------ 139 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 +E +GL+ + R + VR + + L + T A+ + + L Sbjct: 140 -REGDRVGLLVFGQRAYALTPLTADLTSVRDQLADSVVGLAGRETAIGDAIALSVKRL-- 196 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 Q+ ++ LTDG N N +K + AK +++ TI Sbjct: 197 ----------------REQRHGQRVVVLLTDGVNTAGVLNP--LKAAELAKAEGVRVHTI 238 Query: 368 SINASPNGQRL----------------LKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 + S + L+ + + + + L ++ + +L Sbjct: 239 AFGGSGSYSLFGVPIPAGGGDDIDEDGLRKIAEQTGGRFFRARDTEELAGIYAELDRL 296 >gi|15822539|gb|AAG23712.1| calcium-activated chloride channel CLCA4 [Mus musculus] gi|148680071|gb|EDL12018.1| mCG141954, isoform CRA_a [Mus musculus] gi|162317876|gb|AAI56643.1| Chloride channel calcium activated 4 [synthetic construct] Length = 909 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 37/212 (17%), Positives = 71/212 (33%), Gaps = 50/212 (23%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM T++ + A L+L I ++++ + Sbjct: 311 LVLDKSGSMR----------------LGSPITRLTLMNQAAELYLIQI-----IEKESLV 349 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQY--VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 GL+ + + Y ++ + T +K+ ++ +TS + + Sbjct: 350 GLVTFDSTATIQTNLI-RIINDSSYLAISTKLPQYPNGGTSICNGLKKGFEAITSSDQST 408 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINA 371 + I+ LTDGE+N S C + K + I TI++ Sbjct: 409 SGSE----------------IVLLTDGEDNRISS-------CFQEVKHSGAIIHTIAL-G 444 Query: 372 SPNGQRLLKTCVSSPE-YHYNVVNADSLIHVF 402 + L + Y + + LI F Sbjct: 445 PSAARELETLSDMTGGLRFYAKEDVNGLIDAF 476 >gi|86144576|ref|ZP_01062908.1| hypothetical protein MED222_09203 [Vibrio sp. MED222] gi|85837475|gb|EAQ55587.1| hypothetical protein MED222_09203 [Vibrio sp. MED222] Length = 330 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 40/235 (17%), Positives = 81/235 (34%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLS SM ED+ D +++A+K+ L F++ ++ Sbjct: 94 DLMLVVDLSYSMS------QEDMQFNGEYID----RLSAVKHVLSDFIER-------RKG 136 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + + + + Q + + + LI T + A + Sbjct: 137 DRVGLVLFADHAYLQTPLTLDRDTLSQQLNQAVLKLIGTQTAIGDGIGLATKTFVD---- 192 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S Q+ +I L+DG N + D AK+ I T+ + A Sbjct: 193 --------------SDAPQRVMILLSDGSNTAGVLDPLEAA--DIAKKYNATIYTVGVGA 236 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + L++ + ++ ++ L ++ I+QL Sbjct: 237 GEMMVKEFFMTRKVNTAQDLDERTLMEIAKRTGGQYFRARDSKELATIYDTINQL 291 >gi|298207017|ref|YP_003715196.1| aerotolerance-related membrane protein [Croceibacter atlanticus HTCC2559] gi|83849651|gb|EAP87519.1| aerotolerance-related membrane protein [Croceibacter atlanticus HTCC2559] Length = 334 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 43/263 (16%), Positives = 77/263 (29%), Gaps = 66/263 (25%) Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 + + + I + +D+S SM D + ++ Sbjct: 71 ITAMARPRTVDVSTKTKTTKGIDIVMAIDVSASMLAR---------------DLRPNRLE 115 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR-DMDSLI 288 ALK+ F+ + + +G++ Y + V + ++ Sbjct: 116 ALKDVASEFIQ-------GRPNDRVGIVLYAGESYTKTPITSDKSIVLGALNDVKFSEVL 168 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + + L K K II LTDG NN+ + Sbjct: 169 ENGTAIGMGLATSVNRLKDSKA------------------LSKVIILLTDGVNNSGTIDP 210 Query: 349 NTIKICDKAKENFIKIVTISI----NA------SPNGQ------------RLLKTCVS-S 385 + A E IK TI I A NGQ LLK + + Sbjct: 211 KLAS--ELAVEYGIKTYTIGIGSNGMALSPIGIKSNGQFQYGNQKVEIDEDLLKQIATVT 268 Query: 386 PEYHYNVVNADSLIHVFQNISQL 408 ++ N L +++ I++L Sbjct: 269 GGQYFRATNNQKLEAIYEEINKL 291 >gi|254225237|ref|ZP_04918850.1| conserved hypothetical protein [Vibrio cholerae V51] gi|125622336|gb|EAZ50657.1| conserved hypothetical protein [Vibrio cholerae V51] Length = 318 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 43/235 (18%), Positives = 78/235 (33%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLS SM ED+ S D ++ A+K L F+ +E Sbjct: 85 DLMLVVDLSYSMS------QEDMQSGQQMVD----RLTAVKQVLSEFIAK-------REG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + V + + + LI T + A + Sbjct: 128 DRIGLILFADHAYLQTPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKTFID---- 183 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S Q+ +I L+DG N + + AK+ I T+ + A Sbjct: 184 --------------SDAPQRVMILLSDGSNTAGVLDPLEAA--NIAKQYHTTIYTVGVGA 227 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + L ++ +++ N L +++ I+QL Sbjct: 228 GEMVVKDFLFSRKVNTSQDLDEKTLQSIATTTGGHYFRARNQQDLQNIYDTINQL 282 >gi|320352592|ref|YP_004193931.1| von Willebrand factor type A [Desulfobulbus propionicus DSM 2032] gi|320121094|gb|ADW16640.1| von Willebrand factor type A [Desulfobulbus propionicus DSM 2032] Length = 798 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 28/237 (11%), Positives = 70/237 (29%), Gaps = 55/237 (23%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + LV+D SGSM + ++ LK A F +S Sbjct: 323 THDTVRVVLVLDESGSM-----------------NAETPKRIERLKVAAKNF------VS 359 Query: 247 HVKEDVYMGLIGYTTRVEKNI-----EPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + +G++ Y + + + T+ +++A Sbjct: 360 LAENGTELGIVSYASDAAVASGRTEVAIAPLGANRAAWNNAIDGLGPSTRTNIGAGLQKA 419 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF----KSNVNTIKICDKA 357 ++T+ + +I+ ++DG NN ++ + Sbjct: 420 RDLITAA----------------GGVTANTYIVLMSDGLNNEPAPQANADADLNGKIAML 463 Query: 358 KENFIKIVTISINASPNGQRLLKTC----VSSPEYHYNVVNADSLIHVFQNISQLMV 410 + I + + + + L C + ++ + ++ L F + + +V Sbjct: 464 LADGIPVY---VTCTGSDLGLASQCSEIGTGTGGHYVDSADSARLPEAFADFHERIV 517 >gi|293361345|ref|XP_236596.5| PREDICTED: collagen type VI alpha 4 [Rattus norvegicus] Length = 2327 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 29/178 (16%), Positives = 68/178 (38%), Gaps = 24/178 (13%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKPTD 293 + L + V +GL+ Y+ + + ++ V + +L + T+ Sbjct: 256 NFLHSIVSGLDVRSDQVQVGLVQYSDNIYPAFQLK--QSSLKSVVLEQIRNLPYNMGSTN 313 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A++ + ++ T K + +I +TDGE+N+ + Sbjct: 314 TGSALEF--------IRANYLTEMSGSRAKDG---VPQIVILVTDGESNDEVQDAA---- 358 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 D+ K + + + + IN + L+ S P E+ + N + L + ++SQ + Sbjct: 359 -DQLKRDGVFVYVVGINIQDVQE--LQKIASEPFEEFLFTTENFNILQELSGSLSQAV 413 >gi|293349452|ref|XP_002727145.1| PREDICTED: similar to procollagen, type VI, alpha 3 isoform 4 [Rattus norvegicus] Length = 2114 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 29/178 (16%), Positives = 68/178 (38%), Gaps = 24/178 (13%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKPTD 293 + L + V +GL+ Y+ + + ++ V + +L + T+ Sbjct: 256 NFLHSIVSGLDVRSDQVQVGLVQYSDNIYPAFQLK--QSSLKSVVLEQIRNLPYNMGSTN 313 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A++ + ++ T K + +I +TDGE+N+ + Sbjct: 314 TGSALEF--------IRANYLTEMSGSRAKDG---VPQIVILVTDGESNDEVQDAA---- 358 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 D+ K + + + + IN + L+ S P E+ + N + L + ++SQ + Sbjct: 359 -DQLKRDGVFVYVVGINIQDVQE--LQKIASEPFEEFLFTTENFNILQELSGSLSQAV 413 >gi|75076662|sp|Q4R7B7|ANTRL_MACFA RecName: Full=Anthrax toxin receptor-like; Flags: Precursor gi|67969305|dbj|BAE01005.1| unnamed protein product [Macaca fascicularis] Length = 557 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 73/221 (33%), Gaps = 48/221 (21%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + F + ++D SGS++ L + + Sbjct: 70 QGSFDLYFILDKSGSVNNNWI---------------------DLYMWVEETVARFQSSD- 107 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + M I Y+T + + + +++ + + + T ++A Q + Sbjct: 108 ----IRMCFITYSTDGQTVLPLTSDKNRIKNGLDQLRKIVPDGHTFMQAGFRKAIQQI-- 161 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 F G K+PS+ II +TDGE + +T++ KA++ + T+ Sbjct: 162 --------ETFNSGNKVPSM-----IIAMTDGE-LVAHAFQDTLREAQKARKLGANVYTV 207 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVN----ADSLIHVFQN 404 + Q + SPE+ + V N + + Sbjct: 208 DVADYKLDQ--ITAIADSPEHVFAVENGFKAMRDTVDALTS 246 >gi|15600942|ref|NP_232572.1| hypothetical protein VCA0172 [Vibrio cholerae O1 biovar eltor str. N16961] gi|229510539|ref|ZP_04400019.1| protein BatA [Vibrio cholerae B33] gi|229517329|ref|ZP_04406774.1| protein BatA [Vibrio cholerae RC9] gi|229605140|ref|YP_002875844.1| protein BatA [Vibrio cholerae MJ-1236] gi|254286663|ref|ZP_04961618.1| conserved hypothetical protein [Vibrio cholerae AM-19226] gi|254850438|ref|ZP_05239788.1| conserved hypothetical protein [Vibrio cholerae MO10] gi|255746016|ref|ZP_05419963.1| protein BatA [Vibrio cholera CIRS 101] gi|262162145|ref|ZP_06031160.1| protein BatA [Vibrio cholerae INDRE 91/1] gi|9657562|gb|AAF96085.1| conserved hypothetical protein [Vibrio cholerae O1 biovar El Tor str. N16961] gi|150423247|gb|EDN15193.1| conserved hypothetical protein [Vibrio cholerae AM-19226] gi|229345365|gb|EEO10338.1| protein BatA [Vibrio cholerae RC9] gi|229352984|gb|EEO17924.1| protein BatA [Vibrio cholerae B33] gi|229371626|gb|ACQ62048.1| protein BatA [Vibrio cholerae MJ-1236] gi|254846143|gb|EET24557.1| conserved hypothetical protein [Vibrio cholerae MO10] gi|255735770|gb|EET91168.1| protein BatA [Vibrio cholera CIRS 101] gi|262028220|gb|EEY46878.1| protein BatA [Vibrio cholerae INDRE 91/1] Length = 318 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 43/235 (18%), Positives = 78/235 (33%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLS SM ED+ S D ++ A+K L F+ +E Sbjct: 85 DLMLVVDLSYSMS------QEDMQSGQQMVD----RLTAVKQVLSEFIAK-------REG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + V + + + LI T + A + Sbjct: 128 DRIGLILFADHAYLQTPLTLDRQTVANQLNQTVLKLIGTQTAIGEGIGLATKTFID---- 183 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S Q+ +I L+DG N + + AK+ I T+ + A Sbjct: 184 --------------SDAPQRVMILLSDGSNTAGVLDPLEAA--NIAKQYNTTIYTVGVGA 227 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + L ++ +++ N L +++ I+QL Sbjct: 228 GEMVVKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTINQL 282 >gi|260769474|ref|ZP_05878407.1| protein BatA [Vibrio furnissii CIP 102972] gi|260614812|gb|EEX39998.1| protein BatA [Vibrio furnissii CIP 102972] gi|315182004|gb|ADT88917.1| von Willebrand factor type A domain protein [Vibrio furnissii NCTC 11218] Length = 322 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 40/235 (17%), Positives = 76/235 (32%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLS SM D+ +++A+K L F+ ++ Sbjct: 86 DMMLVVDLSYSMS----------QQDMKSGDQFIDRLSAVKQVLSDFIAK-------RQG 128 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + + Q + + + LI T + A + Sbjct: 129 DRLGLIFFADHAYLQTPLTLDRQTIAQQLNQAVLRLIGTQTAIGEGIGLATKTFID---- 184 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-- 369 S Q+ +I L+DG N + + AK+ I T+ + Sbjct: 185 --------------SDAPQRVMILLSDGSNTSGVLDPMEAA--KIAKKYHTTIYTVGVGA 228 Query: 370 ---------------NASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 A ++ L+ + ++ N L H++ I+QL Sbjct: 229 GEMMVKEFFMTRKINTAEDLDEKTLQAIADETGGQYFRARNQQDLQHIYDTINQL 283 >gi|291087628|ref|ZP_06346959.2| putative von Willebrand factor type A domain protein [Clostridium sp. M62/1] gi|291074491|gb|EFE11855.1| putative von Willebrand factor type A domain protein [Clostridium sp. M62/1] Length = 473 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 28/190 (14%), Positives = 66/190 (34%), Gaps = 33/190 (17%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDV----YMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + + A K + F+++++ S E +GL+ ++ N S V + Sbjct: 60 SALEAAKKGIKAFIETLERESAQPEGYAGEKRVGLVSFSDTATVNSMLS----PVVEQAA 115 Query: 282 RDMDSLILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 R + L ++ A++ A ++L P +K + +TDG+ Sbjct: 116 RAAEGLTAGGKSNQAEAIRAAVKLLDMK------------------TPGEKMLFLITDGQ 157 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPN-GQRLLKTCVS--SPEYHYNVVNADS 397 + +A++ + + I I A + L++ S S + + Sbjct: 158 TPFRSQTDSAAA---EARQAGVTVYCIGIAAPDGVNREALRSWASGPSDSHIIEIRELGE 214 Query: 398 LIHVFQNISQ 407 F+ + + Sbjct: 215 AQTAFERLMK 224 >gi|330975134|gb|EGH75200.1| von Willebrand factor, type A [Pseudomonas syringae pv. aptata str. DSM 50252] Length = 352 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 44/248 (17%), Positives = 82/248 (33%), Gaps = 66/248 (26%) Query: 192 LIELVVDLSGSMHC---AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + + VD+SGSM SD +++ ++ L FL+ Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDEV-------------SRLVLVQQLLGDFLE-------G 130 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ +GLI + T+ ++ VR ++ + K T A+ A + L Sbjct: 131 RKGDRVGLILFGTQAFVQAPLTYDRRTVRVWLDEARIGIAGKNTALGDAIGLALKRLRMR 190 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 S + ++ +TDG NN + + T A E +KI I Sbjct: 191 PATS------------------RALVLVTDGANNAGQIDPITAAR--LAAEEGVKIYPIG 230 Query: 369 INASPNGQRL----------------LKTCVS-SPEYHYNVVNADSL------IHVFQNI 405 I + P+ L LK S S ++ + D L + + + Sbjct: 231 IGSDPDKDALQSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKIRATLDALEPV 290 Query: 406 SQLMVHRK 413 +Q + Sbjct: 291 AQQPTQAR 298 >gi|187736265|ref|YP_001878377.1| von Willebrand factor type A [Akkermansia muciniphila ATCC BAA-835] gi|187426317|gb|ACD05596.1| von Willebrand factor type A [Akkermansia muciniphila ATCC BAA-835] Length = 328 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 41/232 (17%), Positives = 75/232 (32%), Gaps = 50/232 (21%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I + DLS SM + ++ A K+ + F+DS + Sbjct: 90 IDIMIAFDLSYSME----------TPDMVLNRMPINRLVAAKHVITQFVDS-------RP 132 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDSTPAMKQAYQILTSDK 309 D +G++G+ + + + V + +I T A+ A L K Sbjct: 133 DDRIGIVGFAGKTKSFCPLTLDHALVNSIIRDFHPRMIQADGTAIGSAIAAAATRLDDRK 192 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + K II +TDG +N+ + + + A + IKI TI++ Sbjct: 193 ETK-----------------SKIIILVTDGASNSGQISPLVAA--ENAAKLGIKIYTIAV 233 Query: 370 NASPN------------GQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 + L+ + H+ N S F +I +L Sbjct: 234 GTEEGTLANGMVVQSEFDEPTLRKIAQLTGGEHFRATNMASFNKAFTSIGKL 285 >gi|26349121|dbj|BAC38200.1| unnamed protein product [Mus musculus] Length = 280 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 45/245 (18%), Positives = 81/245 (33%), Gaps = 49/245 (20%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 + + + L E + ++D S S +N+ Sbjct: 32 PSRGRHVRMYPQTALLESSCENKRADLAFIIDSSRS----VNTYHYAKVKE--------- 78 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 LD + L + +GL+ Y + V+ T K + V R + Sbjct: 79 ----------FILDILQFLDIGPDVTRVGLLQYGSTVKNEFSLK--TFKRKSEVERAVKR 126 Query: 287 L--ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + + T + A++ A I S+ + G + + I+ +TDG Sbjct: 127 MRHLSTGTMTGLAIQYALNIAFSEAE----------GARPLRENVPRIIMIVTDG---RP 173 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN---ADSLI 399 + +V + KA+ I I I + LK S P ++ + V N +SL Sbjct: 174 QDSVAEVA--AKARNTGILIFAIGVG--QVDLNTLKAIGSEPHKDHVFLVANFSQIESLT 229 Query: 400 HVFQN 404 VFQN Sbjct: 230 SVFQN 234 >gi|320323259|gb|EFW79347.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. glycinea str. B076] Length = 352 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 43/227 (18%), Positives = 75/227 (33%), Gaps = 60/227 (26%) Query: 192 LIELVVDLSGSMHC---AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + + VD+SGSM SD +++ ++ L FL+ Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDEV-------------SRLVLVQQLLGDFLE-------G 130 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ +GLI + T+ ++ VR ++ + K T A+ + L Sbjct: 131 RKGDRVGLILFGTQAFVQAPLTYDRRTVRFWLDEAKIGIAGKNTALGDAIGLGLKRLRLR 190 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 S + ++ +TDG NN + + T A E +KI I Sbjct: 191 PATS------------------RVLVLVTDGANNAGQIDPITAAR--LAAEEGVKIYPIG 230 Query: 369 INASPNGQRL----------------LKTCVS-SPEYHYNVVNADSL 398 I A P+ L LK S S ++ + D L Sbjct: 231 IGADPDKDALQSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQL 277 >gi|241113476|ref|YP_002973311.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240861684|gb|ACS59350.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 329 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 44/291 (15%), Positives = 89/291 (30%), Gaps = 50/291 (17%) Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFI---PA 179 + S+ +++ + V+T + E L + P Sbjct: 27 RETSASVRLPFFSQVAKAAGVRPTEGSVVTRRTWPQLVCETLAWCLVVLALARPQFVEPP 86 Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + ++E I L +DLS SM D P K ++ A++ + F+ Sbjct: 87 IEKVEPQRD----ILLALDLSQSM---------DARDFPGADGKPLARVEAVRQVVADFV 133 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 + +GL+ + + E VR + + + T A+ Sbjct: 134 GK-------RPGDRIGLVAFGDAPYPLAPFTMDHELVRTMIADTVPGMAGPRTSLGDALG 186 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 A + + +K +I LTDG N+ S + +K + AK Sbjct: 187 LAI-KMFGK-----------------TTAPEKVLIVLTDG--NDTASRMPPLKAAEIAKS 226 Query: 360 NFIKIVTISINASPNGQR-------LLKTCVSSPEYHYNVVNADSLIHVFQ 403 + T+ I L K S+ ++ + L +++ Sbjct: 227 KGVIFHTVGIGDPAATGEDKLDTATLQKIAASTGGRYFFGGDQSQLAAIYE 277 >gi|218675994|ref|YP_002394813.1| hypothetical protein VS_II0212 [Vibrio splendidus LGP32] gi|218324262|emb|CAV25554.1| Conserved hypothetical protein [Vibrio splendidus LGP32] Length = 347 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 40/235 (17%), Positives = 81/235 (34%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLS SM ED+ D +++A+K+ L F++ ++ Sbjct: 111 DLMLVVDLSYSMS------QEDMQFNGEYID----RLSAVKHVLSDFIER-------RKG 153 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + + + + Q + + + LI T + A + Sbjct: 154 DRVGLVLFADHAYLQTPLTLDRDTLSQQLNQAVLKLIGTQTAIGDGIGLATKTFVD---- 209 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S Q+ +I L+DG N + D AK+ I T+ + A Sbjct: 210 --------------SDAPQRVMILLSDGSNTAGVLDPLEAA--DIAKKYNATIYTVGVGA 253 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + L++ + ++ ++ L ++ I+QL Sbjct: 254 GEMMVKEFFMTRKVNTAQDLDERTLMEIAKRTGGQYFRARDSKELATIYDTINQL 308 >gi|148974032|ref|ZP_01811565.1| hypothetical protein VSWAT3_12932 [Vibrionales bacterium SWAT-3] gi|145965729|gb|EDK30977.1| hypothetical protein VSWAT3_12932 [Vibrionales bacterium SWAT-3] Length = 330 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 39/235 (16%), Positives = 81/235 (34%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLS SM ED+ D +++A+K+ L F++ ++ Sbjct: 94 DLMLVVDLSYSMS------QEDMQFNGEYID----RLSAVKHVLSDFIER-------RKG 136 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + + + + Q + + + LI T + A + Sbjct: 137 DRVGLVLFADHAYLQTPLTLDRDTLSQQLNQAVLRLIGNQTAIGDGIGLATKTFVD---- 192 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S Q+ ++ L+DG N + D AK+ I T+ + A Sbjct: 193 --------------SDAPQRVMVLLSDGSNTAGVLDPLEAA--DIAKKYNATIYTVGVGA 236 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + L++ + ++ ++ L ++ I+QL Sbjct: 237 GEMMVKEFFMTRKVNTAQDLDERTLMEIAKRTGGQYFRARDSKELATIYDTINQL 291 >gi|325695689|gb|EGD37588.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK150] Length = 460 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 53/328 (16%), Positives = 108/328 (32%), Gaps = 51/328 (15%) Query: 49 KKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSG 108 + SM+ + I S + + S+ + ++ + + + ++N++ Sbjct: 52 MRTSMQ-YVDRTIGKATSIFILDDSKYQGSEKGLTKEWSYIGLSEDGKKVRNYV------ 104 Query: 109 YSAVFYNTEIQNIVNSSRISMTHMAN---NRLDSSNNTIFYNMD---VMTSYDYRLQFIE 162 ++ N +I ++ S +M + L N I YN+ TS + Sbjct: 105 WNKSKQNWDITDLGTKSLYNMKLDLEFKADDLYKDNRLISYNLTGKYPDTSNKLSIDTAI 164 Query: 163 HLLNQRYNQKIVSFIPALLRIEMGERPI-----FLIELVVDLSGSMHCAMNSDPEDVNSA 217 LN + V+ + I PI I V D SGSM M + Sbjct: 165 SALNSKQVFSKVAKGKKGIAIAYRNDPIEGQMNVSISFVFDKSGSMAWDMLGKEVEET-- 222 Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 + ++M LK + + + + +V ++ + E E GT + Sbjct: 223 -----NRPSRMKILKEKSIAMMKDLKSIGNVSVNLVAFSTLGSYVQEDFSELDKGTTTIE 277 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + + + T+ ++ L + + K+++ LT Sbjct: 278 TSINKLDEG---GYTNPGDGLRYGMVSLQKNPAQL------------------KYVVLLT 316 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIV 365 DG N F + N D + N IK+ Sbjct: 317 DGVPNAFTAKTN-----DVSSRNRIKVY 339 >gi|163786711|ref|ZP_02181159.1| aerotolerance-related membrane protein [Flavobacteriales bacterium ALC-1] gi|159878571|gb|EDP72627.1| aerotolerance-related membrane protein [Flavobacteriales bacterium ALC-1] Length = 335 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 45/263 (17%), Positives = 80/263 (30%), Gaps = 66/263 (25%) Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 +V + + I + +D+S SM D ++ Sbjct: 72 ITALVRPRTVDVSTKTKTTRGIDIVMSIDVSASMLAK---------------DLLPNRLE 116 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV-RQYVTRDMDSLI 288 ALK F++ + + +GL+ Y + V R +++I Sbjct: 117 ALKKVAADFIE-------GRPNDRIGLVEYAGEAYTKTPITSDKSIVLRSMRDIKYNTII 169 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + + L + +S K II LTDG NN + Sbjct: 170 EGGTAIGMGLATSVNRLKDSRAKS------------------KVIILLTDGVNNGGFIDP 211 Query: 349 NTIKICDKAKENFIKIVTIS----------INASPNG------------QRLLKTCVS-S 385 + A E IK+ TI + +PNG + LLK + Sbjct: 212 KIAS--ELAVEYGIKVYTIGLGTNGTALSPVRINPNGSFQYGRQKVEIDEDLLKEIADVT 269 Query: 386 PEYHYNVVNADSLIHVFQNISQL 408 ++ N L ++ I++L Sbjct: 270 GGKYFRATNNKKLAQIYDEINKL 292 >gi|124006869|ref|ZP_01691699.1| von Willebrand factor type A domain protein [Microscilla marina ATCC 23134] gi|123987550|gb|EAY27259.1| von Willebrand factor type A domain protein [Microscilla marina ATCC 23134] Length = 351 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 38/256 (14%), Positives = 77/256 (30%), Gaps = 66/256 (25%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 I L +D+S SM + +D ++ A K F+ Sbjct: 106 QTSEGIDILLTLDISESM---------------LIEDFTPNRLEAAKLVAKNFV------ 144 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-MDSLILKPTDSTPAMKQAYQI 304 H ++ +GL+ ++ + + +++Y+ D + T A+ Sbjct: 145 -HGRKYDRIGLVIFSGEAYSVSPLTTDYKLLKRYIEDIREDMIQENGTAIGSAL------ 197 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 S K +I ++DG+N + T A IKI Sbjct: 198 ------------GMGTIRMQESASRSKVVILISDGDNTAGNLDPITASRLATAHN--IKI 243 Query: 365 VTISI----NASPN--------------GQRLLKTCVS-SPEYHYNVVNADSLIHVFQNI 405 TI + + +L+ Y + +L +VF I Sbjct: 244 YTILVGRSGKVPYGRDMFGQPQYVNNTVDESVLREIAKIGEGKFYRASDNQALKNVFAEI 303 Query: 406 SQL----MVHRKYSVI 417 ++L ++ ++ I Sbjct: 304 NRLEKTEIIENRFKSI 319 >gi|77920224|ref|YP_358039.1| von Willebrand factor type A domain-containing protein [Pelobacter carbinolicus DSM 2380] gi|77546307|gb|ABA89869.1| von Willebrand factor type A domain protein [Pelobacter carbinolicus DSM 2380] Length = 442 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 38/257 (14%), Positives = 74/257 (28%), Gaps = 45/257 (17%) Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 L I + A +RP + LV+D SGSM Sbjct: 39 LAGPAQKTVIKIALDAPRAPRTAQRPPVNLALVLDRSGSMSGN----------------- 81 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 K+A + A + + + L+ Y VE + + + R Sbjct: 82 ---KIAKAREAAIEAVRRLSDGDLFS------LVVYDDSVETLVP-AQPVSDIGDIEARI 131 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG-ENN 342 T A+ + K P+ ++ L+DG N Sbjct: 132 RRIRPGGSTALFGAVS----------------QGAAEVRKHSDAPYVNRVVLLSDGLANV 175 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + ++ + I + T+ + N + + S HY V ++ L +F Sbjct: 176 GPSRPADLARLGAALLKEGISVTTVGVGTDFNEDLMTQLAERSDGNHYFVESSRDLPRIF 235 Query: 403 -QNISQLMVHRKYSVIL 418 + ++ V++ Sbjct: 236 AAELGDVLSVVARKVVI 252 >gi|117919904|ref|YP_869096.1| von Willebrand factor, type A [Shewanella sp. ANA-3] gi|117612236|gb|ABK47690.1| von Willebrand factor, type A [Shewanella sp. ANA-3] Length = 338 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 37/235 (15%), Positives = 64/235 (27%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM V L ++ Sbjct: 85 DLMLAVDLSGSMQIEDMVINGKVVDRFT-----------LIQHVVSEFIERRKGD----- 128 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V Q++ L+ K T ++ A + + Sbjct: 129 -RIGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGESIALAVKRFDKMDES 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + +I LTDG NN A + I T+ + A Sbjct: 188 N------------------RVLILLTDGSNNAGNIEPEQAAQ--IAANRKVTIYTVGVGA 227 Query: 372 S-----------------PNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 + LK + ++ N+ L ++Q I +L Sbjct: 228 DVMERRTLFGRERVNPSMDLDENQLKHIADVTHGRYFRARNSQELEQIYQEIDKL 282 >gi|114046974|ref|YP_737524.1| von Willebrand factor, type A [Shewanella sp. MR-7] gi|113888416|gb|ABI42467.1| von Willebrand factor, type A [Shewanella sp. MR-7] Length = 338 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 37/235 (15%), Positives = 64/235 (27%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM V L ++ Sbjct: 85 DLMLAVDLSGSMQIEDMVINGKVVDRFT-----------LIQHVVSEFIERRKGD----- 128 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V Q++ L+ K T ++ A + + Sbjct: 129 -RIGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGESIALAVKRFDKMDES 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + +I LTDG NN A + I T+ + A Sbjct: 188 N------------------RVLILLTDGSNNAGNIEPEQAAQ--IAANRKVTIYTVGVGA 227 Query: 372 S-----------------PNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 + LK + ++ N+ L ++Q I +L Sbjct: 228 DVMERRTLFGRERVNPSMDLDENQLKHIADVTHGRYFRARNSQELEQIYQEIDKL 282 >gi|47205231|emb|CAG06181.1| unnamed protein product [Tetraodon nigroviridis] Length = 427 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 35/244 (14%), Positives = 77/244 (31%), Gaps = 45/244 (18%) Query: 166 NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 +R + S + + +VD S S+ + Sbjct: 28 GRRNHAIDPSSTVSPASGSSCRNGPIDLVFIVDSSRSV--------------------RP 67 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 T+ K L +DS+++ + +GL+ Y + V K Sbjct: 68 TEFEKAKEFLQDLVDSLEV---GLDSTRVGLVNYASTVRMEFPLKAHFSKPALKGALARV 124 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + T + A++ A + + + G ++ S + + +TDG + Sbjct: 125 EPLASGTMTGLAIRTAVEKAFAAEA----------GARLNSTKVARVAVVVTDG---RPQ 171 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN---ADSLIH 400 V + A+E+ I+I + + + L+ S P ++ + V + L Sbjct: 172 DEVERVS--AAARESGIEIYAVGV--DRADRTSLRLMASQPHEDHVFYVETYGVIEKLTS 227 Query: 401 VFQN 404 F+ Sbjct: 228 RFRE 231 >gi|91223292|ref|ZP_01258558.1| hypothetical protein V12G01_05596 [Vibrio alginolyticus 12G01] gi|91192105|gb|EAS78368.1| hypothetical protein V12G01_05596 [Vibrio alginolyticus 12G01] Length = 334 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 37/235 (15%), Positives = 77/235 (32%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLSGSM + ++ A+K L F+ ++ Sbjct: 98 DLMLVVDLSGSM----------QQEDMELNGEYIDRLTAVKKVLSDFVAK-------RKG 140 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G++ + + + V Q + + + L+ + T + Sbjct: 141 DRLGVVLFGDHAYLQTPLTADRKTVMQQINQTVIGLVGQRTAIGDGI------------- 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-- 369 + S Q+ +I L+DG N + ++ + AK+ I T+ + Sbjct: 188 -----GLGTKTFVDSDAPQRVMILLSDGSNTAGV--LEPLEAAEIAKKYNATIYTVGVGA 240 Query: 370 ----------------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 A + Q L K + ++ + + L ++ I+QL Sbjct: 241 GEMMVKEFFMTRKVNTAADLDEQTLTKVAEVTGGQYFRARDTEELEKIYDTINQL 295 >gi|261415941|ref|YP_003249624.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|261372397|gb|ACX75142.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|302326143|gb|ADL25344.1| von Willebrand factor type A domain protein [Fibrobacter succinogenes subsp. succinogenes S85] Length = 228 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 55/187 (29%), Gaps = 38/187 (20%) Query: 195 LVVDLSGSMHCA-MNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 LV+D SGSM ++ E +N T + A + A++ F + Sbjct: 24 LVLDTSGSMEGQPISELNEGINCFYDAVRSDETALYAAEIAVVTF--------GGSAVLK 75 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 + + + T AM A +L K Sbjct: 76 TDFSTLEHQPDSP------------------NFFANGGTPMGEAMNMALDLLEKRK---- 113 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI--KICDKAKENFIKIVTISINA 371 + +Q +I+ +TDG+ N S + C+ K + I I I Sbjct: 114 -----GEYKASGVDYYQPWIVLMTDGKPNGDSSEYARAVQRTCEMIKNRKLTIFPIGIGE 168 Query: 372 SPNGQRL 378 + L Sbjct: 169 DADMNAL 175 >gi|153830331|ref|ZP_01982998.1| conserved hypothetical protein [Vibrio cholerae 623-39] gi|148874174|gb|EDL72309.1| conserved hypothetical protein [Vibrio cholerae 623-39] Length = 318 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 47/280 (16%), Positives = 85/280 (30%), Gaps = 60/280 (21%) Query: 151 MTSYDYRLQFIEHLL----NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCA 206 TS ++ LL + I + LVVDLS SM Sbjct: 41 STSGSRPATWLPRLLAVGVWFLLITAAARPVWYGDPISTSTSHR-DLMLVVDLSYSMS-- 97 Query: 207 MNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN 266 ED+ S D ++ A+K L F+ +E +GLI + Sbjct: 98 ----QEDMQSGQQMVD----RLTAVKQVLSEFIAK-------REGDRIGLILFADHAYLQ 142 Query: 267 IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 + + V + + + LI T + A + Sbjct: 143 TPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKTFIDSNA---------------- 186 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------- 373 Q+ +I L+DG N + + AK+ I T+ + A Sbjct: 187 --PQRVMILLSDGSNTAGVLDPLEAA--NIAKQYHTTIYTVGVGAGEMVVKDFLFSRKVN 242 Query: 374 -----NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + L ++ +++ N L +++ I+QL Sbjct: 243 TAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTINQL 282 >gi|316933619|ref|YP_004108601.1| hypothetical protein Rpdx1_2276 [Rhodopseudomonas palustris DX-1] gi|315601333|gb|ADU43868.1| hypothetical protein Rpdx1_2276 [Rhodopseudomonas palustris DX-1] Length = 483 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 39/261 (14%), Positives = 81/261 (31%), Gaps = 65/261 (24%) Query: 20 ANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRF 79 N ++IF ++++ L +G + + +M+SA ++ L A + S Sbjct: 23 GNIAVIFGIALLPLLGFVGAAVDYSRASRARTAMQSALDSTALMVAKDLTSG-------- 74 Query: 80 ESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDS 139 K + S + F + S ++ T+ ++ Sbjct: 75 -------------------KITAENVQSAANTYFTSLYKNTDAPSIDVTATYTPKTSSEN 115 Query: 140 SNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDL 199 + T+ + T ++N S + L +D+ Sbjct: 116 AKLTVGGTGSINTE-------FMKVMNISQMSLGASSTTTWGGT------RLRVALALDV 162 Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 +GSM K++A+K A +D++ S KEDVY+ ++ + Sbjct: 163 TGSMD-------------------SAGKLSAMKTAAKQLIDTLKATSTTKEDVYISIVPF 203 Query: 260 TTRV------EKNIEPSWGTE 274 V + W T Sbjct: 204 NVMVNVGPGNKNATWLDWDTS 224 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 27/143 (18%), Positives = 48/143 (33%), Gaps = 25/143 (17%) Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN------- 342 T+ M A+ L + T K + I+ L+DG N Sbjct: 348 GGTNQGIGMFWAWMTLQA-------TAPLYTPAKDSEYKYTDAIVLLSDGMNTKNRWYGN 400 Query: 343 ----NFKSNVNTIKICD--KAKENFI---KIVTISINAS-PNGQRLLKTCVSSPEYHYNV 392 + + + +CD K N + I TI +N S +LK C S+ ++ Sbjct: 401 GSNWSPQVDDRQKILCDNITTKVNGVPETTIYTIQVNTSGDPESSVLKYCGSTGG-FFST 459 Query: 393 VNADSLIHVFQNISQLMVHRKYS 415 A + FQ + + + + Sbjct: 460 TTASGIQSAFQEVGASLTKLRIA 482 >gi|149922245|ref|ZP_01910682.1| hypothetical protein PPSIR1_07355 [Plesiocystis pacifica SIR-1] gi|149816878|gb|EDM76364.1| hypothetical protein PPSIR1_07355 [Plesiocystis pacifica SIR-1] Length = 370 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 42/238 (17%), Positives = 90/238 (37%), Gaps = 50/238 (21%) Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 S + +R G RP ++V+D+ GS S+ D KR ++ A K Sbjct: 122 SAAISAVRPAPGGRPT---QVVIDIDGS-------------SSMRRSDPKRERVRAAKR- 164 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 F++++ + + G+I + T VE+ ++ + T Sbjct: 165 ---FVETLSRVDKRNQF---GVIEFNTTVEERAPMG---SGMKATSDAIQAVDAVGSTAL 215 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 ++ ++ L K +++ I+ LTDG++ V T+ Sbjct: 216 YTSLIRSIDALEGSGKTG----------------YRRAILVLTDGKDTASSHGVATVI-- 257 Query: 355 DKAKENFIKIVTISINASPNGQRL-----LKTCVSSPEYHY-NVVNADSLIHVFQNIS 406 ++AK ++I +S+ + + + L ++ + + +V AD L+ F I+ Sbjct: 258 NRAKAAKVRIYVVSLGGAGDQKGLGYVGPMQRLTTETGGVFTHVDRADDLVARFDAIA 315 >gi|153802375|ref|ZP_01956961.1| conserved hypothetical protein [Vibrio cholerae MZO-3] gi|124122094|gb|EAY40837.1| conserved hypothetical protein [Vibrio cholerae MZO-3] Length = 318 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 44/235 (18%), Positives = 78/235 (33%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLS SM ED+ S D ++ A+K L F+ +E Sbjct: 85 DLMLVVDLSYSMS------QEDMQSGQQMVD----RLTAVKQVLSEFIAK-------REG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + V + + + LI T + A + Sbjct: 128 DRIGLILFADHAYLQTPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKTFID---- 183 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S Q+ +I L+DG N + D AK+ I T+ + A Sbjct: 184 --------------SDAPQRVMILLSDGSNTAGVLDPLEAA--DIAKQYHTTIYTVGVGA 227 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + L ++ +++ N L +++ I+QL Sbjct: 228 GEMVVKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTINQL 282 >gi|261250853|ref|ZP_05943427.1| protein BatA [Vibrio orientalis CIP 102891] gi|260937726|gb|EEX93714.1| protein BatA [Vibrio orientalis CIP 102891] Length = 322 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 43/232 (18%), Positives = 77/232 (33%), Gaps = 55/232 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LV+DLS SM ED+ D +++A+KN + F+ +E Sbjct: 86 DLMLVLDLSYSMS------QEDMQEGDQYID----RLSAVKNVVSDFVKQ-------REG 128 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + + E + V + LI T + A D + Sbjct: 129 DRLGLVLFADHAYLQTPLTLDRETISDQVNSLVLRLIGDKTAIGEGIGLA-TKTFVDSEA 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-- 369 Q+ ++ L+DG N + + ++ AK+ I TI I Sbjct: 188 -----------------PQRVMVLLSDGSNTSGV--LEPLEAARIAKKYNATIYTIGIGA 228 Query: 370 ---------------NASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNI 405 A ++ LK + ++ NAD L ++ I Sbjct: 229 GEMMVKEFFMTRKVNTAKDLDEKTLKQIADLTGGQYFRARNADELATIYDTI 280 >gi|190339201|gb|AAI63867.1| Matn4 protein [Danio rerio] Length = 944 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 34/197 (17%), Positives = 75/197 (38%), Gaps = 33/197 (16%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + +K + +D +D+ + +GL+ Y++RV S + + + Sbjct: 724 RPQNFELVKQFVNQVVDQLDV---SAKGTRVGLVQYSSRVRTEFPLS--MYHSKDEIKKA 778 Query: 284 M--DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + K T + A+K + S+ + + IP + +F TDG Sbjct: 779 VMNVEYMEKGTMTGLALKHMVENSFSEAEGARPAEK-----NIPRVGL----VF-TDG-- 826 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLI 399 + ++ + KAKE I + + + + + L+ S P ++ + + Sbjct: 827 -RSQDDIQ--EWAKKAKEAGITMYAVGVGKAVEDE--LREIASDPVEKHFFYSAD----- 876 Query: 400 HVFQNISQLMVHRKYSV 416 F ISQ+ + K +V Sbjct: 877 --FTAISQIAENLKLNV 891 Score = 41.0 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 62/169 (36%), Gaps = 25/169 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTP 296 +D I L +G++ Y+++V+ K Q V + + L T + Sbjct: 58 MIDIIHELDIGLAATRIGVVQYSSQVQNVFSLK-AFSKTEQMVKAINEIIPLAQGTMTGL 116 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A + S ++ P++P I+ TDG + + V Sbjct: 117 AIRYAMNVAFSAEE-----------GARPNVPHVAVIV--TDGRPQDRVAEVAAAAR--- 160 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 E+ I+I + + L+ S P ++ + V + LIH F Sbjct: 161 --ESGIEIYAVGVA--RADMTSLRAMASPPFEDHVFLVESF-DLIHQFG 204 >gi|56797871|emb|CAG27569.1| matrilin-4 [Danio rerio] Length = 644 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 34/197 (17%), Positives = 75/197 (38%), Gaps = 33/197 (16%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + +K + +D +D+ + +GL+ Y++RV S + + + Sbjct: 424 RPQNFELVKQFVNQVVDQLDV---SAKGTRVGLVQYSSRVRTEFPLS--MYHSKDEIKKA 478 Query: 284 M--DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + K T + A+K + S+ + + IP + +F TDG Sbjct: 479 VMNVEYMEKGTMTGLALKHMVENSFSEAEGARPAEK-----NIPRVGL----VF-TDG-- 526 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLI 399 + ++ + KAKE I + + + + + L+ S P ++ + + Sbjct: 527 -RSQDDIQ--EWAKKAKEAGITMYAVGVGKAVEDE--LREIASDPVEKHFFYSAD----- 576 Query: 400 HVFQNISQLMVHRKYSV 416 F ISQ+ + K +V Sbjct: 577 --FTAISQIAENLKLNV 591 Score = 41.4 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 62/169 (36%), Gaps = 25/169 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTP 296 +D I L +G++ Y+++V+ K Q V + + L T + Sbjct: 45 MIDIIHELDIGLAATRIGVVQYSSQVQNVFSLK-AFSKTEQMVKAINEIIPLAQGTMTGL 103 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A + S ++ P++P I+ TDG + + V Sbjct: 104 AIRYAMNVAFSAEE-----------GARPNVPHVAVIV--TDGRPQDRVAEVAAAAR--- 147 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 E+ I+I + + L+ S P ++ + V + LIH F Sbjct: 148 --ESGIEIYAVGVA--RADMTSLRAMASPPFEDHVFLVESF-DLIHQFG 191 >gi|56797865|emb|CAG27566.1| matrilin-4 [Danio rerio] Length = 726 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 34/197 (17%), Positives = 75/197 (38%), Gaps = 33/197 (16%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + +K + +D +D+ + +GL+ Y++RV S + + + Sbjct: 506 RPQNFELVKQFVNQVVDQLDV---SAKGTRVGLVQYSSRVRTEFPLS--MYHSKDEIKKA 560 Query: 284 M--DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + K T + A+K + S+ + + IP + +F TDG Sbjct: 561 VMNVEYMEKGTMTGLALKHMVENSFSEAEGARPAEK-----NIPRVGL----VF-TDG-- 608 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLI 399 + ++ + KAKE I + + + + + L+ S P ++ + + Sbjct: 609 -RSQDDIQ--EWAKKAKEAGITMYAVGVGKAVEDE--LREIASDPVEKHFFYSAD----- 658 Query: 400 HVFQNISQLMVHRKYSV 416 F ISQ+ + K +V Sbjct: 659 --FTAISQIAENLKLNV 673 Score = 43.3 bits (100), Expect = 0.078, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 61/169 (36%), Gaps = 25/169 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTP 296 +D I L +G + Y+++V+ K Q V + + L T + Sbjct: 45 MIDIIHELDIGLAATRIGAVQYSSQVQNVFSLK-AFSKTEQMVKAINEIIPLAQGTMTGL 103 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A + S ++ P++P I+ TDG + + V Sbjct: 104 AIRYAMNVAFSAEE-----------GARPNVPHVAVIV--TDGRPQDRVAEVAAAAR--- 147 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 E+ I+I + + L+ S P ++ + V + LIH F Sbjct: 148 --ESGIEIYAVGVA--RADMTSLRAMASPPFEDHVFLVESF-DLIHQFG 191 >gi|56797863|emb|CAG27565.1| matrilin-4 [Danio rerio] Length = 944 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 34/197 (17%), Positives = 75/197 (38%), Gaps = 33/197 (16%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + +K + +D +D+ + +GL+ Y++RV S + + + Sbjct: 724 RPQNFELVKQFVNQVVDQLDV---SAKGTRVGLVQYSSRVRTEFPLS--MYHSKDEIKKA 778 Query: 284 M--DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + K T + A+K + S+ + + IP + +F TDG Sbjct: 779 VMNVEYMEKGTMTGLALKHMVENSFSEAEGARPAEK-----NIPRVGL----VF-TDG-- 826 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLI 399 + ++ + KAKE I + + + + + L+ S P ++ + + Sbjct: 827 -RSQDDIQ--EWAKKAKEAGITMYAVGVGKAVEDE--LREIASDPVEKHFFYSAD----- 876 Query: 400 HVFQNISQLMVHRKYSV 416 F ISQ+ + K +V Sbjct: 877 --FTAISQIAENLKLNV 891 Score = 41.0 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 62/169 (36%), Gaps = 25/169 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTP 296 +D I L +G++ Y+++V+ K Q V + + L T + Sbjct: 58 MIDIIHELDIGLAATRIGVVQYSSQVQNVFSLK-AFSKTEQMVKAINEIIPLAQGTMTGL 116 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A + S ++ P++P I+ TDG + + V Sbjct: 117 AIRYAMNVAFSAEE-----------GARPNVPHVAVIV--TDGRPQDRVAEVAAAAR--- 160 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 E+ I+I + + L+ S P ++ + V + LIH F Sbjct: 161 --ESGIEIYAVGVA--RADMTSLRAMASPPFEDHVFLVESF-DLIHQFG 204 >gi|47087209|ref|NP_998714.1| matrilin-2 [Danio rerio] gi|45827653|gb|AAS78465.1| matrilin-4-like protein [Danio rerio] Length = 821 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 34/197 (17%), Positives = 75/197 (38%), Gaps = 33/197 (16%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + +K + +D +D+ + +GL+ Y++RV S + + + Sbjct: 601 RPQNFELVKQFVNQVVDQLDV---SAKGTRVGLVQYSSRVRTEFPLS--MYHSKDEIKKA 655 Query: 284 M--DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + K T + A+K + S+ + + IP + +F TDG Sbjct: 656 VMNVEYMEKGTMTGLALKHMVENSFSEAEGARPAEK-----NIPRVGL----VF-TDG-- 703 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLI 399 + ++ + KAKE I + + + + + L+ S P ++ + + Sbjct: 704 -RSQDDIQ--EWAKKAKEAGITMYAVGVGKAVEDE--LREIASDPVEKHFFYSAD----- 753 Query: 400 HVFQNISQLMVHRKYSV 416 F ISQ+ + K +V Sbjct: 754 --FTAISQIAENLKLNV 768 Score = 41.4 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 62/169 (36%), Gaps = 25/169 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTP 296 +D I L +G++ Y+++V+ K Q V + + L T + Sbjct: 58 MIDIIHELDIGLAATRIGVVQYSSQVQNVFSLK-AFSKTEQMVKAINEIIPLAQGTMTGL 116 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A + S ++ P++P I+ TDG + + V Sbjct: 117 AIRYAMNVAFSAEE-----------GARPNVPHVAVIV--TDGRPQDRVAEVAAAAR--- 160 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 E+ I+I + + L+ S P ++ + V + LIH F Sbjct: 161 --ESGIEIYAVGVA--RADMTSLRAMASPPFEDHVFLVESF-DLIHQFG 204 >gi|125535226|gb|EAY81774.1| hypothetical protein OsI_36948 [Oryza sativa Indica Group] Length = 633 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 72/210 (34%), Gaps = 36/210 (17%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + V+D+SGSM N +A + + +++ LK ++ + +D Sbjct: 70 IDVVAVLDVSGSM----NDPVAAAAAASPESNLQASRLDVLKASMKFIIRKLDDGD---- 121 Query: 251 DVYMGLIGYTT---RVEKNIEPSW---GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + ++ + + + G + + R T PA+++A +I Sbjct: 122 --RLSIVAFNDGPVKEYSSGLLDVSGDGRSIAGKKIDRLQA---RGGTALMPALEEAVKI 176 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L RQG + FI+ LTDG++ AK + Sbjct: 177 L-----------DERQGGSRNHVG---FILLLTDGDDTTGFRWTRDAIHGAVAK---YPV 219 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVN 394 T + AS + + LL S + V + Sbjct: 220 HTFGLGASHDPEALLHIAQGSRGTYSFVDD 249 >gi|229514670|ref|ZP_04404131.1| protein BatA [Vibrio cholerae TMA 21] gi|229348650|gb|EEO13608.1| protein BatA [Vibrio cholerae TMA 21] Length = 318 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 42/235 (17%), Positives = 77/235 (32%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLS SM ED+ S D ++ A+K L F+ +E Sbjct: 85 DLMLVVDLSYSMS------QEDMQSGQQMVD----RLTAVKQVLSEFIAK-------REG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + V + + + LI T + A + Sbjct: 128 DRIGLILFADHAYLQTPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKTFIDSNA- 186 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 Q+ +I L+DG N + + AK+ I T+ + A Sbjct: 187 -----------------PQRVMILLSDGSNTAGVLDPLEAA--NIAKQYHTTIYTVGVGA 227 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + L ++ +++ N L +++ I+QL Sbjct: 228 GEMVVKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTINQL 282 >gi|229527849|ref|ZP_04417240.1| protein BatA [Vibrio cholerae 12129(1)] gi|229334211|gb|EEN99696.1| protein BatA [Vibrio cholerae 12129(1)] gi|327485392|gb|AEA79798.1| BatA aerotolerance operon [Vibrio cholerae LMA3894-4] Length = 318 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 42/235 (17%), Positives = 77/235 (32%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLS SM ED+ S D ++ A+K L F+ +E Sbjct: 85 DLMLVVDLSYSMS------QEDMQSGQQMVD----RLTAVKQVLSEFIAK-------REG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + V + + + LI T + A + Sbjct: 128 DRIGLILFADHAYLQTPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKTFIDSNA- 186 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 Q+ +I L+DG N + + AK+ I T+ + A Sbjct: 187 -----------------PQRVMILLSDGSNTAGVLDPLEAA--NIAKQYHTTIYTVGVGA 227 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + L ++ +++ N L +++ I+QL Sbjct: 228 GEMVVKDFLFSRKLNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTINQL 282 >gi|121586746|ref|ZP_01676529.1| conserved hypothetical protein [Vibrio cholerae 2740-80] gi|121728206|ref|ZP_01681240.1| conserved hypothetical protein [Vibrio cholerae V52] gi|147672023|ref|YP_001215942.1| hypothetical protein VC0395_1106 [Vibrio cholerae O395] gi|153816797|ref|ZP_01969464.1| conserved hypothetical protein [Vibrio cholerae NCTC 8457] gi|227811796|ref|YP_002811806.1| hypothetical protein VCM66_A0168 [Vibrio cholerae M66-2] gi|229506663|ref|ZP_04396172.1| protein BatA [Vibrio cholerae BX 330286] gi|262167807|ref|ZP_06035508.1| protein BatA [Vibrio cholerae RC27] gi|298500027|ref|ZP_07009833.1| conserved hypothetical protein [Vibrio cholerae MAK 757] gi|121549043|gb|EAX59080.1| conserved hypothetical protein [Vibrio cholerae 2740-80] gi|121629529|gb|EAX61953.1| conserved hypothetical protein [Vibrio cholerae V52] gi|126512600|gb|EAZ75194.1| conserved hypothetical protein [Vibrio cholerae NCTC 8457] gi|146314406|gb|ABQ18946.1| conserved hypothetical protein [Vibrio cholerae O395] gi|227010938|gb|ACP07149.1| conserved hypothetical protein [Vibrio cholerae M66-2] gi|227014797|gb|ACP11006.1| conserved hypothetical protein [Vibrio cholerae O395] gi|229357014|gb|EEO21932.1| protein BatA [Vibrio cholerae BX 330286] gi|262023715|gb|EEY42415.1| protein BatA [Vibrio cholerae RC27] gi|297542008|gb|EFH78059.1| conserved hypothetical protein [Vibrio cholerae MAK 757] Length = 318 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 42/235 (17%), Positives = 77/235 (32%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLS SM ED+ S D ++ A+K L F+ +E Sbjct: 85 DLMLVVDLSYSMS------QEDMQSGQQMVD----RLTAVKQVLSEFIAK-------REG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + V + + + LI T + A + Sbjct: 128 DRIGLILFADHAYLQTPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKTFIDSNA- 186 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 Q+ +I L+DG N + + AK+ I T+ + A Sbjct: 187 -----------------PQRVMILLSDGSNTAGVLDPLEAA--NIAKQYHTTIYTVGVGA 227 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + L ++ +++ N L +++ I+QL Sbjct: 228 GEMVVKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTINQL 282 >gi|1706571|sp|P54281|ECLC_BOVIN RecName: Full=Epithelial chloride channel protein; AltName: Full=Calcium-activated chloride channel gi|1184066|gb|AAC48511.1| calcium-activated chloride channel [Bos taurus] Length = 903 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 40/237 (16%), Positives = 79/237 (33%), Gaps = 59/237 (24%) Query: 177 IPALLRIEMGERPIFL--------IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 + + +P F + LV+D SGSM + Sbjct: 285 TSPMTEMNPPTQPTFSLLKSKQRVVCLVLDKSGSMSSEDRLFRMNQA------------- 331 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT-EKVRQYVTRDMDSL 287 A L + I+ S V G++ + + E + T + V + +T ++ Sbjct: 332 -----AELFLIQIIEKGSLV------GMVTFDSVAEIRNNLTKITDDNVYENITANLPQE 380 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T +K +Q + ++ + + II LTDGE+N Sbjct: 381 ANGGTSICRGLKAGFQAIIQSQQSTSGSE----------------IILLTDGEDNE---- 420 Query: 348 VNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEY-HYNVVNADSLIHVF 402 I C ++ K++ + I TI++ + L + + Y + + L + F Sbjct: 421 ---IHSCIEEVKQSGVIIHTIAL-GPSAAKELETLSDMTGGHRFYANKDINGLTNAF 473 >gi|159901412|ref|YP_001547659.1| hypothetical protein Haur_4901 [Herpetosiphon aurantiacus ATCC 23779] gi|159894451|gb|ABX07531.1| conserved hypothetical membrane protein [Herpetosiphon aurantiacus ATCC 23779] Length = 330 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 41/261 (15%), Positives = 92/261 (35%), Gaps = 45/261 (17%) Query: 150 VMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPI---FLIELVVDLSGSMHCA 206 V S+ R++ + L +V + + ER + I+L +D+S SM Sbjct: 44 VKPSWRIRMRPVLISLRAAAVGLLVVVLTRPQYAQSSERVVREGIDIQLALDISLSMKAG 103 Query: 207 MNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN 266 + DP+D ++ K + F+ ++D +GL+ ++ Sbjct: 104 -DFDPKD-------------RITVAKEVIAEFVK-------GRKDDRIGLVVFSGHAFTQ 142 Query: 267 IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 + + + ++ + + T A+ + L + S Sbjct: 143 VPLTLDYDFLQNLLGQVQTVRRPDGTAIGLALAHSVNGLRN------------------S 184 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 K +I LTDG NN ++ + + A+ +++ TI + NG+ + Sbjct: 185 TTKSKVVILLTDGSNNR--GDIEPAQAAEIARALDVRVYTILVGKPGNGEYPVHDPWRDE 242 Query: 387 EYHYNVVNADSLIHVFQNISQ 407 Y A+ + ++I++ Sbjct: 243 TYLIPAPTAEDEV-ALRDIAE 262 >gi|301764008|ref|XP_002917404.1| PREDICTED: matrilin-4-like [Ailuropoda melanoleuca] Length = 594 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 65/170 (38%), Gaps = 24/170 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKPTDST 295 + + L +G+I Y+++V+ G R+ + + +L+ + T + Sbjct: 59 LVGLLRGLDVGPNATRVGVIQYSSQVQSVFPL--GAFARREDMEHAIRALVPLAQGTMTG 116 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A + S + G + P + + +TDG + ++ Sbjct: 117 LAIQYAMNVAFSVAE----------GARPPEARVPRIAVIVTDG---RPQD--RVAEVAA 161 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 +A+ I+I + + + G L+ S P E+ + V + LI F Sbjct: 162 QARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESF-DLIQEFG 208 Score = 44.1 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 65/177 (36%), Gaps = 26/177 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR--DMDSLILKPTDSTPAMK 299 +D L E +GL+ +++RV G V + + + T + A++ Sbjct: 374 VDFLDVSPEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALR 431 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S+ QG + +L + + TDG + +++ +AKE Sbjct: 432 HMVEHSFSE----------AQGARPRALNVPRVGLVFTDG---RSQDDISV--WAARAKE 476 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I + + + + L+ S P + + D F ++ L+ + + S+ Sbjct: 477 EGIVMYAVGVG--KAVEEELRQIASEPAELHVSYSPD-----FGTMTHLLDNLRGSI 526 >gi|281345581|gb|EFB21165.1| hypothetical protein PANDA_005644 [Ailuropoda melanoleuca] Length = 581 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 65/170 (38%), Gaps = 24/170 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKPTDST 295 + + L +G+I Y+++V+ G R+ + + +L+ + T + Sbjct: 35 LVGLLRGLDVGPNATRVGVIQYSSQVQSVFPL--GAFARREDMEHAIRALVPLAQGTMTG 92 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A + S + G + P + + +TDG + ++ Sbjct: 93 LAIQYAMNVAFSVAE----------GARPPEARVPRIAVIVTDG---RPQD--RVAEVAA 137 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 +A+ I+I + + + G L+ S P E+ + V + LI F Sbjct: 138 QARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESF-DLIQEFG 184 Score = 44.1 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 65/177 (36%), Gaps = 26/177 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR--DMDSLILKPTDSTPAMK 299 +D L E +GL+ +++RV G V + + + T + A++ Sbjct: 391 VDFLDVSPEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALR 448 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S+ QG + +L + + TDG + +++ +AKE Sbjct: 449 HMVEHSFSE----------AQGARPRALNVPRVGLVFTDG---RSQDDISV--WAARAKE 493 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I + + + + L+ S P + + D F ++ L+ + + S+ Sbjct: 494 EGIVMYAVGVG--KAVEEELRQIASEPAELHVSYSPD-----FGTMTHLLDNLRGSI 543 >gi|329848392|ref|ZP_08263420.1| von Willebrand factor type A [Asticcacaulis biprosthecum C19] gi|328843455|gb|EGF93024.1| von Willebrand factor type A [Asticcacaulis biprosthecum C19] Length = 434 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 61/467 (13%), Positives = 139/467 (29%), Gaps = 100/467 (21%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 K+ + N +I L++ L IG + K ++ A + A + + Sbjct: 5 LKRFFRDTRGNVIMIIGLALPVVFLAIGGAVDFSRVMQLKKELQDAADVASVGSVA---- 60 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 ++++A +A F + S V + ++ NI + + Sbjct: 61 -----------VNSYAYKANTKGHSSFKTGENQALAIFNSNVKKHNDLNNI--KVKAKIK 107 Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPI 190 + N + T Y ++ L+ + + P Sbjct: 108 KQSTNLVSEIGVTADYR-----------PYLLGLMGMNTMPITIKSTSSSTFP-----PY 151 Query: 191 FLIELVVDLSGSMHCAMNSDPEDV---NSAPIC-------------------QDKKRTKM 228 L++D S SM + D N++ C + K T++ Sbjct: 152 IDFYLLLDNSPSMGVGATTKDIDTMVANTSDKCAFACHQMDKAGNDYYALAKKLKVTTRI 211 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 ++ A + + + + M + + ++ V+ +L Sbjct: 212 DVVRQATQNLMTTAKNTQTLTDQYRMAIYHFGMAADQIDS----KNPAPYEVSALTTNLS 267 Query: 289 LKPTDST--PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF-------QKFIIFLTDG 339 +++ M YQ SD++ +F + IPS Q+ + F++DG Sbjct: 268 TSASNAAKIDLMTIPYQNYNSDRQTNFPSYLLGMNKVIPSSGDGSSSSKPQQVLFFVSDG 327 Query: 340 ENNNFK---SNVNT--------IKICDKAKENFIKI---VTISINASPNG---------- 375 N+ + SN + C K +KI T + N Sbjct: 328 ANDGYDCAYSNGASCRRISPLDTPQCKAMKARGVKIAVLYTTYLPLPTNAFYNSHLAKYV 387 Query: 376 ------QRLLKTCVSSPEYHYNVVNADSLIHVFQNI-SQLMVHRKYS 415 ++ C + ++ V + + + ++++ + S Sbjct: 388 SPTSQLAAKMQECAT-EGLYFEVGPNEGISEAMNALFAKVISTVRIS 433 >gi|260426945|ref|ZP_05780924.1| von Willebrand factor type A [Citreicella sp. SE45] gi|260421437|gb|EEX14688.1| von Willebrand factor type A [Citreicella sp. SE45] Length = 334 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 29/226 (12%), Positives = 77/226 (34%), Gaps = 43/226 (19%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E + L +D+SGSM + P+ ++ ++A +++ + F++ Sbjct: 90 ETAARDVVLAIDISGSMDSRDFTAPDGT---------RKQRLAGVRDVVRGFVE------ 134 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++ M LI + + + + + + R + T ++ A + Sbjct: 135 -GRDGDRMALIVFGSAAYLQAPLTADLDAILALLDRTQVGMAGPHTALGDSIGLAIRTFE 193 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 S + Q+ +I L+DG + + + + A ++I T Sbjct: 194 SSEID------------------QRLLILLSDGSDTASRMDPVNAA--EIAAGRDVEIFT 233 Query: 367 ISINASPN------GQRLLKTCVS-SPEYHYNVVNADSLIHVFQNI 405 + + L+ + ++ + +L V+++I Sbjct: 234 VGVGDPDATGENRVDLDTLRAIADRTGGAYFFAADEAALTAVYESI 279 >gi|170591769|ref|XP_001900642.1| von Willebrand factor type A domain containing protein [Brugia malayi] gi|158591794|gb|EDP30397.1| von Willebrand factor type A domain containing protein [Brugia malayi] Length = 381 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 29/165 (17%), Positives = 63/165 (38%), Gaps = 19/165 (11%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL--KPTDSTPAMK 299 + ++ + V + LI Y+ + + + + V R ++ L T + A+ Sbjct: 219 VRNINVHPDAVRLALITYSGQAYIHFKFNDPQIGNNTSVIRHLNGLKSIKGTTSTHIALH 278 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 QAY++LT + +K II TDG + ++ K+ Sbjct: 279 QAYKLLTDTDNENGVREG-----------VKKMIIIFTDGHSQRSPQDMALR-----LKD 322 Query: 360 NFIKIVTISIN-ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 ++I I++ A + L + + ++ + VN LI ++ Sbjct: 323 KGVEIFAITLTPAPYADEGELLSITQNTDHIFTPVNLKVLITTYK 367 >gi|126731725|ref|ZP_01747530.1| Von Willebrand domain containing protein [Sagittula stellata E-37] gi|126707891|gb|EBA06952.1| Von Willebrand domain containing protein [Sagittula stellata E-37] Length = 321 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 27/183 (14%), Positives = 59/183 (32%), Gaps = 38/183 (20%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + +DLSGSM V + T++ A+ + Sbjct: 92 DLAIALDLSGSM----------VRDDFYLDGQPITRLEAV-TTVGAEFARRRAGD----- 135 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + LI + + ++ TE + + + + + T+ + A+ A Sbjct: 136 -RVALIVFGSEAYYAAPFTFDTEAIARRIEEATIGISGRATNISDALGLAL--------- 185 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S + +I L+DG NN +N + A + +++ TI+ Sbjct: 186 ---------KRMAGSDADTRVVILLSDGANNAGATNPRGVAQ--LAAQMGVRVHTIA-MG 233 Query: 372 SPN 374 + Sbjct: 234 PKS 236 >gi|53713710|ref|YP_099702.1| hypothetical protein BF2419 [Bacteroides fragilis YCH46] gi|60681981|ref|YP_212125.1| aerotolerance-related membrane protein [Bacteroides fragilis NCTC 9343] gi|253565658|ref|ZP_04843113.1| BatA [Bacteroides sp. 3_2_5] gi|265764034|ref|ZP_06092602.1| BatA [Bacteroides sp. 2_1_16] gi|4838138|gb|AAD30858.1|AF116251_1 BatA [Bacteroides fragilis] gi|52216575|dbj|BAD49168.1| conserved hypothetical protein BatA [Bacteroides fragilis YCH46] gi|60493415|emb|CAH08201.1| aerotolerance-related membrane protein [Bacteroides fragilis NCTC 9343] gi|251945937|gb|EES86344.1| BatA [Bacteroides sp. 3_2_5] gi|263256642|gb|EEZ27988.1| BatA [Bacteroides sp. 2_1_16] gi|301163419|emb|CBW22970.1| aerotolerance-related membrane protein [Bacteroides fragilis 638R] Length = 327 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 39/239 (16%), Positives = 73/239 (30%), Gaps = 63/239 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I L +D+S SM D K ++ A K+ F++ Sbjct: 88 IDIMLAIDVSTSMLAE---------------DLKPNRLEAAKDVAAEFIN----GRPNDN 128 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +G+ + + + +I T + A L K Sbjct: 129 ---IGITLFAGESFTQCPLTVDHAVLLNLFQGIQCDIIEDGTAVGMGIANAVTRLKDSKA 185 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI- 369 +S K II LTDG NN K +++ + + AK I++ TI + Sbjct: 186 KS------------------KVIILLTDGTNN--KGDISPLTAAEIAKSFGIRVYTIGVG 225 Query: 370 ---NAS-----------------PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 A + + L + ++ ++ + L V++ I +L Sbjct: 226 TNGMAPYPVRVGGTTQYINTPVEIDEKTLTQIAGTTDGNYFRATSNSKLKEVYEEIDKL 284 >gi|262202333|ref|YP_003273541.1| von Willebrand factor type A [Gordonia bronchialis DSM 43247] gi|262085680|gb|ACY21648.1| von Willebrand factor type A [Gordonia bronchialis DSM 43247] Length = 325 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 42/244 (17%), Positives = 72/244 (29%), Gaps = 69/244 (28%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LV+D+S SM D +++ A ++A F + E Sbjct: 89 TVILVMDVSRSM---------------NATDVAPSRIRAAQSAAKKF------ADDLTEG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + +GLI + S ++ PT A K+A L K Sbjct: 128 INLGLISF----------------------AGTPSTLVSPTPDHTATKKAVDKLVLADKT 165 Query: 312 SFFTNFFRQGVKI---------PSLPFQKFIIFLTDGENNNFKS--NVNTIKICD-KAKE 359 + F +I P I+ L+DG+ + KAKE Sbjct: 166 ATGEGIFAALDQIRTLNAVLGGPEAAPPAHIVLLSDGKQTVPDEPTDPRGAFTAARKAKE 225 Query: 360 NFIKIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 I + TIS + L + S + + D L V++ + Sbjct: 226 EGIPVSTISFGTAYGTVELDGDRVPVPVDDPSLKQIANLSGGNFFTASSLDELNEVYEKL 285 Query: 406 SQLM 409 + Sbjct: 286 QSEI 289 >gi|238060187|ref|ZP_04604896.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] gi|237881998|gb|EEP70826.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] Length = 779 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 32/235 (13%), Positives = 73/235 (31%), Gaps = 45/235 (19%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 + A+LR R I L VD SGSM+ + RT+ A Sbjct: 575 VVDAILRGWSTLRKKANILLAVDTSGSMNARVGG---------------RTRFQVATTAA 619 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEK------NIEPSWGTEKVRQYVTRDMDSLIL 289 + ++ + L +++ ++ + E G + R + Sbjct: 620 DRAVGLLNSAD------RVALWSFSSETDQRRGKPYSEEIRLGPYDRAAFTRRLTGLRVG 673 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV- 348 T ++ A++ L + ++ LTDG+N + N Sbjct: 674 GNTALYATVRAAHRRLLDNHDPDRINA----------------VVVLTDGKNEYPRDNDL 717 Query: 349 -NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + +K+ ++ + + L + +S ++ + ++ F Sbjct: 718 DRLLADIELDPNRPVKVFCVAFDRESDLAALDRIAGASAGKAFDATDPATIDEAF 772 >gi|152993961|ref|YP_001359682.1| von Willebrand factor type A domain-containing protein [Sulfurovum sp. NBC37-1] gi|151425822|dbj|BAF73325.1| von Willebrand factor type A domain protein [Sulfurovum sp. NBC37-1] Length = 325 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 41/270 (15%), Positives = 87/270 (32%), Gaps = 48/270 (17%) Query: 148 MDVMTSYDYRLQFIEHLLNQRYNQKIVSF--IPALLRIEMGERPIFLIELVVDLSGSMHC 205 + ++ +F+ +L + I + ++ +R + + VDLSGSM Sbjct: 51 VSGISRATKGEKFLLFILYVLVLSALAKPNLIGEPVTKDVSQRELL---ISVDLSGSM-- 105 Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK 265 + K ++ A+K L FL ++ +GLI + Sbjct: 106 -------MTKDFVNKEGKAIDRLEAVKMVLRDFLKE-------RKGEKIGLILFGNAAFV 151 Query: 266 NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP 325 + + + + + T ++ A ++ Sbjct: 152 QAPFTQDLDALEHLLDSLRVGMAGPQTAMGDSIGLAVKMFRESNVT-------------- 197 Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN------GQRLL 379 + +I ++DG++ K T + A +N + + TI I N L Sbjct: 198 ----DRMLIVMSDGDDTGSKVPPKTSA--ELAAKNGVNVFTIGIGDPKNAGEHPIDTDTL 251 Query: 380 KTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 K + + Y N D L +++ I +L Sbjct: 252 KEIAAITGGKFYYAWNLDDLQDIYKQIDKL 281 >gi|332982109|ref|YP_004463550.1| von Willebrand factor type A [Mahella australiensis 50-1 BON] gi|332699787|gb|AEE96728.1| von Willebrand factor type A [Mahella australiensis 50-1 BON] Length = 948 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 36/232 (15%), Positives = 71/232 (30%), Gaps = 53/232 (22%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + + P + LV+D SGSM TK+ K A + +++ Sbjct: 398 KKADIPSLGLVLVIDKSGSM---------------TDGQYGITKLEMAKEAAIRSTEALR 442 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 V + + + + + + + T+ PA+ AY+ Sbjct: 443 PTDSVGVICFDDAASWVVGMRQADDLA-------EIQDSIGTIRPGGGTNMYPALDLAYK 495 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L + K II LTDG++ + I + E+ I Sbjct: 496 ALEEADTKL------------------KHIIVLTDGQSATGDFDG----IAHRMAEDGIT 533 Query: 364 IVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 + ++++ + LL +Y F NI +++ Y Sbjct: 534 LSSVAVGM-DADKNLLSRLAEIGNGRYYY-------TDEFSNIPKILTKETY 577 >gi|148976298|ref|ZP_01813022.1| hypothetical protein VSWAT3_18848 [Vibrionales bacterium SWAT-3] gi|145964392|gb|EDK29647.1| hypothetical protein VSWAT3_18848 [Vibrionales bacterium SWAT-3] Length = 401 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 44/296 (14%), Positives = 96/296 (32%), Gaps = 53/296 (17%) Query: 156 YRLQFIE-HLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVV--DLSGSMHCAMNSDPE 212 Y + + H +RY+Q + + + + ELV+ D+SGSM + S Sbjct: 100 YSVTPLLPHFNYERYDQNVTATGGGFKGVVESKHSAIPTELVLVLDVSGSMSPNIQSLKS 159 Query: 213 DVNSAPIC---QDKKRTKMAALKNALLLF------------------------------- 238 +++A Q + ++ +++ F Sbjct: 160 ILSNALNTIQSQSNNANDLDSVSISIVPFDSGVATHRPPWLSEETAGIYCIDGLSYRNGD 219 Query: 239 ------LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 +D++ L + + + + + T + T Sbjct: 220 FSASLTVDNLATLHSERPVKFTPPSKWLSDCNQESPMLPLTNVFSRVQNSINSLTANGGT 279 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP-FQKFIIFLTDGENNNFKSNV-NT 350 S + + L ++++ G+K+ S+P ++ ++ TDG + N Sbjct: 280 RSYQGLVWGVRQLIPSWQQAW-------GMKVSSVPETRRKLVLFTDGADEGDAFNQLVN 332 Query: 351 IKICDKA-KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 C A K+ I++ I SP+ + C +P ++ N L F +I Sbjct: 333 AGFCTTAIKQYGIEMNFIGYGVSPSRITQFENCAGNPLRVFSATNTTQLNEYFSDI 388 >gi|187470892|sp|A6NF34|ANTRL_HUMAN RecName: Full=Anthrax toxin receptor-like; Flags: Precursor Length = 565 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 33/207 (15%), Positives = 71/207 (34%), Gaps = 44/207 (21%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + F + ++D SGS++ L + + + Sbjct: 72 QGSFDLYFILDKSGSVNNNWI---------------------DLYMWVEETVARFQSPN- 109 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + M I Y+T + + + +++ + + + T ++A Q + S Sbjct: 110 ----IRMCFITYSTDGQTVLPLTSDKNRIKNGLDQLQKIVPDGHTFMQAGFRKAIQQIES 165 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 G K+PS+ II +TDGE + +T++ KA++ + T+ Sbjct: 166 FN----------SGNKVPSM-----IIAMTDGE-LVAHAFQDTLREAQKARKLGANVYTL 209 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVN 394 + Q + SP + + V N Sbjct: 210 GVADYNLDQ--ITAIADSPGHVFAVEN 234 >gi|56460106|ref|YP_155387.1| von Willebrand factor type A (vWA) domain-containing protein [Idiomarina loihiensis L2TR] gi|56179116|gb|AAV81838.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Idiomarina loihiensis L2TR] Length = 327 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 38/234 (16%), Positives = 84/234 (35%), Gaps = 55/234 (23%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I L VDLSGSM A + + ++ +K+ L F++ +E Sbjct: 88 IMLAVDLSGSMEIA----------DMQLEGRSVNRLTMVKHVLSDFIER-------REGD 130 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GLI + ++ V+Q + + L+ + T A+ + + D+K + Sbjct: 131 RLGLILFADTAYLQTPMTYDRNTVKQMLNESVLGLVGERTAIGDAIALSVKRFRDDEKSN 190 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA- 371 + ++ LTDG+N N+ + + A+ + I I++ A Sbjct: 191 ------------------RVLVLLTDGQNTA--GNLPPEQALELAQAYDVTIYPIAVGAE 230 Query: 372 --------------SPNGQR--LLKTCV-SSPEYHYNVVNADSLIHVFQNISQL 408 L+++ + ++ + + L ++Q + +L Sbjct: 231 EVVVDSFFGQRRVNPSRDLDVPLMQSIAKQTGGKYFRARSTNELEEIYQRLDKL 284 >gi|254228714|ref|ZP_04922137.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Vibrio sp. Ex25] gi|262396564|ref|YP_003288417.1| protein BatA [Vibrio sp. Ex25] gi|151938661|gb|EDN57496.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Vibrio sp. Ex25] gi|262340158|gb|ACY53952.1| protein BatA [Vibrio sp. Ex25] Length = 334 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 38/235 (16%), Positives = 77/235 (32%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLSGSM + ++ A+K L F+ ++ Sbjct: 98 DLMLVVDLSGSM----------QQEDMELNGEYIDRLTAVKKVLSDFVAK-------RKG 140 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G++ + + + V Q + + + L+ + T + Sbjct: 141 DRLGVVLFGDHAYLQTPLTADRKTVMQQINQTVIGLVGQRTAIGDGI------------- 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-- 369 + S Q+ +I L+DG N + ++ + AK+ I T+ + Sbjct: 188 -----GLGTKTFVDSDAPQRVMILLSDGSNTAGV--LEPLEAAEIAKKYNATIYTVGVGA 240 Query: 370 ----------------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 A + Q L K + ++ + D L ++ I+QL Sbjct: 241 GEMMVKEFFMTRKVNTAADLDEQTLTKVAEMTGGQYFRARDTDQLEKIYDTINQL 295 >gi|299135165|ref|ZP_07028356.1| conserved hypothetical protein [Afipia sp. 1NLS2] gi|298590142|gb|EFI50346.1| conserved hypothetical protein [Afipia sp. 1NLS2] Length = 601 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 24/165 (14%), Positives = 56/165 (33%), Gaps = 25/165 (15%) Query: 267 IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 S ++ + T+ + +Q L + + F+ K P+ Sbjct: 445 TPMSNQWSTLKSQINAM---TPSGNTNQAVGLFWGWQTLNTTN------DPFKAPAKDPN 495 Query: 327 LPFQKFIIFLTDGENNN-------FKSNVNTIKICDKAKE------NFIKIVTISINASP 373 ++ +I+ L+DG N + +C K+ N I + +I +N S Sbjct: 496 WVYKDYIVLLSDGLNTQNRWTQTVSDIDARQELLCKNIKDPAQNGGNQITVFSIQVNISS 555 Query: 374 NG--QRLLKTCVSSPEYHYN-VVNADSLIHVFQNISQLMVHRKYS 415 ++L+ C + ++ + + F N+ + + S Sbjct: 556 KDPTSKVLQDCATPGAGYFQMITQSSQTADAFNNVLATIAKLRIS 600 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 41/274 (14%), Positives = 88/274 (32%), Gaps = 66/274 (24%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 ++R ++ + N +IIFA+ + + L+G + + +++SA ++A L Sbjct: 6 ITRLTRLARRFQTDARGNVAIIFAIVSIPLVALVGAAVDYTRAVSDRTALQSALDSAALM 65 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 IS A + ++ + A IQN Sbjct: 66 ------------------ISKDAATMSASQITTRARQYVDSLYTATDA-----PIQNFTA 102 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 + + N+ ++ + + F+ +L +N V+ Sbjct: 103 TY-------------TPNSGSGASILLSANGTMPTYFM-RVLGSNFNTLPVATSSTTKWG 148 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + LV+D +GSM + KMAAL++A + + Sbjct: 149 ST----RMRVALVLDNTGSMA-------------------QNGKMAALQSAATDMITKLS 185 Query: 244 LLSHVKEDVYMGLIGY------TTRVEKNIEPSW 271 + DVY+ ++ + +T +W Sbjct: 186 AFNTTTGDVYISIVPFAKDVNVSTSNVSASWLNW 219 >gi|254784286|ref|YP_003071714.1| matrixin family protein [Teredinibacter turnerae T7901] gi|237683907|gb|ACR11171.1| matrixin family protein [Teredinibacter turnerae T7901] Length = 877 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 44/220 (20%), Positives = 76/220 (34%), Gaps = 37/220 (16%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LV+D SGSM+ + DP +KM ALK A +F+D +DL Sbjct: 421 DVVLVMDRSGSMNLSSAPDP------------SVSKMDALKYAANVFMDFLDL----DAG 464 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 GL+ + V P++ + V S A + A +T+ Sbjct: 465 HRAGLVQF-HEVVVPFSPAFNLQPVNA--------------ASLSAAQTAINSMTAGGMT 509 Query: 312 SF---FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + Q ++ ++ LTDG +N T I + + + ++ Sbjct: 510 NIIDGVNEGIAQLTTAVDPSDRQIMLLLTDGLHNRPVGTSVT-DITAPLLASEVTLYSVG 568 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLI--HVFQNIS 406 S N L +S+ H + L F +I+ Sbjct: 569 FGTSTNEAELTPLALSTGGVHLENKDVSDLQLRKHFLSIA 608 >gi|305663382|ref|YP_003859670.1| von Willebrand factor type A [Ignisphaera aggregans DSM 17230] gi|304377951|gb|ADM27790.1| von Willebrand factor type A [Ignisphaera aggregans DSM 17230] Length = 323 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 42/248 (16%), Positives = 81/248 (32%), Gaps = 52/248 (20%) Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 L + + I + + + ++VD SGSM Sbjct: 76 LAGISIVEYVRIPIESSSISSLSFNARTPVVIIVDTSGSMADN----------------- 118 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 M ++K AL +S + +GL+ ++ ++ I P+ R Y+ Sbjct: 119 ----MDSVKYALRTM------VSLFNNTIDIGLVEFSHSIKSAIPPTPN----RSYIDMV 164 Query: 284 MDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 +D + T + A+ A L ++ + F +F+TDG Sbjct: 165 IDRMEAGGGTMYSFALSTALSWLRPYRELNVSA----------------FTVFITDGLPG 208 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHV 401 + + + D+ + I I T+ I P G K S + + V + D L Sbjct: 209 DPQDYRPLL---DEYNKLGIPIYTVFIGEDPRGIDETKLIASKTGGEQFTVESIDRLSDT 265 Query: 402 FQNISQLM 409 I+ + Sbjct: 266 LNTIASKI 273 >gi|149914292|ref|ZP_01902823.1| hypothetical protein RAZWK3B_19866 [Roseobacter sp. AzwK-3b] gi|149811811|gb|EDM71644.1| hypothetical protein RAZWK3B_19866 [Roseobacter sp. AzwK-3b] Length = 597 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 15/71 (21%), Positives = 28/71 (39%), Gaps = 3/71 (4%) Query: 347 NVNTIKICDKAKENFIKIVTISIN--ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + IC K I + TI+ + L+ C SSP H+N + + F + Sbjct: 527 DDRMADICGATKSKGIVVYTIAFEMGEFDSAADRLENCASSPSQHFNATTLN-ISQAFGS 585 Query: 405 ISQLMVHRKYS 415 I+ + + + Sbjct: 586 IAANVQKLRLT 596 >gi|221135318|ref|ZP_03561621.1| von Willebrand factor, type A [Glaciecola sp. HTCC2999] Length = 342 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 36/230 (15%), Positives = 71/230 (30%), Gaps = 55/230 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VDLSGSM + + ++ +K L F+ + Sbjct: 98 DLMVAVDLSGSM----------QTQDMVVNGNEVDRLVMVKTVLGDFIQR-------RVG 140 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ V Q ++ + L+ T A+ A + + Sbjct: 141 DRIGLILFADTAYLQAPLTFDRTTVEQLLSETVIGLVGDSTAIGDAIGLAAKRFSDKPN- 199 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 K ++ LTDG+N + + A + IKI I + A Sbjct: 200 -----------------VNKVLVLLTDGQNTAGNITPD--QALSLAVDQNIKIYPIGVGA 240 Query: 372 SPN------GQR------------LLKTCVSSPEYHYNVVNADSLIHVFQ 403 GQR L + + ++ + L +++ Sbjct: 241 DAMTVNSLFGQRQVNPSADLDEGLLTRLAKDTGGQYFRARDTQELEQIYR 290 >gi|332877593|ref|ZP_08445337.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332684442|gb|EGJ57295.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 333 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 47/281 (16%), Positives = 84/281 (29%), Gaps = 66/281 (23%) Query: 156 YRLQFIEHLLNQRYNQKIVS-FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 Y L F+ ++ ++ + I + +D+S SM Sbjct: 56 YHLLFVLRMIAIALIVVALARPQTHSENAKTKITDGIDIVMAIDVSASMLS--------- 106 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 QD K + ALK F+ + + +GL+ Y + Sbjct: 107 ------QDLKPNRFEALKKVASQFVK-------DRPNDRIGLVIYAGESYTKTPVTTDKL 153 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 + ++ I T + A L K +S + II Sbjct: 154 IILNALSEITYGQIEDGTAIGMGLATAVNRLKESKAKS------------------RVII 195 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI------------NASPN-------- 374 LTDG NN + T + A E IK+ T+ I NA + Sbjct: 196 LLTDGVNNTGFIDPQTAA--ELAAEYGIKVYTVGIGTNGMALSPYALNADGSIIYRMQQV 253 Query: 375 --GQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHR 412 + L+K + ++ N L ++ I+Q+ + Sbjct: 254 DIDEPLMKKIAQVTKGRYFRATNNQKLQQIYDEINQMETTK 294 >gi|261251589|ref|ZP_05944163.1| hypothetical protein VIA_001610 [Vibrio orientalis CIP 102891] gi|260938462|gb|EEX94450.1| hypothetical protein VIA_001610 [Vibrio orientalis CIP 102891] Length = 396 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 48/308 (15%), Positives = 106/308 (34%), Gaps = 40/308 (12%) Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP---ALLRIEMGERPIFL 192 ++D + I + S Y L I L +VS A + ++ P Sbjct: 77 KIDKVSGQIGTHSGCNVSLSYSLSTIFTSLTLSDASFVVSSTANEKAYVTEDVASEP-LE 135 Query: 193 IELVVDLSGSMHCAMNSDPE----DVNSAPICQDKKRTKMAALKNALLLFLD--SIDLLS 246 + LV+D+SGSM ++ + S Q+ +K +K +++ F D S++ Sbjct: 136 LILVLDISGSMASDLDDLKAILKRGLASLKEQQNNALSK-DHIKVSIVPFSDGVSVNNAP 194 Query: 247 HVKED--------VYMGL------------IGYT-TRVEKNIEPSWGTEKVRQYVTRDMD 285 + E G I + T V+ W + VT + Sbjct: 195 WLNETGTFCVEGITESGGKFSAAHTVANLDITHDQTPVKTFQPDKWLMDCSAMSVTLPLT 254 Query: 286 SLILKPTDSTPAMKQ-----AYQILT-SDKKRSFFTNFFRQGVKIPSLP-FQKFIIFLTD 338 + + + T++ +++ +YQ L ++ + + + ++ ++ +TD Sbjct: 255 ADLNQVTNAVDSLRTEGGTASYQGLIWGLRQLTPNWQKAWEVGPNRNFDKVERKLVLMTD 314 Query: 339 G-ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 G + + + +CD+AK+ + + + C ++ N Sbjct: 315 GADYGSHFDELINAGLCDRAKDYGVALNFVGFGVYGARLEQFTRCAGDANGVFSASNTQE 374 Query: 398 LIHVFQNI 405 L F + Sbjct: 375 LDSYFSQL 382 >gi|218461471|ref|ZP_03501562.1| von Willebrand factor type A [Rhizobium etli Kim 5] Length = 459 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 24/148 (16%), Positives = 53/148 (35%), Gaps = 30/148 (20%) Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP-SLPFQKFIIFLTDGENNN 343 + T + + L+ + + Q P S K ++F+TDGE N Sbjct: 310 NLTSEGSTRLDAGVVAGWYTLSPK----WQGVWGDQSSPAPVSDSVHKVMVFMTDGEMN- 364 Query: 344 FKSNVNTI----------------------KICDKAKENFIKIVTISINASPNGQRLLKT 381 K + N C K++ I+I T+S +A + ++ Sbjct: 365 TKYDPNDKFDWICSQTQSSACNAFATAARQTACTAMKKSGIEIYTLSYSADADVVN-IRN 423 Query: 382 CVSSPEYHYNVVNADSLIHVFQNISQLM 409 C ++ + + + ++ V++ I+ + Sbjct: 424 CATNTAHFFTA-SPATIKTVYETIAAAI 450 >gi|7145102|gb|AAA36225.2| MHC serum complement factor B [Homo sapiens] Length = 677 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 34/226 (15%), Positives = 74/226 (32%), Gaps = 39/226 (17%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 +V+D SGSM+ + D D A K L+ ++ + Sbjct: 174 IVLDPSGSMNIYLVLDGSDSIGASNFTG--------AKKCLVNLIEKLASYGVKPRY--- 222 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL------ILKPTDSTPAMKQAYQILTSD 308 GL+ Y T ++ S +VT+ ++ + + T++ A++ Y ++ Sbjct: 223 GLVTYATXXXIWVKVSEAVSSNADWVTKQLNEINYEDHKLKSGTNTEEALQAVYSMM--- 279 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-------KSNVNTIKICDKAKEN- 360 ++ + II +TDG +N I K Sbjct: 280 -------SWPDDVPPEGWNRTRHVIILMTDGLHNMGGDPITVIDXXXXXXYIGKDRKNPR 332 Query: 361 ----FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + + + L + + ++ + V + ++L VF Sbjct: 333 EDYLDVYVFGVGPLVNQVNINALASKKDNEQHVFKVKDMENLEDVF 378 >gi|260433775|ref|ZP_05787746.1| von Willebrand factor type A [Silicibacter lacuscaerulensis ITI-1157] gi|260417603|gb|EEX10862.1| von Willebrand factor type A [Silicibacter lacuscaerulensis ITI-1157] Length = 327 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 35/226 (15%), Positives = 75/226 (33%), Gaps = 43/226 (19%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E+ + L VD+SGSM D + ++ A+K+ L F+ + Sbjct: 90 EKSARDVVLAVDISGSM---------DQRDFKAADGTPKQRLEAVKDVLRAFIAA----- 135 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++ M LI + TR + + + ++ + + T A+ + Sbjct: 136 --RDGDRMALIIFGTRAFVQAPFTEDLQSLNGFLEQTAVGMAGPNTALGDAIGLGIRTFE 193 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 S + Q+ +I L+DG + + + A + + I T Sbjct: 194 SSEVD------------------QRMMIVLSDGADTSSRMTPVIAA--SIAADKGVVIYT 233 Query: 367 ISINASPN------GQRLLKTCVS-SPEYHYNVVNADSLIHVFQNI 405 I + LK + + ++ + +L V++ I Sbjct: 234 IGVGDPDATGEDRVDLDALKDIANKTQGQYFFADDEAALTEVYRQI 279 >gi|190894968|ref|YP_001985261.1| hypothetical protein RHECIAT_PC0000634 [Rhizobium etli CIAT 652] gi|190700629|gb|ACE94711.1| hypothetical conserved protein [Rhizobium etli CIAT 652] Length = 444 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 57/401 (14%), Positives = 113/401 (28%), Gaps = 92/401 (22%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 ++ R F+ N +I+ ALS++ L+ +G + + + M+S +AA Sbjct: 6 LNSFGRLVRAFRSLGRDRTGNVAIVVALSLVPMLVAVGASFDYIRSYNVRQRMQSDLDAA 65 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 ++A ++ ++ AL + ++ S + Sbjct: 66 LIAAVKQI-------------NNSEDTDALKQKVSDWFHAQVENSYALGEIEID------ 106 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 T +N+ S F+ + + Sbjct: 107 ----------------------TTNHNITATASGTVPTTFM------KIANIDTVPVSVG 138 Query: 181 LRIEMGERPIFLIELVVDLSGSM----------------HCAM-----NSDPEDVNSAPI 219 ++ + +V+D S SM C ++ + Sbjct: 139 SAVKGPATSYLNVYIVIDRSPSMLLAATTSGQSTMYSGIGCQFACHTGDAHTVGKKTYAN 198 Query: 220 CQDKKRTK-----MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 D K +A+ LD ID E + +GL ++ + P+ T Sbjct: 199 NYDYSTEKNIKLRADVAGDAVREVLDMIDESDSNHERIKVGLYSLGDTTKEVLAPTLDTS 258 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 R+ ++ D L S +M Y + G S K ++ Sbjct: 259 NARKRLSDDSYGLT-----SATSMNYTY---FDVALAALQKIVGTGGDGTSSANPLKLVL 310 Query: 335 FLTDGENNN----FKSNVNTIK-------ICDKAKENFIKI 364 LTDG + K++ N K C K + Sbjct: 311 LLTDGVQSQRGWVVKNSSNLKKVAPLNPDWCGYVKNKSATM 351 >gi|258620051|ref|ZP_05715090.1| conserved hypothetical protein [Vibrio mimicus VM573] gi|258587409|gb|EEW12119.1| conserved hypothetical protein [Vibrio mimicus VM573] Length = 308 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 46/235 (19%), Positives = 78/235 (33%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLS SM ED+ S D ++ A+K L F+ +E Sbjct: 75 DLMLVVDLSYSMS------QEDMQSGQQMVD----RLTAVKQVLSEFITK-------REG 117 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 MGLI + + + V + + + LI T + A + Sbjct: 118 DRMGLILFADHAYLQTPLTLDRQTVISQLNQAVLKLIGTQTAIGEGIGLATKTFID---- 173 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-- 369 S Q+ +I L+DG N + + AK+ I T+ + Sbjct: 174 --------------SDAPQRVMILLSDGSNTAGVLDPLEAA--NIAKQYQTTIYTVGVGA 217 Query: 370 ---------------NASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 A ++ L+T S + ++ N L ++ I+QL Sbjct: 218 GEMIVKDFLFSRKVNTAQDLDEKTLQTIASTTGGQYFRARNQQDLQSIYDTINQL 272 >gi|149199796|ref|ZP_01876826.1| hypothetical protein LNTAR_23599 [Lentisphaera araneosa HTCC2155] gi|149137084|gb|EDM25507.1| hypothetical protein LNTAR_23599 [Lentisphaera araneosa HTCC2155] Length = 333 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 41/224 (18%), Positives = 78/224 (34%), Gaps = 43/224 (19%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM + T++ ++K L FL +E Sbjct: 94 DLLLAVDLSGSME---------TKDFKNKSGENVTRLDSVKEVLSEFLAE-------REG 137 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + + + + E ++ + + T A+ + + + Sbjct: 138 DRVGLVFFGSAAFIQMPFTEDLEICQELMDEAQVRMAGPQTMLGDAIGLSI-SIFDQSEL 196 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 K +I LTDG N+ S V K A++ I I T+++ Sbjct: 197 E-----------------DKVLILLTDG--NDTGSLVAPEKAAQIARDKGIVIHTVAVGD 237 Query: 372 S--PNGQRL----LKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 Q L L++ S + +Y N + L ++ I ++ Sbjct: 238 PAAAGEQALDEATLRSISSLTKGKYYWAGNREELAGIYDEIDKI 281 >gi|254506100|ref|ZP_05118244.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus 16] gi|219550918|gb|EED27899.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus 16] Length = 415 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 59/447 (13%), Positives = 127/447 (28%), Gaps = 79/447 (17%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 + I + + +++FA+ + + K +E A+ A+LA A+ Sbjct: 2 RKIKKQSGHAALLFAMIIPGLFGIFTLATDGARALQTKARIEDASEIAVLAIAAH----- 56 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 + + + + + I V Sbjct: 57 NDDNQDSQGAGSGS----------RVNRQIATDYLNAYLRDSTQLTGLKVKKYNCDQIAE 106 Query: 133 ANNRLDSSNNTIF-YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF 191 L F Y ++V + D + + S A + + Sbjct: 107 CRAGLARGEPRFFQYEIEVSSVQDTWFPGNDSIEGFGD---TFSAKGAAVARKYQSE-AV 162 Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL--LSHVK 249 I V D SGSM + K L+N + D + + Sbjct: 163 DIIFVSDYSGSMAWNWSGGRNR-------------KYIDLRNIIQEVTDELQKFNDLNNT 209 Query: 250 EDVYMGLIGYTTRVEKNIEPSWG------------------------TEKVRQYVTRDMD 285 ++ +GL + + EK +Y D Sbjct: 210 DNNTVGLTAFNYYTKTVPSNRSNHCFMTQLVNPNGRFSASQTVRNIFVEKNNRYCVNHGD 269 Query: 286 SLILKP---TDSTPAMKQAYQILTSDKKRSFFTNFFR--QGVKIPSLPFQKFIIFLTDGE 340 S + TD+ + + + + + F R Q ++ P ++ +I L+DGE Sbjct: 270 SSRFQDLPLTDNYSSFNNSVRSFYPNHGTASFQGIIRGAQMLRKGRNP-RRLLIVLSDGE 328 Query: 341 NNNFKSNVNTIK--ICDKA------------KENFIKIVTISINASPNGQRLLKTCVSSP 386 + + ++ + +C + ++ + + N R L+ CV + Sbjct: 329 DGDPSRHMQLVNAGMCSTIVNTLSGDLTPDGHKVKARLAVVGFDYDVNKNRALQKCVGAE 388 Query: 387 EYHYNVVNADSLIHVFQNISQLMVHRK 413 + D L + + I++ + H K Sbjct: 389 NVYKAQNRDDILNKILELITEEIGHLK 415 >gi|322378392|ref|ZP_08052846.1| phage/colicin/tellurite resistance cluster TerY protein [Helicobacter suis HS1] gi|322380073|ref|ZP_08054329.1| phage/colicin/tellurite resistance cluster terY protein [Helicobacter suis HS5] gi|321147480|gb|EFX42124.1| phage/colicin/tellurite resistance cluster terY protein [Helicobacter suis HS5] gi|321149148|gb|EFX43594.1| phage/colicin/tellurite resistance cluster TerY protein [Helicobacter suis HS1] Length = 236 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 35/233 (15%), Positives = 75/233 (32%), Gaps = 43/233 (18%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E I L++D S SM MN +T++ L + + +D + + Sbjct: 11 EERFIPIFLLLDTSSSMSTNMNGG--------------QTRIGCLNDCVQTMIDLLKEEA 56 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR-----QYVTRDMDSLILKPTDSTPAMKQA 301 + + +I + G V+ + S + ++ M Sbjct: 57 KRENVSKLAVITF------------GAGGVKLQTPLSKIESIQFSPLGTGGNTPLGMALE 104 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 K +F P + +++ ++DGE N+ I +K + Sbjct: 105 LTRDYIQNKDTF-----------PGKFYTPYVVMVSDGEPNDDWQGPLHDFIHNKENRSS 153 Query: 362 IKI-VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + ++ I + + SP Y + SLI+ F+ I+ + + Sbjct: 154 KSVRYSVFIGNEGEEPQAVHDFSGSPNQVYYANDVQSLINCFKAITASVTQGR 206 >gi|148258759|ref|YP_001243344.1| hypothetical protein BBta_7591 [Bradyrhizobium sp. BTAi1] gi|146410932|gb|ABQ39438.1| hypothetical protein BBta_7591 [Bradyrhizobium sp. BTAi1] Length = 449 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 60/481 (12%), Positives = 129/481 (26%), Gaps = 110/481 (22%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 M +LS F+ I + N ++ FA+ + + +G + + + ++SA +AA Sbjct: 1 MRMLSSLLARFRSDI---QGNVAVTFAIVCVPLITAVGCGVDYSRANQLRAKLQSAVDAA 57 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 + S+ G ++ A DDA+ ++ Sbjct: 58 SVGAVSRTSPAFIAAGAMT---ADGIITAGNDDARNIFNGNMN----------------G 98 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 + S+T T+ ++ V + F+ +++ + + Sbjct: 99 TTGYTLNSVTPEVKKTGSVLTATVSFSASV------PMMFM-NIVGIKTMTL-----QGM 146 Query: 181 LRIEMGERPIFLIELVVDLSGSMH-----------------CAMNSDPEDVNSAPICQDK 223 + L++D S SM S+ + D Sbjct: 147 SKATASMPKYIDFYLLLDNSPSMGVAATPDDVTKMVNATSDAKYGSNRYCAFACHDYNDS 206 Query: 224 KR-----------TKMAALKNALLLFLDSIDLLSHVKEDVYM-----GLIGYTTRVEKNI 267 T++ L++A +D+ M G T + Sbjct: 207 NNFYNLAKSIGVTTRIDVLRSATQQLMDTATQTQTYPNQFRMAIYDFGAASKTIGLRALF 266 Query: 268 EPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 S + + D+ A K + + G Sbjct: 267 ALSANLSSAKSAAGNIDLMGVYGNNDAYTADK---DTPFTAVFPAVNNEISTPGDGTTGS 323 Query: 328 PFQKFIIFLTDGENNNFKS-------------NVNTIKICDKAKENFIKI---VTISINA 371 P K++ F++DG + + + +C K IKI T + Sbjct: 324 PL-KYLFFVSDGVADESNAACLKPKASGNRCQSPINPALCTTLKNRGIKIAVLYTTYLQL 382 Query: 372 SPNGQRL----------------------LKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 N + ++ C SP +++ V + + + Sbjct: 383 PTNSWYMSWIDPFNKGPFGPSPNSEIAQNMQACA-SPGFYFEVSPTQGIADAMNALFKKA 441 Query: 410 V 410 V Sbjct: 442 V 442 >gi|317508725|ref|ZP_07966378.1| von Willebrand factor type A domain-containing protein [Segniliparus rugosus ATCC BAA-974] gi|316252973|gb|EFV12390.1| von Willebrand factor type A domain-containing protein [Segniliparus rugosus ATCC BAA-974] Length = 350 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 47/288 (16%), Positives = 91/288 (31%), Gaps = 58/288 (20%) Query: 152 TSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDP 211 + + + L+ +S L ++ + + LV+D+S SM C Sbjct: 71 SKFLTHVPVAVFLVGLLLLTVALSGPTTLAKVP---KNRATVVLVIDISLSMMCD----- 122 Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW 271 D + T++ A + A I + ++ + +GL+ + + I PS Sbjct: 123 ----------DVRPTRVDAARQAA------IKFVDEMEPTLQLGLVTFAGTAQTLIAPSS 166 Query: 272 GTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 E V++ + + + T + + A Q + + K K P Sbjct: 167 DHEVVKRALDDAIRPDKLAARTATGEGIYTALQQIETLK------GILGGASKAPPAR-- 218 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIK---IVTISIN----------------A 371 I+ +DG+ AKE K I +IS Sbjct: 219 --IVLESDGKETVPDDLNAPRGAFTAAKEAKAKEVPIYSISFGTASPIPYVNIQGSRVPV 276 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN----ISQLMVHRKYS 415 + L K S + + D L V+ + I +V ++ S Sbjct: 277 PADDASLQKVAELSGGKFFTASSLDQLTDVYSSLNAEIGYDLVKQESS 324 >gi|332993941|gb|AEF03996.1| von Willebrand factor, type A [Alteromonas sp. SN2] Length = 344 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 35/231 (15%), Positives = 73/231 (31%), Gaps = 55/231 (23%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + L VDLSGSM ++ ++ K+ L F+ + Sbjct: 90 MMLAVDLSGSM----------KIDDMELNGRQVNRLTMTKSVLYDFIQR-------RVGD 132 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GLI + ++ + V ++ + L+ + T A+ A + ++ + Sbjct: 133 RLGLILFADTAYLQAPLTYDRDTVSTLLSESVIGLVGEQTAIGDAIGLAVKRFDEKEESN 192 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 +I LTDG+N K + A +K+ TI + A Sbjct: 193 N------------------VLILLTDGQNTAGFITPEQAK--ELAVNKKVKVYTIGVGAD 232 Query: 373 -----------------PNGQRLLKTCVSS-PEYHYNVVNADSLIHVFQNI 405 + +L SS ++ + + L ++ + Sbjct: 233 KMLIQSFFGSRQVNPSQELDEDMLSDLASSTGGQYFRARDVNELEAIYAKL 283 >gi|224024929|ref|ZP_03643295.1| hypothetical protein BACCOPRO_01660 [Bacteroides coprophilus DSM 18228] gi|224018165|gb|EEF76163.1| hypothetical protein BACCOPRO_01660 [Bacteroides coprophilus DSM 18228] Length = 332 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 50/314 (15%), Positives = 89/314 (28%), Gaps = 73/314 (23%) Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 I+ + T + N + + Y F+ ++ +++ Sbjct: 23 IMKRKKTEPTMQVSTTRMYMNAPKSWKL-----YLLHAPFLLRIVAFVMIVLVLARPQTT 77 Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + E I L VD+S SM D K ++ A K F++ Sbjct: 78 DNWQNTEIEGIDIMLAVDVSTSMLAE---------------DLKPNRLEAAKQVAAEFIN 122 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV-----RQYVTRDMDSLILKPTDST 295 +GL + + + LI+ T Sbjct: 123 ----GRPNDN---IGLTIFAGEAFTQCPLTVDHGVLLNLFQSIKCDIAQKGLIMDGTALG 175 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 + A L K +S K II LTDG NN + T + Sbjct: 176 MGLANAVSRLKDSKAKS------------------KVIILLTDGVNNRGDISPLTAA--E 215 Query: 356 KAKENFIKIVTISI----NAS-----------------PNGQRLLKTCVSSPEYHYNVVN 394 AK+ I++ TI + A + Q L + ++ ++ + Sbjct: 216 IAKQFGIRVYTIGVGTNGTAPYPMQTYAGVQYVQVPVEIDEQTLTQIAGTTNGNYFRATS 275 Query: 395 ADSLIHVFQNISQL 408 L V+Q I +L Sbjct: 276 NSKLKEVYQEIDKL 289 >gi|291087243|ref|ZP_06571866.1| putative von Willebrand factor type A domain protein [Clostridium sp. M62/1] gi|291076088|gb|EFE13452.1| putative von Willebrand factor type A domain protein [Clostridium sp. M62/1] Length = 2012 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 42/237 (17%), Positives = 79/237 (33%), Gaps = 49/237 (20%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D SGSM + S D + +AL LF + + S ++ Sbjct: 1057 VMFVIDKSGSMDQSFGSGNSDARR------------EVVNSALELFFNQL---SDGDYNI 1101 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 G ++ R TD++ A+ LTS + Sbjct: 1102 QFGGYKFSD------------SGERVNFNDQGWETEYWETDTSNALSH--LKLTSRETDG 1147 Query: 313 FFTNFFRQGVKIPSLPF-------QKFIIFLTDGE--NNNFKSNVNTIKICDKAKEN--- 360 I +L ++++IFLTDGE N++ + + C A +N Sbjct: 1148 STYPSQTLRSAISALENVELGENGKRYLIFLTDGEPGQNSYSFSEKEAENCYSAIKNLDS 1207 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYH-------YNVVNADSLIHVFQNISQLMV 410 I + A+ + +++ VS+ + +AD L F ++ + Sbjct: 1208 GTTFYAIQV-ANSDSHGFMESMVSNANSVDGVTAQKFVGNSADELNAAFSQMAAEIS 1263 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 36/225 (16%), Positives = 74/225 (32%), Gaps = 30/225 (13%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + V+D S SM + D N D+ ++ + +AL + + Sbjct: 1476 DLVFVIDKSLSMDYDI-----DGNEIKWWDDETESRKDIVNDALEEIIP-----DLCSQQ 1525 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + + GY W E+ +Q ++ + T+ + A+ A +L Sbjct: 1526 YDIQIAGYQFSGSSTRVLDWSREE-QQVLSGLKIARTSSSTEPSQALADALDML------ 1578 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN- 370 +K++IF+TDGE + + I TI ++ Sbjct: 1579 -----KTGSPAHRNQSNVKKYLIFMTDGEPTEPEDW--SYNAVRNHAVPGASIYTIGVSS 1631 Query: 371 -ASPNGQRLLKTCVSSPEYH----YNVVNADSLIHVFQNISQLMV 410 AS N +++ S + + +A + F I ++ Sbjct: 1632 DASTNLMEGIRSTALSNGMYAPATFKGTSAQLIRDAFTQIKDEII 1676 Score = 37.2 bits (84), Expect = 4.9, Method: Composition-based stats. Identities = 36/190 (18%), Positives = 64/190 (33%), Gaps = 56/190 (29%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL------ 236 I G++ I LV+DLS SM ++ D D +++ LK+ L Sbjct: 586 ITGGDQQGLDIVLVIDLSNSMDDGISEDSSD------------SRLKVLKDTLGYYRTSY 633 Query: 237 ------------LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-----GTEKVRQY 279 F+D + S ++ Y+T + + E W + Sbjct: 634 GSHGRPSTEGKEGFIDELFEQSPNS---RFSIVTYST--DASTELGWTEYGRNGSGQQTI 688 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 D T+ + YQ + K+R +N +IFL+DG Sbjct: 689 KKAIGDLQANGGTNYEAGL---YQAVEVLKERGNSSNIP-------------VVIFLSDG 732 Query: 340 ENNNFKSNVN 349 + + S+V+ Sbjct: 733 KPTYYYSDVD 742 >gi|224967060|ref|NP_038620.2| matrilin-4 precursor [Mus musculus] Length = 624 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 28/170 (16%), Positives = 68/170 (40%), Gaps = 24/170 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKPTDST 295 + + L +G+I Y+++V+ G R+ + R + +++ + T + Sbjct: 59 LVGLLRSLDVGLNATRVGVIQYSSQVQSVFPL--GAFSRREDMERAIRAVVPLAQGTMTG 116 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A + S+ + G + + ++ +TDG + ++ Sbjct: 117 LAIQYAMNVAFSEAE----------GARPSEERVPRVLVIVTDG---RPQD--RVAEVAA 161 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 +A+ I+I + + + G L+T S P ++ + V + LI F Sbjct: 162 QARARGIEIYAVGVQRADVGS--LRTMASPPLDQHVFLVESF-DLIQEFG 208 Score = 44.9 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 30/177 (16%), Positives = 64/177 (36%), Gaps = 26/177 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR--DMDSLILKPTDSTPAMK 299 +D L E +GL+ +++RV G V + + + T + A++ Sbjct: 415 VDFLDVSPEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALR 472 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S+ QG + L + + TDG + +++ +AKE Sbjct: 473 HMVEHSFSE----------AQGARPRDLNVPRVGLVFTDG---RSQDDISV--WAARAKE 517 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I + + + + L+ S P + + D F ++ L+ + K S+ Sbjct: 518 EGIVMYAVGVG--KAVEEELREIASEPSELHVSYSPD-----FSTMTHLLENLKGSI 567 >gi|157961563|ref|YP_001501597.1| von Willebrand factor type A [Shewanella pealeana ATCC 700345] gi|157846563|gb|ABV87062.1| von Willebrand factor type A [Shewanella pealeana ATCC 700345] Length = 328 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 43/229 (18%), Positives = 75/229 (32%), Gaps = 43/229 (18%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E+ + + VDLSGSM DP+ ++ +K L FL D Sbjct: 89 EQVGREMMIAVDLSGSMEARDFVDPQGEILR---------RVDGVKALLQSFLLKRDSD- 138 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 +GLI + + + + Q + + + T A+ A Sbjct: 139 ------RIGLIAFGENAYLQAPFTQDKQILSQLLQQMDVRMAGAGTAIGDAIGVAVNHFE 192 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + + K ++ LTDG N+ S + A E + I Sbjct: 193 QSEVEN------------------KVLLLLTDG--NDTSSEFPPLDAAHYAGEQGVVIYP 232 Query: 367 ISINASPN-GQRLL------KTCVSSPEYHYNVVNADSLIHVFQNISQL 408 I+I N G+ L + + + + SLI V++ + QL Sbjct: 233 IAIGDPKNVGEDSLDIATLERIADLTQGRVFEADDGQSLIEVYKVLEQL 281 >gi|14548116|sp|O89029|MATN4_MOUSE RecName: Full=Matrilin-4; Short=MAT-4; Flags: Precursor gi|3766288|emb|CAA06889.1| matrilin-4 precursor [Mus musculus] gi|22477196|gb|AAH36558.1| Matrilin 4 [Mus musculus] gi|148674433|gb|EDL06380.1| matrilin 4, isoform CRA_a [Mus musculus] gi|148674434|gb|EDL06381.1| matrilin 4, isoform CRA_a [Mus musculus] Length = 624 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 28/170 (16%), Positives = 68/170 (40%), Gaps = 24/170 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKPTDST 295 + + L +G+I Y+++V+ G R+ + R + +++ + T + Sbjct: 59 LVGLLRSLDVGLNATRVGVIQYSSQVQSVFPL--GAFSRREDMERAIRAVVPLAQGTMTG 116 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A + S+ + G + + ++ +TDG + ++ Sbjct: 117 LAIQYAMNVAFSEAE----------GARPSEERVPRVLVIVTDG---RPQD--RVAEVAA 161 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 +A+ I+I + + + G L+T S P ++ + V + LI F Sbjct: 162 QARARGIEIYAVGVQRADVGS--LRTMASPPLDQHVFLVESF-DLIQEFG 208 Score = 44.9 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 30/177 (16%), Positives = 64/177 (36%), Gaps = 26/177 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR--DMDSLILKPTDSTPAMK 299 +D L E +GL+ +++RV G V + + + T + A++ Sbjct: 415 VDFLDVSPEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALR 472 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S+ QG + L + + TDG + +++ +AKE Sbjct: 473 HMVEHSFSE----------AQGARPRDLNVPRVGLVFTDG---RSQDDISV--WAARAKE 517 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I + + + + L+ S P + + D F ++ L+ + K S+ Sbjct: 518 EGIVMYAVGVG--KAVEEELREIASEPSELHVSYSPD-----FSTMTHLLENLKGSI 567 >gi|327270786|ref|XP_003220169.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 975 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 44/234 (18%), Positives = 85/234 (36%), Gaps = 55/234 (23%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D+SGSM ++A LK A LFL I ++ ++ Sbjct: 306 LVLDVSGSM-------------------TTYDRIARLKQAAELFLLQI-----IETGSWV 341 Query: 255 GLIGYTTRVEKNIEPSWGTE-KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + + I T VR+ + + + T ++Q +Q + Sbjct: 342 GIVTFNSYATRQIGLRQITSDSVRESLKNYLPTSAGGGTIICSGVRQGFQ------VKQK 395 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 +G + I+ LTDGE+N+ S C + + + I TI++ Sbjct: 396 KYQTSTKGCE---------IVLLTDGEDNSVSS-------CFAEVQSSGSIIHTIAL-GP 438 Query: 373 PNGQRLLKTCVSSPEYHYNVV---NADSLIHVFQNISQ---LMVHRKYSVILKG 420 + L + ++ +++ LI F IS + + + KG Sbjct: 439 NAAKELEMLADMTGGLKFSATDSLDSNGLIDAFSRISSESGDISQQSIQLESKG 492 >gi|262191198|ref|ZP_06049398.1| protein BatA [Vibrio cholerae CT 5369-93] gi|262032938|gb|EEY51476.1| protein BatA [Vibrio cholerae CT 5369-93] Length = 477 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 43/235 (18%), Positives = 78/235 (33%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLS SM ED+ S D ++ A+K L F+ +E Sbjct: 244 DLMLVVDLSYSMS------QEDMQSGQQMVD----RLTAVKQVLSEFIAK-------REG 286 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + V + + + LI T + A + Sbjct: 287 DRIGLILFADHAYLQTPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKTFID---- 342 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S Q+ +I L+DG N + + AK+ I T+ + A Sbjct: 343 --------------SDAPQRVMILLSDGSNTAGVLDPLEAA--NIAKQYHTTIYTVGVGA 386 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + L ++ +++ N L +++ I+QL Sbjct: 387 GEMVVKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTINQL 441 >gi|229522840|ref|ZP_04412254.1| protein BatA [Vibrio cholerae TM 11079-80] gi|229340057|gb|EEO05065.1| protein BatA [Vibrio cholerae TM 11079-80] Length = 318 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 43/235 (18%), Positives = 78/235 (33%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLS SM ED+ S D ++ A+K L F+ +E Sbjct: 85 DLMLVVDLSYSMS------QEDMQSGQQMVD----RLTAVKQVLSEFIAK-------REG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + V + + + LI T + A + Sbjct: 128 DRIGLILFADHAYLQTPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKTFID---- 183 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S Q+ +I L+DG N + + AK+ I T+ + A Sbjct: 184 --------------SDAPQRVMILLSDGSNTAGVLDPLEAA--NIAKQYHTTIYTVGVGA 227 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + L ++ +++ N L +++ I+QL Sbjct: 228 GEMVVKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTINQL 282 >gi|153214389|ref|ZP_01949360.1| conserved hypothetical protein [Vibrio cholerae 1587] gi|124115338|gb|EAY34158.1| conserved hypothetical protein [Vibrio cholerae 1587] Length = 318 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 43/235 (18%), Positives = 78/235 (33%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLS SM ED+ S D ++ A+K L F+ +E Sbjct: 85 DLMLVVDLSYSMS------QEDMQSGQQMVD----RLTAVKQVLSEFIAK-------REG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + V + + + LI T + A + Sbjct: 128 DRIGLILFADHAYLQTPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKTFID---- 183 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S Q+ +I L+DG N + + AK+ I T+ + A Sbjct: 184 --------------SDAPQRVMILLSDGSNTAGVLDPLEAA--NIAKQYHTTIYTVGVGA 227 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + L ++ +++ N L +++ I+QL Sbjct: 228 GEMVVKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTINQL 282 >gi|323495646|ref|ZP_08100717.1| membrane associated secretion system protein [Vibrio sinaloensis DSM 21326] gi|323319281|gb|EGA72221.1| membrane associated secretion system protein [Vibrio sinaloensis DSM 21326] Length = 419 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 56/437 (12%), Positives = 123/437 (28%), Gaps = 55/437 (12%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 K + + + +I+FA+ + L K +E A+ A+LA A+ N Sbjct: 2 KALRKQSGHAAILFAMIIPGLFGLFTLASDGARAIQTKARIEDASEIAVLAIAA---HND 58 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 + + R + D + + + + +I V Sbjct: 59 DNKNSQGSGSGSAVNRKIATDYLEAYLHDVDSVNNLKIHKYNCDQIPECVAGLARGEPRF 118 Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFL 192 +++++ + + D + + ++ K + Sbjct: 119 FQYEVEATSRHVSW-----FPGDSSIPGF----GKTFDAKGAATARKYQSE------AVD 163 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT------------KMAALKNALLLFLD 240 I V D SGSM N K T + + Sbjct: 164 ILFVADYSGSMAGGWNGGSNRKYIDLRNIIKVVTDELQKFNDLNNTDNNTVGMTGFNYYT 223 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT--EKVRQYVTRDMDSLILKP---TDST 295 + L+ + EK +Y DS + TD+ Sbjct: 224 KTKPTNRSNSCFMTQLVYNNNYNINYTKTVNNIFNEKNNKYCVSHSDSSRFRDIDLTDNY 283 Query: 296 PAMKQAYQILTSDKKRSFFTNFFR--QGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + + + + R Q +K + P ++ +I L+DG+++ ++ Sbjct: 284 SSFNTTVNGFYPNHGTASYQGIMRGAQMLKKGTNP-RRLLIVLSDGDDSGTSQKNIHKQL 342 Query: 354 -----CDKAKEN------------FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 C K K+ ++ + + + N L+ C + D Sbjct: 343 VNAGMCTKIKQELSTGISSSGQSIKARLAVVGFDYNVNNNTALRDCAGAENVFKAQNTDD 402 Query: 397 SLIHVFQNISQLMVHRK 413 L + + I++ + H K Sbjct: 403 ILNKILELITEEIGHLK 419 >gi|325279872|ref|YP_004252414.1| von Willebrand factor type A [Odoribacter splanchnicus DSM 20712] gi|324311681|gb|ADY32234.1| von Willebrand factor type A [Odoribacter splanchnicus DSM 20712] Length = 330 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 44/273 (16%), Positives = 80/273 (29%), Gaps = 63/273 (23%) Query: 157 RLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNS 216 + F+ LL ++ + + I L +D+S SM Sbjct: 57 HVLFVLELLAIILLVTALARPQSSNSWQTYTSEGIDIALALDVSTSMLA----------- 105 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 +D ++ A K F+ + +GL+ + + + Sbjct: 106 ----RDFTPDRLEAAKEVATKFI-------LERPQDKIGLVVFAGESFTQCPLTTDQAVL 154 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + +I T + A L +S K +I L Sbjct: 155 VNLLREVKSGMIQDGTAIGLGLANAVNRLKDSPGKS------------------KVVILL 196 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISI----NAS-----PNGQRL--------- 378 TDG NN T + AK I++ TI + A P G +L Sbjct: 197 TDGINNQGAIAPVTAA--ELAKAFGIRVYTIGVGTYGEAPYPVPTPFGVQLQNMPVEIDE 254 Query: 379 --LKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 L+ + + ++ + D L ++ I QL Sbjct: 255 GVLQQIANVTGGKYFRATDNDKLQQIYSEIDQL 287 >gi|224372482|ref|YP_002606854.1| von Willebrand factor, type A [Nautilia profundicola AmH] gi|223588580|gb|ACM92316.1| von Willebrand factor, type A [Nautilia profundicola AmH] Length = 288 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 47/272 (17%), Positives = 90/272 (33%), Gaps = 48/272 (17%) Query: 137 LDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELV 196 + IF N + R +E + + + S + + + ++ + I + Sbjct: 23 SSEDDRIIFPNAHIFQKSKNRFSLLEFFIIFFLSVALASPVKSKIITNTHKKG-YNIVID 81 Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL 256 +D SGSM K+ A K S+D K D +GL Sbjct: 82 LDTSGSMAEF-------------------NKIDAAKAV------SLDFAKKRKNDA-LGL 115 Query: 257 IGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTN 316 + + ++ + + R S+ T Y L N Sbjct: 116 VVFGNIAYIASPLTFDKKTFEDILKRIYVSIAGGKTAI-------YDALFLSSNLFKNAN 168 Query: 317 FFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQ 376 +K II LTDG +N + ++ + K K+ IK+ +I+I + Sbjct: 169 G------------EKIIILLTDGMDNMSITPLDVVI--KKLKKEHIKVYSIAIGGDADLS 214 Query: 377 RLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 L K + Y + + L ++ +I++L Sbjct: 215 VLKKISKETNGKFYIASSLEDLKKIYSDINKL 246 >gi|254513911|ref|ZP_05125972.1| von Willebrand factor type A domain protein [gamma proteobacterium NOR5-3] gi|219676154|gb|EED32519.1| von Willebrand factor type A domain protein [gamma proteobacterium NOR5-3] Length = 330 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 37/229 (16%), Positives = 73/229 (31%), Gaps = 43/229 (18%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E+ + L +DLSGSM D +++ +++A K L F Sbjct: 90 EKSARDLMLALDLSGSM---------DARDFRDQNGQQQNRLSAAKQVLKDFAAQ----- 135 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++ +GLI + + + ++ T A+ A L Sbjct: 136 --RDGDRLGLIVFGNAAYLQAPFTDDIATWLTLLEESEVAMAGPSTALGDAIGLAI-SLF 192 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + + ++ LTDG N+ S V + A N + I T Sbjct: 193 QASETRN-----------------RVLVVLTDG--NDTGSRVPPVDAASIAAANDVTIYT 233 Query: 367 ISINASPN------GQRLLKTCVSSP-EYHYNVVNADSLIHVFQNISQL 408 +++ L SS + ++ +L ++ I++L Sbjct: 234 VAVGDPSTIGEEALDLETLDAIASSTRGASFLALDTLALKDAYEQINRL 282 >gi|255535987|ref|YP_003096358.1| aerotolerance operon BatA [Flavobacteriaceae bacterium 3519-10] gi|255342183|gb|ACU08296.1| BatA (Bacteroides aerotolerance operon) [Flavobacteriaceae bacterium 3519-10] Length = 334 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 46/276 (16%), Positives = 86/276 (31%), Gaps = 68/276 (24%) Query: 158 LQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSA 217 L+ +++L + + + I + VD+S SM Sbjct: 59 LKISKYILLSALIIAMARPRTFTISENNDDTKGIDIMMSVDVSLSMLAR----------- 107 Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 D + ++ ALKN F+D + +GL+ Y+ + + + Sbjct: 108 ----DLEPDRLTALKNIAKKFVDK-------RPGDRIGLVTYSGEAFTKVPVTSDHAVLL 156 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + + + T + A L K +S K II +T Sbjct: 157 EELENLNPLELQPGTAIGEGLSVAVSHLRHSKAKS------------------KIIILMT 198 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL------------------- 378 DG N + + AK N I++ +I I NG L Sbjct: 199 DGVNTIENAMPAQVGA-QLAKSNDIRVYSIGIG--TNGYALMPTQTDIFGDLVFTEVEVK 255 Query: 379 -----LKTCV-SSPEYHYNVVNADSLIHVFQNISQL 408 L+ ++ ++ + SL V++ I+QL Sbjct: 256 IDEPVLREIAQTTGGKYFRATSNQSLEEVYEEINQL 291 >gi|296228120|ref|XP_002759672.1| PREDICTED: collagen alpha-5(VI) chain [Callithrix jacchus] Length = 2614 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 51/356 (14%), Positives = 115/356 (32%), Gaps = 58/356 (16%) Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGY----------SAVFYNTEIQNIV 122 + D + +S K L+ A F+ + S + ++++ Sbjct: 689 QEISDAIDRMSLIDKGTLMGKALNFVDQYFTHSKGARFGAKKFLILITDGVAQDDVRDPA 748 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 R + + + ++ + + S + ++ +HL Q +K+V + AL Sbjct: 749 RILRGKDVTIFSVGVYGADRSQLEEISGDGSLVFYVENFDHL--QALEKKLVFRVCALHD 806 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + ++ + V+D SGS+ + L + + Sbjct: 807 CKRIKQ--LDVVFVLDHSGSIAEQSQDHM-----------------------INLTMHLV 841 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV--TRDMDSLILKPTDSTPAMKQ 300 ++ V G + Y+ E T R + M T + A+K Sbjct: 842 KKADVGRDRVQFGALKYSDNPEILFYL--NTYSNRSAIIENLRMRRDTGGNTYTAKALKH 899 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A L ++ + + ++ +I +TDGE+++ +T K ++ Sbjct: 900 A-NALFTE-----------EHGSRINQNVKQMLIVITDGESDDRVELNDTAA---KLRDK 944 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I I + + Q+ L+ + V N D L ++ + + M V Sbjct: 945 GITIFAVGVG--KADQKELEGMAGNKNNTIYVDNFDKLKDIYAPVQESMCTESPEV 998 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 57/339 (16%), Positives = 106/339 (31%), Gaps = 58/339 (17%) Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 SN L +I +A FI IK + + I S+ S+ Sbjct: 500 SNDIDLRKAILNIQQLTGDTHTGEALDFILPRIKNGIKERMSQVPCYLIVLTDGKSQYSV 559 Query: 130 THMANN--------RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL 181 A + + + R+ F ++ + + V + + Sbjct: 560 VEPAKRVRAEQITVHAIGIGEANKKELQEIAGKEERVSFGQNFDALKSIKNEV--VHGIC 617 Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + E I +VD SGS+ ++ KM K + L Sbjct: 618 TEKGCEDMKADIMFLVDSSGSIG-----------------NENFGKM---KIFMKNLLTK 657 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD--SLILKPTDSTPAMK 299 I + + +G++ ++ E+ + T +Q ++ +D SLI K T A+ Sbjct: 658 IQI---GADKTQIGVVQFSDDPEEEFQL--NTYFTQQEISDAIDRMSLIDKGTLMGKALN 712 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 Q T K F +KF+I +TDG + + I + Sbjct: 713 FVDQYFTHSKGARFGA--------------KKFLILITDGVAQDDVRDPARI-----LRG 753 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 + I ++ + + Q L+ + V N D L Sbjct: 754 KDVTIFSVGVYGADRSQ--LEEISGDGSLVFYVENFDHL 790 >gi|294055316|ref|YP_003548974.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] gi|293614649|gb|ADE54804.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] Length = 730 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 35/184 (19%), Positives = 63/184 (34%), Gaps = 43/184 (23%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 ERP + ++D+SGSM + K+ LK AL+L +D Sbjct: 362 ERPASNLVFLLDVSGSMS-------------------QPNKLPLLKEALMLLTRRLDSRD 402 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 V VY G G + + T + +T+ T++ ++ AYQ Sbjct: 403 RVAIVVYAGASG----LVLPSTTANNTATIEHALTQLQAG---GSTNAGAGIELAYQ--- 452 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKENFIKIV 365 R F +I TDG+ N ++N + +I ++ + + Sbjct: 453 --VAREHFIEDGNNR-----------VILCTDGDFNVGQTNRGDLAQIVADQAKDGVSLT 499 Query: 366 TISI 369 + Sbjct: 500 VLGF 503 >gi|297482040|ref|XP_002692521.1| PREDICTED: matrilin 4 [Bos taurus] gi|296480952|gb|DAA23067.1| matrilin 4 [Bos taurus] Length = 584 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 28/171 (16%), Positives = 65/171 (38%), Gaps = 24/171 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKPTDST 295 + + L +G+I Y+++V+ R+ + R + +++ + T + Sbjct: 59 LVGLLRSLDVGPNATRVGVIQYSSQVQSVFPLR--AFSRREDMERAIRAVVPLAQGTMTG 116 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A + S + G + P + + +TDG + ++ Sbjct: 117 LAIQYAMNVAFSVAE----------GARPPEAHVPRVAVIVTDG---RPQD--RVAEVAA 161 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQN 404 +A+ I+I + + + G L+ S P E+ + V + LI F Sbjct: 162 QARARGIEIYAVGVQRADVGS--LRAMASPPLNEHVFLVESF-DLIQEFGR 209 Score = 43.3 bits (100), Expect = 0.081, Method: Composition-based stats. Identities = 31/177 (17%), Positives = 65/177 (36%), Gaps = 26/177 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR--DMDSLILKPTDSTPAMK 299 +D L E +GL+ +++RV G V + + + T + A++ Sbjct: 374 VDFLDVSPEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALR 431 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S+ QG + +L + + TDG + N++ +AKE Sbjct: 432 HMVEHSFSE----------AQGARPRALNVPRVGLVFTDG---RSQDNISV--WAARAKE 476 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I + + + + L+ S P + + D F ++ L+ + K S+ Sbjct: 477 EGIVMYAVGVG--KAVEEELREIASEPAELHVSYSPD-----FSTMTHLLENLKGSI 526 >gi|300796915|ref|NP_001178240.1| matrilin-4 [Bos taurus] Length = 584 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 28/171 (16%), Positives = 65/171 (38%), Gaps = 24/171 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKPTDST 295 + + L +G+I Y+++V+ R+ + R + +++ + T + Sbjct: 59 LVGLLRSLDVGPNATRVGVIQYSSQVQSVFPLR--AFSRREDMERAIRAVVPLAQGTMTG 116 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A + S + G + P + + +TDG + ++ Sbjct: 117 LAIQYAMNVAFSVAE----------GARPPEAHVPRVAVIVTDG---RPQD--RVAEVAA 161 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQN 404 +A+ I+I + + + G L+ S P E+ + V + LI F Sbjct: 162 QARARGIEIYAVGVQRADVGS--LRAMASPPLNEHVFLVESF-DLIQEFGR 209 Score = 43.3 bits (100), Expect = 0.081, Method: Composition-based stats. Identities = 31/177 (17%), Positives = 65/177 (36%), Gaps = 26/177 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR--DMDSLILKPTDSTPAMK 299 +D L E +GL+ +++RV G V + + + T + A++ Sbjct: 374 VDFLDVSPEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALR 431 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S+ QG + +L + + TDG + N++ +AKE Sbjct: 432 HMVEHSFSE----------AQGARPRALNVPRVGLVFTDG---RSQDNISV--WAARAKE 476 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I + + + + L+ S P + + D F ++ L+ + K S+ Sbjct: 477 EGIVMYAVGVG--KAVEEELREIASEPAELHVSYSPD-----FSTMTHLLENLKGSI 526 >gi|119383876|ref|YP_914932.1| von Willebrand factor, type A [Paracoccus denitrificans PD1222] gi|119373643|gb|ABL69236.1| von Willebrand factor, type A [Paracoccus denitrificans PD1222] Length = 282 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 28/178 (15%), Positives = 63/178 (35%), Gaps = 37/178 (20%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +DLSGSM + +++ A+K F+ + + Sbjct: 82 DIVLAIDLSGSM----------QKEDFQLDGQPISRLDAVKRTASRFVAA-------RRG 124 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + R ++ + V + V + + T + + A + L + + Sbjct: 125 DRIGLVIFGDRAYFAQPLTFDVDAVARAVDEAQIGISGRATAISDGLGLAMKRLAASEAP 184 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + + ++ ++DG + NV + A + I+I TI++ Sbjct: 185 T------------------RVVVLMSDG--VDTSGNVQAVDAARLAAGHGIRIHTIAL 222 >gi|146338996|ref|YP_001204044.1| hypothetical protein BRADO1945 [Bradyrhizobium sp. ORS278] gi|146191802|emb|CAL75807.1| conserved hypothetical protein [Bradyrhizobium sp. ORS278] Length = 432 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 77/454 (16%), Positives = 134/454 (29%), Gaps = 90/454 (19%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 + LLSRFR ++ N ++IFA++++ L IG I K + ++ +AA Sbjct: 8 LSLLSRFR-------RNDSGNIAVIFAIALLPILAFIGSAIDYSMAVRAKAKLSASIDAA 60 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 +LA S+ AK A A + + + Sbjct: 61 LLAATGYTAMR---------GSSSDAKTA----ATNMFNGQM-------------SAHKL 94 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFI-------EHLLNQRYNQKI 173 NS I +T + R + + T+ M + Y + + Sbjct: 95 TSNSLSIDITDSVSARTVTGSATVVVKTSFMYMFGYPTMTVSASSSASASFPTYMDFYVL 154 Query: 174 VSFIPALLRIEMGERPIFLIELVVD-LSGSMHCAMNSDPEDVNSAPICQDKKRT-----K 227 V P+ L D + + H S + DK + + Sbjct: 155 VDNSPSQGLGATTADMTTLQNATTDTCAFACHDTYTSSSKKTLQTNSYYDKAKKLGVTMR 214 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +++A D+ V M + S G V + Sbjct: 215 IDVVRSATQSLTDTATSSQIVSNQYRMAVYS-----MGADCGSLGLTTVASLSSSMSSVK 269 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP------SLPFQKFIIFLTDGEN 341 + M Y +D F IP S QK++ F++DG Sbjct: 270 SSVG--ALDLMTIPYSGYNNDMCTDFDGTMSAMSGVIPTQGDGSSTNPQKWLFFVSDGVA 327 Query: 342 NN---------FKSNVN-----TIKICDKAKENFIKI---VTISINASPNGQ-------- 376 + +S T C K IKI T + + NG Sbjct: 328 DYAYPTTCSKTTQSGGRCVEPLTTTTCTALKARGIKIAVLYTTYLAITSNGYYNTWVKPW 387 Query: 377 -----RLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 ++K+C SP Y+Y V ++ S+ + Sbjct: 388 RDSIGTIMKSCA-SPGYYYEVDSSGSIGSALTAL 420 >gi|323493530|ref|ZP_08098652.1| hypothetical protein VIBR0546_14455 [Vibrio brasiliensis LMG 20546] gi|323312353|gb|EGA65495.1| hypothetical protein VIBR0546_14455 [Vibrio brasiliensis LMG 20546] Length = 322 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 40/232 (17%), Positives = 70/232 (30%), Gaps = 55/232 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLS SM + QD ++ A+K + F +E Sbjct: 86 DLMLVVDLSYSMS----------QKDMLSQDDYIDRLTAVKKVVSDFAQQ-------REG 128 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + + + + V + LI T + A + Sbjct: 129 DRLGLVLFADHAYLQTPLTLDRNTIAKQVESLVLRLIGDKTAIGEGIGLATKTFID---- 184 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-- 369 S Q+ +I L+DG N + + AK+ I TI + Sbjct: 185 --------------SDAPQRVMILLSDGSNTSGVLDPIEAA--KIAKKYNATIYTIGVGA 228 Query: 370 ---------------NASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNI 405 A + L+ + ++ NAD L ++ I Sbjct: 229 GEMMVKEFFMTRKVNTAKDLDENTLQEIADLTGGQYFRARNADELATIYDTI 280 >gi|315185579|gb|EFU19348.1| von Willebrand factor type A [Spirochaeta thermophila DSM 6578] Length = 459 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 49/241 (20%), Positives = 82/241 (34%), Gaps = 39/241 (16%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 LR + LV+D SGSM A++ P + D R ++ K A+ FL Sbjct: 78 SLRRGVNREEGISFLLVLDASGSMWDALDGTPTE--------DPDRMRITHAKRAIREFL 129 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 + E +GL + V + + P+ + Sbjct: 130 ------PLLSERDRVGLAVFNRTYRMIQPIVDDPALVLEKLDAIE----------RPSRE 173 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN---NNFKSNVNTIKICDK 356 QAY L + + + ++ ++ L+DGEN + KS D Sbjct: 174 QAYTELYRSMEEALTSFEEEGR--------RRVLVVLSDGENFPVDPEKSPATPGTAVDL 225 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 A I I + RL+ S + ++ NA L V+ I Q V ++Y+ Sbjct: 226 AHRYGITCYVIHFGTEKD--RLIGDLASETGGRVFDARNALELASVYTAI-QEQVLQEYA 282 Query: 416 V 416 V Sbjct: 283 V 283 >gi|227820127|ref|YP_002824098.1| transmembrane protein [Sinorhizobium fredii NGR234] gi|227339126|gb|ACP23345.1| putative transmembrane protein [Sinorhizobium fredii NGR234] Length = 451 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 53/393 (13%), Positives = 113/393 (28%), Gaps = 95/393 (24%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 ++ N ++ A+ ++ +L +G + + ++ M+S +AA++A ++ Sbjct: 19 MVSDRGGNVALTVAICIIPMILAVGAGLDYTRAYNVQSRMQSDLDAALVAAIKEI----- 73 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 D E I IK+ S S+ +T + Sbjct: 74 DEYDEDE-----------------IAEKIKDWFDAQSEK----------QSATYDLTEIT 106 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLI 193 ++ + + S + + + + IE + Sbjct: 107 VDKS-------GHTITASASGTVPTTLMTLADIKTV------PVGVISAIEGPATSYLEV 153 Query: 194 ELVVDLSGSM--------HCAMNSDPEDVNSAPICQDKKRTK------------------ 227 +V+D S SM + +D K K Sbjct: 154 YIVIDKSPSMLLAATSEDQAMLRADANITCEFACHDTKDPVKKNGTVIASTYYNYIKSLG 213 Query: 228 ----MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 +A+ LD +D + +GL + + +EP++ T R+ ++ D Sbjct: 214 VKLRTDVALDAVEEVLDMVDAADEDHARIKVGLYSLGETISEVLEPTYSTSTARKKLSDD 273 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 L + S + A + L + + K ++ LTDG +N Sbjct: 274 SSGLTSATSMSATYFQTALKALKKKVGTAGDGTSAASPL--------KLVLLLTDGVQSN 325 Query: 344 FKSNVNTI------------KICDKAKENFIKI 364 + CD K+N + Sbjct: 326 RDWVIKWSGKYWGRVTPLNPDWCDYLKDNDATM 358 >gi|209809179|ref|YP_002264717.1| membrane associated secretion system protein [Aliivibrio salmonicida LFI1238] gi|208010741|emb|CAQ81132.1| membrane associated secretion system protein [Aliivibrio salmonicida LFI1238] Length = 422 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 66/452 (14%), Positives = 132/452 (29%), Gaps = 82/452 (18%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 K +K + +I+FA+ + + + K +E A A LA ++ N Sbjct: 2 KLRRHQKGHAAILFAMMIPALFGIFTLASDGARAIQTKARIEDAAEVATLAVSAHNDPNQ 61 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 G S +N +I +I + S Y + I Sbjct: 62 DYGGGGSPSSANQQ------IVTDYINAYISDVDSINEIKVYKRNCEEIPECKAGLAVGE 115 Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP-IF 191 + + V TS + ++ SF + + + Sbjct: 116 P--------RYFEHEVGVTTSQKSWFPGNDAIVGMGD-----SFSTSGHSLARKYQSEAV 162 Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + D SGSM +KK + + +++ L + L H D Sbjct: 163 DVMFAADFSGSMGDRWTGG-----------NKKYEDLIDIIDSISKELQKFNDLEHNDND 211 Query: 252 VYMGLIGYTTRVE-KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 MG+ Y + S G Y+++ + + + + D Sbjct: 212 NTMGITAYNEYTYSQYSGSSGGWWGDDCYLSQAESDGFWGGVSISKTIDGLWNEKSKDHC 271 Query: 311 RSFFTNFFRQGVKIPS----------------------------------LPFQKFIIFL 336 + + + + + S ++ +I L Sbjct: 272 NNSYNSGRFNDIPLTSNFDVVNQDVSRFWPEGGTSSYQALIRGAQLLTYGTNSRRLLIVL 331 Query: 337 TDG---ENNNFKSNVNTIKICDKAKEN------------FIKIVTISINASPNGQRLLKT 381 +DG +NN S VN +C ++ ++ I + P+ + LK Sbjct: 332 SDGMDTDNNLTSSLVN-AGMCRDIQQGLESDKTLDNRPIRAQMAVIGFDYEPSENQALKD 390 Query: 382 CVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 CV + + + D L + + IS+ + H K Sbjct: 391 CVGAENVYKAENSDDILNTILELISEEIGHLK 422 >gi|291398577|ref|XP_002715569.1| PREDICTED: Epithelial chloride channel protein-like [Oryctolagus cuniculus] Length = 958 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 43/240 (17%), Positives = 82/240 (34%), Gaps = 67/240 (27%) Query: 178 PALLRIEMGERPIFL--------IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 ++ PIF + LV+D SGSM + + +M Sbjct: 341 SSMAGTNPPPHPIFSLLRAKQRVVCLVLDKSGSMDS----------------EDRLLRMN 384 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR-----QYVTRDM 284 A L + I+ S V G++ + E + K+ Q +T ++ Sbjct: 385 QA--AALYLIQIIERESLV------GMVTF----ESTAKIQNNLTKITDDDTYQKITANL 432 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + T +K +Q +T + + + I+ LTDGE+N Sbjct: 433 PQVAGGGTSICSGLKAGFQAITYSNQNTSGSE----------------IVLLTDGEDNG- 475 Query: 345 KSNVNTIKIC-DKAKENFIKIVTISINASPNGQRL-LKTCVSSPEYHYNVVNADSLIHVF 402 I C ++ K++ I TI++ + L + + ++ Y + + LI F Sbjct: 476 ------IHSCFEEVKQSGAIIHTIAL-GPSAAKELEILSSMTGGYRFYANKDINGLIDAF 528 >gi|195614282|gb|ACG28971.1| retrotransposon protein [Zea mays] Length = 650 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 35/174 (20%), Positives = 66/174 (37%), Gaps = 35/174 (20%) Query: 180 LLRIEMGERPI----FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 L+R+E RP + V+D+SGSM D +AP + + +++ LK A Sbjct: 63 LVRVEAPARPEARIPIDVVAVLDVSGSM--------NDPAAAPTERTRTTSRLDLLKTAA 114 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEK--NIEPSWGTEKVRQYVTRDMDSLIL-KPT 292 + ++ + ++ ++ R + + + T R+ R +D L T Sbjct: 115 KFMVAKLEDGD------RLSIVAFSDRPVRELSSGLLYMTADGRRNAIRSLDQLEARGGT 168 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 PA ++A ++L + FI+ LTDG + S Sbjct: 169 ALVPAFEEAVKVLDGRQGDGGDRLG--------------FIVLLTDGAEDASGS 208 >gi|255039218|ref|YP_003089839.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] gi|254951974|gb|ACT96674.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] Length = 320 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 30/179 (16%), Positives = 66/179 (36%), Gaps = 42/179 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I +VVDLS SM A D +++ +K L F++ + Sbjct: 79 DIFMVVDLSKSMDAA---------------DVTPSRLEKVKFELNRFIE-------NERA 116 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKK 310 +G+I ++ ++ ++ + ++ D L T+ A++ AY L + Sbjct: 117 NRIGIIIFSNDAYIHVPLTYDAAALELFIQSLQTDLLPTNGTNVCGAIEMAYNKLMNSAD 176 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 K ++ TDGEN++ +N + + + I + ++++ Sbjct: 177 ---------------PTSRAKMMVLFTDGENSSSCTN----ALFNNLRRFGIGVYSVAV 216 >gi|224052500|ref|XP_002194907.1| PREDICTED: similar to anthrax toxin receptor 1 [Taeniopygia guttata] Length = 537 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 32/206 (15%), Positives = 66/206 (32%), Gaps = 43/206 (20%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 F + V+D SGS+ T++ + +L +S Sbjct: 23 HGAFDLYFVLDKSGSV------------------KNHWTEIYSFVESLAE-----KFISP 59 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + + M I +++R ++ + E +R+ + + L T K+A Sbjct: 60 M---LRMSFIVFSSRGTTIMKLTENREAIRRGLDTLKEELPGGDTFMHEGFKRA------ 110 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 N + II LTDGE + + + ++A+ + + Sbjct: 111 --------NEQIYHETYGGVRTASVIIALTDGELQDAQF-YYAEQEANRARSFGAIVYCV 161 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVV 393 + Q L T S ++ + V Sbjct: 162 GVKDFNETQ--LSTIADSIDHVFPVK 185 >gi|320160918|ref|YP_004174142.1| hypothetical protein ANT_15140 [Anaerolinea thermophila UNI-1] gi|319994771|dbj|BAJ63542.1| hypothetical protein ANT_15140 [Anaerolinea thermophila UNI-1] Length = 486 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 43/220 (19%), Positives = 77/220 (35%), Gaps = 46/220 (20%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + E P F I LV+D S SMH A +M +K++ L L Sbjct: 110 NVKEAPPFHICLVLDRSTSMHGA--------------------RMDMVKSSALNLLKQFR 149 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + ++ ++ R E I P+ + + R + T+ ++ + Sbjct: 150 KQDLIS------VVAFSDRAEVVIPPT-RVPDLAKDDHRISMLQVGGGTEIYQGLQLGIE 202 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L S F + +I LTDG + + I++ ++A ++ I+ Sbjct: 203 QLRSI-----------------DPRFMRQLILLTDGHT--YGDDEACIELAEEAAQDGIQ 243 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 I T+ I N + L K S V + L F+ Sbjct: 244 INTMGIGHEWNDELLDKIATISGANSIFVTSPKDLNKFFE 283 >gi|320106407|ref|YP_004181997.1| VWFA-like domain-containing protein [Terriglobus saanensis SP1PR4] gi|319924928|gb|ADV82003.1| VWFA-related domain-containing protein [Terriglobus saanensis SP1PR4] Length = 305 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 31/231 (13%), Positives = 69/231 (29%), Gaps = 56/231 (24%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + + +D S S+ ++ A K + L D + Sbjct: 77 LSMVMAIDTSESVITQFQTER-----------------DAAKRFVKQMLREQDEMD---- 115 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 LI ++ V++ + + ++ + T A+ A Q LT K+ Sbjct: 116 -----LISFSDTVDEIVPFTNDAGRMNAGIGNLHKG---DATSLYDAIYLASQRLTEAKR 167 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 +K ++ +TDG N + + + A+ I I I Sbjct: 168 ---------------DATRRKILVIVTDGGN--TTKGMRYQQAVEAAERAGAAIYPI-IM 209 Query: 371 ASP---------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 L++ + ++ V++ L F ++S + + Sbjct: 210 VPIEADAGRNTGGEHALIQMAQDTGGKYFYVLDKHDLDKAFAHLSDDLRTQ 260 >gi|226943994|ref|YP_002799067.1| von Willebrand factor, type A (VWA) domain-containing protein [Azotobacter vinelandii DJ] gi|226718921|gb|ACO78092.1| von Willebrand factor, type A (VWA) domain protein [Azotobacter vinelandii DJ] Length = 335 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 38/234 (16%), Positives = 80/234 (34%), Gaps = 54/234 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VD+SGSM + Q + ++ +K+ L F++ + Sbjct: 91 DLLLAVDVSGSME----------YADMHWQGESIGRLELVKHLLGQFIE-------DRRG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + K T A+ + L + Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRRTVRTWLEEAAIGIAGKDTAIGDAIGLGLKRLRQRPAQ 193 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S + +I +TDG N + + A ++I TI I A Sbjct: 194 S------------------RVLILVTDGANTAGEIAPSVAARLAAA--EGVRIHTIGIGA 233 Query: 372 SPN------------GQRL----LKTCV-SSPEYHYNVVNADSLIHVFQNISQL 408 P G L L+ + ++ +++ L + + +++L Sbjct: 234 DPRQDGPPGLLGLTPGLDLDEPTLRAIAEETGGSYFRARSSEELRAIEETLARL 287 >gi|269965331|ref|ZP_06179451.1| hypothetical protein VMC_08810 [Vibrio alginolyticus 40B] gi|269829977|gb|EEZ84206.1| hypothetical protein VMC_08810 [Vibrio alginolyticus 40B] Length = 334 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 37/235 (15%), Positives = 77/235 (32%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLSGSM + ++ A+K L F+ ++ Sbjct: 98 DLMLVVDLSGSM----------QQEDMELNGEYIDRLTAVKQVLSDFVAK-------RKG 140 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G++ + + + V Q + + + L+ + T + Sbjct: 141 DRLGVVLFGDHAYLQTPLTADRKSVMQQINQTVIGLVGQRTAIGDGI------------- 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-- 369 + S Q+ +I L+DG N + ++ + AK+ I T+ + Sbjct: 188 -----GLGTKTFVDSDAPQRVMILLSDGSNTAGV--LEPLEAAEIAKKYNATIYTVGVGA 240 Query: 370 ----------------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 A + Q L K + ++ + + L ++ I+QL Sbjct: 241 GEMMVKEFFMTRKVNTAADLDEQTLTKVAEVTGGQYFRARDTEELEKIYDTINQL 295 >gi|310817054|ref|YP_003965018.1| hypothetical protein EIO_2641 [Ketogulonicigenium vulgare Y25] gi|308755789|gb|ADO43718.1| conserved hypothetical protein [Ketogulonicigenium vulgare Y25] Length = 733 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 34/185 (18%), Positives = 58/185 (31%), Gaps = 45/185 (24%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 E P + V+D SGSM A+ ++ KNA L +++L Sbjct: 260 PREAPEVTMVFVLDRSGSMQQAVGDSN---------------RLGVAKNA---TLSALEL 301 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 L+ + +G+I + T + S T + T P + AY+ Sbjct: 302 LNPQSQ---IGVIVFDTEETTVVPLS--TLDIPAAQIALDRVDTGGGTAIYPGLVAAYRE 356 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L S K II +TDG + I + + + Sbjct: 357 LQR------------------SESPAKHIIVMTDG----LSQPGDWEGILRQITADGTTV 394 Query: 365 VTISI 369 ++I Sbjct: 395 SAVAI 399 >gi|254820233|ref|ZP_05225234.1| hypothetical protein MintA_09911 [Mycobacterium intracellulare ATCC 13950] Length = 339 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 39/234 (16%), Positives = 75/234 (32%), Gaps = 51/234 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + LV+D+S SM D ++AA K A F D + + Sbjct: 103 VMLVIDVSESMAS---------------NDVPPNRLAAAKEAGKQFADQLTPA------I 141 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GL+ + + P+ V+ + + T + + A Q + + Sbjct: 142 NLGLVEFAANATLLVPPTTNRGAVKSGIDSLQPAPK---TATGEGIFTALQAIAT----- 193 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENN---NFKSNVNTIKICDKAKENFIKIVTISI 369 + P I+ +DG N + + AK ++I TIS Sbjct: 194 -----VGSVMGGGEGPPPARIVLESDGAENVPLDPNAPQGAFTAARAAKGQGVQISTISF 248 Query: 370 N--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + Q L K C + ++ + DSL +V+ + + + Sbjct: 249 GTPYGTVDYEGATIPVPVDDQTLQKICEITDGEAFHADSLDSLKNVYTTLQRQI 302 >gi|330945007|gb|EGH46785.1| von Willebrand factor, type A [Pseudomonas syringae pv. pisi str. 1704B] Length = 258 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 44/243 (18%), Positives = 80/243 (32%), Gaps = 66/243 (27%) Query: 197 VDLSGSMHC---AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 VD+SGSM SD +++ ++ L FL+ ++ Sbjct: 2 VDVSGSMDYPDMQWKSDEV-------------SRLVLVQQLLGDFLE-------GRKGDR 41 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 +GLI + T+ ++ VR ++ + K T A+ A + L S Sbjct: 42 VGLILFGTQAFVQAPLTYDRRTVRVWLDEARIGIAGKNTALGDAIGLALKRLRMRPATS- 100 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 + ++ +TDG NN + + T A E +KI I I + P Sbjct: 101 -----------------RALVLVTDGANNAGQIDPITAAR--LAAEEGVKIYPIGIGSDP 141 Query: 374 NGQRL----------------LKTCVS-SPEYHYNVVNADSL------IHVFQNISQLMV 410 + L LK S S ++ + D L + + ++Q Sbjct: 142 DKDALQSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKIRATLDALEPVAQQPT 201 Query: 411 HRK 413 + Sbjct: 202 QAR 204 >gi|194221587|ref|XP_001495285.2| PREDICTED: similar to collagen type VI alpha 6 [Equus caballus] Length = 2301 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 55/352 (15%), Positives = 114/352 (32%), Gaps = 64/352 (18%) Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGY-----------SAVFYNTEI 118 SN LG E+I A F ++++ + I Sbjct: 528 SNKHDLGKAIENIRQMGGNTNTGAALNFTLGLLQKAKKERGNKVPCHLVVLTNGVSKDSI 587 Query: 119 QNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP 178 N R + H+ + +N T + Y + + L + R + Sbjct: 588 VEPANRLREELIHVYAIGVREANQTQLREIAGEEKRMYYVHDFDALKDIRN-----QVVQ 642 Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + E + I +VD SGS+ + +KM K + Sbjct: 643 EICAQEACKEMKADIMFLVDSSGSIG-----------------PENFSKM---KIFMKNL 682 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 + + + V +G++ ++ ++ + + +Y+++ S D+ M Sbjct: 683 VSKSQI---GADRVQIGVVQFSHVNKEEFQLN-------RYMSQSEIS------DAIDRM 726 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 + + +F + +F K +KF+I +TDGE + + + Sbjct: 727 AHIGETTLTGHALTFVSQYFSPA-KGARPNVRKFLILITDGEAQDIVKDPAV-----ALR 780 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + I I ++ + S Q L+ PE + V N + Q+I +V Sbjct: 781 QEGIIIYSVGVFGSNVTQ--LEEISGRPEMVFYVEN----FDILQHIEDDLV 826 >gi|159045656|ref|YP_001534450.1| von Willebrand factor type A domain-containing protein [Dinoroseobacter shibae DFL 12] gi|157913416|gb|ABV94849.1| von Willebrand factor type A domain protein [Dinoroseobacter shibae DFL 12] Length = 328 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 33/226 (14%), Positives = 69/226 (30%), Gaps = 43/226 (19%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + L VD+SGSM + P+ + ++ A+K+ + F+ Sbjct: 91 TSAARDLVLAVDISGSMDDRDMTAPDGT---------RLQRLQAVKDVVGAFVAE----- 136 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 +E + LI + + + + V + + + + T A+ A + Sbjct: 137 --REGDRISLIVFGAKPFIQAPFTEDLDSVVELLNQVQTGMAGPNTAIGDAIGLAIRSFE 194 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + ++ +I L+DG + I A + I I T Sbjct: 195 DSEIE------------------ERLLILLSDGADTASTMTP--INAAQIAAQEGITIYT 234 Query: 367 ISINASPN-GQRLLKTCVSSP------EYHYNVVNADSLIHVFQNI 405 I + G+ L Y + + L ++ I Sbjct: 235 IGVGNPDGSGEERLDPATLEDIATRGGGAFYFADDVEGLSEIYAEI 280 >gi|149042955|gb|EDL96529.1| matrilin 4 (predicted), isoform CRA_a [Rattus norvegicus] Length = 637 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 27/170 (15%), Positives = 67/170 (39%), Gaps = 24/170 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKPTDST 295 + + L +G+I Y+++V+ G R+ + R + +++ + T + Sbjct: 72 LVGLLHSLDVGLNATRVGVIQYSSQVQSVFPL--GAFSNREDMERAIRAVVPLAQGTMTG 129 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A + S+ + G + + ++ +TDG + ++ Sbjct: 130 LAIQYAMNVAFSEAE----------GARPSEERVPRVLVIVTDG---RPQD--RVAEVAA 174 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 +A+ I+I + + + G L+ S P ++ + V + LI F Sbjct: 175 QARARGIEIYAVGVQRADVGS--LRAMASPPLDQHVFLVESF-DLIQEFG 221 Score = 44.1 bits (102), Expect = 0.047, Method: Composition-based stats. Identities = 30/177 (16%), Positives = 64/177 (36%), Gaps = 26/177 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR--DMDSLILKPTDSTPAMK 299 +D L E +GL+ +++RV G V + + + T + A++ Sbjct: 428 VDFLDVSPEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALR 485 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S+ QG + L + + TDG + +++ +AKE Sbjct: 486 HMVEHSFSEV----------QGARPRDLNVPRVGLVFTDG---RSQDDISV--WAARAKE 530 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I + + + + L+ S P + + D F ++ L+ + K S+ Sbjct: 531 EGIVMYAVGVG--KAVEEELREIASEPSELHVSYSPD-----FNTMTHLLENLKGSI 580 >gi|157962424|ref|YP_001502458.1| von Willebrand factor type A [Shewanella pealeana ATCC 700345] gi|157847424|gb|ABV87923.1| von Willebrand factor type A [Shewanella pealeana ATCC 700345] Length = 336 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 40/232 (17%), Positives = 64/232 (27%), Gaps = 55/232 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM V L ++ Sbjct: 85 DLMLSVDLSGSMQIEDMVIDGKVVDRFT-----------LIQHVISDFIERRKGD----- 128 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V QY+ L+ K T A+ + ++ Sbjct: 129 -RIGLILFADHAYLQSPLTQDRRSVAQYLKEAQIGLVGKQTAIGEAIALGVKR-FDKVEQ 186 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S + +I LTDG NN A + I I TI + A Sbjct: 187 SN-----------------RVLILLTDGSNNAGAITPEQASQ--IAAQRGITIYTIGVGA 227 Query: 372 SPNGQRLL------------------KTCVSSPEYHYNVVNADSLIHVFQNI 405 +R L + + ++ N + L ++Q I Sbjct: 228 DVMERRTLFGKERVNPSMDLDESQLQEIAKVTGGQYFRARNTEELEQIYQVI 279 >gi|157818269|ref|NP_001100009.1| matrilin-4 [Rattus norvegicus] gi|149042956|gb|EDL96530.1| matrilin 4 (predicted), isoform CRA_b [Rattus norvegicus] Length = 624 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 27/170 (15%), Positives = 67/170 (39%), Gaps = 24/170 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKPTDST 295 + + L +G+I Y+++V+ G R+ + R + +++ + T + Sbjct: 59 LVGLLHSLDVGLNATRVGVIQYSSQVQSVFPL--GAFSNREDMERAIRAVVPLAQGTMTG 116 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A + S+ + G + + ++ +TDG + ++ Sbjct: 117 LAIQYAMNVAFSEAE----------GARPSEERVPRVLVIVTDG---RPQD--RVAEVAA 161 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 +A+ I+I + + + G L+ S P ++ + V + LI F Sbjct: 162 QARARGIEIYAVGVQRADVGS--LRAMASPPLDQHVFLVESF-DLIQEFG 208 Score = 44.1 bits (102), Expect = 0.047, Method: Composition-based stats. Identities = 30/177 (16%), Positives = 64/177 (36%), Gaps = 26/177 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR--DMDSLILKPTDSTPAMK 299 +D L E +GL+ +++RV G V + + + T + A++ Sbjct: 415 VDFLDVSPEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALR 472 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S+ QG + L + + TDG + +++ +AKE Sbjct: 473 HMVEHSFSEV----------QGARPRDLNVPRVGLVFTDG---RSQDDISV--WAARAKE 517 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I + + + + L+ S P + + D F ++ L+ + K S+ Sbjct: 518 EGIVMYAVGVG--KAVEEELREIASEPSELHVSYSPD-----FNTMTHLLENLKGSI 567 >gi|77465284|ref|YP_354787.1| von Willebrand factor domain-containing protein [Rhodobacter sphaeroides 2.4.1] gi|77389702|gb|ABA80886.1| Von Willebrand domain containing protein [Rhodobacter sphaeroides 2.4.1] Length = 328 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 28/177 (15%), Positives = 63/177 (35%), Gaps = 37/177 (20%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I L +D+SGSM + T++ A+K F++ ++ Sbjct: 90 IVLTLDMSGSMLIEDFDIDGVQS----------TRLEAVKRVARSFVEE-------RQGD 132 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GL+ + R ++ V + + + + T + A + +T S Sbjct: 133 RIGLVLFANRAYVAAPLTFDLAAVGRAIEEASIGITGRSTAIADGLGLALKSVTESSAAS 192 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + I+ L+DG++N + + + A + ++I TI++ Sbjct: 193 ------------------RVIVLLSDGQDNAHQIDARQVA--GLAARHGVRIHTIAL 229 >gi|255557538|ref|XP_002519799.1| Inter-alpha-trypsin inhibitor heavy chain H3 precursor, putative [Ricinus communis] gi|223541038|gb|EEF42595.1| Inter-alpha-trypsin inhibitor heavy chain H3 precursor, putative [Ricinus communis] Length = 514 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 27/224 (12%), Positives = 67/224 (29%), Gaps = 54/224 (24%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 +RP + V+D+SGSM K+ +K A+L + + Sbjct: 55 SSNDRPGLDLVAVLDVSGSMAGD--------------------KIEKVKTAMLFVIKKLS 94 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS-LILKPTDSTPAMKQAY 302 + + ++ ++ + TE ++ + + ++ T+ T ++ Sbjct: 95 PID------RLSVVTFSADANRLCPLRQITENSQKDLEKLINGLNADGATNITAGLQTGL 148 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 ++L+ I+ ++DGE N + Sbjct: 149 KVLSDRSLSGGRVVG---------------IMLMSDGEQNAGGDAAQVPV-------GNV 186 Query: 363 KIVTISI---NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 + T + + + S +V + ++L F Sbjct: 187 PVYTFGFGINHEPRVLKAIAHN--SIGGTFSDVQDTNNLSKAFS 228 >gi|226315298|ref|YP_002775194.1| hypothetical protein BBR47_57130 [Brevibacillus brevis NBRC 100599] gi|226098248|dbj|BAH46690.1| hypothetical membrane protein [Brevibacillus brevis NBRC 100599] Length = 424 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 34/225 (15%), Positives = 80/225 (35%), Gaps = 53/225 (23%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 I +V+D SGSM +SDP+ L A + +D Sbjct: 109 QASGANNIVMVLDTSGSMQ---SSDPD----------------NQLFKAAADMVQRMDS- 148 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPS-WGTEKVRQYVTRDMDSLIL--KPTDSTPAMKQAY 302 D+ + ++ + + + ++ V+ V + + T A++ Sbjct: 149 -----DMNIAVVTFHDQTNVLQPLTELSSQSVKDEVVKKLLQFPRTDGGTRIDLALQAGL 203 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 L +++ + ++ ++DG S+++ K+N + Sbjct: 204 DQLQANQMANST------------------VVLMSDG-----YSDLDVPAALAPYKQNQV 240 Query: 363 KIVTISIN-ASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNI 405 + T+ ++ +G LL+ + + ++NV +AD + +F I Sbjct: 241 IVHTVGMSQIDADGTALLQKIAAETGGSYFNVEHADQMTGIFGQI 285 >gi|126277540|ref|XP_001376725.1| PREDICTED: similar to integrin alpha 11 subunit [Monodelphis domestica] Length = 1530 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 22/182 (12%), Positives = 61/182 (33%), Gaps = 27/182 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ + + +G++ Y V V++ V T++ Sbjct: 527 LINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVKEVVEAASHIEQRGGTET--- 582 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A+ I + + K +K +I +TDGE + + K+ + + Sbjct: 583 -RTAFGIEFARSEA---------FQKGGRKGAKKVMIVITDGE---SHDSPDLEKVIEDS 629 Query: 358 KENFIKIVTISINA--------SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQ 407 +++ + +++ +K S P+ + +NV + +L + + Sbjct: 630 EKDNVTRYAVAVLGYYNRRGINPEAFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALGD 689 Query: 408 LM 409 + Sbjct: 690 RI 691 >gi|312886236|ref|ZP_07745850.1| von Willebrand factor type A [Mucilaginibacter paludis DSM 18603] gi|311301261|gb|EFQ78316.1| von Willebrand factor type A [Mucilaginibacter paludis DSM 18603] Length = 335 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 32/190 (16%), Positives = 60/190 (31%), Gaps = 43/190 (22%) Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 +ID + + +D +GL+ ++ + + + + +I T + Sbjct: 123 AIDFIKNRPDD-RIGLVIFSGESFTQCPLTIDHDVLINLYHDIKNGMIEDGTAIGMGLAT 181 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A L S K +I LTDG NN T + AK+ Sbjct: 182 AVNRL------------------RGSEAKSKVVILLTDGVNNAGSIPPITAA--EIAKQF 221 Query: 361 FIKIVTISI----------------------NASPNGQRLLKTCVSSPEYHYNVVNADSL 398 I++ T+ I + L K + ++ N D+L Sbjct: 222 GIRVYTVGIGTQGYAPYPVPSPYGGVVYQRMEVQIDEPTLTKIAAITGGKYFRATNNDAL 281 Query: 399 IHVFQNISQL 408 +++ I QL Sbjct: 282 TRIYKQIDQL 291 >gi|312196063|ref|YP_004016124.1| von Willebrand factor type A [Frankia sp. EuI1c] gi|311227399|gb|ADP80254.1| von Willebrand factor type A [Frankia sp. EuI1c] Length = 560 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 39/224 (17%), Positives = 69/224 (30%), Gaps = 51/224 (22%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH------VK 249 V+D SGSM ++AAL+ AL + D LS + Sbjct: 370 VLDTSGSMEGP--------------------RLAALQQALTGLTGADDSLSGRFARFRAR 409 Query: 250 EDVYMGLIGYTTRVEKN-----IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 E V + I + +V +P+ G+ ++ T A+ AY Sbjct: 410 EQVTI--ITFNDKVTATRQFTVSDPTPGSADLKAISDYGAALRAGGNTAIYSALDAAYTT 467 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK--AKENFI 362 + K I+ +TDGENN + + + + Sbjct: 468 AAAGMKADPSALTS--------------IVLMTDGENNRGLDSAGFLARYNTRPPDVRGV 513 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD-SLIHVFQNI 405 + + + L + S+ ++ SL VF+ I Sbjct: 514 RTFAVDFGDADRA-ALTQIATSTGGAVFDATAPGVSLSDVFREI 556 >gi|293604651|ref|ZP_06687053.1| conserved hypothetical protein [Achromobacter piechaudii ATCC 43553] gi|292816982|gb|EFF76061.1| conserved hypothetical protein [Achromobacter piechaudii ATCC 43553] Length = 3744 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 57/170 (33%), Gaps = 23/170 (13%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 ++ +E G+ + I LVVD SGSM ++ ++++A +K+AL Sbjct: 3071 GTVVTVEPGKN--YNIALVVDTSGSMAYKLDG---------STNGSGQSRIALVKDALTN 3119 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 + + + +G + + V+ +T + T+ A Sbjct: 3120 LANQLVGHDGIVNVTLIGFATTAGTPVTLQNLT--SANVQTLLTAITNLSATGGTNYEAA 3177 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 A S K + ++ FLTDG+ + N Sbjct: 3178 FNSAVSWFNSQTAAG----------KSVAAGYENVTFFLTDGDPTYYLKN 3217 >gi|261415414|ref|YP_003249097.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|261371870|gb|ACX74615.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|302325633|gb|ADL24834.1| BatA protein [Fibrobacter succinogenes subsp. succinogenes S85] Length = 367 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 35/254 (13%), Positives = 70/254 (27%), Gaps = 60/254 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQD---------KKRTKMAALKNALLLFLDSI 242 I L +D+SGSM + + K +++ ++ + F+ Sbjct: 94 DIMLALDVSGSMGTLDMLTRTEQAKLGVMNAEKILKRGEYWKYSRLGYAQDVIAEFIGK- 152 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 + +GL + R + + + + D A Sbjct: 153 ------RHSDRIGLSAFGARSFTQCPLTMDYGSLLEILKASDD--------------LAR 192 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 L +++ S + +I LTDG +N + AK + Sbjct: 193 DTLVNNRTAIGDGLMNALARLKMSDAKSRVVILLTDGRDNASVVPPVRAA--EVAKSLGV 250 Query: 363 KIVTISI-------------------------NASPNG--QRLLKTCVS-SPEYHYNVVN 394 K+ T+ + G + +LK S + Y N Sbjct: 251 KVYTVGVGKKSGKILAFQQNPWTGEISWGERDITPEEGIDEDVLKAIASKTGGRFYRAEN 310 Query: 395 ADSLIHVFQNISQL 408 L ++ I +L Sbjct: 311 KAELEKIYSEIDEL 324 >gi|326931809|ref|XP_003212016.1| PREDICTED: matrilin-4-like, partial [Meleagris gallopavo] Length = 465 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 33/173 (19%), Positives = 66/173 (38%), Gaps = 24/173 (13%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKPTD 293 +D I L +G+I Y+++V+ T R + R ++S++ + T Sbjct: 55 RFMIDIIGNLDVGPNATRVGVIQYSSQVQNIFSLK--TFFTRAEMERAINSIVPLAQGTM 112 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A++ A + + ++ G + P + I +TDG + V+ + Sbjct: 113 TGLAIQYAMNVAFTVQE----------GARPPHKKIPRIAIIVTDG---RPQDRVSEVA- 158 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQN 404 A+ I+I + I L+ S P E+ + V + LI F Sbjct: 159 -AHARNAGIEIYAVGIQ--RADMNSLRAMASPPLEEHVFLVESF-ELIQQFGK 207 >gi|118100589|ref|XP_425698.2| PREDICTED: similar to matrilin-4 [Gallus gallus] Length = 564 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 33/173 (19%), Positives = 66/173 (38%), Gaps = 24/173 (13%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKPTD 293 +D I L +G+I Y+++V+ T R + R ++S++ + T Sbjct: 55 RFMIDIIGNLDVGPNATRVGVIQYSSQVQNIFSLK--TFFTRAEMERAINSIVPLAQGTM 112 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A++ A + + ++ G + P + I +TDG + V+ + Sbjct: 113 TGLAIQYAMNVAFTVQE----------GARPPHKKIPRIAIIVTDG---RPQDRVSEVA- 158 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQN 404 A+ I+I + I L+ S P E+ + V + LI F Sbjct: 159 -AHARNAGIEIYAVGIQ--RADMNSLRAMASPPLEEHVFLVESF-ELIQQFGK 207 >gi|311030436|ref|ZP_07708526.1| hypothetical protein Bm3-1_07816 [Bacillus sp. m3-13] Length = 921 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 61/353 (17%), Positives = 110/353 (31%), Gaps = 69/353 (19%) Query: 53 MESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAV 112 ++ I +++ L E ++ RAL DA + + I SL + Sbjct: 274 VDEIVENNISHSLTQVSGEPKVLMVDQEGEGDNVFRAL--DASGWKIDRIHPSLLPTTLS 331 Query: 113 FYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQK 172 + I ++ + L + F MT + L + Sbjct: 332 GFLAYESIIFHNVSGHLLSATQMELIETAVKDFGVGFTMTGGNESYG-----LGGYFQTP 386 Query: 173 IVSFIPALLRIEMGER-PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 I +P + ++ + P + +V+D SGSM K Sbjct: 387 IEKILPVDMDVKGKKEIPSLGLIIVLDRSGSMMGE--------------------KFDLA 426 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV-----RQYVTRDMDS 286 K A ++ + G I + T +W + + V + S Sbjct: 427 KEAAARSVELLKEEDT------FGFIAFDTE-------AWTVVETEPIKNKDEVIETIRS 473 Query: 287 LILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 L TD PA+ QAYQ L + +K II LTDG++N+ Sbjct: 474 TALGGGTDIFPALNQAYQQLNEMDLK------------------RKHIILLTDGQSNDGP 515 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 I ++ N + + T++I + L + Y V A ++ Sbjct: 516 YEE----IIEEGLTNNVTLSTVAIGGDADTSLLEELAEIGTGRFYEVYEASAV 564 >gi|296125842|ref|YP_003633094.1| von Willebrand factor type A [Brachyspira murdochii DSM 12563] gi|296017658|gb|ADG70895.1| von Willebrand factor type A [Brachyspira murdochii DSM 12563] Length = 328 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 44/240 (18%), Positives = 83/240 (34%), Gaps = 66/240 (27%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I LVVD+S SM D T++ A K ++ F+ + Sbjct: 88 ISLVVDVSPSMMAE---------------DMMPTRLEASKKTMIDFIKK-------RNFD 125 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 + L+ + R ++ + + + +++ T + A +L S K + Sbjct: 126 KISLVAFALRASVLSPSTFDYTLLEEEI-KNIKIDEEGSTSIGLGIATAVDMLRSVKGDN 184 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS---- 368 +K II LTDGENN+ + + + A IKI TI Sbjct: 185 -----------------EKIIILLTDGENNSGEIDPKLAS--EIASNFNIKIYTIGIGDA 225 Query: 369 --------INASPNGQRLLKT------------CVSSPEYHYNVVNADSLIHVFQNISQL 408 + G+R ++ ++ ++N NA +L +V+ I ++ Sbjct: 226 NGSHAWVTYDDPNYGKRRIRADFSLNEEALIDIASTTGGKYFNAQNASALDNVYNTIDRI 285 >gi|116624819|ref|YP_826975.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116227981|gb|ABJ86690.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 837 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 30/196 (15%), Positives = 67/196 (34%), Gaps = 52/196 (26%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + L++D S SM K+ + A + ++++ + V Sbjct: 395 VVLIIDKSSSMEGR--------------------KIELARLAAIGVVENLRPIDSV---- 430 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS-LILKPTDSTPAMKQAYQILTSDKKR 311 G++ + + + + R + + + T PA+ +AYQ + Sbjct: 431 --GVLIFDNSFQWAVPIR--KAEDRATIKKLISGITPDGGTQIAPALTEAYQRIL----- 481 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 P K I+ LTDG +++ + +A+ N + I T+ + Sbjct: 482 -------------PQTAMYKHIVLLTDG----ISEEGDSMTLTKEAQANHVTISTVGLGQ 524 Query: 372 SPNGQRLLKTCVSSPE 387 N + L+ S+ + Sbjct: 525 DVN-RAFLEKVASNAD 539 >gi|264679151|ref|YP_003279058.1| ferredoxin-dependent glutamate synthase [Comamonas testosteroni CNB-2] gi|262209664|gb|ACY33762.1| ferredoxin-dependent glutamate synthase [Comamonas testosteroni CNB-2] Length = 1405 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 42/298 (14%), Positives = 84/298 (28%), Gaps = 32/298 (10%) Query: 80 ESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDS 139 + N +D + N I + S + + S + + D Sbjct: 585 DGPGNADASLTLDRLADLVNNPITLTTSKGTLTLTGYDATTGKVSYTYQTSGQQAHTGDD 644 Query: 140 SNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDL 199 +N + + V + + +L + + G I L +D Sbjct: 645 TNVQDHFQITVEDKFGGKATGDLGVLITDTAPSLKPIAESSALSSHGTN----IMLTLDT 700 Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 SGSM + + +++ LK+++ LD DV + ++ + Sbjct: 701 SGSMAWSSGVNNS--------NGWSLSRLDVLKSSVNGLLDKY----GEAGDVRVLILEF 748 Query: 260 TTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFR 319 + + + + +V T+ A+ +A N Sbjct: 749 NSSATQKGSGWMSLAEAKTFVNGLYAD---GGTNYQDALTKA----------MAAWNNSG 795 Query: 320 QGVKIPSLPFQKFIIFLTDGE--NNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 G + Q F TDGE +N S+ +N I I + G Sbjct: 796 TGKLEGN-NVQNISYFFTDGEPDSNRSVSSSQQTTWEKFLADNHINSYGIGLGTGATG 852 >gi|288925756|ref|ZP_06419687.1| BatA protein [Prevotella buccae D17] gi|315608294|ref|ZP_07883284.1| aerotolerance protein BatA [Prevotella buccae ATCC 33574] gi|288337411|gb|EFC75766.1| BatA protein [Prevotella buccae D17] gi|315250075|gb|EFU30074.1| aerotolerance protein BatA [Prevotella buccae ATCC 33574] Length = 332 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 41/244 (16%), Positives = 69/244 (28%), Gaps = 68/244 (27%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I L +D+S SM D K ++ A K+ F+ Sbjct: 88 IDIMLAMDVSASMLAE---------------DLKPNRIEAAKDVAAEFI----SGRPNDN 128 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQIL 305 +GL + + + + LI T + A L Sbjct: 129 ---IGLTIFAGEAFTQCPMTTDHASLLTLLQDVRTDMATRGLINDGTAIGMGLANAVSRL 185 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 K +S + +I LTDG NN + T AK I++ Sbjct: 186 KDSKTKS------------------RVVILLTDGANNAGDISPLTAAQ--MAKSLGIRVY 225 Query: 366 TISIN---------------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 TI++ A + + L + + Y+ N L ++ + Sbjct: 226 TIAVGTSKVAPYPIEVGGRVQYISRPADIDTKTLREIAAVTEGNFYSANNTAQLKQIYHD 285 Query: 405 ISQL 408 I QL Sbjct: 286 IDQL 289 >gi|298488105|ref|ZP_07006142.1| von Willebrand factor type A domain protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] gi|298157384|gb|EFH98467.1| von Willebrand factor type A domain protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] Length = 352 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 42/227 (18%), Positives = 75/227 (33%), Gaps = 60/227 (26%) Query: 192 LIELVVDLSGSMHC---AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + + VD+SGSM SD +++ ++ L FL+ Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDEV-------------SRLVLVQQLLGDFLE-------G 130 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ +GLI + T+ ++ VR ++ + K T A+ + L Sbjct: 131 RKGDRVGLILFGTQAFVQAPLTYDRRTVRVWLDEAKIGIAGKNTALGDAIGLGLKRLRLR 190 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 S + ++ +TDG NN + + T A E +KI I Sbjct: 191 PATS------------------RVLVLVTDGANNAGQIDPITAAR--LAAEEGVKIYPIG 230 Query: 369 INASPNGQRL----------------LKTCVS-SPEYHYNVVNADSL 398 I + P+ L LK S S ++ + D L Sbjct: 231 IGSDPDKDALQSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQL 277 >gi|289624057|ref|ZP_06457011.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. aesculi str. NCPPB3681] gi|289650363|ref|ZP_06481706.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. aesculi str. 2250] gi|330866187|gb|EGH00896.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. aesculi str. 0893_23] Length = 352 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 42/227 (18%), Positives = 75/227 (33%), Gaps = 60/227 (26%) Query: 192 LIELVVDLSGSMHC---AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + + VD+SGSM SD +++ ++ L FL+ Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDEV-------------SRLVLVQQLLGDFLE-------G 130 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ +GLI + T+ ++ VR ++ + K T A+ + L Sbjct: 131 RKGDRVGLILFGTQAFVQAPLTYDRRTVRVWLDEAKIGIAGKNTALGDAIGLGLKRLRLR 190 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 S + ++ +TDG NN + + T A E +KI I Sbjct: 191 PATS------------------RVLVLVTDGANNAGQIDPITAAR--LAAEEGVKIYPIG 230 Query: 369 INASPNGQRL----------------LKTCVS-SPEYHYNVVNADSL 398 I + P+ L LK S S ++ + D L Sbjct: 231 IGSDPDKDALQSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQL 277 >gi|261252915|ref|ZP_05945488.1| putative outer membrane adhesin like proteiin [Vibrio orientalis CIP 102891] gi|260936306|gb|EEX92295.1| putative outer membrane adhesin like proteiin [Vibrio orientalis CIP 102891] Length = 3332 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 35/182 (19%), Positives = 67/182 (36%), Gaps = 23/182 (12%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + I LVVD SGSM + D T+M ++ AL ++S+ Sbjct: 2733 YNIALVVDASGSMGDYV--YNTDGTVMRNPDGSAMTRMDMMQEALTNLVESLVTHDGS-- 2788 Query: 251 DVYMGLIGYTTRVEKNIE---PSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILT 306 + + LIG+ ++ E + ++ V + +++ ++L + TD ++A Sbjct: 2789 -INIKLIGFDDNIDVTFEALDITNSSDVVAELLSKIENNLPVGGGTDYGVGFEEANNWYA 2847 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 S S FLTDGE N+ N + + + K++ Sbjct: 2848 SSSISSNGYENMT--------------FFLTDGEPNSGTLNNGLTEYNELVSTHNAKVMA 2893 Query: 367 IS 368 + Sbjct: 2894 VG 2895 >gi|70730213|ref|YP_259952.1| von Willebrand factor type A domain-containing protein [Pseudomonas fluorescens Pf-5] gi|68344512|gb|AAY92118.1| von Willebrand factor type A domain protein [Pseudomonas fluorescens Pf-5] Length = 332 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 37/234 (15%), Positives = 78/234 (33%), Gaps = 43/234 (18%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 +RP+ + L +DLS SM ++ +++A+K + F+ Sbjct: 87 QQRPVRDLMLAIDLSQSM---------QTQDFNDANGQRIDRLSAVKEVVQGFIQR---- 133 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 ++D +GLI + + + + + + T A+ A ++L Sbjct: 134 ---RKDDRLGLIVFGSGAFAQAPLTLDHASLSLLLEDSGIGMAGPNTAIGDAIGLALKLL 190 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 + +K +I LTDG N+ S + A + I Sbjct: 191 EQAHE------------------PEKVLILLTDG--NDTSSAITPQHAAAMAAARGVVIH 230 Query: 366 TISINASPNG-------QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 TI I L + ++ ++ + +L V+ + +L H+ Sbjct: 231 TIGIGDPSAEGEAKVDLSALEQIARTTGGRYFRAEDRSALDQVYATLDRLTPHQ 284 >gi|149410544|ref|XP_001506183.1| PREDICTED: similar to protocadherin 9, partial [Ornithorhynchus anatinus] Length = 588 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 44/229 (19%), Positives = 80/229 (34%), Gaps = 49/229 (21%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + + V+D S S+ + +K ++ L + Sbjct: 2 ESSCDSKRLDLIFVIDSSRSV--------------------RPHDFEKVKEFIVTILQFL 41 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL--ILKPTDSTPAMKQ 300 D+ V +GLI Y + V+ T + V R + + + T + A++ Sbjct: 42 DVAPDV---TRVGLIQYGSTVKNEFSLK--TYGRKSEVERAVKVMKRLGTGTMTGLAIQY 96 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A I S+ + G + + I+ +TDG + + V KA+ + Sbjct: 97 AVNIAFSESE----------GARPLRENVPRIIMIVTDGRPQDPVAEVA-----AKARNS 141 Query: 361 FIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN---ADSLIHVFQN 404 I I I + LK+ S P ++ + V N +SL VFQN Sbjct: 142 GILIFAIGVG--QVDYNTLKSIGSKPHQDHVFLVANFSQIESLTSVFQN 188 >gi|218662717|ref|ZP_03518647.1| hypothetical protein RetlI_26604 [Rhizobium etli IE4771] Length = 295 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 41/284 (14%), Positives = 87/284 (30%), Gaps = 63/284 (22%) Query: 177 IPALLRIEMGERPIFLIELVVDLSGSM----------------HCAMNSDPEDVNSAPIC 220 I E +++D + SM CA D Sbjct: 8 ISGTATAEYQTAAFMDFYILLDNTPSMGVGATPDDVSKLEAKAGCAFACHQMDKTINNYT 67 Query: 221 QDKK---RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 K ++ ++ A D+ + MG+ + T+ E + + Sbjct: 68 IAKSLGVAMRIDVVRQATQALTDTAKTERVSSDQFRMGVYTFGTKAEDAKLTT--ISGLT 125 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFF-------RQGVKIPSLPFQ 330 +T+ + TD+ M YQ +D+ +F + + G ++ + Sbjct: 126 SDLTKVKNY-----TDAVDLMTIPYQNYNNDQITNFDSAMTQMNTIIDQAGDGTSNISAE 180 Query: 331 KFIIFLTDGENNNFKSNVNTIKI-------------CDKAKENFIKI---VTISINASPN 374 K + F++DG +++K + T K C K+ +KI T + N Sbjct: 181 KILFFVSDGVGDSYKPSTCTKKTTGGRCQEPIDTSFCKPLKDRGVKIAVLYTTYLPLPSN 240 Query: 375 G-------------QRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 ++ C SP +++ V + + + + Sbjct: 241 SWYNTWIKPFQGEIPTKMQACA-SPGFYFEVSPTEGITDAMKAL 283 >gi|126464748|ref|YP_001045861.1| von Willebrand factor, type A [Rhodobacter sphaeroides ATCC 17029] gi|126106559|gb|ABN79089.1| von Willebrand factor, type A [Rhodobacter sphaeroides ATCC 17029] Length = 328 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 28/177 (15%), Positives = 63/177 (35%), Gaps = 37/177 (20%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I L +D+SGSM + T++ A+K F++ ++ Sbjct: 90 IVLTLDMSGSMLIEDFDIDGVQS----------TRLEAVKRVARSFVEE-------RQGD 132 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GL+ + R ++ V + + + + T + A + +T Sbjct: 133 RIGLVLFANRAYVAAPLTFDLAAVGRAIEEASIGITGRSTAIADGLGLALKRVT------ 186 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 S + I+ L+DG++N + + + A + ++I TI++ Sbjct: 187 ------------ESAAASRVIVLLSDGQDNAHQIDARQVA--GLAARHGVRIHTIAL 229 >gi|299533511|ref|ZP_07046889.1| isocitrate dehydrogenase [Comamonas testosteroni S44] gi|298718494|gb|EFI59473.1| isocitrate dehydrogenase [Comamonas testosteroni S44] Length = 688 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 36/172 (20%), Positives = 60/172 (34%), Gaps = 32/172 (18%) Query: 174 VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 V+ A + G+ L L++DLSGSM+ + +++++ +K Sbjct: 258 VAEPTASDYVLPGQDSNLL--LMLDLSGSMNW-------GQENNKNPAPGEKSRLQIMKE 308 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLILKPT 292 A+ L LDS L VK V + + + + + +W + V T Sbjct: 309 AVSLMLDSYAALGDVK--VRI--VTFQNTAAQARQTTWIDIATAKSIVNAL---TATGGT 361 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + A++ A Q S K N FLTDGE Sbjct: 362 PYSKALETAMQAFNSAGKIDGGKNIAY---------------FLTDGEPTAS 398 >gi|73538303|ref|YP_298670.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] gi|72121640|gb|AAZ63826.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] Length = 358 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 39/261 (14%), Positives = 88/261 (33%), Gaps = 64/261 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LV+DLSGSM QD + +++ A + A + L Sbjct: 97 TVILVIDLSGSM---------------RAQDVRPSRIRAAQQAARVLL------DAQPAG 135 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQIL----T 306 V +G++ + + + V ++ L T + A L T Sbjct: 136 VSVGVVA----MAGTAALAQAPSHSKDDVATAIEGLKPQGGTALGNGLLIALTTLLPQTT 191 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKF---------------IIFLTDGENNNFKSNVNTI 351 +D +R Q K + I+ +DGE+N+ V Sbjct: 192 NDAERLMNGGDVAQPGKPGKAAPGELDNGEPVRPGSYASGAIVLFSDGESNSGPGAVQAA 251 Query: 352 KICDKAKENFIKIVTISINASPN--------------GQRLLKTCV-SSPEYHYNVVNAD 396 ++ A +++ T+ + + +++LK ++ ++ + + Sbjct: 252 QL---AATYGVRVYTVGVGTTEGVVLSADGWSARVRLDEKVLKQVADTTGAEYFRLEDTA 308 Query: 397 SLIHVFQNI-SQLMVHRKYSV 416 +L V++ + ++L ++ V Sbjct: 309 ALKKVYRALNTRLAFDKRDQV 329 >gi|317055486|ref|YP_004103953.1| von Willebrand factor type A [Ruminococcus albus 7] gi|315447755|gb|ADU21319.1| von Willebrand factor type A [Ruminococcus albus 7] Length = 1311 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 40/231 (17%), Positives = 78/231 (33%), Gaps = 55/231 (23%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 G+ I LV+D SGSM DP+++ K+A F+D + Sbjct: 636 NSGKTVAMDIALVIDSSGSMTWN---DPKNLR----------------KDAAKEFVDKLS 676 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + +I + + + N + + + D T T + + + Sbjct: 677 SIDE------AAIIDFDSSSKINRNLTSNRTLLYSAID---DIDSSGGTSLTAGVSKGLE 727 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L+ +K +I LTDG+ KS +A + Sbjct: 728 ALSKSND-------------------KKIMILLTDGKGPYDKSLT------TQAINAGVT 762 Query: 364 IVTISINASPN-GQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHR 412 I TI + + + Q LL + + + +Y+ + F N+S + ++ Sbjct: 763 IYTIGLGTNNDIDQPLLNSIATETGGKYYHAKKDIDIQGSFDNVSGDLGNK 813 >gi|254504856|ref|ZP_05117007.1| hypothetical protein SADFL11_4895 [Labrenzia alexandrii DFL-11] gi|222440927|gb|EEE47606.1| hypothetical protein SADFL11_4895 [Labrenzia alexandrii DFL-11] Length = 455 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 61/458 (13%), Positives = 139/458 (30%), Gaps = 64/458 (13%) Query: 16 ASEKANFSIIFA-LSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 A N +I+ A V L+ IG + V+ + ++S ++A LA AS + + Sbjct: 3 ADRSGNVAILTALAFVPLMLITIG-SLDVVRMTTAQAKLQSTLDSATLAAAS-LSNTADI 60 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV-------NSSRI 127 E I + + + +SL+ S T + + + + Sbjct: 61 EDTVDEYIQANLPDTAPWTTLKLTMGDVTDSLNAKSVEITATVDIEMTILKLAGIDKTSV 120 Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQ-------FIEHLLNQRYNQK--IVSFIP 178 + +A + ++ ++ FI+ +L + +++ +S IP Sbjct: 121 LASSVAQQAAQNIEVSVVLDISSSMGGSKITSLREAAKGFIDTMLKEDEDKEYTSLSIIP 180 Query: 179 ALLRIEMGE-----RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 + +G+ V+D S + +N + + + + + Sbjct: 181 FGGTVNIGDFYDTYAVNSSTPGVIDSPSSANYYVNKNVPYGKFMFSTEREGC--IEYTDD 238 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + + V + ++ + D + T Sbjct: 239 DFDMAAIPANSRPQVPDFTKWVATN-PWCPSEDSAMVLNSNNTTDLKALIDDMDLSDGTG 297 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN---------- 343 ++L+ + +F + K + +TDG Sbjct: 298 MDIGALWGAKVLSGSMRGQLGGDFSDRPADFNDEDTLKVAVIMTDGAITAQFRPRDYTTT 357 Query: 344 ------------FKSNVNT------------IKICDKAKENFIKIVTISINASPNGQ--R 377 K N+NT ++C+ +N +++ TI + + Sbjct: 358 GKIKNKTQQTIVSKGNINTASTKADDAVAYFKRVCEYLNDNNVQVYTIGFQINSGSLPDQ 417 Query: 378 LLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 LLK C SS +Y V + + F I+ + + + S Sbjct: 418 LLKYCASSLSNYYFVEGLN-IEDAFNAIASAVNNLRVS 454 >gi|166366808|ref|YP_001659081.1| von Willebrand factor type A [Microcystis aeruginosa NIES-843] gi|166089181|dbj|BAG03889.1| von Willebrand factor type A [Microcystis aeruginosa NIES-843] Length = 456 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 46/270 (17%), Positives = 91/270 (33%), Gaps = 32/270 (11%) Query: 115 NTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIV 174 T+I++I + + + + + D++L+ ++ N+ Y K + Sbjct: 48 ITDIEDIDDKVTLQIQVT---------GEESKPIMGLAESDFQLKVLDKKNNKTYQGKQL 98 Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 F R I +++D SGSM+C S + K + K+ A NA Sbjct: 99 PFDWKSPRETTPPDAW--IVVLIDFSGSMNC---SQDLNTKCDAKAVAKGKRKLDAAINA 153 Query: 235 LLLFLDSIDLLSHVKEDVYM------------GLIGYTTRVEKNIEPSWGTE---KVRQY 279 L F+ + + + G Y +V ++ K+ + Sbjct: 154 LGTFIK-LASERKGNTYLSIVPFGVEGKNDKPGACDYYPKVTSETLDNFNLVQDVKLTNF 212 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + D T+ A+K+ + +DK+ F+ P II L+DG Sbjct: 213 LGSLADKTPCATTNFYQALKETVKFFKNDKEGRFYPKDKEGKPLKPQPRLS--IILLSDG 270 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISI 369 +NN + + I + T+ Sbjct: 271 FDNNSNYQEVQKTLANLQNNKDIVVHTLGY 300 >gi|328951280|ref|YP_004368615.1| von Willebrand factor type A [Marinithermus hydrothermalis DSM 14884] gi|328451604|gb|AEB12505.1| von Willebrand factor type A [Marinithermus hydrothermalis DSM 14884] Length = 320 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 37/240 (15%), Positives = 84/240 (35%), Gaps = 57/240 (23%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 R + + L +D S SM +D + +++ A K A F+ Sbjct: 87 PAYRSVATVVLAIDTSRSM---------------RAEDLEPSRLEAAKAAAREFI---RA 128 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + V +GL+ +++ P+ E++ Q V + T + A ++ Sbjct: 129 MPPG---VEVGLVAFSSYATLLQPPTTDRERLEQAVDLLDLAHR---TAIGDGLVAALRV 182 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L ++ P ++ L+DG NN + + +A+ +++ Sbjct: 183 L---------------PLEDSDAPGGMSVVLLSDGRNNYGIDPLEAAR---QAEAQGVRV 224 Query: 365 VTISI---------------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 T+ + A + + L + + +Y +AD L V+Q +++ + Sbjct: 225 YTVGVGLSENTYVFANGYYIRAGLDEETLQEIAALTGGAYYRASSADELRAVYQTLARAV 284 >gi|296200542|ref|XP_002747689.1| PREDICTED: matrilin-4 [Callithrix jacchus] Length = 770 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 62/169 (36%), Gaps = 22/169 (13%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTP 296 + + L+ +G+I Y+++V+ + V D + L T + Sbjct: 56 LVGLVRGLNVGPNATRVGVIQYSSQVQSVFPLR-AFSRREDMVRAIRDLVPLAQGTMTGL 114 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A + S + G + P + + +TDG + ++ + Sbjct: 115 AIQYAMNVAFSVAE----------GARPPEERVPRVAVIVTDG---RPQD--RVAEVAAQ 159 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 A+ I+I + + + G L+ S P E+ + V + LI F Sbjct: 160 ARARGIEIYAVGVQRADVGS--LRAMASPPLEEHVFLVESF-DLIQEFG 205 Score = 42.6 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 30/177 (16%), Positives = 64/177 (36%), Gaps = 26/177 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR--DMDSLILKPTDSTPAMK 299 +D L E +GL+ +++RV G V + + + T + A++ Sbjct: 412 VDFLDVSPEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALR 469 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S+ QG + +L + + TDG + +++ +AKE Sbjct: 470 HMVEHSFSE----------AQGARPRALNVPRVGLVFTDG---RSQDDISV--WAARAKE 514 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I + + + + + L+ S P S F ++ L+ + K S+ Sbjct: 515 EGIVMYAVGVGKAVEAE--LREIASEP-----AEQHMSYAPDFGTMTHLLENLKGSI 564 >gi|256376278|ref|YP_003099938.1| hypothetical protein Amir_2147 [Actinosynnema mirum DSM 43827] gi|255920581|gb|ACU36092.1| von Willebrand factor type A [Actinosynnema mirum DSM 43827] Length = 321 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 49/270 (18%), Positives = 89/270 (32%), Gaps = 59/270 (21%) Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 L + V+ ++ + LV+D+S SM D Sbjct: 63 FLMVAFMLLTVALAGPTAEQKVPRNRA-TVMLVIDVSLSM---------------KATDV 106 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + T++ A + A F + + +GLI + + P+ R V++ Sbjct: 107 QPTRLEAAQVAAKSF------AEGLTPGINLGLISFAGSATVLVAPT----TDRSAVSQG 156 Query: 284 MDSLILK-PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 +D L L T + A+ A + S K + P + ++ +TDG Sbjct: 157 IDGLKLAQSTATGDAIVAALSAIDSFGKVVGGAD--------GPPPAR--VVLMTDG--- 203 Query: 343 NFKSNVNTIKICD---KAKENFIKIVTISI--------------NASPNGQRLLKTCVSS 385 K V T K D AKE I I TIS + + + + S Sbjct: 204 --KETVGTRKATDAAGDAKEAGIPISTISFGTERGSVDINGKAQEVPVDDESMKEIAKIS 261 Query: 386 PEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + +A+ L V+ + + + + K Sbjct: 262 GGEFFKAASAEELRRVYDTLGEQIGYEKKQ 291 >gi|134093121|gb|ABO52981.1| matrilin 4 isoform 1 precursor [Callithrix jacchus] Length = 580 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 62/169 (36%), Gaps = 22/169 (13%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTP 296 + + L+ +G+I Y+++V+ + V D + L T + Sbjct: 56 LVGLVRGLNVGPNATRVGVIQYSSQVQSVFPLR-AFSRREDMVRAIRDLVPLAQGTMTGL 114 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A + S + G + P + + +TDG + ++ + Sbjct: 115 AIQYAMNVAFSVAE----------GARPPEERVPRVAVIVTDG---RPQD--RVAEVAAQ 159 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 A+ I+I + + + G L+ S P E+ + V + LI F Sbjct: 160 ARARGIEIYAVGVQRADVGS--LRAMASPPLEEHVFLVESF-DLIQEFG 205 Score = 40.6 bits (93), Expect = 0.49, Method: Composition-based stats. Identities = 22/130 (16%), Positives = 48/130 (36%), Gaps = 19/130 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR--DMDSLILKPTDSTPAMK 299 +D L E +GL+ +++RV G V + + + T + A++ Sbjct: 371 VDFLDVSPEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALR 428 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S+ QG + +L + + TDG + +++ +AKE Sbjct: 429 HMVEHSFSE----------AQGARPRALNVPRVGLVFTDG---RSQDDISV--WAARAKE 473 Query: 360 NFIKIVTISI 369 I + + + Sbjct: 474 EGIVMYAVGV 483 >gi|242091866|ref|XP_002436423.1| hypothetical protein SORBIDRAFT_10g002210 [Sorghum bicolor] gi|241914646|gb|EER87790.1| hypothetical protein SORBIDRAFT_10g002210 [Sorghum bicolor] Length = 636 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 58/164 (35%), Gaps = 31/164 (18%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + R + V+D+SGSM D + P + +++ LK A + + Sbjct: 73 VRPEARVPIDVVAVLDVSGSM--------NDPAAVPPERRPTTSRLDLLKTAAKFMVAKL 124 Query: 243 DLLSHVKEDVYMGLIGYTTRVEK--NIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMK 299 + + ++ + R K + + + R+ + +D L T PA + Sbjct: 125 EDGD------RLSIVAFNDRPVKELSSGLLYMSADGRRKAMKSVDQLEARGGTALVPAFE 178 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 +A ++L FI+ LTDGE+ + Sbjct: 179 EAVKVLDGRVGDGRNRLG--------------FIVLLTDGEDTS 208 >gi|33321021|gb|AAQ06268.1| unknown [Sorghum bicolor] Length = 610 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 58/164 (35%), Gaps = 31/164 (18%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + R + V+D+SGSM D + P + +++ LK A + + Sbjct: 65 VRPEARVPIDVVAVLDVSGSM--------NDPAAVPPERRPTTSRLDLLKTAAKFMVAKL 116 Query: 243 DLLSHVKEDVYMGLIGYTTRVEK--NIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMK 299 + + ++ + R K + + + R+ + +D L T PA + Sbjct: 117 EDGD------RLSIVAFNDRPVKELSSGLLYMSADGRRKAMKSVDQLEARGGTALVPAFE 170 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 +A ++L FI+ LTDGE+ + Sbjct: 171 EAVKVLDGRVGDGRNRLG--------------FIVLLTDGEDTS 200 >gi|126731955|ref|ZP_01747758.1| Von Willebrand domain containing protein [Sagittula stellata E-37] gi|126707487|gb|EBA06550.1| Von Willebrand domain containing protein [Sagittula stellata E-37] Length = 318 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 28/178 (15%), Positives = 59/178 (33%), Gaps = 37/178 (20%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +DLSGSM S +++AA++ F+ + Sbjct: 88 DIVLALDLSGSMEREDFSLNGQTV----------SRLAAVQGVAADFVR-------GRTG 130 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + R + V + + + K T + A + L + Sbjct: 131 DRVGLVVFGDRAYVAAPQTHDVASVARLIDGLQIGVSGKATAIADGLGLAIRRLRERDAK 190 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 S + I+ L+DG++ + A++ +++ TI++ Sbjct: 191 S------------------RVILLLSDGQDTTGMVDPVAAAQ--TARDLGMRVYTIAL 228 >gi|218672731|ref|ZP_03522400.1| hypothetical protein RetlG_14377 [Rhizobium etli GR56] Length = 323 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 23/147 (15%), Positives = 52/147 (35%), Gaps = 28/147 (19%) Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + T + + L+ + + V S K ++F+TDGE N Sbjct: 174 NLTSEGSTRLDAGVVAGWYTLSPKWQGVWGDETSPAEV---SDSVHKVMVFMTDGEMN-T 229 Query: 345 KSNVNTI----------------------KICDKAKENFIKIVTISINASPNGQRLLKTC 382 K + N C K++ I+I T+S +A + ++ C Sbjct: 230 KYDPNDKFDWICSQTQSSACNAFATAAMQTACTAMKKSGIEIYTLSYSADADVVN-IRNC 288 Query: 383 VSSPEYHYNVVNADSLIHVFQNISQLM 409 ++ + + + ++ V++ I+ + Sbjct: 289 ATNTAHFFTA-SPATIKTVYETIAAAI 314 >gi|153825062|ref|ZP_01977729.1| von Willebrand factor type A domain protein [Vibrio cholerae MZO-2] gi|149741387|gb|EDM55421.1| von Willebrand factor type A domain protein [Vibrio cholerae MZO-2] Length = 318 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 47/280 (16%), Positives = 85/280 (30%), Gaps = 60/280 (21%) Query: 151 MTSYDYRLQFIEHLL----NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCA 206 TS ++ LL + I + LVVDLS SM Sbjct: 41 STSGSRPATWLPRLLAVGVWFLLITAAARPVWYGDPISTSTSHR-DLMLVVDLSYSMS-- 97 Query: 207 MNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN 266 ED+ S D ++ A+K L F+ +E +GLI + Sbjct: 98 ----QEDMQSGQQMVD----RLTAVKQVLSEFIAK-------REGDRIGLILFADHAYLQ 142 Query: 267 IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 + + V + + + LI T + A + Sbjct: 143 TPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKTFIDSNA---------------- 186 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------- 373 Q+ +I L+DG N + + AK+ I T+ + A Sbjct: 187 --PQRVMILLSDGSNTAGVLDPLEAA--NIAKQYHTTIYTLGVGAGEMVVKDFLFSRKVN 242 Query: 374 -----NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + L ++ +++ N L +++ I+QL Sbjct: 243 TAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTINQL 282 >gi|84498180|ref|ZP_00996977.1| putative membrane protein [Janibacter sp. HTCC2649] gi|84381680|gb|EAP97563.1| putative membrane protein [Janibacter sp. HTCC2649] Length = 654 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 35/236 (14%), Positives = 77/236 (32%), Gaps = 50/236 (21%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 P+ + + R L++D SGSM R+ MA ++ A+ Sbjct: 74 PSPVTSKPATRAQRTTVLLIDTSGSMG--------------------RSGMATVRTAVKD 113 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 FL S +DV +G++ + + + PT + A Sbjct: 114 FLASA------PKDVRIGVVSF----------------------GNTAGPEIAPTTARAA 145 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 ++ L +D + F+ + + S + I+ L+DG+N Sbjct: 146 VQAVVDDLRADGNTALFSGVTQAVRMLGSTG-DRSIVLLSDGKNTVGDRASGLAAAGKAL 204 Query: 358 KENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + +++ + N L + +A+ + FQ ++++ + Sbjct: 205 TASQVRVEVVRFTTGENDPEALAAFAKAGGGSVVQATDAEGVRTAFQTAAKVLESQ 260 >gi|239781743|pdb|2WIN|I Chain I, C3 Convertase (C3bbb) Stabilized By Scin gi|239781744|pdb|2WIN|J Chain J, C3 Convertase (C3bbb) Stabilized By Scin gi|239781745|pdb|2WIN|K Chain K, C3 Convertase (C3bbb) Stabilized By Scin gi|239781746|pdb|2WIN|L Chain L, C3 Convertase (C3bbb) Stabilized By Scin Length = 507 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 34/226 (15%), Positives = 79/226 (34%), Gaps = 39/226 (17%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 +V+D SGSM+ + D D A K L+ ++ + Sbjct: 2 IVLDPSGSMNIYLVLDGSDSIGASNFTG--------AKKCLVNLIEKVASYGVKPRY--- 50 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL------ILKPTDSTPAMKQAYQILTSD 308 GL+ Y T + ++ S +VT+ ++ + + T++ A++ Y ++ Sbjct: 51 GLVTYATYPKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMM--- 107 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK------SNVNTIKICDKAKEN-- 360 ++ + II +TDG +N + + K ++N Sbjct: 108 -------SWPDDVPPEGWNRTRHVIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPR 160 Query: 361 ----FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + + + L + + ++ + V + ++L VF Sbjct: 161 EDYLDVYVFGVGPLVNQVNINALASKKDNEQHVFKVKDMENLEDVF 206 >gi|194384366|dbj|BAG64956.1| unnamed protein product [Homo sapiens] Length = 1266 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 34/226 (15%), Positives = 79/226 (34%), Gaps = 39/226 (17%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 +V+D SGSM+ + D D A K L+ ++ + Sbjct: 763 IVLDPSGSMNIYLVLDGSDSIGASNFTG--------AKKCLVNLIEKVASYGVKPRY--- 811 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL------ILKPTDSTPAMKQAYQILTSD 308 GL+ Y T + ++ S +VT+ ++ + + T++ A++ Y ++ Sbjct: 812 GLVTYATYPKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMM--- 868 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK------SNVNTIKICDKAKEN-- 360 ++ + II +TDG +N + + K ++N Sbjct: 869 -------SWPDDVPPEGWNRTRHVIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPR 921 Query: 361 ----FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + + + L + + ++ + V + ++L VF Sbjct: 922 EDYLDVYVFGVGPLVNQVNINALASKKDNEQHVFKVKDMENLEDVF 967 >gi|168983786|emb|CAQ06837.1| complement factor B [Homo sapiens] gi|168984885|emb|CAQ08426.1| complement factor B [Homo sapiens] Length = 589 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 34/226 (15%), Positives = 79/226 (34%), Gaps = 39/226 (17%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 +V+D SGSM+ + D D A K L+ ++ + Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGASNFTG--------AKKCLVNLIEKVASYGVKPRY--- 309 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL------ILKPTDSTPAMKQAYQILTSD 308 GL+ Y T + ++ S +VT+ ++ + + T++ A++ Y ++ Sbjct: 310 GLVTYATYPKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMM--- 366 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK------SNVNTIKICDKAKEN-- 360 ++ + II +TDG +N + + K ++N Sbjct: 367 -------SWPDDVPPEGWNRTRHVIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPR 419 Query: 361 ----FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + + + L + + ++ + V + ++L VF Sbjct: 420 EDYLDVYVFGVGPLVNQVNINALASKKDNEQHVFKVKDMENLEDVF 465 >gi|134105218|pdb|2OK5|A Chain A, Human Complement Factor B Length = 752 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 34/226 (15%), Positives = 79/226 (34%), Gaps = 39/226 (17%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 +V+D SGSM+ + D D A K L+ ++ + Sbjct: 246 IVLDPSGSMNIYLVLDGSDSIGASNFTG--------AKKCLVNLIEKVASYGVKPRY--- 294 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL------ILKPTDSTPAMKQAYQILTSD 308 GL+ Y T + ++ S +VT+ ++ + + T++ A++ Y ++ Sbjct: 295 GLVTYATYPKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMM--- 351 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK------SNVNTIKICDKAKEN-- 360 ++ + II +TDG +N + + K ++N Sbjct: 352 -------SWPDDVPPEGWNRTRHVIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPR 404 Query: 361 ----FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + + + L + + ++ + V + ++L VF Sbjct: 405 EDYLDVYVFGVGPLVNQVNINALASKKDNEQHVFKVKDMENLEDVF 450 >gi|57209925|emb|CAI41860.1| complement factor B [Homo sapiens] Length = 764 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 34/226 (15%), Positives = 79/226 (34%), Gaps = 39/226 (17%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 +V+D SGSM+ + D D A K L+ ++ + Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGASNFTG--------AKKCLVNLIEKVASYGVKPRY--- 309 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL------ILKPTDSTPAMKQAYQILTSD 308 GL+ Y T + ++ S +VT+ ++ + + T++ A++ Y ++ Sbjct: 310 GLVTYATYPKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMM--- 366 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK------SNVNTIKICDKAKEN-- 360 ++ + II +TDG +N + + K ++N Sbjct: 367 -------SWPDDVPPEGWNRTRHVIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPR 419 Query: 361 ----FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + + + L + + ++ + V + ++L VF Sbjct: 420 EDYLDVYVFGVGPLVNQVNINALASKKDNEQHVFKVKDMENLEDVF 465 >gi|13278732|gb|AAH04143.1| Complement factor B [Homo sapiens] gi|14124934|gb|AAH07990.1| Complement factor B [Homo sapiens] gi|62898361|dbj|BAD97120.1| complement factor B preproprotein variant [Homo sapiens] gi|119623955|gb|EAX03550.1| complement factor B [Homo sapiens] gi|123982996|gb|ABM83239.1| complement factor B [synthetic construct] gi|123997681|gb|ABM86442.1| complement factor B [synthetic construct] gi|307685187|dbj|BAJ20524.1| complement factor B [synthetic construct] Length = 764 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 34/226 (15%), Positives = 79/226 (34%), Gaps = 39/226 (17%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 +V+D SGSM+ + D D A K L+ ++ + Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGASNFTG--------AKKCLVNLIEKVASYGVKPRY--- 309 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL------ILKPTDSTPAMKQAYQILTSD 308 GL+ Y T + ++ S +VT+ ++ + + T++ A++ Y ++ Sbjct: 310 GLVTYATYPKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMM--- 366 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK------SNVNTIKICDKAKEN-- 360 ++ + II +TDG +N + + K ++N Sbjct: 367 -------SWPDDVPPEGWNRTRHVIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPR 419 Query: 361 ----FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + + + L + + ++ + V + ++L VF Sbjct: 420 EDYLDVYVFGVGPLVNQVNINALASKKDNEQHVFKVKDMENLEDVF 465 >gi|13560705|gb|AAK30167.1|AF349679_1 factor B [Homo sapiens] Length = 621 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 34/226 (15%), Positives = 79/226 (34%), Gaps = 39/226 (17%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 +V+D SGSM+ + D D A K L+ ++ + Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGASNFTG--------AKKCLVNLIEKVASYGVKPRY--- 309 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL------ILKPTDSTPAMKQAYQILTSD 308 GL+ Y T + ++ S +VT+ ++ + + T++ A++ Y ++ Sbjct: 310 GLVTYATYPKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMM--- 366 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK------SNVNTIKICDKAKEN-- 360 ++ + II +TDG +N + + K ++N Sbjct: 367 -------SWPDDVPPEGWNRTRHVIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPR 419 Query: 361 ----FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + + + L + + ++ + V + ++L VF Sbjct: 420 EDYLDVYVFGVGPLVNQVNINALASKKDNEQHVFKVKDMENLEDVF 465 >gi|291922|gb|AAA16820.1| complement factor B [Homo sapiens] gi|2347133|gb|AAB67977.1| complement factor B [Homo sapiens] Length = 764 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 34/226 (15%), Positives = 79/226 (34%), Gaps = 39/226 (17%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 +V+D SGSM+ + D D A K L+ ++ + Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGASNFTG--------AKKCLVNLIEKVASYGVKPRY--- 309 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL------ILKPTDSTPAMKQAYQILTSD 308 GL+ Y T + ++ S +VT+ ++ + + T++ A++ Y ++ Sbjct: 310 GLVTYATYPKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMM--- 366 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK------SNVNTIKICDKAKEN-- 360 ++ + II +TDG +N + + K ++N Sbjct: 367 -------SWPDDVPPEGWNRTRHVIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPR 419 Query: 361 ----FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + + + L + + ++ + V + ++L VF Sbjct: 420 EDYLDVYVFGVGPLVNQVNINALASKKDNEQHVFKVKDMENLEDVF 465 >gi|67782358|ref|NP_001701.2| complement factor B preproprotein [Homo sapiens] gi|584908|sp|P00751|CFAB_HUMAN RecName: Full=Complement factor B; AltName: Full=C3/C5 convertase; AltName: Full=Glycine-rich beta glycoprotein; Short=GBG; AltName: Full=PBF2; AltName: Full=Properdin factor B; Contains: RecName: Full=Complement factor B Ba fragment; Contains: RecName: Full=Complement factor B Bb fragment; Flags: Precursor gi|4261689|gb|AAD13989.1|S67310_1 complement factor B [Homo sapiens] gi|297569|emb|CAA51389.1| complement factor B [Homo sapiens] gi|25070931|gb|AAN71991.1| B-factor, properdin [Homo sapiens] gi|55961819|emb|CAI17456.1| complement factor B [Homo sapiens] gi|123857994|emb|CAM25864.1| complement factor B [Homo sapiens] gi|168984418|emb|CAQ09274.1| complement factor B [Homo sapiens] gi|168985079|emb|CAQ07483.1| complement factor B [Homo sapiens] gi|168985957|emb|CAQ07113.1| complement factor B [Homo sapiens] Length = 764 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 34/226 (15%), Positives = 79/226 (34%), Gaps = 39/226 (17%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 +V+D SGSM+ + D D A K L+ ++ + Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGASNFTG--------AKKCLVNLIEKVASYGVKPRY--- 309 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL------ILKPTDSTPAMKQAYQILTSD 308 GL+ Y T + ++ S +VT+ ++ + + T++ A++ Y ++ Sbjct: 310 GLVTYATYPKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMM--- 366 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK------SNVNTIKICDKAKEN-- 360 ++ + II +TDG +N + + K ++N Sbjct: 367 -------SWPDDVPPEGWNRTRHVIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPR 419 Query: 361 ----FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + + + L + + ++ + V + ++L VF Sbjct: 420 EDYLDVYVFGVGPLVNQVNINALASKKDNEQHVFKVKDMENLEDVF 465 >gi|94313457|ref|YP_586666.1| hypothetical protein Rmet_4532 [Cupriavidus metallidurans CH34] gi|93357309|gb|ABF11397.1| conserved hypothetical protein, (Von Willebrand factor, type A); putative membrane protein [Cupriavidus metallidurans CH34] Length = 334 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 41/234 (17%), Positives = 75/234 (32%), Gaps = 43/234 (18%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 +P+ + L +DLS SM P V ++ A++ + F+ Sbjct: 90 TQPVRDLLLALDLSQSMDTRDFKTPSGVLEP---------RVDAVRQVVADFVAR----- 135 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + +GLI + + VR+ + D P M A L Sbjct: 136 --RTGDRIGLIVFGDAPYPLAPFTLDHALVRELLA-----------DMVPGMAGASTSLG 182 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 +K +I LTDG N+ S + + D AK + + T Sbjct: 183 DAIGLGIKMFDQSHAQ-------EKVMILLTDG--NDTASRMPPAQAADIAKTRGVVVHT 233 Query: 367 ISINASPN------GQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + I LK S + ++ + SL ++ + ++ HR+ Sbjct: 234 VGIGDPATTGEQKVDLDALKHIASTTGGRYFFGADQTSLASIYATLDRVTPHRE 287 >gi|262172998|ref|ZP_06040675.1| protein BatA [Vibrio mimicus MB-451] gi|261890356|gb|EEY36343.1| protein BatA [Vibrio mimicus MB-451] Length = 318 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 45/235 (19%), Positives = 78/235 (33%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLS SM ED+ S D ++ A+K L F+ +E Sbjct: 85 DLMLVVDLSYSMS------QEDMQSGQQMVD----RLTAVKQVLSEFITK-------REG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + V + + + LI T + A + Sbjct: 128 DRVGLILFADHAYLQTPLTMDRQTVISQLNQAVLKLIGTQTAIGEGIGLATKTFID---- 183 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-- 369 S Q+ +I L+DG N + + AK+ I T+ + Sbjct: 184 --------------SDAPQRVMILLSDGSNTAGVLDPLEAA--NIAKQYQTTIYTVGVGA 227 Query: 370 ---------------NASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 A ++ L+T S + ++ N L ++ I+QL Sbjct: 228 GEMIVKDFLFSRKVNTAQDLDEKTLQTIASTTGGQYFRARNQQDLQSIYDTINQL 282 >gi|260912478|ref|ZP_05919014.1| aerotolerance protein BatA [Prevotella sp. oral taxon 472 str. F0295] gi|260633397|gb|EEX51551.1| aerotolerance protein BatA [Prevotella sp. oral taxon 472 str. F0295] Length = 332 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 47/246 (19%), Positives = 74/246 (30%), Gaps = 72/246 (29%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I L +D+S SM D +M A K+ F+ D + Sbjct: 88 IDIMLAMDVSTSMLAE---------------DLTPNRMEAAKDVAAEFI--ADRPNDN-- 128 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVR-----QYVTRDMDS--LILKPTDSTPAMKQAYQ 303 +GL + + T+ Q V D+ + LI T + A Sbjct: 129 ---IGLTIFAGEAFTQCPMT--TDHTSLLNMLQTVRTDIAAKGLIQDGTAIGMGLANAVS 183 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L K +S K +I LTDG NN + T + AK I+ Sbjct: 184 RLKDSKAKS------------------KVVILLTDGSNNMGDLSPMTSA--NIAKSLGIR 223 Query: 364 IVTISI---------NASPNG------------QRLLKTCVSSPEYHYNVVNADSLIHVF 402 + TI + G + L S+ Y N L ++ Sbjct: 224 VYTIGVGTNKVARYPMPVAGGVQYVNMPVEIDTKVLKDIAASTDGNFYRATNNQELKQIY 283 Query: 403 QNISQL 408 ++I +L Sbjct: 284 KDIDKL 289 >gi|226226933|ref|YP_002761039.1| hypothetical membrane protein [Gemmatimonas aurantiaca T-27] gi|226090124|dbj|BAH38569.1| hypothetical membrane protein [Gemmatimonas aurantiaca T-27] Length = 326 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 41/242 (16%), Positives = 74/242 (30%), Gaps = 67/242 (27%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I L VD+S SM + + +M K D + ++ Sbjct: 86 IDIALTVDISSSMLAE--------------DFQPQNRMEVAK-------DKVKRFVMGRK 124 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQY--VTRDMDSLILKPTDSTPAMKQAYQILTSD 308 +GL+ ++ + + Y V +D+L + D Sbjct: 125 SDRVGLVAFSGEALTQVPLT------TDYPVVLAAIDNLQVGQ--------------LED 164 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 S + ++ LTDGENN + T A I+I TI Sbjct: 165 GTAIGTAIATAANRLRNSPGRSRVMVLLTDGENNRGAIDPRTAAQ--AAGTFGIRIYTIG 222 Query: 369 I-----NASPNGQRL--LKT---------------CVSSPEYHYNVVNADSLIHVFQNIS 406 + A P G+ L L+ S+ ++ +A +L +++ I Sbjct: 223 VGTDGMAAVPVGRGLFGLRYENRPVKIDEALLTEIANSTGGRYFRAKDAAALQSIYEQID 282 Query: 407 QL 408 +L Sbjct: 283 RL 284 >gi|45384196|ref|NP_990403.1| matrilin-3 precursor [Gallus gallus] gi|14548115|sp|O42401|MATN3_CHICK RecName: Full=Matrilin-3; Flags: Precursor gi|2326444|emb|CAA03885.1| matrilin-3 [Gallus gallus] Length = 452 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 28/174 (16%), Positives = 62/174 (35%), Gaps = 26/174 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDST 295 ID L + + ++ Y + V+ + +++ V+R + T + Sbjct: 77 LSKMIDTLDVGERTTRVAVMNYASTVKVEFPLRTYFDKASMKEAVSRIQP--LSAGTMTG 134 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A + +++ + NF K +I +TDG + V + Sbjct: 135 LAIQAAMDEVFTEEMGTRPANF----------NIPKVVIIVTDG---RPQDQVE--NVAA 179 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN---ADSLIHVFQN 404 A+ I+I + + + L+ S P E+ + V + L F+ Sbjct: 180 NARTAGIEIYAVGVG--RADMQSLRIMASEPLDEHVFYVETYGVIEKLTSKFRE 231 >gi|331694298|ref|YP_004330537.1| von Willebrand factor type A [Pseudonocardia dioxanivorans CB1190] gi|326948987|gb|AEA22684.1| von Willebrand factor type A [Pseudonocardia dioxanivorans CB1190] Length = 362 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 32/182 (17%), Positives = 65/182 (35%), Gaps = 33/182 (18%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + + +D+SGSM D ++AA + A+ F++ D Sbjct: 88 VIVALDVSGSM---------------CSTDVVPNRLAAAQEAVRRFVEEQDSG------T 126 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQILTSDKKR 311 +GL+ ++ E + R +TR +D L T A+ ++ ++ Sbjct: 127 RIGLVVFS----GFAELAVAPTTDRDAITRALDGLTTGRGTTVGSAILKSVDAISEIDPD 182 Query: 312 SFFTNFFRQGV----KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 ++ V + P + ++ LTDG N + + K A E +++ I Sbjct: 183 VAPSDPAPGTVPPPPRAPGTYAPEIVVLLTDGANTTGVTPEDAAK---TAAERGVRVYPI 239 Query: 368 SI 369 Sbjct: 240 GF 241 >gi|87308177|ref|ZP_01090319.1| hypothetical protein DSM3645_21307 [Blastopirellula marina DSM 3645] gi|87289259|gb|EAQ81151.1| hypothetical protein DSM3645_21307 [Blastopirellula marina DSM 3645] Length = 1032 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 34/246 (13%), Positives = 82/246 (33%), Gaps = 50/246 (20%) Query: 166 NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 + S + +R + P+ + LV+D SGSM Sbjct: 433 WANTKLEEASPVRFTIR-DAKVVPVGALMLVLDKSGSMQGE------------------- 472 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 KM + A L + ++ + G+IG+ ++ ++ + + +V + Sbjct: 473 -KMQMTQGAALAAIRAMGAAD------FAGVIGFDSQAQRIVPIR-KVDNPGMFVAQVRK 524 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T+ TP + ++ L + K +I L+DG+ Sbjct: 525 LSASGGTNMTPGVALGFRDLQNVDAGV------------------KHMIVLSDGQT---- 562 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 N +I K+ + + +++ + + + + + Y V N ++ +F Sbjct: 563 EPGNVAQIASDMKKMGMTVSAVAVGSDADQKLMATVARNGGGKFYAVNNPKAIPRIFMRE 622 Query: 406 SQLMVH 411 ++ + Sbjct: 623 ARRVAQ 628 >gi|256426121|ref|YP_003126774.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] gi|256041029|gb|ACU64573.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] Length = 462 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 41/224 (18%), Positives = 76/224 (33%), Gaps = 49/224 (21%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 R I LV+D SGSM K+ + A +D + Sbjct: 73 EASKPRVPLNISLVLDRSGSMSGD--------------------KIKYARQAAKFLIDQL 112 Query: 243 DLLSHVKEDVYMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 + H+ ++ Y RVE + E ++ + + D T+ + M + Sbjct: 113 NSTDHLS------IVNYDDRVEVTSPSQSVKNKEALKAAIDKIHD---RGSTNLSGGMLE 163 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC--DKAK 358 Y + S +K + ++ LTDG N ++ +K +K K Sbjct: 164 GYTQVKSTRKEGYVNR----------------VLLLTDGLANQGITDPLELKRLAENKYK 207 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 E+ I + T + A N L + +Y + + D + +F Sbjct: 208 EDGIALSTFGVGADYNEDLLTMLAENGRANYYFIDSPDKIPQIF 251 >gi|167752251|ref|ZP_02424378.1| hypothetical protein ALIPUT_00494 [Alistipes putredinis DSM 17216] gi|167660492|gb|EDS04622.1| hypothetical protein ALIPUT_00494 [Alistipes putredinis DSM 17216] Length = 344 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 34/189 (17%), Positives = 61/189 (32%), Gaps = 43/189 (22%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E + LVVD+S SM D + ++ K A+ D + Sbjct: 86 ESRGIEMMLVVDVSNSMLAE---------------DFQPNRLERTKYAIDKLFDGLKQD- 129 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQIL 305 +GL+ + + + + + R S++ + TD A+ A Sbjct: 130 ------RVGLVVFAGDAVVQLPITSDYRMAKAFARRISPSMVSVQGTDIGQALSLA---- 179 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 + P + I+ +TDGE + I+ ++A E I+I Sbjct: 180 ------------TMSFSEKGDNPAGRVIVLITDGEG----HDSGAIEAAERAAEQGIRIF 223 Query: 366 TISINASPN 374 TI I Sbjct: 224 TIGIGTPEG 232 >gi|73960091|ref|XP_537088.2| PREDICTED: similar to chloride channel calcium activated 4 [Canis familiaris] Length = 905 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 48/368 (13%), Positives = 121/368 (32%), Gaps = 69/368 (18%) Query: 41 IYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKN 100 + + +KN++E+ + + G + + +K L + FI Sbjct: 170 VDQPFYISRKNTIEATRCSTHITGINVVFKECQGSSCITRQCRRDSKTGLYEAKCTFIPE 229 Query: 101 HIKESLSGYSAVFYNTEIQNIVN--SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRL 158 S + ++ + + ++ ++ T N + N+ +++ + + Sbjct: 230 K---SQTTRDSIMFMQSLDSVTEFCTAETHNTEAPNLQNKMCNSRSTWDIIMNSEDFQNA 286 Query: 159 QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP 218 + P ++ +R ++ LV+D SGSM Sbjct: 287 SPMTGTDPP--------PHPTFSLLKSKQR---VVCLVLDKSGSMSSE------------ 323 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK-VR 277 ++ + A LFL I +++ G++ + + + T+ Sbjct: 324 -------DRLFQMNQAAELFLIQI-----IEKGSLTGMVTFESSATIQNYLTEITDHNAY 371 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + + ++ T ++ +Q + + + + I+ LT Sbjct: 372 EKILANLPQAAGGGTSICSGLRAGFQAIIHSNQNTSGSE----------------IVLLT 415 Query: 338 DGENNNFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRL--LKTCVSSPEYHYNVVN 394 DGE++N +C ++ K++ I TI++ + L L ++ Y + Sbjct: 416 DGEDDNIS-------LCFEEVKKSGSVIHTIAL-GPSAAKELEILSN-MTGGHRFYANKD 466 Query: 395 ADSLIHVF 402 + LI F Sbjct: 467 INGLIDAF 474 >gi|327313515|ref|YP_004328952.1| von Willebrand factor type A domain-containing protein [Prevotella denticola F0289] gi|326945266|gb|AEA21151.1| von Willebrand factor type A domain protein [Prevotella denticola F0289] Length = 318 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 43/267 (16%), Positives = 74/267 (27%), Gaps = 58/267 (21%) Query: 156 YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 L + + +++ + + I L +D+S SM Sbjct: 53 IHLPMVLRCMVYTLTVIVLARPQTYNAWDDKDTEGIDIMLTMDVSASM------------ 100 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS----- 270 + +D +M K F+ +GL + + Sbjct: 101 ---LTEDVYPNRMVVAKEVASEFI----SGRPNDN---IGLTIFAGEAFTQCPMTLDHAA 150 Query: 271 --WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 VR L+ T + A L K +S Sbjct: 151 LLNLLHGVRT--DLVTSGLMQDGTAIGMGLANAVSRLKDSKAKS---------------- 192 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG-------QRLLKT 381 K +I LTDG NN + T A++ I+I TI + L Sbjct: 193 --KIVILLTDGSNNAGSISPMTAA--AIARKFGIRIYTIGFGKETGEEIGAIDYKTLQDI 248 Query: 382 CVSSPEYHYNVVNADSLIHVFQNISQL 408 VS+ Y + L ++Q+I +L Sbjct: 249 AVSTNGEFYRAQSQAELSRIYQDIDKL 275 >gi|284166763|ref|YP_003405042.1| von Willebrand factor A [Haloterrigena turkmenica DSM 5511] gi|284016418|gb|ADB62369.1| von Willebrand factor type A [Haloterrigena turkmenica DSM 5511] Length = 853 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 25/171 (14%), Positives = 61/171 (35%), Gaps = 26/171 (15%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 ID L + +G+ + + S E ++ V T+ ++ A Sbjct: 668 IDELDPSAD--RVGVYDFASSGRALHPLSDDLESAKESVV----GTAYGGTNMAAGLEAA 721 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + + ++ +I L+DG+N+N ++ ++ D++ + Sbjct: 722 LNDYATR----------------GTDDRERIVILLSDGKNSNTANDERMDELADRSDDLD 765 Query: 362 IKIVTISINA----SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + T+ ++A S +L + +Y + D L+ +F+ I Sbjct: 766 YTLHTVGLDALEHDSIPEDKLEGWATETGGNYYQTADPDELLDLFEEIVDE 816 >gi|92117939|ref|YP_577668.1| hypothetical protein Nham_2418 [Nitrobacter hamburgensis X14] gi|91800833|gb|ABE63208.1| conserved hypothetical protein [Nitrobacter hamburgensis X14] Length = 483 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 56/482 (11%), Positives = 138/482 (28%), Gaps = 74/482 (15%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 + R R + + N + IFA++++ L +G + + ++SM++A ++A L Sbjct: 6 IRERIRSSAVRFGQDLRGNIAPIFAIALLPMLGFVGAAVDYTRANAARSSMQAAMDSAAL 65 Query: 63 A------------GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYS 110 A ++ + + + ++ ++ + + S Sbjct: 66 MVAKDANAASPQMTADQVTAAAQKYFNALYHNTDAQGASVSAVYTPYNNGTPATVVLSGS 125 Query: 111 AVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIE---HLLNQ 167 N + + ++ N+ N + M + + + + Sbjct: 126 G---NVQTDFMKVVGFPQISFKTNSTATWGNTKLRVAMALDVTGSMSSAGKLVQMKIAAK 182 Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELV---VDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + + + A + + P ++ V + + + S ++ C Sbjct: 183 KLIDTLKASATAEGDVYISIIPFNVMVNVGANNNTASWLEWEDGSYDNSSSNYGSCSGSG 242 Query: 225 RTK-------MAALKNALLLFLDSIDLLSHVKEDV-YMGLIGYTTRVEKNIE-------P 269 ++K +AA K + S + V G Y T ++ + Sbjct: 243 KSKPNTKSSCIAAGKTWTPKNISSWKGCVTDRGPVSKPGSGDYDTTKDEPVASTPYTLYL 302 Query: 270 SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQG-------- 321 + + + K +DS+ + + Sbjct: 303 ARNYSTCPSSILPMTSAYDSKESDSSTDDSTLKGKINNLVANGATNQAIAMQMAWMMLQP 362 Query: 322 -------VKIPSLPFQKFIIFLTDGENN-----------NFKSNVNTIKICDKAKENFI- 362 K + II L+DG N + + + +C+ K + I Sbjct: 363 TAPFPAPAKDEKYKYTDAIILLSDGLNTQDRWYGNGSDWSSQVDTRQALLCNNIKNDPIS 422 Query: 363 --------KIVTISINAS-PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 +I TI +N +LK C + + A + F I + + Sbjct: 423 KTDPTRRTRIYTIQVNTDGDPESTVLKNCATDG--FFPTSTASGIASAFAQIGASLSQLR 480 Query: 414 YS 415 + Sbjct: 481 IA 482 >gi|325698104|gb|EGD39985.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK160] Length = 464 Score = 50.3 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 25/161 (15%), Positives = 54/161 (33%), Gaps = 28/161 (17%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I V D SGSM +N + + +++M+ LK+ + + + + +V ++ Sbjct: 199 ISFVFDKSGSMSWDLNGNNTNYWG-------PKSRMSILKDKATIMMRDLKDIGNVSVNL 251 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 I + + E GT + + T+ ++ L + + Sbjct: 252 VSFSILGSYVQKDFSELDKGTTTIEASINALQTG---GVTNPGDGLRYGMMSLQNHSAQL 308 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 K+++ LTDG N + + N Sbjct: 309 ------------------KYVVLLTDGIPNAYTVDTNDTSW 331 >gi|262164788|ref|ZP_06032526.1| protein BatA [Vibrio mimicus VM223] gi|262027168|gb|EEY45835.1| protein BatA [Vibrio mimicus VM223] Length = 318 Score = 50.3 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 45/235 (19%), Positives = 78/235 (33%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLS SM ED+ S D ++ A+K L F+ +E Sbjct: 85 DLMLVVDLSYSMS------QEDMQSGQQMVD----RLTAVKQVLSEFITK-------REG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + V + + + LI T + A + Sbjct: 128 DRVGLILFADHAYLQTPLTLDRQTVISQLNQAVLKLIGTQTAIGEGIGLATKTFID---- 183 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-- 369 S Q+ +I L+DG N + + AK+ I T+ + Sbjct: 184 --------------SDAPQRVMILLSDGSNTAGVLDPLEAA--NIAKQYQTTIYTVGVGA 227 Query: 370 ---------------NASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 A ++ L+T S + ++ N L ++ I+QL Sbjct: 228 GEMIVKDFLFSRKVNTAQDLDEKTLQTIASTTGGQYFRARNQQDLQSIYDTINQL 282 >gi|307153048|ref|YP_003888432.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] gi|306983276|gb|ADN15157.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] Length = 413 Score = 50.3 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 38/227 (16%), Positives = 70/227 (30%), Gaps = 47/227 (20%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + E G + LV+D SGSM + +K A + + Sbjct: 30 SAQGEPGRTLPLNLCLVLDHSGSMGGR--------------------PLETVKKAAIELV 69 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 ++ + +I + R K I P+ G E + + + T +K Sbjct: 70 KQLNPED------RVSVIAFDHR-AKVIVPNQGIEDLNTVIEQIKALRAAGGTAIDEGLK 122 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 K+ + I LTDGEN + N +K+ A + Sbjct: 123 ------LGIKESALGKQERVSQ-----------IFLLTDGENEHG-DNERCLKLAQVASD 164 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADSLIHVFQNI 405 I + T+ + Q +L+ S + N + + F + Sbjct: 165 YNITLNTLGF-GNHWNQDVLEKIADSAGGSLSYIENPEKALEEFSRL 210 >gi|118349484|ref|XP_001008023.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89289790|gb|EAR87778.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 646 Score = 50.3 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 44/266 (16%), Positives = 76/266 (28%), Gaps = 50/266 (18%) Query: 142 NTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSG 201 T+ + D + + + Q I+ ++ RP + V+D SG Sbjct: 159 KTLNKHFQFNNEQDCNIPIMVSVKTQDSTNDILEEQKEQVKQVEQSRPSIDLVCVIDNSG 218 Query: 202 SMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT 261 SM K+ +K LL LD ++ + LI + + Sbjct: 219 SMQGE--------------------KIQNVKTTLLQLLDMLNSND------RLSLILFNS 252 Query: 262 RVEKNIEP----SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNF 317 T ++ + TD M A+ IL +KR FF Sbjct: 253 YPTLLCNLRKVDDENTPNIQSIINSI---TADGGTDINSGMLMAFNIL---QKRQFFNPV 306 Query: 318 FRQGVKIPSLPFQKFIIFLTDG-ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQ 376 I L+DG +N + I K I + + +G Sbjct: 307 SS-------------IFLLSDGQDNGADEKIKKYINSNQSLKNECFSIHSFGFGSDHDGP 353 Query: 377 RLLKTCVSSPEYHYNVVNADSLIHVF 402 + + C Y V + + F Sbjct: 354 LMNRICQLKDGNFYYVEKINQVDEFF 379 >gi|126352405|ref|NP_001075268.1| calcium-activated chloride channel regulator 1 precursor [Equus caballus] gi|122142874|sp|Q2TU62|CLCA1_HORSE RecName: Full=Calcium-activated chloride channel regulator 1; AltName: Full=Calcium-activated chloride channel family member 1; Short=eCLCA1; Flags: Precursor gi|46578151|gb|AAT01505.1| putative calcium activated chloride channel-like protein 1 [Equus caballus] Length = 913 Score = 50.3 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 41/233 (17%), Positives = 81/233 (34%), Gaps = 55/233 (23%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM ++ L A LFL V++ ++ Sbjct: 309 LVLDKSGSMAIG-------------------DRLKRLTQAGKLFLLQ-----TVEQGSWV 344 Query: 255 GLIGYTTRVEKNIEPSWGTEKV-RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + R +T+ + ++ T ++ A+ ++ K Sbjct: 345 GMVTFDSAAYVQSALRQIKGGTDRDALTKSLPTVASGGTSICSGLRSAFTVIRKKYKTDG 404 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISI--N 370 I+ LTDGE+N TI C ++ K++ I T+++ + Sbjct: 405 SE-----------------IVLLTDGEDN-------TISSCFNEVKQSGAIIHTVALGPS 440 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ---LMVHRKYSVILKG 420 A+ + L K Y + + LI F +S + ++ +G Sbjct: 441 AAAELEELSKMTGGLQTYASDQAQNNGLIDAFGALSSGNGAVSQHSIQLVSRG 493 >gi|325297739|ref|YP_004257656.1| von Willebrand factor type A [Bacteroides salanitronis DSM 18170] gi|324317292|gb|ADY35183.1| von Willebrand factor type A [Bacteroides salanitronis DSM 18170] Length = 332 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 47/314 (14%), Positives = 90/314 (28%), Gaps = 77/314 (24%) Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 + T + N + + Y F+ ++ +++ Sbjct: 25 RRKKTEPTMQVSTTRMYMNAPQSWKV-----YLLHAPFLLRIVTFIMIVLVLARPQTTDN 79 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + E I L VD+S SM D K ++ A K F++ Sbjct: 80 WQNTEIEGIDIMLAVDVSTSMLAE---------------DLKPNRLEAAKQVAAQFIN-- 122 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPS-------WGTEKVRQYVTRDMDSLILKPTDST 295 +GL + + ++ + + LI T Sbjct: 123 --GRPNDN---IGLTIFAGEAFTQCPLTIDHGVLLNLFGSIKGDIAQR--GLIEDGTAIG 175 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 + + L K +S K II LTDG NN + T + Sbjct: 176 MGLANSISRLKDSKAKS------------------KVIILLTDGSNNRGDISPLTAA--E 215 Query: 356 KAKENFIKIVTIS-------------------INAS--PNGQRLLKTCVSSPEYHYNVVN 394 AK+ I++ TI +N + Q L++ ++ ++ + Sbjct: 216 IAKQFGIRVYTIGVGTNGTAPYPMPTYAGVQYVNVPVEIDEQTLIQIASTTNGNYFRATS 275 Query: 395 ADSLIHVFQNISQL 408 L V++ I +L Sbjct: 276 NSKLKEVYEEIDKL 289 >gi|241204947|ref|YP_002976043.1| hypothetical protein Rleg_2227 [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240858837|gb|ACS56504.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 429 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 67/450 (14%), Positives = 133/450 (29%), Gaps = 100/450 (22%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 I+ NF I+ AL V+ G + + + +A +AA + + Sbjct: 9 ISDRSGNFGIMTALLVVPLFGAAGMAVDFAHALSLRTQLYAAADAAAVGS---IAEKSGA 65 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + N DDA+ + I E+ ++ I +T A Sbjct: 66 VAAAMTMSGNGTISLGKDDARSIFMSQIS------------GELTDVQVDLGIDVTKTA- 112 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIE 194 N+L+S V S F+ +L + I E Sbjct: 113 NKLNSQ---------VSFSATVPTTFM-RVLGRDSIT-----ISGTATAEYQTASFMDFY 157 Query: 195 LVVDLSGSMH--------CAMNSDPEDVNSAPICQDKKRT-------------KMAALKN 233 +++D + SM M + D + + + ++ ++ Sbjct: 158 ILLDNTPSMGVGATATDVSTMEKNTSDTCAFACHETQNNNNYYNLAKKLGVSMRIDVVRQ 217 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL--ILKP 291 A + MG+ + T+ E ++ D L + Sbjct: 218 ATKELTVTAKSTRVSSNQFRMGVYTFGTKAEDAKL---------TTISDPTDDLDKVRSY 268 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS-------LPFQKFIIFLTDGENNNF 344 TD+ M +Q +D++ SF + + I + QK + F++DG ++ Sbjct: 269 TDAVDLMTIPFQGYNNDQQTSFDSALTQMKTIITTPGDGSTATTPQKILFFVSDGVGDSE 328 Query: 345 KSNVNTIKI-------------CDKAKENFIKI---VTISINASPNG------------- 375 K T K+ C K+ I+I T + N Sbjct: 329 KPKGCTKKLTGNRCQEPIDTSFCQPLKDKSIRIAVLYTTYLPLPKNSWYNTWIKPFQGEI 388 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 ++ C SP ++ V + + + + Sbjct: 389 PTKMQACA-SPGLYFEVTPTEGIADAMKAL 417 >gi|120437734|ref|YP_863420.1| von Willebrand factor(vWA) type A domain-containing protein [Gramella forsetii KT0803] gi|117579884|emb|CAL68353.1| membrane protein containing von Willebrand factor(vWA) type A domain [Gramella forsetii KT0803] Length = 335 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 42/258 (16%), Positives = 77/258 (29%), Gaps = 67/258 (25%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 + + + I + +D+S SM D + ++ A K Sbjct: 74 MARPRSVDVSTQTSSTQGIDIVMAIDVSASMLAR---------------DLQPNRLDATK 118 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV-TRDMDSLILKP 291 N F+ + +GL+ Y + V + + ++++ Sbjct: 119 NVAEEFIQ-------DRPGDRIGLVVYAGESFTKTPITSDKAIVLDALEDIEYNNVLENG 171 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A + S K II LTDG NN + +T Sbjct: 172 TAIGSGLATAVNRIKDSDAES------------------KVIILLTDGVNNAGFIDPSTA 213 Query: 352 KICDKAKENFIKIVTISI----NA-------SPNGQ------------RLLKT-CVSSPE 387 + A E IK+ TI + A NG+ LLK ++ Sbjct: 214 S--ELAVEFGIKVYTIGVGSNGMALSPVGVNPANGRLRFGNVQVEIDEDLLKEIAAATGG 271 Query: 388 YHYNVVNADSLIHVFQNI 405 ++ N + L ++ I Sbjct: 272 KYFRATNNEKLEEIYAEI 289 >gi|307591433|ref|YP_003900232.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] gi|306986287|gb|ADN18166.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] Length = 491 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 34/222 (15%), Positives = 70/222 (31%), Gaps = 43/222 (19%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I L++D SGSM K++ +K A F+ DL+++ Sbjct: 54 IVLLIDCSGSMDGN--------------------KLSEVKTAATSFVQRQDLITN----- 88 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 + ++G+ + V+ + ++ + D T A+ A L + Sbjct: 89 RIAVMGFGSGVQLGTPLTSDVNVLQTAIANLYDG---GGTMMDQALTAATDQLHNASAS- 144 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 + + I+ TDG + NT+ A+ I I +++ Sbjct: 145 -----LESAIPSGE---NQHILLFTDGVAADP---YNTLVAGQTAQNAQINI--VAVATG 191 Query: 373 PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 L P + N + FQ + + ++ Sbjct: 192 DADTNFLSQLTGDPNLVFYA-NTGNFDAAFQAAEKAIYSKQL 232 >gi|329896848|ref|ZP_08271743.1| BatA [gamma proteobacterium IMCC3088] gi|328921553|gb|EGG28934.1| BatA [gamma proteobacterium IMCC3088] Length = 328 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 41/229 (17%), Positives = 77/229 (33%), Gaps = 43/229 (18%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E+ + L VD+SGSM K+ +++A K L F+ Sbjct: 90 EKTARDLMLAVDISGSME---------ATDFVDATGKQTDRLSAAKQVLKQFVA------ 134 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 +E +GLI + + + E + + ++ T A+ + Sbjct: 135 -GREGDRLGLIVFGSAAYLQAPFTDDRETWLALLDESIVNMAGPSTALGDAIGLS----- 188 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 +F K + +I LTDG N+ S V + AK + I T Sbjct: 189 -------IAHFRESKTK------NRVLIVLTDG--NDTGSKVPPLDAAQVAKAEGVTIYT 233 Query: 367 ISINASPN-GQRLL------KTCVSSPEYHYNVVNADSLIHVFQNISQL 408 +++ G+ L ++ +N + +L +Q I +L Sbjct: 234 VAVGDPETVGEEALDLEVLDSIAQTTGGVSFNAADLKALQETYQRIDEL 282 >gi|332560892|ref|ZP_08415210.1| von Willebrand factor, type A [Rhodobacter sphaeroides WS8N] gi|332274690|gb|EGJ20006.1| von Willebrand factor, type A [Rhodobacter sphaeroides WS8N] Length = 341 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 28/177 (15%), Positives = 63/177 (35%), Gaps = 37/177 (20%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I L +D+SGSM + T++ A+K F++ ++ Sbjct: 103 IVLTLDMSGSMLIEDFDIDGVQS----------TRLEAVKRVARSFVEE-------RQGD 145 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GL+ + R ++ V + + + + T + A + +T S Sbjct: 146 RIGLVLFANRAYVAAPLTFDLAAVGRAIEEASIGITGRSTAIADGLGLALKRVTESGAAS 205 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + I+ L+DG++N + + + A + ++I TI++ Sbjct: 206 ------------------RVIVLLSDGQDNAHQIDARQVA--GLAARHGVRIHTIAL 242 >gi|299139026|ref|ZP_07032203.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] gi|298599180|gb|EFI55341.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] Length = 318 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 30/192 (15%), Positives = 64/192 (33%), Gaps = 37/192 (19%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 A K + L D + L+ + V++ + + +K+ + R Sbjct: 111 EAAKKFMASLLRKQDQID---------LMDFADDVDELVSFTSDVQKIDSGLGRIHHG-- 159 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T A+ A Q L G S ++ ++ +TDGEN + Sbjct: 160 -DATALYDAVYLASQRL---------------GETPTSAGQRRVLVLITDGENTTHHGSY 203 Query: 349 NTIKICDKAKENFIKIVT---ISINASPN-----GQRLLKTCVSSPEYHYNVVNADSLIH 400 + ++A+ I + ++A L++ + +Y V + L Sbjct: 204 D--AALEQAQRAGAMIYALIIVPVSADAGRNTGGEHALIQLARDTGGKYYYVEDKHDLAP 261 Query: 401 VFQNISQLMVHR 412 FQ++S + + Sbjct: 262 AFQHVSDDLRTQ 273 >gi|256820507|ref|YP_003141786.1| von Willebrand factor type A [Capnocytophaga ochracea DSM 7271] gi|256582090|gb|ACU93225.1| von Willebrand factor type A [Capnocytophaga ochracea DSM 7271] Length = 333 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 40/245 (16%), Positives = 72/245 (29%), Gaps = 65/245 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I + +D+S SM D K + ALK F+ D + Sbjct: 92 IDIVMAIDVSASMLSK---------------DLKPNRFEALKKVASQFVK--DRPND--- 131 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +GL+ Y + + ++ I T + A L K Sbjct: 132 --RIGLVIYAGESYTKTPVTTDKGIILNALSELTYGQIEDGTAIGMGLATAVNRLKESKA 189 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 +S + II LTDG NN + T + A E I++ TI I Sbjct: 190 KS------------------RVIILLTDGVNNTGFIDPQTAA--ELAAEYGIRVYTIGIG 229 Query: 371 ASPNGQR----------------------LLKTCVS-SPEYHYNVVNADSLIHVFQNISQ 407 ++ L+K + ++ + L ++ I++ Sbjct: 230 SNGTALSPYALNPDGSIMYRMLQVEIDEPLMKKIAEVTHGRYFRATDNQKLQQIYDEINK 289 Query: 408 LMVHR 412 + + Sbjct: 290 METTK 294 >gi|291547618|emb|CBL20726.1| fibro-slime domain [Ruminococcus sp. SR1/5] Length = 1928 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 60/379 (15%), Positives = 112/379 (29%), Gaps = 90/379 (23%) Query: 96 RFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYD 155 I N E + S V T + +T A + SS ++ + Sbjct: 1036 NKILNETPEEIINSSMVTSKTAKVKDWDQRTYDITINATSTSTSSIIETKTSVADIML-- 1093 Query: 156 YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFL--IELVVDLSGSMHCAM------ 207 + + + + +F+ ++ + + +++ GS Sbjct: 1094 --VLDVSGSMGEDITSYSYTFVANNTSEARDDKKLLNRNVTYYIEVDGSYKEMWYYSSYN 1151 Query: 208 ---------NSDPEDVNSAPIC-----QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 +SD + C T++ ALKNA+ F+D S + Sbjct: 1152 KGWRVGPRGSSDDAAKDKYNNCKIYTRTSTTETRLDALKNAVNQFIDDTAKKSPNSK--- 1208 Query: 254 MGLIGYTTRVEKNIEPSW-GTE------------KVRQYVTRDMDSLILKPTDSTPAMKQ 300 +G+ +++ + N GT KV + D TD ++ Sbjct: 1209 IGITVFSSTDDYNRPYGNHGTSVSLGEVGTADSAKVTELKNFVKDLKANGGTDPAVGLED 1268 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE--------NNNFKSNVNT-- 350 A L + + K+++ TDG+ N+N + N T Sbjct: 1269 AKNKLDAMVDTN-----------------PKYVVLFTDGKPTGGGNKWNSNAQKNAETQA 1311 Query: 351 -------IKICDKAKENFIKIVTISINASPNG---QRLLKT----------CVSSPEYHY 390 D AK + TI + G + L SS + Sbjct: 1312 GELKTGLRNNVDNAKNP-YTVYTIGFALNDEGDRAKTFLSGGTYDGKKDPGIASSSDCAK 1370 Query: 391 NVVNADSLIHVFQNISQLM 409 +A SL +FQ+IS + Sbjct: 1371 TADDAASLTQIFQSISSTI 1389 >gi|212722920|ref|NP_001131192.1| hypothetical protein LOC100192500 [Zea mays] gi|194690832|gb|ACF79500.1| unknown [Zea mays] Length = 650 Score = 49.9 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 34/174 (19%), Positives = 66/174 (37%), Gaps = 35/174 (20%) Query: 180 LLRIEMGERPI----FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 L+R+E RP + V+D+SGSM + +AP + + +++ LK A Sbjct: 63 LVRVEAPARPEARIPIDVVAVLDVSGSM--------NNPAAAPTERTRTTSRLDLLKTAA 114 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEK--NIEPSWGTEKVRQYVTRDMDSLIL-KPT 292 + ++ + ++ ++ R + + + T R+ R +D L T Sbjct: 115 KFMVAKLEDGD------RLSIVAFSDRPVRELSSGLLYMTADGRRNAIRSLDQLEARGGT 168 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 PA ++A ++L + FI+ LTDG + S Sbjct: 169 ALVPAFEEAVKVLDGRQGDGGDRLG--------------FIVLLTDGAEDASGS 208 >gi|115963085|ref|XP_001182555.1| PREDICTED: similar to calcium activated chloride channel 1 precursor [Strongylocentrotus purpuratus] Length = 1245 Score = 49.9 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 37/281 (13%), Positives = 92/281 (32%), Gaps = 47/281 (16%) Query: 97 FIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDY 156 I + S S + + I N + + ++ N + +N + + Sbjct: 431 RITGADQASGSIMNGPLTYSNIVNFCDDDVNGIGNLHNREAPNKHNRLCNSRSAWEVMRE 490 Query: 157 RLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNS 216 F ++ N + L++ G+ + LV+D SGSM Sbjct: 491 HEDFNDNSPPSNLNDAQIEPSFDLVQASTGDECR--VVLVLDTSGSMG------------ 536 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEK 275 + K+ + A ++ V + + +G++ +T + T+ Sbjct: 537 ----TSNRIDKVNSAATA---------FVNLVDDGISIGIVTFTGSPTTRHALTQINTQA 583 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 R + T ++Q ++L + S I+ Sbjct: 584 DRDSLRDIFQLTASGGTCIGCGLEQGLEVLMAHPSGSADGG---------------IIVL 628 Query: 336 LTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQ 376 +TDG+++ ++++ I ++ +++ T++I G+ Sbjct: 629 MTDGQDSGIQNHI----IRQTLQDMGVRVNTVAIGEDAYGE 665 >gi|115678877|ref|XP_794839.2| PREDICTED: similar to calcium activated chloride channel 1 precursor [Strongylocentrotus purpuratus] Length = 1031 Score = 49.9 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 37/281 (13%), Positives = 92/281 (32%), Gaps = 47/281 (16%) Query: 97 FIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDY 156 I + S S + + I N + + ++ N + +N + + Sbjct: 217 RITGADQASGSIMNGPLTYSNIVNFCDDDVNGIGNLHNREAPNKHNRLCNSRSAWEVMRE 276 Query: 157 RLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNS 216 F ++ N + L++ G+ + LV+D SGSM Sbjct: 277 HEDFNDNSPPSNLNDAQIEPSFDLVQASTGDECR--VVLVLDTSGSMG------------ 322 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEK 275 + K+ + A ++ V + + +G++ +T + T+ Sbjct: 323 ----TSNRIDKVNSAATA---------FVNLVDDGISIGIVTFTGSPTTRHALTQINTQA 369 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 R + T ++Q ++L + S I+ Sbjct: 370 DRDSLRDIFQLTASGGTCIGCGLEQGLEVLMAHPSGSADGG---------------IIVL 414 Query: 336 LTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQ 376 +TDG+++ ++++ I ++ +++ T++I G+ Sbjct: 415 MTDGQDSGIQNHI----IRQTLQDMGVRVNTVAIGEDAYGE 451 >gi|163800205|ref|ZP_02194106.1| hypothetical protein 1103602000595_AND4_05979 [Vibrio sp. AND4] gi|159175648|gb|EDP60442.1| hypothetical protein AND4_05979 [Vibrio sp. AND4] Length = 334 Score = 49.9 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 37/235 (15%), Positives = 79/235 (33%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L+VDLSGSM +D + ++ A+K L F++ ++ Sbjct: 98 DMMLLVDLSGSMQKEDMNDNGEY----------IDRLTAVKRVLSDFVEK-------RQG 140 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G++ + + V Q + + + L+ + T + Sbjct: 141 DRLGVVLFGDHAYLQTPLTADRRTVMQQINQAVIGLVGERTAIGDGI------------- 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + S Q+ +I L+DG N + ++ + A++ I T+ + A Sbjct: 188 -----GLGTKTFVDSDAPQRVMILLSDGSNTAGV--LEPLEATEIAQKYNATIYTVGVGA 240 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + Q L K + ++ +A L ++ I+QL Sbjct: 241 GEMMVKDFFMTRKVNTASDLDEQTLTKIAEMTGGKYFRARDAKELEAIYDTINQL 295 >gi|332664649|ref|YP_004447437.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] gi|332333463|gb|AEE50564.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] Length = 328 Score = 49.9 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 41/252 (16%), Positives = 76/252 (30%), Gaps = 65/252 (25%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 +L+ E + I L +DLS SM QD + ++ K + F+ Sbjct: 76 VLKEEKIKANGIDIMLSMDLSSSMLA---------------QDFEPNRLEVSKKMAIEFV 120 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 + +GL+ + + + + ++ + + T + Sbjct: 121 K-------GRPHDRIGLVVFAGEAFTQCPLTTDHKILETFLEQLECGNLEDGTAIGMGLA 173 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 A L +S K II LTDG NN T + AKE Sbjct: 174 GAVNRLKKSPAKS------------------KVIILLTDGVNNVGYFKPLTAG--ELAKE 213 Query: 360 NFIKIVTISI------NASPN----------------GQRLLKTCVS-SPEYHYNVVNAD 396 IK+ +I + + + LL+ + ++ N Sbjct: 214 LGIKVYSIGVGTIGEALTPVSRLSDGSFFLDYAQVEIDEELLREIARMTGGQYFRAKNNQ 273 Query: 397 SLIHVFQNISQL 408 L ++ I +L Sbjct: 274 DLRQIYNTIDRL 285 >gi|332817900|ref|XP_526306.3| PREDICTED: collagen alpha-5(VI) chain isoform 2 [Pan troglodytes] Length = 2526 Score = 49.9 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 50/356 (14%), Positives = 116/356 (32%), Gaps = 58/356 (16%) Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGY----------SAVFYNTEIQNIV 122 + D + +S + L A F+ + S + ++++ Sbjct: 689 QEISDAIDRMSLINEGTLTGKALNFVGQYFTHSKGARLGAKKFLILITDGVAQDDVRDPA 748 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 R + + + ++N + + +S + ++ +HL + +K++ + AL Sbjct: 749 RILRGKDVTIFSVGVYNANRSQLEEISGDSSLVFHVENFDHL--KALERKLIFHVCAL-- 804 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + + + V+D SGS+ QD + L + + Sbjct: 805 HDCKRITLLDVVFVLDHSGSI-------------KKQYQDHM----------INLTIHLV 841 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS--LILKPTDSTPAMKQ 300 ++ V G + Y+ + T R + ++ T + A+K Sbjct: 842 KKADVGRDRVQFGALKYSDQPNILFYL--NTYSNRSAIIENLRKRRDTGGNTYTAKALKH 899 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A L ++ + ++ +I +TDGE+++ +T + + Sbjct: 900 A-NALFTE-----------EHGSRIKQNVKQMLIVITDGESHDHDQLNDTAL---ELRNK 944 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I I + + Q+ L+ + V N D L VF + + M V Sbjct: 945 GITIFAVGVG--KANQKELEGMAGNKNNTIYVDNFDKLKDVFTLVQERMCTEAPEV 998 >gi|241767791|ref|ZP_04765389.1| von Willebrand factor type A [Acidovorax delafieldii 2AN] gi|241361168|gb|EER57806.1| von Willebrand factor type A [Acidovorax delafieldii 2AN] Length = 277 Score = 49.9 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 40/257 (15%), Positives = 83/257 (32%), Gaps = 62/257 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +D+SGSM D + ++ A +NA F+ + Sbjct: 17 TIMLAMDVSGSM---------------RATDVQPDRITAAQNAAKAFIAEL------PRH 55 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 V +G++ + + P+ E + + + + + T + + A L D Sbjct: 56 VRVGIVAFAGSAQLAQLPTQSHEDLAKAIDSFQ---LQRGTATGNGIMLALATLFPDAGI 112 Query: 312 SFFTNFFRQGVK-------IPSLPFQKF------------IIFLTDGENNNFKSNVNTIK 352 RQ + P + F II LTDG+ + + Sbjct: 113 DIAALGGRQAMHPRPLDEVTRQDPAKPFTPVAPGSYTSAAIIMLTDGQRTTGVDPLEAAQ 172 Query: 353 ICDKAKENFIKIVTISINASPN--------------GQRLLKTCVS-SPEYHYNVVNADS 397 A + +++ T+ + + LK + +++ A Sbjct: 173 W---AADRGVRVYTVGVGTVQGETIGFEGWSMRVRLDEETLKAVAGRTHAEYFHAATAAD 229 Query: 398 LIHVFQNI-SQLMVHRK 413 L V++ + S+L V ++ Sbjct: 230 LKKVYETLSSRLAVEKR 246 >gi|183583553|ref|NP_694996.5| collagen alpha-5(VI) chain [Homo sapiens] Length = 2526 Score = 49.9 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 50/356 (14%), Positives = 116/356 (32%), Gaps = 58/356 (16%) Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGY----------SAVFYNTEIQNIV 122 + D + +S + L A F+ + S + ++++ Sbjct: 689 QEISDAIDRMSLINEGTLTGKALNFVGQYFTHSKGARLGAKKFLILITDGVAQDDVRDPA 748 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 R + + + ++N + + +S + ++ +HL + +K++ + AL Sbjct: 749 RILRGKDVTIFSVGVYNANRSQLEEISGDSSLVFHVENFDHL--KALERKLIFRVCAL-- 804 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + + + V+D SGS+ QD + L + + Sbjct: 805 HDCKRITLLDVVFVLDHSGSI-------------KKQYQDHM----------INLTIHLV 841 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS--LILKPTDSTPAMKQ 300 ++ V G + Y+ + T R + ++ T + A+K Sbjct: 842 KKADVGRDRVQFGALKYSDQPNILFYL--NTYSNRSAIIENLRKRRDTGGNTYTAKALKH 899 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A L ++ + ++ +I +TDGE+++ +T + + Sbjct: 900 A-NALFTE-----------EHGSRIKQNVKQMLIVITDGESHDHDQLNDTAL---ELRNK 944 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I I + + Q+ L+ + V N D L VF + + M V Sbjct: 945 GITIFAVGVG--KANQKELEGMAGNKNNTIYVDNFDKLKDVFTLVQERMCTEAPEV 998 >gi|149632101|ref|XP_001514410.1| PREDICTED: hypothetical protein [Ornithorhynchus anatinus] Length = 2392 Score = 49.9 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 59/382 (15%), Positives = 127/382 (33%), Gaps = 71/382 (18%) Query: 44 LDWHYKKNSMESANNA-AILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHI 102 +D + KN + A L G + + L ++ F+ R + Sbjct: 488 MDRYSNKNDLVKAVENIRQLGGNTDTGAALDKMLPLFQRARQQRAR------------KV 535 Query: 103 KESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIE 162 + L + + ++ R ++ + +N+T + S Y + + Sbjct: 536 PQHLVVLTDGLSHDSVREPAGRLRGDNINVYAIGVKEANHTQLEEIAGSDSRVYYVHNFD 595 Query: 163 HLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 L + + + ++ E + I +VD SGS+ D + + Sbjct: 596 SLKDIKN-----RVVRSICSEEACKEMSADIMFLVDSSGSIG----GDNFEKMKTFMKNV 646 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 RTK+ A V +GL+ ++ ++ + + ++ Sbjct: 647 VNRTKIGA-------------------NQVQVGLVQFSDINKEGFQL--NQYDTKTKISD 685 Query: 283 DMD--SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 +D SLI + T A+ F + + K +KF++ LTDG+ Sbjct: 686 AIDGLSLIGRGTLIGGAL--------------TFVSDYFSVSKGARPNVKKFLVLLTDGK 731 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL-- 398 + V + +++ + I ++ + S Q L+ + + V N D L Sbjct: 732 ---SQDAVKEAAV--ALRQDGVIIYSVGVFGSEYSQ--LEEISGRSDMVFYVENFDILKP 784 Query: 399 ---IHVFQNISQLMVHRKYSVI 417 + VF S V ++ V+ Sbjct: 785 VEDVLVFGICSPYEVCKRIEVL 806 >gi|119026487|ref|YP_910332.1| truncated clumping factor [Bifidobacterium adolescentis ATCC 15703] gi|118766071|dbj|BAF40250.1| truncated clumping factor [Bifidobacterium adolescentis ATCC 15703] Length = 431 Score = 49.9 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 25/135 (18%), Positives = 46/135 (34%), Gaps = 13/135 (9%) Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 ++ + R + LV+D SGSM+ M+ S C ++ A Sbjct: 306 TLNLNVVGKDTRESHETTEKIEVVLVLDTSGSMNYCMDG------SQRRCNKSNPKRLTA 359 Query: 231 LKNALLLFLDSI----DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 LK A F+D+ D + V + + + + T ++ V+R + Sbjct: 360 LKEAATSFIDATETTNDTIQDENSKVRIAIAQFGQTSGVVSSLTSDTAALKSSVSRLSAN 419 Query: 287 LILKPTDSTPAMKQA 301 T + M A Sbjct: 420 ---GATPADKGMAAA 431 >gi|157818579|ref|NP_001100919.1| calcium-activated chloride channel regulator 1 [Rattus norvegicus] gi|149026146|gb|EDL82389.1| chloride channel calcium activated 3 (predicted) [Rattus norvegicus] Length = 910 Score = 49.9 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 38/217 (17%), Positives = 77/217 (35%), Gaps = 54/217 (24%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM ++ + A LFL V++ ++ Sbjct: 310 LVLDKSGSMQSD-------------------NRLNRMNQASRLFLLQ-----TVEQGSWV 345 Query: 255 GLIGYTTRVEKNIEPS-WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + E + + R + + + ++ T ++ A+ + K+ + Sbjct: 346 GMVTFDSTAYVQSELTQLNSGADRDLLIKRLPTVASGGTSICSGLQAAFTSI----KKKY 401 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 T+ I+ LTDGE+N TI C D K + I T+++ Sbjct: 402 PTDGAE-------------IVLLTDGEDN-------TISSCFDLVKNSGAIIHTVAL-GP 440 Query: 373 PNGQRL--LKTCVSSPEYHYNVV-NADSLIHVFQNIS 406 + L L + + + + L+ F +S Sbjct: 441 SAAKELEQLSKMTGGLQTYSSDQIQNNGLVDAFAALS 477 >gi|68536401|ref|YP_251106.1| hypothetical protein jk1316 [Corynebacterium jeikeium K411] gi|260577533|ref|ZP_05845473.1| conserved hypothetical protein [Corynebacterium jeikeium ATCC 43734] gi|68264000|emb|CAI37488.1| hypothetical protein jk1316 [Corynebacterium jeikeium K411] gi|258604337|gb|EEW17574.1| conserved hypothetical protein [Corynebacterium jeikeium ATCC 43734] Length = 663 Score = 49.9 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 42/250 (16%), Positives = 75/250 (30%), Gaps = 64/250 (25%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + +V+D S SM D T+M A K A Sbjct: 52 GSTSSNDATGGDSKVAVVLDASDSMAEKDTGDGG-------------TRMDAAKKAAN-- 96 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTR------------VEKNIEPSWGTEKVRQYVTRDMDS 286 D + + + +I Y + + S G K + Sbjct: 97 ----DTIDTLADSAQTAVIAYGSEESNAPDNRDKGCQDITTLASLGNNKPEDLEDKINGL 152 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T A+K+A + L S KR+ II ++DG Sbjct: 153 EPKGYTPIGNAIKKAAEELGSSGKRN--------------------IILVSDG-----ID 187 Query: 347 NVNTIKICDKAKE---NFI--KIVTISINASPNGQRLLKTCVS--SPEYHYNVVNADSLI 399 +CD A++ + I I T+ Q+ L+ C+S S + + + ++L Sbjct: 188 TCAPPPVCDVAEDIAGDGIDLAIHTVGFKVDDKAQKELE-CISEVSGGTYTSADDTEALT 246 Query: 400 HVFQNISQLM 409 + +Q + Sbjct: 247 EALTDAAQRV 256 >gi|260430630|ref|ZP_05784603.1| von Willebrand factor, type A [Citreicella sp. SE45] gi|260418659|gb|EEX11916.1| von Willebrand factor, type A [Citreicella sp. SE45] Length = 318 Score = 49.9 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 38/217 (17%), Positives = 78/217 (35%), Gaps = 50/217 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +DLSGSM + + +++AA+++ F+ + Sbjct: 88 DIVLALDLSGSME----------REDFVLDGRTVSRLAAVQDVAAQFVR-------GRTG 130 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + R P+ V Q + + K T + A + L Sbjct: 131 DRVGLVVFGDRAYVAAAPTHDVGAVAQVIGTLQIGVSGKATAIADGLGLAIRRL------ 184 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + II L+DG++ + A+E +++ TI++ Sbjct: 185 ------------RGREAESRVIILLSDGQDTTGAVDPVAAAQ--AAQELGMRVYTIAL-- 228 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 G L SSP+ + V++++L + I+++ Sbjct: 229 ---GPADL---ASSPDA-RDAVDSETL----RRIAEV 254 >gi|168698099|ref|ZP_02730376.1| von Willebrand factor type A domain protein [Gemmata obscuriglobus UQM 2246] Length = 311 Score = 49.5 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 32/229 (13%), Positives = 72/229 (31%), Gaps = 51/229 (22%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 +R + IE+ VD+SGSM+ + T+ A+ F Sbjct: 79 KRKLTNIEICVDVSGSMNNPFG---------------RATRYDGAMEAVTAFTSY----- 118 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKV---RQYVTRDMDSLILKPTDSTPAMKQAYQ 303 ++ GL + V + + ++ T A++ Sbjct: 119 --RQGDAFGLTFFGNEVLHWCPLTTDVSAINCATPFMRPGQLPPWFGGTLIAKALRACKA 176 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L + + I+ +TDG++ +F +N ++ ++ K I Sbjct: 177 ELIKRPEG------------------DRMIVLITDGDSQDF-ANGADAEVAEELKAEGIT 217 Query: 364 IVTISINASPNGQR-LLKT------CVSSPEYHYNVVNADSLIHVFQNI 405 + + I Q +++ + + + ++L VF+ I Sbjct: 218 VFAVVIGNDRQFQNPIIRNGSVQTVTARTGGESFEAGDPNALATVFKRI 266 >gi|119505575|ref|ZP_01627647.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2080] gi|119458684|gb|EAW39787.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2080] Length = 316 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 36/225 (16%), Positives = 71/225 (31%), Gaps = 43/225 (19%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 ++ + + VDLSGSM K ++ A+K L Sbjct: 76 QQKAGRDLMIAVDLSGSME---------TEDFSQADGKPADRLTAVKTVLRQL------- 119 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 ++ + +GLI + + + + + T A+ A L Sbjct: 120 ANERAGDRLGLIVFGSSAYLQSPFTEDHRTWLLLLNETRIRMAGPSTALGDAVGLAI-KL 178 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 D + + ++ LTDG N+ S V + A I+I Sbjct: 179 FKDAETEH-----------------RVLLLLTDG--NDTGSLVPPVDAARVAATEDIRIY 219 Query: 366 TISINASPN-GQR------LLKTCVSSPEYHYNVVNADSLIHVFQ 403 I++ G+ L + + + ++++ LI VF+ Sbjct: 220 PIAVGDPTAVGEEAIDLDTLARMAEVTGGQAFEALSSEDLIAVFK 264 >gi|154492261|ref|ZP_02031887.1| hypothetical protein PARMER_01895 [Parabacteroides merdae ATCC 43184] gi|154087486|gb|EDN86531.1| hypothetical protein PARMER_01895 [Parabacteroides merdae ATCC 43184] Length = 328 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 44/281 (15%), Positives = 83/281 (29%), Gaps = 64/281 (22%) Query: 157 RLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNS 216 + FI + I++ + + I L +D+S SM Sbjct: 55 HIPFILRMAAVAVLIVILARPQSTNSWQNSSTEGIDIVLAMDISTSMMA----------- 103 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 QD K ++ A K+ F++ +GL+ + + + Sbjct: 104 ----QDLKPNRLEASKDVASAFIN----GRPNDN---IGLVVFAAESFTQCPLTTDHTVL 152 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 +I T + A + + +S K II L Sbjct: 153 LNLFKDVQPGIIQDGTAIGLGLANAVSRIKDSQAKS------------------KVIILL 194 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISI----NAS-----------------PNG 375 TDG NN + T + AK +++ TI + A + Sbjct: 195 TDGVNNQGEIAPVTAA--EIAKTFGVRVYTIGVGTQGKAPYPFQTAFGVQYMDVDVEIDE 252 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 L + ++ ++ + SL ++ I + M K SV Sbjct: 253 PTLKQIAATTGGQYFRATDNASLKEIYSEIDK-MEKTKISV 292 >gi|149732066|ref|XP_001492602.1| PREDICTED: similar to complement factor B [Equus caballus] Length = 768 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 47/269 (17%), Positives = 91/269 (33%), Gaps = 47/269 (17%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 YD + E L + + + A GE+ ++V+D SGSM+ M D D Sbjct: 224 YDTPAEVAEAFL--SSLTETIEGVEAEEGHSPGEQQK--RKIVLDPSGSMNIYMVLDGSD 279 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT--------RVEK 265 A A D I+ ++ GL+ Y T E+ Sbjct: 280 SIGAHNFT-----------RAKNCLRDFIEKVASYGVKPRYGLVTYATVPKVLIRVSQER 328 Query: 266 NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP 325 + + W TEK+ + D + T++ A++ Y ++ ++ Sbjct: 329 SSDADWVTEKLNE--ISYEDHKLKTGTNTRKALQAVYSMM----------SWEGNAPPEG 376 Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIK----ICDKAKEN--------FIKIVTISINASP 373 + I+ +TDG +N V I + D ++ I + + + Sbjct: 377 WNRTRHVILLMTDGLHNMGGDPVPVIHEIRDLLDIGRDRKNPREDYLDIYVFGVGPLVNQ 436 Query: 374 NGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 L + ++ + V + ++L VF Sbjct: 437 ENINALASKKDGEQHVFKVKDMENLEDVF 465 >gi|146337718|ref|YP_001202766.1| hypothetical protein BRADO0587 [Bradyrhizobium sp. ORS278] gi|146190524|emb|CAL74523.1| conserved hypothetical protein; putative vWFA domain [Bradyrhizobium sp. ORS278] Length = 442 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 51/471 (10%), Positives = 123/471 (26%), Gaps = 100/471 (21%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 + + + N ++ FA+ + + +G + + + +++A +AA + Sbjct: 1 MRSLSLLLSRFRRNAGGNVAVTFAIVCVPVITAVGCGVDYSRTNQMRAKLQAAVDAASVG 60 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 S+ G ++ A DDA++ ++ Sbjct: 61 AVSRTSPAFIAAGAMT---TDGVIAAGNDDARKIFNGNMS----------------GTTG 101 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 + S+T T+ ++ V T ++ + S Sbjct: 102 YTLDSLTPEVKKTGSVLTATVSFSATVPT-------LFMSIVGYKTMSLQGSST-----A 149 Query: 184 EMGERPIFLIELVVDLSGSMHCAMN----------SDPEDVNSAPICQDKKR-------- 225 + L++D S SM A + + + D Sbjct: 150 KASMPKYIDFYLLLDNSPSMGVAATPADVTKMVSATSDKCAFACHDYNDANNYYNLAKTL 209 Query: 226 ---TKMAALKNALLLFLDSIDLLSHVKEDVYM-----GLIGYTTRVEKNIEPSWGTEKVR 277 T++ L++A +D+ M G T + S + Sbjct: 210 GVTTRIDVLRSATQQLMDTAQQTQTYSNQFRMAIYDFGASSKTIGLRALFALS---SSLT 266 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + + ++ + + + + G P K++ F++ Sbjct: 267 SAKSAAGNIDLMGVYGNNDSFTADKDTPYTTALPAINNEIATPGDGTSGSPL-KYLFFVS 325 Query: 338 DGENNNFKS-------------NVNTIKICDKAKENFIKI---VTISINASPNGQRL--- 378 DG + + + +C K IKI T + N + Sbjct: 326 DGVADESNAACLKPKASGNRCQSPINPALCTALKNRGIKIAVLYTTYLQLPTNSWYMSWI 385 Query: 379 -------------------LKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 ++ C S +++ V + + + V Sbjct: 386 DPFNKGPFGPSPNSEIAQNMQACASD-GFYFEVSPTQGIADAMNALFKKAV 435 >gi|218261917|ref|ZP_03476585.1| hypothetical protein PRABACTJOHN_02256 [Parabacteroides johnsonii DSM 18315] gi|218223693|gb|EEC96343.1| hypothetical protein PRABACTJOHN_02256 [Parabacteroides johnsonii DSM 18315] Length = 328 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 44/281 (15%), Positives = 83/281 (29%), Gaps = 64/281 (22%) Query: 157 RLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNS 216 + FI + I++ + + I L +D+S SM Sbjct: 55 HVPFILRMAAVAVLIVILARPQSTNSWQNSSTEGIDIVLAMDISTSMMA----------- 103 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 QD K ++ A K+ F++ +GL+ + + + Sbjct: 104 ----QDLKPNRLEASKDVASAFIN----GRPNDN---IGLVVFAAESFTQCPLTTDHTVL 152 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 +I T + A + + +S K II L Sbjct: 153 LNLFKDVQPGIIQDGTAIGLGLANAVSRIKDSQAKS------------------KVIILL 194 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISI----NAS-----------------PNG 375 TDG NN + T + AK +++ TI + A + Sbjct: 195 TDGVNNQGEIAPVTAA--EIAKTFGVRVYTIGVGTQGKAPYPFQTAFGVQYMDVDVEIDE 252 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 L + ++ ++ + SL ++ I + M K SV Sbjct: 253 PTLKQIAATTGGQYFRATDNASLKEIYSEIDK-MEKTKISV 292 >gi|294141682|ref|YP_003557660.1| von Willebrand factor type A domain-containing protein [Shewanella violacea DSS12] gi|194578720|dbj|BAG66046.1| von Willebrand factor typeA domain protein [Shewanella violacea] gi|293328151|dbj|BAJ02882.1| von Willebrand factor type A domain protein [Shewanella violacea DSS12] Length = 334 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 34/235 (14%), Positives = 74/235 (31%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VDLSGSM + + + +++ + F++ ++ Sbjct: 84 DLMIAVDLSGSM----------QIEDMVLNGQAVDRFTMIQDVVSDFIER-------RKG 126 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V Q++ L+ K T A+ + K Sbjct: 127 DKLGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGEAIALGVKRFDMVDKS 186 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + ++ LTDG NN+ + A + +KI I + A Sbjct: 187 N------------------RILVLLTDGSNNSGSISPEQAA--AIAAKRGVKIYAIGVGA 226 Query: 372 SPNGQ------------------RLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + +L + ++ ++ L ++Q I +L Sbjct: 227 DVMERRSIFGTERVNPSMDLDEAQLSSLAKITGGLYFRARSSQDLQQIYQEIDKL 281 >gi|189082691|sp|A8TX70|CO6A5_HUMAN RecName: Full=Collagen alpha-5(VI) chain; AltName: Full=Collagen alpha-1(XXIX) chain; AltName: Full=von Willebrand factor A domain-containing protein 4; Flags: Precursor gi|158828630|gb|ABW81241.1| collagen XXIX alpha 1 [Homo sapiens] Length = 2615 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 50/356 (14%), Positives = 116/356 (32%), Gaps = 58/356 (16%) Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGY----------SAVFYNTEIQNIV 122 + D + +S + L A F+ + S + ++++ Sbjct: 689 QEISDAIDRMSLINEGTLTGKALNFVGQYFTHSKGARLGAKKFLILITDGVAQDDVRDPA 748 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 R + + + ++N + + +S + ++ +HL + +K++ + AL Sbjct: 749 RILRGKDVTIFSVGVYNANRSQLEEISGDSSLVFHVENFDHL--KALERKLIFRVCAL-- 804 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + + + V+D SGS+ QD + L + + Sbjct: 805 HDCKRITLLDVVFVLDHSGSI-------------KKQYQDHM----------INLTIHLV 841 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS--LILKPTDSTPAMKQ 300 ++ V G + Y+ + T R + ++ T + A+K Sbjct: 842 KKADVGRDRVQFGALKYSDQPNILFYL--NTYSNRSAIIENLRKRRDTGGNTYTAKALKH 899 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A L ++ + ++ +I +TDGE+++ +T + + Sbjct: 900 A-NALFTE-----------EHGSRIKQNVKQMLIVITDGESHDHDQLNDTAL---ELRNK 944 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I I + + Q+ L+ + V N D L VF + + M V Sbjct: 945 GITIFAVGVG--KANQKELEGMAGNKNNTIYVDNFDKLKDVFTLVQERMCTEAPEV 998 >gi|118443684|ref|YP_877685.1| hypothetical protein NT01CX_1604 [Clostridium novyi NT] gi|118134140|gb|ABK61184.1| hypothetical protein NT01CX_1604 [Clostridium novyi NT] Length = 1252 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 35/231 (15%), Positives = 72/231 (31%), Gaps = 50/231 (21%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK 265 M + + + + LK+A + + + K + +GL+ + TR + Sbjct: 153 TMGTTNYYIQGNKCYRQSSYNEKNRLKHAQESAIKFVKKFENDKN-ISIGLVSFDTRAIE 211 Query: 266 NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP 325 E + +V+ + + T+ +K A +IL K Sbjct: 212 QKELTSSLSEVKSSINNLKVAY-NGATNIEAGLKSAQKIL-----------------KKG 253 Query: 326 SLPFQKFIIFLTDG----------------------ENNNFKSNVN---------TIKIC 354 + K++I ++DG +N N +I Sbjct: 254 NEDADKYVILMSDGFPTAFDYAGEKFEENFNEHEVQDNTFINFGYNDYRGYAMKHSINQA 313 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 D K+ I I + N ++L K ++ + N D+L + I Sbjct: 314 DSLKKVGINSFIIGFSDGANSEKLNKIAKAAGGEYEEARNTDALNGAYNKI 364 Score = 38.3 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 29/163 (17%), Positives = 64/163 (39%), Gaps = 25/163 (15%) Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 SG+ N ++ R+++ ++K F+D E+ + ++ Y Sbjct: 698 SGNFTINGKKYYVKDNKVYEFNEQDRSRIDSVKKVANDFVDKFK----DDENTEIAIVRY 753 Query: 260 TTRVEKNIEPSWGT---EKVRQYVTRDMDSLILK-PTDSTPAMKQAYQILTSDKKRSFFT 315 +++ + ++ S K + + + ++SL T+ ++++Y IL K S Sbjct: 754 SSKADVVLDNSNKVFLSSKDNETIKKRINSLKADVATNIGDGIRKSYSILDKCDKDS--- 810 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 +K++I +TDG + NTIK + K Sbjct: 811 --------------EKYMILMTDGVPTAYTCYANTIKTLNNCK 839 >gi|332817898|ref|XP_003310056.1| PREDICTED: collagen alpha-5(VI) chain isoform 1 [Pan troglodytes] Length = 2615 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 50/356 (14%), Positives = 116/356 (32%), Gaps = 58/356 (16%) Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGY----------SAVFYNTEIQNIV 122 + D + +S + L A F+ + S + ++++ Sbjct: 689 QEISDAIDRMSLINEGTLTGKALNFVGQYFTHSKGARLGAKKFLILITDGVAQDDVRDPA 748 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 R + + + ++N + + +S + ++ +HL + +K++ + AL Sbjct: 749 RILRGKDVTIFSVGVYNANRSQLEEISGDSSLVFHVENFDHL--KALERKLIFHVCAL-- 804 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + + + V+D SGS+ QD + L + + Sbjct: 805 HDCKRITLLDVVFVLDHSGSI-------------KKQYQDHM----------INLTIHLV 841 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS--LILKPTDSTPAMKQ 300 ++ V G + Y+ + T R + ++ T + A+K Sbjct: 842 KKADVGRDRVQFGALKYSDQPNILFYL--NTYSNRSAIIENLRKRRDTGGNTYTAKALKH 899 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A L ++ + ++ +I +TDGE+++ +T + + Sbjct: 900 A-NALFTE-----------EHGSRIKQNVKQMLIVITDGESHDHDQLNDTAL---ELRNK 944 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I I + + Q+ L+ + V N D L VF + + M V Sbjct: 945 GITIFAVGVG--KANQKELEGMAGNKNNTIYVDNFDKLKDVFTLVQERMCTEAPEV 998 >gi|308050057|ref|YP_003913623.1| hypothetical protein Fbal_2347 [Ferrimonas balearica DSM 9799] gi|307632247|gb|ADN76549.1| conserved hypothetical protein [Ferrimonas balearica DSM 9799] Length = 457 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 24/186 (12%), Positives = 55/186 (29%), Gaps = 15/186 (8%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 KG ++ I+ +++ + L + + K +++A +AA L+GA + Sbjct: 9 KGRRRQRGAVIIMITIAMFAILAMGALALDGGHLLLNKARLQNAVDAAALSGAVAIQKEY 68 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 L R E + F + + + + + + +T Sbjct: 69 DYLRARQEGL------------VTFTSALGAQDFAELNDRVSLSVLNFASDEVSPQITVE 116 Query: 133 ANNRLD--SSNNTIFYNMDVMTSYDYRL-QFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 + R D T +T D L F ++ V+ Sbjct: 117 FSERPDPFVPVLTPGAQYVRVTVSDVPLNNFFAQVMGVDKRVSAVAVAGPSTSTPQCSTD 176 Query: 190 IFLIEL 195 + + + Sbjct: 177 LLPMMV 182 >gi|288928458|ref|ZP_06422305.1| BatA protein [Prevotella sp. oral taxon 317 str. F0108] gi|288331292|gb|EFC69876.1| BatA protein [Prevotella sp. oral taxon 317 str. F0108] Length = 332 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 46/246 (18%), Positives = 74/246 (30%), Gaps = 72/246 (29%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I L +D+S SM D +M A K+ F+ D + Sbjct: 88 IDIMLAMDVSTSMLAE---------------DLTPNRMEAAKDVAAEFI--ADRPNDN-- 128 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVR-----QYVTRDMDS--LILKPTDSTPAMKQAYQ 303 +GL + + T+ Q V D+ + LI T + A Sbjct: 129 ---IGLTIFAGEAFTQCPMT--TDHTSLLNMLQTVRTDIAAKGLIQDGTAIGMGLANAVS 183 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L K +S K +I LTDG NN + T + AK I+ Sbjct: 184 RLKDSKAKS------------------KVVILLTDGSNNMGDLSPMTSA--NIAKSLGIR 223 Query: 364 IVTISI---------NASPNG------------QRLLKTCVSSPEYHYNVVNADSLIHVF 402 + TI + G + L ++ Y N L ++ Sbjct: 224 VYTIGVGTNKVARYPMPVAGGVQYVNMPVEIDTKVLKDIAATTDGNFYRATNNQELKQIY 283 Query: 403 QNISQL 408 ++I +L Sbjct: 284 KDIDKL 289 >gi|260437096|ref|ZP_05790912.1| putative von Willebrand factor type A domain protein [Butyrivibrio crossotus DSM 2876] gi|292810406|gb|EFF69611.1| putative von Willebrand factor type A domain protein [Butyrivibrio crossotus DSM 2876] Length = 623 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 45/237 (18%), Positives = 83/237 (35%), Gaps = 52/237 (21%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSM---------HCAMNSDPEDVNSAPICQDKKRTKMA 229 ++ +F +++D SGSM P C K +K Sbjct: 85 KEIKYPNSNNIVFDTVILIDCSGSMRTNDPDFEYSVKNTLYPGSSYQITTCYRKLASK-- 142 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 + + D G++ +T+ E + + + + + Sbjct: 143 -------------NYVKAQGNDDRTGIVLFTSEANTVCELTNSEYVLMNAIDKIYSN--- 186 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T+ A+K++ +ILT+ + S +K I+ ++DGE+ S Sbjct: 187 GGTNFNNAIKESIRILTNTRNDS-----------------EKRILLVSDGESELSSS--- 226 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNI 405 + D A EN IKI T+ N LLK + ++ V AD LI+++ I Sbjct: 227 ---VIDLAIENNIKINTV-YIGGQNNNELLKNVAERTGGKYFKAVTADELINIYSEI 279 >gi|258651542|ref|YP_003200698.1| von Willebrand factor type A [Nakamurella multipartita DSM 44233] gi|258554767|gb|ACV77709.1| von Willebrand factor type A [Nakamurella multipartita DSM 44233] Length = 681 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 70/224 (31%), Gaps = 54/224 (24%) Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 +V+D SGSM D ++ A K A+ L ++ + V VY Sbjct: 60 MIVLDASGSM---------------NQDDAPGLRIDAAKAAVTDLLGTLPAPTQVGLMVY 104 Query: 254 MGLIGYTTRVEKN------IEPS-WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 G +T E+ + GT ++ T A++ A Q L Sbjct: 105 -GTSTGSTDAERAAGCQDIKTLAPVGTLNAATLTSQVAGITASGYTPIGNALRAAAQALP 163 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK---ENFI- 362 ++ RS I+ ++DGE+ CD A+ E + Sbjct: 164 NEGPRS--------------------IVLVSDGEDTCAPP-----APCDVARELHEQGVD 198 Query: 363 -KIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQN 404 + T+ + L ++ + + NA L Q Sbjct: 199 LTVHTVGFKVDATARDQLSCVAQATGGTYSDAGNATGLTDALQA 242 >gi|297567412|ref|YP_003686384.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] gi|296851861|gb|ADH64876.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] Length = 319 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 42/240 (17%), Positives = 82/240 (34%), Gaps = 54/240 (22%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 P I L +D+S SM D ++ A + AL +F+ + Sbjct: 83 DPRTTIVLALDVSRSM---------------RATDVLPSRFEAAREALKVFIREL----- 122 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + +GL+ ++ + + P+ +++ V T + + Q L Sbjct: 123 -PQGARIGLVTFSRAATEVVAPTTNRQRLLDSVELIGLEF---GTAIGEGILTSLQALPP 178 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 ++R + II LTDG + + + +I A E ++I TI Sbjct: 179 LEQRKDAKDPSELAT----------IILLTDGRSISGIDPLEAARI---AAEQKVRIHTI 225 Query: 368 SINASPNG----------------QRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMV 410 + G + +LK + + ++ V +A L +Q +SQ V Sbjct: 226 GVGRVTEGPVPGLESVYQWAAYFDEDVLKQIAAITGGKYFFVNSAGKLRETYQQLSQSFV 285 >gi|126341670|ref|XP_001379945.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 2439 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 27/141 (19%), Positives = 48/141 (34%), Gaps = 22/141 (15%) Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKPTDSTPAMKQAYQIL 305 ++ V +GL + + K + V + SL T + A+ Sbjct: 50 GRDQVRVGLAQFNDNIYKAFLL--NQFPRKSDVLEQILSLPYRTGGTRTGSALNFLRTEF 107 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 ++ S + Q +I +TDGE+N+ + + K K + I Sbjct: 108 FTESAGSRAKDNVPQ-----------IVILVTDGESNDEVAEAAS-----KLKGQGVSIY 151 Query: 366 TISINASPNGQRLLKTCVSSP 386 + IN + LKT S P Sbjct: 152 VVGINVQDVQE--LKTIASKP 170 >gi|71061058|dbj|BAE16255.1| calcium activated chloride channel [Rattus norvegicus] Length = 902 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 41/215 (19%), Positives = 77/215 (35%), Gaps = 51/215 (23%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM + + +M A L+L I V+++ + Sbjct: 311 LVLDKSGSMD----------------TEDRLIRMN---QAAELYLTQI-----VEKESMV 346 Query: 255 GLIGYT-TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 GL+ + T +N + +T ++ + T ++ +Q +TS + + Sbjct: 347 GLVTFDSTAQIQNYLIKITNTGDYKKITGNLPQQAVGGTSICRGLEAGFQAITSSDQSTS 406 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 + I+ LTDGE++ S C + K + I TI++ Sbjct: 407 GSE----------------IVLLTDGEDDLISS-------CFEVVKHSGAVIHTIAL-GP 442 Query: 373 PNGQRLLKTCVSSPE-YHYNVVNADSLIHVFQNIS 406 + L + Y + +SL+ F IS Sbjct: 443 KAARELETLSDMTGGLRFYANKDVNSLMDAFSGIS 477 >gi|332654605|ref|ZP_08420348.1| von Willebrand factor, type A [Ruminococcaceae bacterium D16] gi|332516569|gb|EGJ46175.1| von Willebrand factor, type A [Ruminococcaceae bacterium D16] Length = 472 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 40/229 (17%), Positives = 70/229 (30%), Gaps = 53/229 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I LV+D SGSM T +A +K F+D ID + +D Sbjct: 54 DIVLVLDRSGSM--------------------TGTPLADMKLGAKTFIDLIDEATDSSQD 93 Query: 252 ------VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 MG++ ++ + + + ++ V T+ A +A Q Sbjct: 94 GQIGSGSRMGVVSFSNTAVADTQLITSVDALKAAVDNLSAG---GSTNHADAFAKAIQ-- 148 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 + P+ K ++ TDG A+ I I Sbjct: 149 ----------------LFDPASANAKVMVMFTDGNTTIGAPPAPVAA---AARAQGIIIY 189 Query: 366 TISINASPN-GQRLLKTCVSSPEYHYNVV--NADSLIHVFQNISQLMVH 411 I + S L + P+ + V NA L +F ++ + Sbjct: 190 CIGLIGSDGLDITALNDWATDPDASHVAVTPNAADLEELFAELAANISK 238 >gi|291544120|emb|CBL17229.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Ruminococcus sp. 18P13] Length = 1117 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 37/227 (16%), Positives = 68/227 (29%), Gaps = 54/227 (23%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + + I L VD+SGSM+ ++ K AL F+ Sbjct: 570 SGDAGTSQLEPYDIGLCVDVSGSMYGD--------------------RLEKAKTALNTFI 609 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 D++ + ++ ++ E +R + D T++ + Sbjct: 610 DAMLPQDNAC------MVSFSDNAYLVAGYGASKEVMRSRTNQLRDLY---GTNTDVGLS 660 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + IL K II + DG+ N + V D AK Sbjct: 661 KTISILADQ----------------GRSDASKMIIMICDGDVNYIQGTV------DAAKA 698 Query: 360 NFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNI 405 I + TI N LL+ + +Y + ++ + I Sbjct: 699 AGIAVYTI--NVVSGDNDLLQKIADETGGEYYYAATTEEVVSQVEAI 743 >gi|89513613|gb|ABD74633.1| capillary morphogenesis protein 2A [Danio rerio] gi|122891370|emb|CAM13145.1| novel protein similar to vertebrate anthrax toxin receptor family protein [Danio rerio] Length = 478 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 32/241 (13%), Positives = 74/241 (30%), Gaps = 49/241 (20%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + + + V+D SGS+ +++ L Sbjct: 22 SSFKAETPSCHGAYDLYFVLDRSGSVSTDW------------------SEIYDFVKNLTE 63 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 V ++ + I +++R E + + ++ + + + T Sbjct: 64 RF--------VSPNLRVSFIVFSSRAEIVLPLTGDRSEINKGLKTLSEVNPAGETYMHEG 115 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 +K A + + + K+S I+ LTDG+ + + TI D A Sbjct: 116 IKLATEQMKKEPKKSSS-----------------IIVALTDGKLETYIHQL-TIDEADSA 157 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 ++ ++ + + Q L S E + V + I ++ + + I Sbjct: 158 RKYGARVYCVGVKDFDEEQ--LADVADSKEQVFPVKGGFQ---ALKGIVNSILKQSCTEI 212 Query: 418 L 418 L Sbjct: 213 L 213 >gi|84386788|ref|ZP_00989813.1| hypothetical protein V12B01_19181 [Vibrio splendidus 12B01] gi|84378316|gb|EAP95174.1| hypothetical protein V12B01_19181 [Vibrio splendidus 12B01] Length = 404 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 40/295 (13%), Positives = 86/295 (29%), Gaps = 51/295 (17%) Query: 156 YRLQFIE-HLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVV--DLSGSMHCAMNS--- 209 Y L + + + Y K+ + + ++ ELV+ D+SGSM + S Sbjct: 103 YSLSPLLPNFQYESYATKVTATGGGYKSVVESKQSSIPTELVLVLDVSGSMGSNIQSLKS 162 Query: 210 -----------------------------DPEDVNSAPICQDKKRTKMAALKN------- 233 D P K+ + + Sbjct: 163 ILSNALNTIQSQSNNANDLDSVSISIVPFDSGVAAQRPPWLSKEAAGIYCIDGLNYRNGN 222 Query: 234 -ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 + L +D++ L + + G+ + ++ T + T Sbjct: 223 FSAALTVDNLATLHSQQPVKFAKPNGWLSDCNQSSPMLPLTSVFSRVRNSINSLTANGGT 282 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG-ENNNFKSNVNTI 351 S + + L ++++ N +P + ++ TDG + + + Sbjct: 283 RSFHGLLWGVRQLIPSWQQAWGINVST----VPETRRK--LVLFTDGADEGDTFDQLVNA 336 Query: 352 KICDKAKEN-FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 C A I++ I S + + C +P ++ N L F +I Sbjct: 337 GFCTTAINQYGIEMNFIGYGVSSSRIAQFERCAGNPSRVFSATNTTQLNEYFSDI 391 >gi|327274976|ref|XP_003222250.1| PREDICTED: collagen alpha-6(VI) chain-like [Anolis carolinensis] Length = 2088 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 39/221 (17%), Positives = 79/221 (35%), Gaps = 42/221 (19%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 I V+D SGS+ + +K + + Sbjct: 798 RLDIVFVIDGSGSID--------------------PKEYDIMKE---FMISLVKKSDVSH 834 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + V G + Y+ E + T K D I + T + A++ + + L S++ Sbjct: 835 DRVQFGAVKYSAEPETFFYLNRYTTKSAIIRAIQNDKSIGETTYTAKALRHS-EGLFSEE 893 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS-NVNTIKICDKAKENFIKIVTIS 368 + R+ +P + +I +TDG++++ + + K+ + N I I I Sbjct: 894 ------HGSRKHRGVPQV-----LIVITDGDSHDAAELDEVSKKL----RANGIIIYAIG 938 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 I LL S + ++ V + L H++ IS+ + Sbjct: 939 IE-RARPDELLTM-AGSEDKYFYVNTFEGLKHLYPRISEKI 977 >gi|294673502|ref|YP_003574118.1| BatA protein [Prevotella ruminicola 23] gi|294472594|gb|ADE81983.1| putative BatA protein [Prevotella ruminicola 23] Length = 332 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 49/281 (17%), Positives = 82/281 (29%), Gaps = 72/281 (25%) Query: 156 YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 L + LL +++ + I L +D+S SM Sbjct: 53 MPLSMLLRLLVFVMIVMVLARPQTRNSWDSKTVEGIDIMLAMDVSTSMLAE--------- 103 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 D + ++ A K F+ + +GL + + T+ Sbjct: 104 ------DLRPNRIEAAKQVASEFI----IGRPNDN---IGLAIFAGESFTQCPMT--TDH 148 Query: 276 VR-----QYVTRDMDS--LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 Q V D+ + LI T + A L K +S Sbjct: 149 ASLLNLLQNVRTDIAARGLIEDGTAIGMGLANAVSRLKDSKAKS---------------- 192 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI---------NASPNGQRL- 378 K +I LTDG NN + +T + AK I++ TI + G + Sbjct: 193 --KVVILLTDGSNNRGDISPSTAA--EIAKSLGIRVYTIGVGTNKVAPYPMPVAGGVQYV 248 Query: 379 ----------LKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 L S + Y N + L +++ I QL Sbjct: 249 NVPVEIDTKTLSEIASITEGDFYRATNTNELRKIYKEIDQL 289 >gi|332285111|ref|YP_004417022.1| hypothetical protein PT7_1858 [Pusillimonas sp. T7-7] gi|330429064|gb|AEC20398.1| hypothetical protein PT7_1858 [Pusillimonas sp. T7-7] Length = 342 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 40/231 (17%), Positives = 81/231 (35%), Gaps = 43/231 (18%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 P+ I L +D+S SM QD++ ++ +K + F+D Sbjct: 86 THTEPVRDILLALDISQSMDSQ---------DFRDAQDRQVSRWTVVKAVVADFIDK--- 133 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + D +GLI + T + + +R + + T A+ ++ Sbjct: 134 ----RTDDRLGLIVFGTGAFPQAPLTRDHKSLRLLLDHTAVGMAGPNTAIGDAIGMGIRM 189 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L S ++R K +I LTDG N+ S V ++ + A ++ + + Sbjct: 190 LDSAQER------------------DKVLILLTDG--NDTGSAVPPLRAANLAAQHHVTV 229 Query: 365 VTISINASPNGQR-------LLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 TI I + L +S + + +L V+ + ++ Sbjct: 230 HTIGIGSPTASGDDQVDFDTLRGISSASGGQFFQAQDGAALHDVYATLDRI 280 >gi|145587695|ref|NP_001038174.2| anthrax toxin receptor 2a [Danio rerio] gi|141796884|gb|AAI39637.1| Anthrax toxin receptor 2a [Danio rerio] Length = 478 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 32/241 (13%), Positives = 74/241 (30%), Gaps = 49/241 (20%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + + + V+D SGS+ +++ L Sbjct: 22 SSFKAETPSCHGAYDLYFVLDRSGSVSTDW------------------SEIYDFVKNLTE 63 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 V ++ + I +++R E + + ++ + + + T Sbjct: 64 RF--------VSPNLRVSFIVFSSRAEIVLPLTGDRSEINKGLKTLSEVNPAGETYMHEG 115 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 +K A + + + K+S I+ LTDG+ + + TI D A Sbjct: 116 IKLATEQMKKEPKKSSS-----------------IIVALTDGKLETYIHQL-TIDEADSA 157 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 ++ ++ + + Q L S E + V + I ++ + + I Sbjct: 158 RKYGARVYCVGVKDFDEEQ--LADVADSKEQVFPVKGGFQ---ALKGIVNSILKQSCTEI 212 Query: 418 L 418 L Sbjct: 213 L 213 >gi|293391324|ref|ZP_06635658.1| Flp pilus assembly protein TadG [Aggregatibacter actinomycetemcomitans D7S-1] gi|290951858|gb|EFE01977.1| Flp pilus assembly protein TadG [Aggregatibacter actinomycetemcomitans D7S-1] Length = 525 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 41/285 (14%), Positives = 87/285 (30%), Gaps = 43/285 (15%) Query: 2 HLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 +L ++ K+ +E ++II AL LLL+ F + K + A + A Sbjct: 6 NLTAKLFNSIKQFFQNEHGVYAIITALLAFQLLLLVAFTVDGTGILLDKARLAQATDQAA 65 Query: 62 L---------------AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESL 106 L + S+ ++S+ E S + + + ++ +K L Sbjct: 66 LLLIAEDNKYRKNKDHSDVSR--QHVSQQDINREGNSKVQAQWKKRN-QELVQGLVKLYL 122 Query: 107 SGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLN 166 I+ ++ + + N T ++ + + +F L Sbjct: 123 RSDDKNGQKNSSPAIIKDPFLAECLEEKTQPKNKNGTAK-SIACVVQGSVQRKFW--LPW 179 Query: 167 QRYNQKIVSFIPALLRIEMGERPI---------FLIELVVDLSGSMHCAMNSDPEDVNSA 217 + + I G+ + +V DLSGSM V+ Sbjct: 180 GQTLVSSSRLHDGRVGINSGKTYAVKDKQITIPIDLMMVTDLSGSM----------VSPI 229 Query: 218 PICQDKKRTKMAALKNALLLFLDSI---DLLSHVKEDVYMGLIGY 259 ++ AL++ + + D MG + + Sbjct: 230 DKRIPSSSIRIDALRDVVKDIEGILLPKDSRDDTSPYNRMGFVAF 274 Score = 42.9 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 35/230 (15%), Positives = 68/230 (29%), Gaps = 37/230 (16%) Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 C + + I + K+ L ++D ++ + + I Y Sbjct: 286 DCVLPYYAQQSKKEEISNLYRNNKLDQASKLLDQYMDIERTINQIDQFNGSN-ISYDFIN 344 Query: 264 EKNIEP--SWGTEKVRQYV--------TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 S G E R + + T T M ++T K Sbjct: 345 TTKKCLGKSEGKETTRAWFDKKNLGVSNALNEIDPDGGTAVTSGMFIGTNLMTDTNK--- 401 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV----NTIKICDKAK----------- 358 + ++ ++ L+DGE+N + +C+K K Sbjct: 402 ---DPEAAPSKLNTNTRRILLVLSDGEDNRPTEGTLVKLMSAGLCNKIKRKIDSLQDTKY 458 Query: 359 ---ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 E + V + N + + K CV + +Y V + L+ F+ I Sbjct: 459 PKVEARVAFVALGYNPPQDQVNVWKQCV--GKQYYTVFSKQGLLDAFRQI 506 >gi|74136383|ref|NP_001028084.1| calcium-activated chloride channel regulator 1 precursor [Macaca mulatta] gi|75043731|sp|Q6PT52|CLCA1_MACMU RecName: Full=Calcium-activated chloride channel regulator 1; AltName: Full=Calcium-activated chloride channel family member 1; Flags: Precursor gi|46371863|gb|AAS90562.1| calcium-activated chloride channel family member 1 [Macaca mulatta] Length = 913 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 48/240 (20%), Positives = 86/240 (35%), Gaps = 69/240 (28%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM ++ L A LFL I L ++ Sbjct: 309 LVLDKSGSMA-------------------TGNRLNRLNQAGQLFLLQIIELRS-----WV 344 Query: 255 GLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + E + R +T+ + + T ++ A+ ++ K+ + Sbjct: 345 GMVTFDSAAHVQSELIQINSGSDRDTLTKRLPTAASGGTSICSGLRLAFTVI----KKKY 400 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 T+ I+ LTDGE+N TI C ++ K++ I T+++ Sbjct: 401 PTDGSE-------------IVLLTDGEDN-------TISGCFNEVKQSGAIIHTVAL-GP 439 Query: 373 PNGQRL---------LKTCVSSPEYHYNVVNADSLIHVFQNISQ---LMVHRKYSVILKG 420 + L L+T S V N + LI F +S ++ R + KG Sbjct: 440 SAARELEELSKMTGGLQTYASD-----QVQN-NGLIDAFGALSSGNGVVSERSIQLESKG 493 >gi|32452632|gb|AAP43994.1| TadG [Aggregatibacter actinomycetemcomitans] Length = 525 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 41/285 (14%), Positives = 87/285 (30%), Gaps = 43/285 (15%) Query: 2 HLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 +L ++ K+ +E ++II AL LLL+ F + K + A + A Sbjct: 6 NLTAKLFNSIKQFFQNEHGVYAIITALLAFPLLLLVAFTVDGTGILLDKARLAQATDQAA 65 Query: 62 L---------------AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESL 106 L + S+ ++S+ E S + + + ++ +K L Sbjct: 66 LLLIAEDNKYRKNKDHSDVSR--QHVSQQDINREGNSKVQAQWKKRN-QELVQGLVKLYL 122 Query: 107 SGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLN 166 I+ ++ + + N T ++ + + +F L Sbjct: 123 RSDDKNGQKNSSPAIIKDPFLAECLEEKTQPKNKNGTAK-SIACVVQGSVQRKFW--LPW 179 Query: 167 QRYNQKIVSFIPALLRIEMGERPI---------FLIELVVDLSGSMHCAMNSDPEDVNSA 217 + + I G+ + +V DLSGSM V+ Sbjct: 180 GQTLVSSSRLHDGRVGINSGKTYAVKDKQITIPIDLMMVTDLSGSM----------VSPI 229 Query: 218 PICQDKKRTKMAALKNALLLFLDSI---DLLSHVKEDVYMGLIGY 259 ++ AL++ + + D MG + + Sbjct: 230 DKRIPSSSIRIDALRDVVKDIEGILLPKDSRDDTSPYNRMGFVAF 274 Score = 42.9 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 35/230 (15%), Positives = 68/230 (29%), Gaps = 37/230 (16%) Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 C + + I + K+ L ++D ++ + + I Y Sbjct: 286 DCVLPYYAQQSKKEEISNLYRNNKLDQASKLLDQYMDIERTINQIDQFNGSN-ISYDFIN 344 Query: 264 EKNIEP--SWGTEKVRQYV--------TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 S G E R + + T T M ++T K Sbjct: 345 TTKKCLGKSEGKETTRAWFDKKNLGVSNALNEIDPDGGTAVTSGMFIGTNLMTDTNK--- 401 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV----NTIKICDKAK----------- 358 + ++ ++ L+DGE+N + +C+K K Sbjct: 402 ---DPEAAPSKLNTNTRRILLVLSDGEDNRPTEGTLVKLMSAGLCNKIKRKIDSLQDTKY 458 Query: 359 ---ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 E + V + N + + K CV + +Y V + L+ F+ I Sbjct: 459 PKVEARVAFVALGYNPPQDQVNVWKQCV--GKQYYTVFSKQGLLDAFRQI 506 >gi|294139879|ref|YP_003555857.1| hypothetical protein SVI_1108 [Shewanella violacea DSS12] gi|293326348|dbj|BAJ01079.1| hypothetical protein [Shewanella violacea DSS12] Length = 405 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 52/354 (14%), Positives = 100/354 (28%), Gaps = 52/354 (14%) Query: 108 GYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQ 167 GYS + I++ R L S + + +Y + + L Sbjct: 57 GYSQRADQDLLVGILDYYRPGFVVHDGEALVSIDGKN--RCSIEATYRFNPTMMALLPES 114 Query: 168 RYNQKIVSFIPALLR--IEMGERPIFLIELVVDLSGSMHCAM---------------NSD 210 +S + + LV+D+S SM + D Sbjct: 115 ARTHVSLSSDTGATSHLVINSTPLPMDLALVLDISSSMSAQLPQLKLIINGALEEIRQQD 174 Query: 211 PEDVNSAPICQDKKRTKMAAL------KNALL-LFLDSIDLLSHVKEDVYM----GLIGY 259 P +V T + L K+A +D + H + Sbjct: 175 PNEVGGVRFSLVPFETGVGVLNAPWMPKSAAKVTCVDGLSYGQHSVDYARTVDDLAEPAA 234 Query: 260 TTRVEKNIEPSW-----------GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ W + V + +D+L+ T S+ YQ L Sbjct: 235 NLNIKSVFASQWLDACSMDATILPLTQDLNLVKQRVDALVTSGTTSS------YQGLIWG 288 Query: 309 KKRSF-FTNFFRQGVKIPSLPFQKFIIFLTDG-ENNNFKSNVNTIKICDKAKENF-IKIV 365 + Q + S + ++ TDG + ++ +C ++ I++ Sbjct: 289 VRTLLPQWQEEWQIPPVESPALIQRLVLFTDGADQGFHLDDLIEQGLCRVIQDKHHIEMS 348 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 I S + + C Y+ N L F+ L K S++L+ Sbjct: 349 FIGFGVSDRRLQQFRECAGDKGKVYDAQNTQELEAFFRE--ALQTDTKASLVLR 400 >gi|71983551|ref|NP_509176.2| hypothetical protein C16E9.1 [Caenorhabditis elegans] gi|34366001|gb|AAC47957.2| Hypothetical protein C16E9.1 [Caenorhabditis elegans] Length = 565 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 37/242 (15%), Positives = 75/242 (30%), Gaps = 48/242 (19%) Query: 166 NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 N + F+ + + I+ I ++ D SGS+ +K Sbjct: 365 NLEAKKHTEDFVKTPEKRGPVKDCIYDIGIIFDSSGSL------------------EKNF 406 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 K A L ++ + +G++ + G KVR + Sbjct: 407 QKQLAFAKQL------VEQMPISDNATRVGIVQFA-----------GKTKVRVLANFSQN 449 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 LK Y T + + K P+ + ++ TDG + Sbjct: 450 KSQLKTIIDRSPF---YSGTTFTNQALKKMAALYEESKRPNAKLK--LMLFTDGYSAEDT 504 Query: 346 SNVNTIKICDKAKENFIKIVTISINAS-PNGQRL--LKTCVSSPEYHYNVVNADSLIHVF 402 S + K + + T+ I+ G + L+ +S E++Y+ + L+ F Sbjct: 505 SEGE-----EALKSQGVVVYTVGISTDKSAGLNMKELRGMATSSEHYYDSSDFADLLKHF 559 Query: 403 QN 404 + Sbjct: 560 PS 561 >gi|159040640|ref|YP_001539892.1| von Willebrand factor type A [Caldivirga maquilingensis IC-167] gi|157919475|gb|ABW00902.1| von Willebrand factor type A [Caldivirga maquilingensis IC-167] Length = 474 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 44/285 (15%), Positives = 92/285 (32%), Gaps = 50/285 (17%) Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 I N+ R+S +N Y + +Y L + ++ + Sbjct: 228 IGNALRVSRFTEHSNYPTYITGVREYRIG-DPAYRIDLDKTSMNMVRKTFLNKPMSTRDI 286 Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + E + + I L +D SGSM + KM K A++ ++ Sbjct: 287 VVREYADVKLMDIVLCLDTSGSMKEFSGAYM---------------KMDIAKEAIVKYIR 331 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL----ILKPTDSTP 296 + + + ++ + R + WG V++Y+ + T+ Sbjct: 332 YLSRTND-----RLSMVLFNFRAD----ILWGPHSVKKYINEMEEMSRYIYPGGGTNIAN 382 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+++A IL+ K II +TDG N S IK K Sbjct: 383 ALEKARIILSKSN------------------YPNKHIICITDGRTVNASS---CIKEAVK 421 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + + + T+++ + + L++ + + +L Sbjct: 422 LRRMGVTLSTVAVGDNSDFDLLMRLSKIGNGLFIKINDISNLDKA 466 >gi|296393889|ref|YP_003658773.1| von Willebrand factor type A [Segniliparus rotundus DSM 44985] gi|296181036|gb|ADG97942.1| von Willebrand factor type A [Segniliparus rotundus DSM 44985] Length = 343 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 44/249 (17%), Positives = 80/249 (32%), Gaps = 55/249 (22%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + + LVVD+S SM +C D + T++ A + A I + Sbjct: 97 KNRATVVLVVDISLSM---------------VCDDVRPTRVDAARQAA------IKFVDE 135 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILT 306 ++ + +GL+ + + I PS E V+ + + + T + + A Q + Sbjct: 136 MEPTLQLGLVTFAGTAQTLIAPSSDHEIVKHALDEAIRPDKLAARTATGEGIYTALQQI- 194 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK--- 363 ++ K PS I+ +DG+ AKE K Sbjct: 195 -----ETLSSILGGKSKAPSAR----IVLESDGKETVPDDLNAPRGAFTAAKEAKAKEVP 245 Query: 364 IVTISIN----------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN--- 404 I +IS + L K S + + D L V+ + Sbjct: 246 IYSISFGTTRPIPYVNIQGSRVPVPADDASLQKVAELSGGKFFTAGSLDQLSDVYSSLNA 305 Query: 405 -ISQLMVHR 412 I +V + Sbjct: 306 EIGYDLVKQ 314 >gi|224048789|ref|XP_002188138.1| PREDICTED: matrilin 3 [Taeniopygia guttata] Length = 284 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 27/174 (15%), Positives = 64/174 (36%), Gaps = 26/174 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDST 295 + ID L + + ++ Y + V+ + +++ V+R + T + Sbjct: 56 LSEMIDTLDVGERTTRVAVMNYASTVKVEFPLRTYFDKASMKEAVSRIEP--LSAGTMTG 113 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A + + ++ G + + + +I +TDG + V + Sbjct: 114 LAIQTAMEEVFTE----------EMGTRPAAFNIPRVVIVVTDG---RPQDQVQDVA--A 158 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN---ADSLIHVFQN 404 A+ I+I T+ + + L+ S P E+ + V + L F+ Sbjct: 159 SARTAGIEIYTVGVG--RADMQALRIMASEPLDEHVFYVETYGVIEKLTSRFRE 210 >gi|119599630|gb|EAW79224.1| hypothetical protein FLJ35880 [Homo sapiens] Length = 2531 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 50/356 (14%), Positives = 116/356 (32%), Gaps = 58/356 (16%) Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGY----------SAVFYNTEIQNIV 122 + D + +S + L A F+ + S + ++++ Sbjct: 634 QEISDAIDRMSLINEGTLTGKALNFVGQYFTHSKGARLGAKKFLILITDGVAQDDVRDPA 693 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 R + + + ++N + + +S + ++ +HL + +K++ + AL Sbjct: 694 RILRGKDVTIFSVGVYNANRSQLEEISGDSSLVFHVENFDHL--KALERKLIFRVCAL-- 749 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + + + V+D SGS+ QD + L + + Sbjct: 750 HDCKRITLLDVVFVLDHSGSI-------------KKQYQDHM----------INLTIHLV 786 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS--LILKPTDSTPAMKQ 300 ++ V G + Y+ + T R + ++ T + A+K Sbjct: 787 KKADVGRDRVQFGALKYSDQPNILFYL--NTYSNRSAIIENLRKRRDTGGNTYTAKALKH 844 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A L ++ + ++ +I +TDGE+++ +T + + Sbjct: 845 A-NALFTE-----------EHGSRIKQNVKQMLIVITDGESHDHDQLNDTAL---ELRNK 889 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I I + + Q+ L+ + V N D L VF + + M V Sbjct: 890 GITIFAVGVG--KANQKELEGMAGNKNNTIYVDNFDKLKDVFTLVQERMCTEAPEV 943 >gi|73669697|ref|YP_305712.1| BatA [Methanosarcina barkeri str. Fusaro] gi|72396859|gb|AAZ71132.1| BatA [Methanosarcina barkeri str. Fusaro] Length = 317 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 42/251 (16%), Positives = 89/251 (35%), Gaps = 60/251 (23%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 + + LV+D+SGSM QD +++ A K++ + ++S+ Sbjct: 84 QTKEGVNVVLVMDVSGSMQA---------------QDYTPSRLEAAKSSAEILINSLKSK 128 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 Y G++ + + S EKV + + + T + + Sbjct: 129 D------YAGIVTFESGATTAAYLSPYKEKVIEKL--RNVAPKEGSTAIGDGLSLGIDMA 180 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 +S + +K II L+DG NN + + AK N I++ Sbjct: 181 SSIPNK------------------KKVIILLSDGVNNAGYISPDEAIQ--YAKANNIQVY 220 Query: 366 TISINASPN----------------GQRLLKTCVSS-PEYHYNVVNADSLIHVFQNISQL 408 TI + ++ N + L+ + ++ ++ +L +++NIS+ Sbjct: 221 TIGMGSNGNVLLGYDWFGNPQYAELDEATLQAIANDTGGKYFKSIDDKTLDEIYKNISEN 280 Query: 409 MVHRKYSVILK 419 + K +K Sbjct: 281 IKREKEETNIK 291 >gi|15602708|ref|NP_245780.1| hypothetical protein PM0843 [Pasteurella multocida subsp. multocida str. Pm70] gi|12721152|gb|AAK02927.1| TadG [Pasteurella multocida subsp. multocida str. Pm70] Length = 588 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 44/284 (15%), Positives = 95/284 (33%), Gaps = 36/284 (12%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 ++L+ +K +E ++++ AL L+LIGF + K + + A Sbjct: 4 LYLIRSCCDSIRKFYENELGVYTVMTALLAFPLLVLIGFTVDGTGVVLDKARLAQGMDQA 63 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 LA +V+ + + + + + D +F N + + Sbjct: 64 ALA----LVAENNDYRENKKHGDVNRQVVSPQDKAKFGGNEFMAKQEKRNQELIQGIAKL 119 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMT--SYDYRLQFIEHLLNQRYNQKIVSFIP 178 + S + + A +D + +D+ T Y R + + + + Sbjct: 120 YLRSENANASSDAPITIDKPFHYSCEELDLPTGNEYARRKPIVCEVQGGVNRKFWLPVSE 179 Query: 179 ALLRIEMGERPIFLIE-------------------LVVDLSGSMHCAMNSDPEDVNSAPI 219 +L+ + ++ +E LV D SGSM NS +D N + Sbjct: 180 SLVSADKLKQDRIRMESDTSYAIKEKGIVIPVELMLVSDFSGSM----NSHLQDKNGRSL 235 Query: 220 CQDKKRTKMAALKNALLLFLDSI---DLLSHVKEDVYMGLIGYT 260 +TK+ L+ + + D+ V +G ++ Sbjct: 236 ----GKTKITILREVVSEISKILLPEDVSEGVSPFNRIGFTTFS 275 Score = 44.9 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 28/154 (18%), Positives = 52/154 (33%), Gaps = 26/154 (16%) Query: 270 SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 +W +K + T ++ + I+ + K + K+ Sbjct: 426 AWFDQKNKDITRELNIVRPSGWTSASSGLLVGANIMMDENKSPD-----AKPSKLG-TNI 479 Query: 330 QKFIIFLTDGENNNFKSNVNTI----KICDKAKEN--------------FIKIVTISINA 371 Q+ I+ L+DGE+N + T +CDK +E I V + Sbjct: 480 QRVILVLSDGEDNWPTYSTLTTLLNNGMCDKIREQLGKLQDPNLRELPGRIAFVAFGYSP 539 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 N K CV + +Y + + L+ F+ I Sbjct: 540 PANQVAAWKKCV--GDQYYTAYSKEELLESFKQI 571 >gi|222478562|ref|YP_002564799.1| von Willebrand factor type A [Halorubrum lacusprofundi ATCC 49239] gi|222451464|gb|ACM55729.1| von Willebrand factor type A [Halorubrum lacusprofundi ATCC 49239] Length = 491 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 29/222 (13%), Positives = 69/222 (31%), Gaps = 61/222 (27%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 ++LV+D SGSM +++A K+ ++I + + Sbjct: 228 VQLVIDTSGSMGG--------------------SRIANTKSGAKQLAETILDANPDNQV- 266 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 G+ + + + + V + S T++ + L + + Sbjct: 267 --GVTRFNNGASTPQQLTDDLDDVEAAIDGLSAS---GGTNAQAGVDAGQAELENCPHDN 321 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 + ++ DG+ N S AK +I I + Sbjct: 322 ------------------RVMVVFGDGDINTDGS---------AAKVAGTEIFAIGV--- 351 Query: 373 PNGQRL--LKTCVSSP--EYHYNVVNADSLIHVFQNISQLMV 410 G L+ S P E+ + ++ ++ +F +++ + Sbjct: 352 -GGASFSDLEDLASDPADEHVFFAIDDGAIEQIFGQVAETIT 392 >gi|47523388|ref|NP_999313.1| calcium-activated chloride channel regulator 1 precursor [Sus scrofa] gi|75051712|sp|Q9TUB5|CLCA1_PIG RecName: Full=Calcium-activated chloride channel regulator 1; AltName: Full=Calcium-activated chloride channel family member 1; AltName: Full=pCLCA1; Flags: Precursor gi|6002646|gb|AAF00077.1|AF095584_1 epithelial chloride channel protein [Sus scrofa] Length = 917 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 44/223 (19%), Positives = 78/223 (34%), Gaps = 66/223 (29%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM ++ L A LFL V++ ++ Sbjct: 309 LVLDKSGSMTVG-------------------GRLKRLNQAGKLFLLQ-----TVEQGAWV 344 Query: 255 GLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + E + R + R + + T ++ A+ ++ K+ + Sbjct: 345 GMVAFDSAAYVKSELVQINSAAERDALARSLPTAASGGTSICSGLRSAFTVI----KKKY 400 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 T+ I+ LTDGE+N TI C + K+N I T+++ Sbjct: 401 PTDGSE-------------IVLLTDGEDN-------TISACFPEVKQNGAIIHTVAL-GP 439 Query: 373 PNGQRL---------LKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + L L+T S N + LI F +S Sbjct: 440 SAAKELEELSQMTGGLQTYASD-----QAEN-NGLIDAFGALS 476 >gi|159896782|ref|YP_001543029.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159889821|gb|ABX02901.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 950 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 34/243 (13%), Positives = 69/243 (28%), Gaps = 41/243 (16%) Query: 184 EMGERPIFLIELVVDLSGSM-HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 +RP + ++D SGSM C N + K+ K A+ + Sbjct: 401 NRQQRPDIALVFIIDKSGSMDACHCNGGDMAAREGGGTR-----KIDIAKEAVAQAAAVL 455 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 +G++ + IE V T+ M AY Sbjct: 456 GKDD------KLGVVTFDDSAHWTIELD-KVPSQDDVVAALAPVPPSGQTNVVSGMNAAY 508 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + L + I LTDG + + I + ++ I Sbjct: 509 EQLRQSDAKIKHA------------------ILLTDG----WGHATDIGSIAENMNKDGI 546 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF-----QNISQLMVHRKYSVI 417 + ++ + + L + +Y + + +F Q + +V +++ Sbjct: 547 TLSVVA-AGNGSDNALQRYAELGGGRYYPARVMEEVPQIFLQETIQAVGTYIVEEQFTPA 605 Query: 418 LKG 420 G Sbjct: 606 YAG 608 >gi|61557272|ref|NP_001013220.1| chloride channel calcium activated 2 [Rattus norvegicus] gi|38175219|dbj|BAD01114.1| Ca(2+)-activated chloride channel [Rattus norvegicus] Length = 903 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 41/215 (19%), Positives = 77/215 (35%), Gaps = 51/215 (23%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM + + +M A L+L I V+++ + Sbjct: 311 LVLDKSGSMD----------------TEDRLIRMN---QAAELYLTQI-----VEKESMV 346 Query: 255 GLIGYT-TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 GL+ + T +N + +T ++ + T ++ +Q +TS + + Sbjct: 347 GLVTFDSTAQIQNYLIKITNTGDYKKITGNLPQQAVGGTSICRGLEAGFQAITSSDQSTS 406 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 + I+ LTDGE++ S C + K + I TI++ Sbjct: 407 GSE----------------IVLLTDGEDDLISS-------CFEVVKHSGAVIHTIAL-GP 442 Query: 373 PNGQRLLKTCVSSPE-YHYNVVNADSLIHVFQNIS 406 + L + Y + +SL+ F IS Sbjct: 443 KAARELETLSDMTGGLRFYANKDVNSLMDAFSGIS 477 >gi|126664966|ref|ZP_01735949.1| hypothetical protein MELB17_17899 [Marinobacter sp. ELB17] gi|126630336|gb|EBA00951.1| hypothetical protein MELB17_17899 [Marinobacter sp. ELB17] Length = 341 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 39/233 (16%), Positives = 77/233 (33%), Gaps = 53/233 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVD+S SM + Q ++ ++ A+K L F+D ++ Sbjct: 89 DLMLVVDISPSMD----------EPDMVRQGRRINRLQAVKQVLAEFIDQ-------RQG 131 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ V + + T A+ A + L Sbjct: 132 DRLGLILFGSQAYVQAPLTFDRTTVNILLQEAGLGMAGNATAIGDAVGLAVKRLRERPLE 191 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 Q+ I LTDG N + + + A+ + +++ TI I A Sbjct: 192 ------------------QRVAIVLTDGANTAGEITPDKAS--ELAQASAVRLYTIGIGA 231 Query: 372 -------------SPNGQR---LLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 L + + ++ N L ++ +I+QL Sbjct: 232 GADSAITGLLQRNPSRDLDEALLTRMAQQTGGQYFRARNLAELGGIYTSINQL 284 >gi|2623767|gb|AAB86531.1| Lu-ECAM-1 [Bos taurus] Length = 820 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 39/238 (16%), Positives = 78/238 (32%), Gaps = 61/238 (25%) Query: 177 IPALLRIEMGERPIFL--------IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 + + P F + LV+D SGSM ++ Sbjct: 286 TSPMTEMNPPTHPTFSLLKSKQRVVCLVLDKSGSMSAE-------------------DRL 326 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE-KVRQYVTRDMDSL 287 + A L+L + +++ +G++ + + E + T+ V Q +T + + Sbjct: 327 FQMNQAAELYLIQV-----IEKGSLVGMVTFDSVAEIQNHLTRITDDNVYQKITAKLPQV 381 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T +K +Q + + + + II LTDGE+N S Sbjct: 382 ANGGTSICRGLKAGFQAIIHSDQSTSGSE----------------IILLTDGEDNEINS- 424 Query: 348 VNTIKIC-DKAKENFIKIVTISINASPNGQRL--LKTCVSSPEYHYNVVNADSLIHVF 402 C + K + I TI++ + L L ++ + + L + F Sbjct: 425 ------CFEDVKRSGAIIHTIAL-GPSAAKELETLSN-MTGGYRFFANKDITGLTNAF 474 >gi|161788949|dbj|BAF95091.1| double von Willebrand factor A domains [Mus musculus] Length = 2309 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 23/144 (15%), Positives = 53/144 (36%), Gaps = 22/144 (15%) Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKPTDSTPAMKQAY 302 L + V +GL+ Y+ + ++ V + +L + T + A++ Sbjct: 265 LDVRSDQVQVGLVQYSDNIYPAFPLK--QSSLKSAVLDRIRNLPYSMGGTSTGSALEF-- 320 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + + T K + ++ +TDGE+++ +V D+ K + + Sbjct: 321 ------IRANSLTEMSGSRAKDG---VPQIVVLVTDGESSDEVQDVA-----DQLKRDGV 366 Query: 363 KIVTISINASPNGQRLLKTCVSSP 386 + + IN + L+ S P Sbjct: 367 FVFVVGINIQDVQE--LQKIASEP 388 >gi|148689169|gb|EDL21116.1| mCG140660 [Mus musculus] Length = 2242 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 23/144 (15%), Positives = 53/144 (36%), Gaps = 22/144 (15%) Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKPTDSTPAMKQAY 302 L + V +GL+ Y+ + ++ V + +L + T + A++ Sbjct: 265 LDVRSDQVQVGLVQYSDNIYPAFPLK--QSSLKSAVLDRIRNLPYSMGGTSTGSALEF-- 320 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + + T K + ++ +TDGE+++ +V D+ K + + Sbjct: 321 ------IRANSLTEMSGSRAKDG---VPQIVVLVTDGESSDEVQDVA-----DQLKRDGV 366 Query: 363 KIVTISINASPNGQRLLKTCVSSP 386 + + IN + L+ S P Sbjct: 367 FVFVVGINIQDVQE--LQKIASEP 388 >gi|332221819|ref|XP_003260062.1| PREDICTED: calcium-activated chloride channel regulator 1 [Nomascus leucogenys] Length = 914 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 46/240 (19%), Positives = 84/240 (35%), Gaps = 69/240 (28%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM ++ L A LFL + + ++ Sbjct: 309 LVLDKSGSMA-------------------TGNRLNRLNQAGQLFL--LQTVELGS---WV 344 Query: 255 GLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + E + R + + + + T ++ A+ ++ K+ + Sbjct: 345 GMVTFDSAAHVQSELIQINSGSDRDTLAKRLPAAASGGTSICRGLRSAFTVI----KKKY 400 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 T+ I+ LTDGE+N TI C ++ K++ I T+++ Sbjct: 401 PTDGSE-------------IVLLTDGEDN-------TISECFNEVKQSGAIIHTVAL-GP 439 Query: 373 PNGQRL---------LKTCVSSPEYHYNVVNADSLIHVFQNISQ---LMVHRKYSVILKG 420 Q L L+T S V N + LI F +S + R + KG Sbjct: 440 SAAQELEELSKMTGGLQTYASD-----QVQN-NGLIDAFGALSSGNGAVSQRSIQLESKG 493 >gi|325860278|ref|ZP_08173400.1| von Willebrand factor type A domain protein [Prevotella denticola CRIS 18C-A] gi|325482157|gb|EGC85168.1| von Willebrand factor type A domain protein [Prevotella denticola CRIS 18C-A] Length = 318 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 42/267 (15%), Positives = 74/267 (27%), Gaps = 58/267 (21%) Query: 156 YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 L + + +++ + + I L +D+S SM Sbjct: 53 IHLPMVLRCMVYTLTVIVLARPQTYNAWDDKDTEGIDIMLTMDVSASM------------ 100 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS----- 270 + +D +M K F+ +GL + + Sbjct: 101 ---LTEDVYPNRMVVAKEVASEFI----SGRPNDN---IGLTIFAGEAFTQCPMTLDHAA 150 Query: 271 --WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 VR L+ T + A L K +S Sbjct: 151 LLNLLHGVRT--DLVTSGLMQDGTAIGMGLANAVSRLKDSKAKS---------------- 192 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG-------QRLLKT 381 K +I LTDG NN + T A++ I++ TI + L Sbjct: 193 --KIVILLTDGSNNAGSISPMTAA--AIARKFGIRVYTIGFGKETGEEIGAIDYKTLQDI 248 Query: 382 CVSSPEYHYNVVNADSLIHVFQNISQL 408 VS+ Y + L ++Q+I +L Sbjct: 249 AVSTNGEFYRAQSQAELSRIYQDIDKL 275 >gi|319952789|ref|YP_004164056.1| von willebrand factor type a [Cellulophaga algicola DSM 14237] gi|319421449|gb|ADV48558.1| von Willebrand factor type A [Cellulophaga algicola DSM 14237] Length = 332 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 41/256 (16%), Positives = 73/256 (28%), Gaps = 65/256 (25%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 + + I + +D+S SM D K ++A+LK Sbjct: 76 PQTEDISTKTKTTKGIDIVMAIDVSSSMLAR---------------DLKPNRLASLKKVA 120 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 F+ + + +GL+ Y + V + + T Sbjct: 121 ADFIKK-------RPNDRIGLVVYAGESYTKTPITSDKGIVLNALKEITYGSLEDGTAIG 173 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 + + L S K II LTDG NN+ T + Sbjct: 174 MGLATSVNRL------------------KESKALSKVIILLTDGINNSGFIEPQTAA--E 213 Query: 356 KAKENFIKIVTISI---------------------NASPN-GQRLLKTCV-SSPEYHYNV 392 A E IK TI + + LL+ ++ ++ Sbjct: 214 LAVEYDIKTYTIGLGTNGNALSPIAINSDGSFRYGMKPVEIDEGLLEQIAKTTGGAYFRA 273 Query: 393 VNADSLIHVFQNISQL 408 N +SL ++ I++L Sbjct: 274 TNNESLASIYDEINKL 289 >gi|297664534|ref|XP_002810694.1| PREDICTED: calcium-activated chloride channel regulator 1-like isoform 2 [Pongo abelii] Length = 914 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 45/240 (18%), Positives = 84/240 (35%), Gaps = 69/240 (28%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM ++ L A LFL + + ++ Sbjct: 309 LVLDKSGSMA-------------------TGNRLNRLNQAGQLFL--LQTVELGS---WV 344 Query: 255 GLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + E + R + + + + T ++ A+ ++ ++ + Sbjct: 345 GMVTFDSAAHVQSELIQINSGSDRDTLAKRLPAAASGGTSICSGLRSAFTVI----RKKY 400 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 T+ I+ LTDGE+N TI C ++ K++ I T+++ Sbjct: 401 PTDGSE-------------IVLLTDGEDN-------TISGCFNEVKQSGAIIHTVAL-GP 439 Query: 373 PNGQRL---------LKTCVSSPEYHYNVVNADSLIHVFQNISQ---LMVHRKYSVILKG 420 Q L L+T S V N + LI F +S + R + KG Sbjct: 440 SAAQELEELSKMTGGLRTYASD-----QVQN-NGLIDAFGALSSGNGAVSQRSIQLESKG 493 >gi|168700938|ref|ZP_02733215.1| BatA [Gemmata obscuriglobus UQM 2246] Length = 317 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 33/223 (14%), Positives = 63/223 (28%), Gaps = 45/223 (20%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 +R + I+ VD+SGSM + A A+ FLD Sbjct: 92 QKRSLTNIQFAVDVSGSMLAPFGDGN---------------RYDASMKAIDTFLD-FRKG 135 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR---QYVTRDMDSLILKPTDSTPAMKQAY 302 GL + + + +R ++ + T A+ Sbjct: 136 DA------FGLTFFGDAFVHWVPLTTDVTAIRCSPPFMRPETVPPPFGGTAIAKALNGCK 189 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 L + K I+ +TDG ++ N +I + Sbjct: 190 TELRRRDEG------------------DKMIVLITDG--FSYDLTGNDEEIARTLSAEGV 229 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + I + ++ C + + + D+L VF+ I Sbjct: 230 AVFCIIVGGFEPQAEIVNICRLTGGEAFRADDPDALPAVFKKI 272 >gi|307595413|ref|YP_003901730.1| von Willebrand factor type A [Vulcanisaeta distributa DSM 14429] gi|307550614|gb|ADN50679.1| von Willebrand factor type A [Vulcanisaeta distributa DSM 14429] Length = 495 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 48/305 (15%), Positives = 103/305 (33%), Gaps = 55/305 (18%) Query: 107 SGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDV------MTSYDYRLQF 160 ++N+V ++ M + R + Y + V +D LQ Sbjct: 227 RSRDGERMFEALRNVVRNAMSGMGQVKVVRFTDIDKYPTYVVSVREYKIGDNYFDVDLQK 286 Query: 161 IEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 L+++ + ++ E I L +D+SGSM N P Sbjct: 287 TAMNLSRKSMMHKLFTNKDIVVKEYANVKTIDIVLCLDVSGSMRELSNGMP--------- 337 Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 K+ K+A+ ++ + + + ++ + R + WG +VR+Y Sbjct: 338 ------KIEIAKDAVAQYIQFLSKTND-----RLAMVLFNFRAD----VLWGLHQVRRYW 382 Query: 281 TRDMDSL----ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + L T+ A++++ ++LT + S K +I + Sbjct: 383 QQMNYMLKYVYAGGGTNLANALERSREVLTRSRSNS------------------KHVICV 424 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 TDG ++ IK + + + I TI+I + + + L++ + + Sbjct: 425 TDG---RTVNSSMCIKEAVRLRRSGTTISTIAIGENSDDELLMRLSKIGGGLFIKISSIH 481 Query: 397 SLIHV 401 L Sbjct: 482 DLGKA 486 >gi|311254858|ref|XP_001927013.2| PREDICTED: calcium-activated chloride channel regulator 4 [Sus scrofa] Length = 910 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 37/234 (15%), Positives = 83/234 (35%), Gaps = 56/234 (23%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM + ++ + A FL I V+ ++ Sbjct: 291 LVLDKSGSMSSS-------------------NRLNRMNQAAKYFLMQI-----VENGSWV 326 Query: 255 GLIGYTTRVEKNIEPSWGT-EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + + T R + + + T +++A++++ + Sbjct: 327 GMVHFDSTASIRSDLIQITGSNERDKLLGSLPTTASGGTSICSGIRRAFEVVRKLYSHTD 386 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 + I+ LTDGE+N T C D+ K++ I I++ Sbjct: 387 GSE----------------IVLLTDGEDN-------TAGACVDEVKQSGAIIHFIAL-GP 422 Query: 373 PNGQRLLKTCVSSPE-YHYNVVNADS--LIHVFQNIS---QLMVHRKYSVILKG 420 + +++ ++ + Y A++ LI F ++ + + + KG Sbjct: 423 SADKAVIEMSTATGGVHFYATDEAENNGLIDAFGALASGNTDISQQSLQLESKG 476 >gi|297664532|ref|XP_002810693.1| PREDICTED: calcium-activated chloride channel regulator 1-like isoform 1 [Pongo abelii] Length = 914 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 45/240 (18%), Positives = 84/240 (35%), Gaps = 69/240 (28%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM ++ L A LFL + + ++ Sbjct: 309 LVLDKSGSMA-------------------TGNRLNRLNQAGQLFL--LQTVELGS---WV 344 Query: 255 GLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + E + R + + + + T ++ A+ ++ ++ + Sbjct: 345 GMVTFDSAAHVQSELIQINSGSDRDTLAKRLPAAASGGTSICSGLRSAFTVI----RKKY 400 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 T+ I+ LTDGE+N TI C ++ K++ I T+++ Sbjct: 401 PTDGSE-------------IVLLTDGEDN-------TISGCFNEVKQSGAIIHTVAL-GP 439 Query: 373 PNGQRL---------LKTCVSSPEYHYNVVNADSLIHVFQNISQ---LMVHRKYSVILKG 420 Q L L+T S V N + LI F +S + R + KG Sbjct: 440 SAAQELEELSKMTGGLRTYASD-----QVQN-NGLIDAFGALSSGNGAVSQRSIQLESKG 493 >gi|114707049|ref|ZP_01439948.1| Von Willebrand domain containing protein [Fulvimarina pelagi HTCC2506] gi|114537599|gb|EAU40724.1| Von Willebrand domain containing protein [Fulvimarina pelagi HTCC2506] Length = 317 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 39/228 (17%), Positives = 71/228 (31%), Gaps = 48/228 (21%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L +DLSGSM + T++ A+K A+ + Sbjct: 89 DLILAIDLSGSME----------REDFDLDGRTVTRLDAVK-AVAKEFVTSRAGD----- 132 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ V + V + + T + A + L + Sbjct: 133 -RVGLILFAEFAYTAAPLTFDVAAVSRIVDEATIGISGRSTAIAGGLGLALKRLKRSDAQ 191 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-- 369 S + +I L+DG ++ NV A++ + + TI++ Sbjct: 192 S------------------RVVILLSDG--SDTSGNVLPRDSARLAEQLGVTVHTIALGP 231 Query: 370 ----NASPN----GQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 A L+ S + V N D L V I +L Sbjct: 232 EDMETAPQTRDAVDTATLRDIAELSGGRTFRVRNTDDLRAVTAEIDRL 279 >gi|2623765|gb|AAB86530.1| Lu-ECAM-1 [Bos taurus] Length = 794 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 39/238 (16%), Positives = 78/238 (32%), Gaps = 61/238 (25%) Query: 177 IPALLRIEMGERPIFL--------IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 + + P F + LV+D SGSM ++ Sbjct: 286 TSPMTEMNPPTHPTFSLLKSKQRVVCLVLDKSGSMSAE-------------------DRL 326 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE-KVRQYVTRDMDSL 287 + A L+L + +++ +G++ + + E + T+ V Q +T + + Sbjct: 327 FQMNQAAELYLIQV-----IEKGSLVGMVTFDSVAEIQNHLTRITDDNVYQKITAKLPQV 381 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T +K +Q + + + + II LTDGE+N S Sbjct: 382 ANGGTSICRGLKAGFQAIIHSDQSTSGSE----------------IILLTDGEDNEINS- 424 Query: 348 VNTIKIC-DKAKENFIKIVTISINASPNGQRL--LKTCVSSPEYHYNVVNADSLIHVF 402 C + K + I TI++ + L L ++ + + L + F Sbjct: 425 ------CFEDVKRSGAIIHTIAL-GPSAAKELETLSN-MTGGYRFFANKDITGLTNAF 474 >gi|317455060|pdb|2XWB|F Chain F, Crystal Structure Of Complement C3b In Complex With Factors B And D gi|317455061|pdb|2XWB|H Chain H, Crystal Structure Of Complement C3b In Complex With Factors B And D Length = 732 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 33/226 (14%), Positives = 78/226 (34%), Gaps = 39/226 (17%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 +V+D SGSM+ + D A K L+ ++ + Sbjct: 227 IVLDPSGSMNIYLVLDGSGSIGASDFTG--------AKKCLVNLIEKVASYGVKPRY--- 275 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL------ILKPTDSTPAMKQAYQILTSD 308 GL+ Y T + ++ S +VT+ ++ + + T++ A++ Y ++ Sbjct: 276 GLVTYATYPKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMM--- 332 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK------SNVNTIKICDKAKEN-- 360 ++ + II +TDG +N + + K ++N Sbjct: 333 -------SWPDDVPPEGWNRTRHVIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPR 385 Query: 361 ----FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + + + L + + ++ + V + ++L VF Sbjct: 386 EDYLDVYVFGVGPLVNQVNINALASKKDNEQHVFKVKDMENLEDVF 431 >gi|251837060|pdb|3HRZ|D Chain D, Cobra Venom Factor (Cvf) In Complex With Human Factor B gi|251837064|pdb|3HS0|D Chain D, Cobra Venom Factor (Cvf) In Complex With Human Factor B gi|251837068|pdb|3HS0|I Chain I, Cobra Venom Factor (Cvf) In Complex With Human Factor B gi|317455073|pdb|2XWJ|I Chain I, Crystal Structure Of Complement C3b In Complex With Factor B gi|317455074|pdb|2XWJ|J Chain J, Crystal Structure Of Complement C3b In Complex With Factor B gi|317455075|pdb|2XWJ|K Chain K, Crystal Structure Of Complement C3b In Complex With Factor B gi|317455076|pdb|2XWJ|L Chain L, Crystal Structure Of Complement C3b In Complex With Factor B Length = 741 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 33/226 (14%), Positives = 78/226 (34%), Gaps = 39/226 (17%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 +V+D SGSM+ + D A K L+ ++ + Sbjct: 236 IVLDPSGSMNIYLVLDGSGSIGASDFTG--------AKKCLVNLIEKVASYGVKPRY--- 284 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL------ILKPTDSTPAMKQAYQILTSD 308 GL+ Y T + ++ S +VT+ ++ + + T++ A++ Y ++ Sbjct: 285 GLVTYATYPKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMM--- 341 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK------SNVNTIKICDKAKEN-- 360 ++ + II +TDG +N + + K ++N Sbjct: 342 -------SWPDDVPPEGWNRTRHVIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPR 394 Query: 361 ----FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + + + L + + ++ + V + ++L VF Sbjct: 395 EDYLDVYVFGVGPLVNQVNINALASKKDNEQHVFKVKDMENLEDVF 440 >gi|293361347|ref|XP_576462.3| PREDICTED: collagen type VI alpha 5 [Rattus norvegicus] Length = 1730 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 54/344 (15%), Positives = 117/344 (34%), Gaps = 58/344 (16%) Query: 80 ESISNHAKRALIDDAKRFIKNHIKESLSGY----------SAVFYNTEIQNIVNSSRISM 129 + +S + L A F+ + S G + E+ + R Sbjct: 722 DGMSPINRNTLTGSALTFVNEYFDISKGGRPQVRKFLILLTDGEAQDEVGGPAMALRSKS 781 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 + + + +N T + S + ++ +HL + K++ + AL + E Sbjct: 782 VTIFSVGVYGANRTQLEEISGEGSLVFHVENFDHL--KTIESKLIFRVCALHDCKRIE-- 837 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + I V+D SGS+ P Q+ + L + ++ + Sbjct: 838 LLDIVFVLDHSGSIG-------------PREQESM----------INLTIHLVEKADVGR 874 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM--DSLILKPTDSTPAMKQAYQILTS 307 + V G + Y+ E T R +T + T + A++ A IL Sbjct: 875 DRVQFGALMYSNNPEILFYL--NTYSSRSAITEHLKRPRDTRGDTYTAKALQHA-NILF- 930 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + ++ +I +TDGE+++ +T + + ++ I I + Sbjct: 931 ----------MEEHGSRLKQNVRQLMIVITDGESHDRDKLNDTAR---ELRDKGITIFAV 977 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + Q L+T E +V N D L ++ + +++ + Sbjct: 978 GVG--RANQDELETMAGKKENTIHVDNFDKLRDIYLPLQEILCN 1019 Score = 39.5 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 38/244 (15%), Positives = 81/244 (33%), Gaps = 46/244 (18%) Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 D R+ F ++ + +Y + + + + E I +VD SGS+ Sbjct: 619 DERVSFGQNFDSLKYIKN--EIVHRICSDKGCEDMKADIMFLVDSSGSIGL--------- 667 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 T +K + + I++ + +G++ ++ ++ + Sbjct: 668 -----------TNFEKMKTFMKNLVGKIEI---GADRSQVGVVQFSDYNKEEFQL--NKY 711 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 R+ V +D + + + LT F N + K +KF+I Sbjct: 712 STREEVYAAIDGM--------SPINR--NTLTGSALT--FVNEYFDISKGGRPQVRKFLI 759 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 LTDGE + + + I ++ + + Q L+ ++V N Sbjct: 760 LLTDGEAQDEVGGPAM-----ALRSKSVTIFSVGVYGANRTQ--LEEISGEGSLVFHVEN 812 Query: 395 ADSL 398 D L Sbjct: 813 FDHL 816 >gi|293349450|ref|XP_001073278.2| PREDICTED: collagen type VI alpha 5-like [Rattus norvegicus] Length = 2640 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 54/344 (15%), Positives = 117/344 (34%), Gaps = 58/344 (16%) Query: 80 ESISNHAKRALIDDAKRFIKNHIKESLSGY----------SAVFYNTEIQNIVNSSRISM 129 + +S + L A F+ + S G + E+ + R Sbjct: 722 DGMSPINRNTLTGSALTFVNEYFDISKGGRPQVRKFLILLTDGEAQDEVGGPAMALRSKS 781 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 + + + +N T + S + ++ +HL + K++ + AL + E Sbjct: 782 VTIFSVGVYGANRTQLEEISGEGSLVFHVENFDHL--KTIESKLIFRVCALHDCKRIE-- 837 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + I V+D SGS+ P Q+ + L + ++ + Sbjct: 838 LLDIVFVLDHSGSIG-------------PREQESM----------INLTIHLVEKADVGR 874 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM--DSLILKPTDSTPAMKQAYQILTS 307 + V G + Y+ E T R +T + T + A++ A IL Sbjct: 875 DRVQFGALMYSNNPEILFYL--NTYSSRSAITEHLKRPRDTRGDTYTAKALQHA-NILF- 930 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + ++ +I +TDGE+++ +T + + ++ I I + Sbjct: 931 ----------MEEHGSRLKQNVRQLMIVITDGESHDRDKLNDTAR---ELRDKGITIFAV 977 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + Q L+T E +V N D L ++ + +++ + Sbjct: 978 GVG--RANQDELETMAGKKENTIHVDNFDKLRDIYLPLQEILCN 1019 Score = 39.5 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 38/244 (15%), Positives = 81/244 (33%), Gaps = 46/244 (18%) Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 D R+ F ++ + +Y + + + + E I +VD SGS+ Sbjct: 619 DERVSFGQNFDSLKYIKN--EIVHRICSDKGCEDMKADIMFLVDSSGSIGL--------- 667 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 T +K + + I++ + +G++ ++ ++ + Sbjct: 668 -----------TNFEKMKTFMKNLVGKIEI---GADRSQVGVVQFSDYNKEEFQL--NKY 711 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 R+ V +D + + + LT F N + K +KF+I Sbjct: 712 STREEVYAAIDGM--------SPINR--NTLTGSALT--FVNEYFDISKGGRPQVRKFLI 759 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 LTDGE + + + I ++ + + Q L+ ++V N Sbjct: 760 LLTDGEAQDEVGGPAM-----ALRSKSVTIFSVGVYGANRTQ--LEEISGEGSLVFHVEN 812 Query: 395 ADSL 398 D L Sbjct: 813 FDHL 816 >gi|159901411|ref|YP_001547658.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159894450|gb|ABX07530.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 337 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 36/194 (18%), Positives = 71/194 (36%), Gaps = 41/194 (21%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 E +R + +++D S SM QD + +++ A K +L LD + Sbjct: 81 ETLKRSGLQVLILLDGSRSMAA---------------QDVRPSRIDASKRMVLALLDRL- 124 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD-SLILKPTDSTPAMKQAY 302 E +G++ + + + R V L L TD Sbjct: 125 ------EGNQVGMLMFGSSSYVQFPLTSDLAAARSLVEPINPRGLSLGGTD--------V 170 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + + ++ RSF I + + +I +TDG +++ +S+ + +A + + Sbjct: 171 EEVITEGLRSF---------PIGQIE-GRTMILITDGGDSDEQSDGEAVAAAREAAKMGL 220 Query: 363 KIVTISINASPNGQ 376 I TI + GQ Sbjct: 221 TIHTIGMATEAGGQ 234 >gi|72162840|ref|YP_290497.1| von Willebrand factor, type A [Thermobifida fusca YX] gi|71916572|gb|AAZ56474.1| von Willebrand factor, type A [Thermobifida fusca YX] Length = 609 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 42/239 (17%), Positives = 81/239 (33%), Gaps = 31/239 (12%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 + + A+L R + LV+D SGSM ++ T++ K Sbjct: 392 SANVLNAMLENWAELRKPANVLLVIDTSGSMQESVPG-------------TGSTRLELAK 438 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 A + LD +GL ++T +E N + W R+ V + T Sbjct: 439 EAAITSLDEFSDSD------RVGLWMFSTDLEDNGQ-DW-----RELVPLGPLGASVNGT 486 Query: 293 DSTPAMKQAYQILTSDKKRS-FFTNFFRQGVKIPSLPFQKF--IIFLTDGEN---NNFKS 346 + + L + T + ++FLTDG+N N Sbjct: 487 PRREELAERISNLPPGGGTGLYDTALAAHTLVAEHSRPDAINAVVFLTDGKNEDLNGISL 546 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 I + + ++I TIS + + + + ++ Y+ + S+ VF+ + Sbjct: 547 EKLLDSITPEPGQQGVRIFTISYGEDADLKTMTQIAEATNAAAYDASDPQSIDEVFEAV 605 >gi|326926921|ref|XP_003209645.1| PREDICTED: integrin alpha-11-like [Meleagris gallopavo] Length = 1195 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 22/182 (12%), Positives = 61/182 (33%), Gaps = 27/182 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ + + +G++ Y V V+ V T++ Sbjct: 188 LINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVKDVVAAASHIEQRGGTET--- 243 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + AY I + + K ++ +I +TDGE + + K+ + + Sbjct: 244 -RTAYGIEFARSEA---------FQKGGRKGAKRVMIVITDGE---SHDSPDLEKVIEDS 290 Query: 358 KENFIKIVTISINA--------SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQ 407 +++ + +++ +K S P+ + +NV + +L + + + Sbjct: 291 EKDNVTRYAVAVLGYYNRRGINPEAFLNEIKFIASDPDDKHFFNVTDEAALKDIVDALGE 350 Query: 408 LM 409 + Sbjct: 351 RI 352 >gi|118096010|ref|XP_413930.2| PREDICTED: similar to integrin alpha 11 subunit [Gallus gallus] Length = 1191 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 22/182 (12%), Positives = 61/182 (33%), Gaps = 27/182 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ + + +G++ Y V V+ V T++ Sbjct: 188 LINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVKDVVAAASHIEQRGGTET--- 243 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + AY I + + K ++ +I +TDGE + + K+ + + Sbjct: 244 -RTAYGIEFARSEA---------FQKGGRKGAKRVMIVITDGE---SHDSPDLEKVIEDS 290 Query: 358 KENFIKIVTISINA--------SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQ 407 +++ + +++ +K S P+ + +NV + +L + + + Sbjct: 291 EKDNVTRYAVAVLGYYNRRGINPEAFLNEIKFIASDPDDKHFFNVTDEAALKDIVDALGE 350 Query: 408 LM 409 + Sbjct: 351 RI 352 >gi|313159754|gb|EFR59111.1| von Willebrand factor type A domain protein [Alistipes sp. HGB5] Length = 340 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 35/225 (15%), Positives = 70/225 (31%), Gaps = 43/225 (19%) Query: 151 MTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 +++ L+FI + ++ + + L VD+S SM Sbjct: 50 VSTGRVTLKFILFCTAVTLLILAAARPQFGSKLREEKTQGVEMMLAVDVSNSMLAE---- 105 Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 D + ++ K A+ D + +GLI + + + + Sbjct: 106 -----------DFEPNRLERTKYAINKLFDGL-------HQDRVGLIVFAGEPKVQLPIT 147 Query: 271 WGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 + + R SL ++ T A+ QA + + + + Sbjct: 148 SDYRMAKAFAKRIDPSLVPVQGTAIGKALSQALMSFSGETEENH---------------- 191 Query: 330 QKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 + +I +TDGEN+ + E I+I TI I Sbjct: 192 SRVVILITDGENHEDDALAAARHA----AEMGIRIYTIGIGTPEG 232 >gi|168699403|ref|ZP_02731680.1| hypothetical protein GobsU_07777 [Gemmata obscuriglobus UQM 2246] Length = 354 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 45/251 (17%), Positives = 82/251 (32%), Gaps = 59/251 (23%) Query: 189 PIFLIELV--VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 P I LV +D+SGSM D AP +++ A + AL LFL Sbjct: 85 PAKGIALVVALDVSGSMGAE---DVVWTPGAP-----SVSRLEAARRALKLFLAGGAAPD 136 Query: 247 HVKEDVY----MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD-SLILKPTDSTPAMKQA 301 D +GL+ + E + + + T+ ++ +A Sbjct: 137 GTAFDPRPGDAVGLVAFAAVPETVCPATLNHSVLFKVADALQPKGGADAGTNIGDSLAEA 196 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-------- 353 L + ++S + +I L+DGE+N K +V + Sbjct: 197 VIRLDAADQKS------------------RVLILLSDGEHNILKEDVRDAQRPGIDRTLK 238 Query: 354 ----CDKAKENFIKIVTI------SINASP-------NGQRLLKTCVS-SPEYHYNVVNA 395 A +++ TI + A P G++ LK + + + Sbjct: 239 PREAAQLAANLGVRVYTIDAGGDPPLGAPPDAVAQRFAGRKALKDVAEMTGGKSFQATSG 298 Query: 396 DSLIHVFQNIS 406 L+ ++ IS Sbjct: 299 AELLSAYREIS 309 >gi|8567336|ref|NP_059502.1| calcium-activated chloride channel regulator 1 precursor [Mus musculus] gi|81881572|sp|Q9D7Z6|CLCA1_MOUSE RecName: Full=Calcium-activated chloride channel regulator 1; AltName: Full=Calcium-activated chloride channel family member 3; Short=mCLCA3; AltName: Full=Protein gob-5; Flags: Precursor gi|3721912|dbj|BAA33743.1| gob-5 [Mus musculus] gi|15919901|dbj|BAB25815.2| unnamed protein product [Mus musculus] gi|74201990|dbj|BAE22995.1| unnamed protein product [Mus musculus] gi|109731429|gb|AAI16320.1| Chloride channel calcium activated 3 [Mus musculus] gi|109732845|gb|AAI16319.1| Chloride channel calcium activated 3 [Mus musculus] gi|148680065|gb|EDL12012.1| chloride channel calcium activated 3 [Mus musculus] Length = 913 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 39/217 (17%), Positives = 79/217 (36%), Gaps = 54/217 (24%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM D + +M + LL V++ ++ Sbjct: 310 LVLDKSGSML----------------NDDRLNRMNQA--------SRLFLLQTVEQGSWV 345 Query: 255 GLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + E + R + + + ++ T ++ A+ ++ K+ + Sbjct: 346 GMVTFDSAAYVQSELKQLNSGADRDLLIKHLPTVSAGGTSICSGLRTAFTVI----KKKY 401 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 T+ I+ LTDGE+N TI C D K++ I T+++ Sbjct: 402 PTDGSE-------------IVLLTDGEDN-------TISSCFDLVKQSGAIIHTVAL-GP 440 Query: 373 PNGQRL--LKTCVSSPEYH-YNVVNADSLIHVFQNIS 406 + L L + + + V + L+ F +S Sbjct: 441 AAAKELEQLSKMTGGLQTYSSDQVQNNGLVDAFAALS 477 >gi|160894031|ref|ZP_02074810.1| hypothetical protein CLOL250_01586 [Clostridium sp. L2-50] gi|156864409|gb|EDO57840.1| hypothetical protein CLOL250_01586 [Clostridium sp. L2-50] Length = 1391 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 36/365 (9%), Positives = 118/365 (32%), Gaps = 58/365 (15%) Query: 52 SMESANNAAI-LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKR-FIKNHIKESLSGY 109 + + + + ++ +S + + + ++ IS+ + A ++ A + + + Sbjct: 406 HLHNVIDNSAKISDSSLLKDAIDEVVTQYNQISDDSYNAELNAAVNDMVDKQSSQVVPVS 465 Query: 110 SAVFYNTEIQNIVNSSRISMTHMANNRLDSSN-NTIFYNMDVMTSYDYRLQFIEHLLNQR 168 + + + + ++ +R+D+S +I +++ + D + + + + Sbjct: 466 EETINGSFNSYVATTLKYDRINIHISRIDTSAYPSIQAYININGTKDSKEELADQFTKED 525 Query: 169 YNQKIVSFIPALLRIEMG-ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + + + G E I +V+D SGSM A ++ + + + Sbjct: 526 FTVIDTQYEITDFTLNSGAESEAVSIGIVMDKSGSMEGAAIANAKQAATEAVEHITSEKM 585 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 M ++ Y + + ++ + D Sbjct: 586 M---------------------------IVSYDNEAYLEQSLTSRSGTLKNSIAAISDG- 617 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T+ + + A L ++K +I ++DG++ + + Sbjct: 618 --GGTNISAGLNLALDNLEAEKGSRA-------------------VILMSDGQDGGSEED 656 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNIS 406 + D+A + I + T+ + ++ + + L ++ + Sbjct: 657 MQAAT--DRAAKLGISVYTVGFGECDD--AYMQAIAEVTGGKFVKASASTELSDIYLYLQ 712 Query: 407 QLMVH 411 + +V+ Sbjct: 713 KYIVN 717 >gi|315223608|ref|ZP_07865462.1| aerotolerance protein BatA [Capnocytophaga ochracea F0287] gi|314946389|gb|EFS98384.1| aerotolerance protein BatA [Capnocytophaga ochracea F0287] Length = 340 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 39/245 (15%), Positives = 72/245 (29%), Gaps = 65/245 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I + +D+S SM D K + ALK F+ + Sbjct: 99 IDIVMAIDVSASMLSK---------------DLKPNRFEALKKVASQFVK-------DRP 136 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 + +GL+ Y + + ++ I T + A L K Sbjct: 137 NDRIGLVIYAGESYTKTPVTTDKGIILNALSELTYGQIEDGTAIGMGLATAVNRLKESKA 196 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 +S + II LTDG NN + T + A E I++ TI I Sbjct: 197 KS------------------RVIILLTDGVNNTGFIDPQTAA--ELAAEYGIRVYTIGIG 236 Query: 371 ASPNGQR----------------------LLKTCVS-SPEYHYNVVNADSLIHVFQNISQ 407 ++ L+K + ++ + L ++ I++ Sbjct: 237 SNGTALSPYALNPDGSIMYRMLQVEIDEPLMKKIAEVTHGRYFRATDNQKLQQIYNEINK 296 Query: 408 LMVHR 412 + + Sbjct: 297 METTK 301 >gi|126657060|ref|ZP_01728231.1| von Willebrand factor type A domain protein [Cyanothece sp. CCY0110] gi|126621603|gb|EAZ92313.1| von Willebrand factor type A domain protein [Cyanothece sp. CCY0110] Length = 328 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 35/190 (18%), Positives = 61/190 (32%), Gaps = 36/190 (18%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + + L+VDLS SM Q K ++ A+K + F++ Sbjct: 88 TQVQSARDLMLLVDLSQSMEAK---------DFQDQQGNKIDRLEAVKLVVDDFIER--- 135 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 +E +GLI + T+ + + E R + + T A+ A Q Sbjct: 136 ----REGDRIGLILFGTKAYLQVPFTQDLETARFLLDEAQIGMAGAQTMLGDAIGLAIQT 191 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 K + + +I LTDG + + + A + I I Sbjct: 192 FEDSKTEN------------------RVLILLTDGNDTGSQVPPDQAA--KFAAQRNIVI 231 Query: 365 VTISINASPN 374 TI+I Sbjct: 232 YTIAIGNPET 241 >gi|261414506|ref|YP_003248189.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|261370962|gb|ACX73707.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] Length = 227 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 33/189 (17%), Positives = 57/189 (30%), Gaps = 38/189 (20%) Query: 195 LVVDLSGSMHCA-MNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 LV+D SGSM +N E V T + A + +++ F Sbjct: 23 LVLDTSGSMEGDSINELNEGVRLFYDAVRSDETALYAAEISVVTF--------GGHASCQ 74 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G + + + T AM A +L K Sbjct: 75 AGFSTLEHQPDAPQFYA------------------DGGTPMGEAMNMALDMLEKRK---- 112 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIK-ICDKAKENFIKIVTISINA 371 + +Q +I+ +TDG N ++ +I+ CD + + I I I Sbjct: 113 -----SEYKASGVDYYQPWIVLMTDGMPNGSQAELSRSIQRTCDMINDRKLTIFPIGIGE 167 Query: 372 SPNGQRLLK 380 + L + Sbjct: 168 DADMDVLAR 176 >gi|221369290|ref|YP_002520386.1| von Willebrand factor, type A precursor [Rhodobacter sphaeroides KD131] gi|221162342|gb|ACM03313.1| von Willebrand factor, type A precursor [Rhodobacter sphaeroides KD131] Length = 328 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 28/177 (15%), Positives = 62/177 (35%), Gaps = 37/177 (20%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I L +D+SGSM + T++ A+K F++ ++ Sbjct: 90 IVLTLDMSGSMLIEDFDIDGVQS----------TRLEAVKRVARSFVEE-------RQGD 132 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GL + R ++ V + + + + T + A + +T S Sbjct: 133 RIGLALFANRAYVAAPLTFDLAAVGRAIEEASIGITGRSTAIADGLGLALKRVTESGAAS 192 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + I+ L+DG++N + + + A + ++I TI++ Sbjct: 193 ------------------RVIVLLSDGQDNAHQIDARQVA--GLAARHGVRIHTIAL 229 >gi|126306129|ref|XP_001365364.1| PREDICTED: similar to putative calcium activated chloride channel-like protein 1; eCLCA1 [Monodelphis domestica] Length = 895 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 50/239 (20%), Positives = 91/239 (38%), Gaps = 67/239 (28%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM ++ L A LFL I +++ + Sbjct: 309 LVLDKSGSMASG-------------------DRLNRLNQASKLFLLQI-----IEKGSWA 344 Query: 255 GLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + E T+ R + + ++ T ++ A+ ++ K+ F Sbjct: 345 GMVTFDSSATIQSELIQIETDAQRNSLITRLPTVAGGGTSICSGLRTAFTVI----KKKF 400 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISI--N 370 T+ I+ LTDGE+N TI C D+ K++ I T+++ + Sbjct: 401 STDGSE-------------IVLLTDGEDN-------TISTCFDEVKQSGAIIHTVALGPS 440 Query: 371 ASPNGQRL------LKTCVSSPEYHYNVVNADSLIHVFQNISQ---LMVHRKYSVILKG 420 A P ++L +KT + N N + LI F +S + R ++ KG Sbjct: 441 ADPGLEKLAEMTGGMKTTATD-----NAQN-NGLIDAFSALSSGNGAITQRSIQLVSKG 493 >gi|255570576|ref|XP_002526245.1| protein binding protein, putative [Ricinus communis] gi|223534439|gb|EEF36142.1| protein binding protein, putative [Ricinus communis] Length = 540 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 36/234 (15%), Positives = 66/234 (28%), Gaps = 48/234 (20%) Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 ++ V RP + V+D+S SM KM Sbjct: 79 SKLKVMLELTGGDSSSYGRPGLDLVAVLDVSRSMEGD--------------------KME 118 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS-LI 288 +K A+L + + + ++ ++ + T K ++ ++ Sbjct: 119 KMKTAMLFIIKKLGPTD------RLSIVTFSGGANRLCPLRQTTGKSQEEFENLINGLNA 172 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T+ T ++ A ++L I+ ++DGE N Sbjct: 173 DGATNITAGLQTALKVLKGRSFNGERVVG---------------IMLMSDGEQNAGSDAT 217 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 I IN P G + + S +V N DSL F Sbjct: 218 GV-----SVGNVPIHTFGFGINHEPKGLKAIAH-NSIGGTFSDVQNIDSLTKAF 265 >gi|91783676|ref|YP_558882.1| hypothetical protein Bxe_A2138 [Burkholderia xenovorans LB400] gi|91687630|gb|ABE30830.1| Conserved hypothetical protein containing von Willebrand factor type A domain [Burkholderia xenovorans LB400] Length = 337 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 31/186 (16%), Positives = 68/186 (36%), Gaps = 36/186 (19%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + P + L +DLSGSM DP ++ +++A+K + F+ Sbjct: 82 DEPARDLMLAIDLSGSMATRDFVDPAG---------ERMDRLSAVKRVVANFVAK----- 127 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++ +GL+ + + + VR + + + T A+ +++ Sbjct: 128 --RKGDRIGLVVFGDAAYPQAPLTLDHDSVRILLDQMQIGMAGPRTAIGDAIGLTVKLMA 185 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + +K +I LTDG N+ S + + + AK++ + + T Sbjct: 186 DSHAQ------------------EKVLILLTDG--NDTSSAIPPERAAEIAKQHKLVVHT 225 Query: 367 ISINAS 372 + I Sbjct: 226 VGIGDP 231 >gi|60477748|gb|AAH90753.1| Matn4 protein [Danio rerio] Length = 261 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 74/197 (37%), Gaps = 33/197 (16%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + +K + +D +D+ + +GL+ Y++ V S + + + Sbjct: 41 RPQNFELVKQFVNQVVDQLDV---SAKGTRVGLVQYSSCVRTEFPLS--MYHSKDEIKKA 95 Query: 284 M--DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + K T + A+K + S+ + + IP + +F TDG Sbjct: 96 VMNVEYMEKGTMTGLALKHMVENSFSEAEGARPAEK-----NIPRVGL----VF-TDG-- 143 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLI 399 + ++ + KAKE I + + + + + L+ S P ++ + + Sbjct: 144 -RSQDDIQ--EWAKKAKEAGITMYAVGVGKAVEDE--LREIASDPVEKHFFYSAD----- 193 Query: 400 HVFQNISQLMVHRKYSV 416 F ISQ+ + K +V Sbjct: 194 --FTAISQIAENLKLNV 208 >gi|332706285|ref|ZP_08426352.1| hypothetical protein LYNGBM3L_16440 [Lyngbya majuscula 3L] gi|332354933|gb|EGJ34406.1| hypothetical protein LYNGBM3L_16440 [Lyngbya majuscula 3L] Length = 413 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 39/244 (15%), Positives = 85/244 (34%), Gaps = 53/244 (21%) Query: 166 NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 NQ +Q+ +S + + + LV+D SGSMH Sbjct: 17 NQPSSQRQLSMAISAIAASSSRNVPLNLCLVLDHSGSMHGQ------------------- 57 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR---VEKNIEPSWGTEKVRQYVTR 282 + +K A + ++ + + ++ + R + +N +++++ + R Sbjct: 58 -PLETVKQAAVGLIERLQPDD------RLSIVAFDHRAKVLVRNQPMG-NLDQIKRKINR 109 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 T +K + L K+ + + LTDGEN Sbjct: 110 LGAD---GGTAIDEGLKLGVKELIKAKQDTVSQ-----------------VFLLTDGENE 149 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHV 401 + +N + IK+ + A EN + I ++ A+ N Q +L+ + + + + Sbjct: 150 HG-NNESCIKLAELAAENNLTINSLGFGANWN-QDILEKIADIATGSLSYIEEPEQALSE 207 Query: 402 FQNI 405 F + Sbjct: 208 FARL 211 >gi|114557513|ref|XP_001143250.1| PREDICTED: calcium-activated chloride channel regulator 1 [Pan troglodytes] Length = 914 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 45/240 (18%), Positives = 84/240 (35%), Gaps = 69/240 (28%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM ++ L A LFL + + ++ Sbjct: 309 LVLDKSGSMA-------------------TGNRLNRLNQAGQLFL--LQTVELGS---WV 344 Query: 255 GLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + E + R + + + + T ++ A+ ++ ++ + Sbjct: 345 GMVTFDSAAHVQSELIQINSGSDRDTLAKRLPAAASGGTSICSGLRLAFTVI----RKKY 400 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 T+ I+ LTDGE+N TI C ++ K++ I T+++ Sbjct: 401 PTDGSE-------------IVLLTDGEDN-------TISGCFNEVKQSGAIIHTVAL-GP 439 Query: 373 PNGQRL---------LKTCVSSPEYHYNVVNADSLIHVFQNISQ---LMVHRKYSVILKG 420 Q L L+T S V N + LI F +S + R + KG Sbjct: 440 SAAQELEELSKMTGGLQTYASD-----QVQN-NGLIDAFGALSSGNGAVSQRSIQLESKG 493 >gi|48428050|sp|Q864V9|CFAB_GORGO RecName: Full=Complement factor B; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement factor B Ba fragment; Contains: RecName: Full=Complement factor B Bb fragment; Flags: Precursor gi|29690187|gb|AAM10005.1| complement factor B precursor [Gorilla gorilla] Length = 764 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 34/229 (14%), Positives = 80/229 (34%), Gaps = 45/229 (19%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 +V+D SGSM+ + D D A K L+ ++ + Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGASNFTG--------AKKCLVNLIEKVASYGVKPRY--- 309 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL------ILKPTDSTPAMKQAYQILTSD 308 GL+ Y T + ++ S +VT+ ++ + + T++ A++ Y ++ Sbjct: 310 GLVTYATYPKIWVKVSDPDSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMM--- 366 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE--------- 359 ++ + II +TDG +N + I + D+ ++ Sbjct: 367 -------SWPDDVPPEGWNRTRHVIILMTDGLHNM---GGDPITVIDEIRDLLYIGKDHK 416 Query: 360 ------NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + + + L + + ++ + V + ++L VF Sbjct: 417 NPREDYLDVYVFGVGPLVNQVNINALASKKDNEQHVFKVKDMENLEDVF 465 >gi|330952765|gb|EGH53025.1| von Willebrand factor, type A [Pseudomonas syringae Cit 7] Length = 262 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 37/190 (19%), Positives = 67/190 (35%), Gaps = 43/190 (22%) Query: 192 LIELVVDLSGSMHC---AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + + VD+SGSM SD +++ ++ L FL+ Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDEV-------------SRLVLVQQLLGDFLE-------G 130 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ +GLI + T+ ++ VR ++ + K T A+ A + L Sbjct: 131 RKGDRVGLILFGTQAFVQAPLTYDRRTVRVWLDEARIGIAGKNTALGDAIGLALKRLRMR 190 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 S + ++ +TDG NN + + T A E +KI I Sbjct: 191 PATS------------------RALVLVTDGANNAGQIDPITAAR--LAAEEGVKIYPIG 230 Query: 369 INASPNGQRL 378 I + P+ L Sbjct: 231 IGSDPDKDAL 240 >gi|296159241|ref|ZP_06842067.1| von Willebrand factor type A [Burkholderia sp. Ch1-1] gi|295890500|gb|EFG70292.1| von Willebrand factor type A [Burkholderia sp. Ch1-1] Length = 345 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 32/186 (17%), Positives = 68/186 (36%), Gaps = 36/186 (19%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + P + L +DLSGSM DP ++ +++A+K + F+ Sbjct: 90 DEPARDLMLAIDLSGSMATRDFVDPAG---------ERMDRLSAVKRVVADFVAK----- 135 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++ +GL+ + + + VR + + + T A+ +++ Sbjct: 136 --RKGDRIGLVVFGDAAYPQAPLTLDHDSVRILLDQMQIGMAGPRTAIGDAIGLTVKLMA 193 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + +K +I LTDG N+ S + + + AK++ + + T Sbjct: 194 DSHAQ------------------EKVLILLTDG--NDTSSAIPPERAAEIAKQHKLVVHT 233 Query: 367 ISINAS 372 I I Sbjct: 234 IGIGDP 239 >gi|229495775|ref|ZP_04389503.1| BatB protein [Porphyromonas endodontalis ATCC 35406] gi|229317349|gb|EEN83254.1| BatB protein [Porphyromonas endodontalis ATCC 35406] Length = 338 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 31/188 (16%), Positives = 58/188 (30%), Gaps = 43/188 (22%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + GE + +D+S SM D ++ K + LD Sbjct: 81 KENPGEAKGIEAMIALDISNSMLAE---------------DLSPNRLQFAKLTIHRLLDY 125 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + +G++ + + + ++ V D+ P M Sbjct: 126 LAES-------KVGVVVFAGNAYMQLPITTDLAMAKKMVD-----------DANPDM--- 164 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 L++ G K II TDGEN+ + ++ KAK Sbjct: 165 ---LSNQGTAIASAIDLSLGSFSDRHDVGKAIILFTDGENHEGDA----LEAAKKAKSQG 217 Query: 362 IKIVTISI 369 +K+ TI++ Sbjct: 218 VKVYTIAV 225 >gi|219847249|ref|YP_002461682.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219541508|gb|ACL23246.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 842 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 33/215 (15%), Positives = 69/215 (32%), Gaps = 45/215 (20%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 +R + L++D S SM + +K K A +L L ++ Sbjct: 391 QRAPISLLLIIDRSASMSASFGV----------------SKFDLAKEAAILALTALQAGD 434 Query: 247 HVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQI 304 +G++ + T I + G + + ++ + T+ A+ Sbjct: 435 ------RIGVLAFDTDTIWVIPFQAVGEGAAVAELQTRIATMAIGGGTNIERALAVGLPA 488 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L ++ + LTDG + + ++ + A+ I + Sbjct: 489 LAAEPHSVRHA------------------VLLTDG-RSYSNNYPRYQQLVETARAAQITL 529 Query: 365 VTISINASPNGQRLLKTCVS-SPEYHYNVVNADSL 398 TI+I + LL+ +Y V +A L Sbjct: 530 STIAI-GTDADTDLLEQLARWGNGRYYFVPDAADL 563 >gi|297565073|ref|YP_003684045.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] gi|296849522|gb|ADH62537.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] Length = 308 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 36/227 (15%), Positives = 76/227 (33%), Gaps = 51/227 (22%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + L +D+SGSM D K +++ A K A F++ + V Sbjct: 88 VVLAIDVSGSMMAD---------------DLKPSRLDAAKAAARSFVERMPAG------V 126 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GL+ + + + V + + T + L S K Sbjct: 127 KVGLVSFAAGAVLESGLTADHQGVIERIDLLE---RRANTAIGEGL------LESLKAFP 177 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 N V +P+ +I L+DG N + + +AK +++ TI + + Sbjct: 178 TGAN---HQVAVPAT-----VILLSDGRNRIGIAPQEAAQ---EAKRRGVRVYTIGVGSD 226 Query: 373 PNGQRL---------LKTCVS-SPEYHYNVVNADSLIHVFQNISQLM 409 + L+ + ++ +AD L +++ + + Sbjct: 227 DPNASVDWAGFDEAELRGIAEVTGGRYFAADSADRLQEIYRELGSQI 273 >gi|187251530|ref|YP_001876012.1| von Willebrand factor type A [Elusimicrobium minutum Pei191] gi|186971690|gb|ACC98675.1| Von Willebrand factor type [Elusimicrobium minutum Pei191] Length = 373 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 47/239 (19%), Positives = 80/239 (33%), Gaps = 57/239 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +D SGSM QD ++ A K A F+ + + + Sbjct: 148 DIILAIDTSGSMAA---------------QDFDPNRITAAKVAAANFI-----ANRLSD- 186 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-TRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +G++ + + + E + ++ + + T A+ + L Sbjct: 187 -RIGIVVFASDAMLQSPLTLDYESLLDFLADVRIGMVRTDGTAIGDAIAVSSVHLERSPA 245 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN---FIKIVTI 367 RS K II LTDGE+N+ I D AK IK+ TI Sbjct: 246 RS------------------KVIILLTDGESNSG-----VISPLDAAKTAALYGIKVYTI 282 Query: 368 SINASPNGQRL------LKTCVS-SPEYHYNVVNADSLIHVFQNI-SQLMVHRKYSVIL 418 + + + L L+ + +Y N L ++ I S K SV++ Sbjct: 283 ATISKNSRDSLDFKPDDLEQIAKLTGGKYYRAYNEAELTKIYAEIDSLEKTEFKNSVLV 341 >gi|148656915|ref|YP_001277120.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148569025|gb|ABQ91170.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 561 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 34/214 (15%), Positives = 66/214 (30%), Gaps = 44/214 (20%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 + A+ I + + + V+D+SGSM ++ K AL Sbjct: 365 VLAAIRSIWVENKKRVDVMAVLDVSGSMA-------------------DEARLEQAKTAL 405 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 +F++ + GL ++ S K + + R T Sbjct: 406 RIFIEQLQDDDG------FGLTIFSDSATVLTPVSPIGPKRAEILNRIAGLTPRGGTRLL 459 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 +AYQ +++ P + ++ LTDG +N + N + Sbjct: 460 DTTVEAYQEMSA----------------TPPGQRIRAVVVLTDGLDNKSQRNAQDVLNLL 503 Query: 356 KAKENF--IKIVTISINASPNGQRLLKTCVSSPE 387 + IK+ T++ LLK + Sbjct: 504 RQDREGYSIKVFTVAFGG-DADVNLLKEIAEATG 536 >gi|953237|gb|AAA99719.1| collagen type XII alpha-1 precursor [Mus musculus] Length = 3067 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 49/332 (14%), Positives = 103/332 (31%), Gaps = 54/332 (16%) Query: 91 IDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA----NNRLDSSNNTIFY 146 ++ N + LS + V +I +T MA ++ L T Sbjct: 330 GEEVIEPPSNLVVTELSSKYIRLSWDPSPSAVTGYKILLTPMAAGSRHHALSVGPQTTTL 389 Query: 147 NM---DVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSM 203 N+ T Y + ++ L + + + +P +++ V+ S + Sbjct: 390 NVRDLTADTEYQISVFAMKGL-------TSSEPTSVMEKTQPKTQP---MKVQVECSRGV 439 Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 + S I K+ A L L +S + V + L+ Y+ Sbjct: 440 DIKADIVFLVDGSYSIGT-ANFVKV----RAFLEVLAKSFEISPNR--VQISLVQYSRDP 492 Query: 264 EKNIEPS--WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQG 321 E + + + T++ AM + + K S Sbjct: 493 HTEFTLKEFNRVEDIIKAINTFP--YRGGSTNTGKAMTYVREKIFVPNKGS--------- 541 Query: 322 VKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT 381 K +I +TDGE+++ + K + + ++I + + + L+ Sbjct: 542 ----RSNVPKVMILITDGESSDAFRDPAI-----KLRNSDVEIFAVGVK--DAVRSELEA 590 Query: 382 CVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 S P + + V + FQ IS ++ Sbjct: 591 IASPPAETHVFTVED----FDAFQRISFVLTQ 618 >gi|281182610|ref|NP_001162037.1| matrilin-4 [Pongo abelii] gi|134093133|gb|ABO52993.1| matrilin 4 isoform 1 precursor [Pongo abelii] Length = 581 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 65/169 (38%), Gaps = 22/169 (13%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTP 296 + + L+ +G+I Y+++V+ ++ + + RD+ L T + Sbjct: 57 LMGLLRGLNVGPNATRVGVIQYSSQVQSVFPLRAFSRSEDMERAIRDLVPLAQG-TMTGL 115 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A + S + G + P + + +TDG + ++ + Sbjct: 116 AIQYAMNVAFSVAE----------GARPPEERVPRVAVIVTDG---RPQD--RVAEVAAQ 160 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 A+ I+I + + + G L+ S P E+ + V + LI F Sbjct: 161 ARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESF-DLIQEFG 206 Score = 42.2 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 65/177 (36%), Gaps = 26/177 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR--DMDSLILKPTDSTPAMK 299 +D L E +GL+ +++RV G V + + + T + A++ Sbjct: 372 VDFLDVSPEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALR 429 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S+ QG + +L + + TDG + +++ +AKE Sbjct: 430 HMVEHSFSE----------AQGARPRALNVPRVGLVFTDG---RSQDDISV--WAARAKE 474 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I + + + + + L+ S P + D F ++ L+ + + S+ Sbjct: 475 EGIVMYAVGVGKAVEAE--LREIASKPAELHVSYAPD-----FGTMTHLLENLRGSI 524 >gi|4585469|gb|AAD25487.1|AF127036_1 calcium-activated chloride channel protein 1 [Homo sapiens] gi|119593592|gb|EAW73186.1| chloride channel, calcium activated, family member 1 [Homo sapiens] gi|189067292|dbj|BAG37002.1| unnamed protein product [Homo sapiens] Length = 914 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 45/240 (18%), Positives = 84/240 (35%), Gaps = 69/240 (28%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM ++ L A LFL + + ++ Sbjct: 309 LVLDKSGSMA-------------------TGNRLNRLNQAGQLFL--LQTVELGS---WV 344 Query: 255 GLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + E + R + + + + T ++ A+ ++ ++ + Sbjct: 345 GMVTFDSAAHVQSELIQINSGSDRDTLAKRLPAAASGGTSICSGLRSAFTVI----RKKY 400 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 T+ I+ LTDGE+N TI C ++ K++ I T+++ Sbjct: 401 PTDGSE-------------IVLLTDGEDN-------TISGCFNEVKQSGAIIHTVAL-GP 439 Query: 373 PNGQRL---------LKTCVSSPEYHYNVVNADSLIHVFQNISQ---LMVHRKYSVILKG 420 Q L L+T S V N + LI F +S + R + KG Sbjct: 440 SAAQELEELSKMTGGLQTYASD-----QVQN-NGLIDAFGALSSGNGAVSQRSIQLESKG 493 >gi|311254860|ref|XP_003125977.1| PREDICTED: calcium-activated chloride channel regulator 4-like [Sus scrofa] Length = 874 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 37/234 (15%), Positives = 83/234 (35%), Gaps = 56/234 (23%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM + ++ + A FL I V+ ++ Sbjct: 255 LVLDKSGSMSSS-------------------NRLNRMNQAAKYFLMQI-----VENGSWV 290 Query: 255 GLIGYTTRVEKNIEPSWGT-EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + + T R + + + T +++A++++ + Sbjct: 291 GMVHFDSTASIRSDLIQITGSNERDKLLGSLPTTASGGTSICSGIRRAFEVVRKLYSHTD 350 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 + I+ LTDGE+N T C D+ K++ I I++ Sbjct: 351 GSE----------------IVLLTDGEDN-------TAGACVDEVKQSGAIIHFIAL-GP 386 Query: 373 PNGQRLLKTCVSSPE-YHYNVVNADS--LIHVFQNIS---QLMVHRKYSVILKG 420 + +++ ++ + Y A++ LI F ++ + + + KG Sbjct: 387 SADKAVIEMSTATGGVHFYATDEAENNGLIDAFGALASGNTDISQQSLQLESKG 440 >gi|222616155|gb|EEE52287.1| hypothetical protein OsJ_34277 [Oryza sativa Japonica Group] Length = 367 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 37/219 (16%), Positives = 73/219 (33%), Gaps = 40/219 (18%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + V+D+SGSM DP +S ++K +++ LK+A+ + ++ Sbjct: 46 IDVVEVLDVSGSMG-----DPAMASSDFK-KNKPPSRLDVLKDAMKFIIRKLEDGD---- 95 Query: 251 DVYMGLIGYTTRVEKNIEP------SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + ++ + R K G + V T PA+++A ++ Sbjct: 96 --RLSIVAFNDRPVKEYSTGLLDISGNGRRIAEKKVDWLE---GRGGTALMPALEEAIRV 150 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L S FI+ LTDG++ + + K + Sbjct: 151 LDCRPGDSRNRVG--------------FILLLTDGDDTSGFRWSRDVINGAVGK---YPV 193 Query: 365 VTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHV 401 T + A+ + + LL S + N D + Sbjct: 194 HTFGLGAAHSSEALLYIAQESRGTYSFVDDENMDKIAGA 232 >gi|73960095|ref|XP_547299.2| PREDICTED: similar to calcium activated chloride channel 1 precursor [Canis familiaris] Length = 911 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 44/223 (19%), Positives = 83/223 (37%), Gaps = 66/223 (29%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM ++ L A LFL I V++ ++ Sbjct: 309 LVLDKSGSMA-------------------TGDRLKRLNQAGKLFLLQI-----VEQGSWV 344 Query: 255 GLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + + E + R +T+ + ++ T ++ A+ ++ K+ + Sbjct: 345 GMVTFDSAAQVQSELIQINSGTERDALTKSLPTVATGGTSICSGLRSAFAVI----KKKY 400 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 T+ I+ LTDGE+N TI C ++ K++ I T+++ Sbjct: 401 PTDGAE-------------IVLLTDGEDN-------TISSCFNEVKQSGAVIHTVAL-GP 439 Query: 373 PNGQRL---------LKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + L L+T S N + LI F +S Sbjct: 440 SAAKELEELSKMTGGLQTYASD-----QAQN-NGLIDAFGALS 476 >gi|311245368|ref|XP_003121804.1| PREDICTED: integrin alpha-11-like [Sus scrofa] Length = 1055 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 22/182 (12%), Positives = 60/182 (32%), Gaps = 27/182 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ + + +G++ Y V V+ V T++ Sbjct: 191 LINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVKDVVEAASHIEQRGGTET--- 246 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A+ I + + K +K +I +TDGE + + K+ ++ Sbjct: 247 -RTAFGIEFARSEA---------FQKGGRKGAKKVMIVITDGE---SHDSPDLEKVIQQS 293 Query: 358 KENFIKIVTISINA--------SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQ 407 +++ + +++ +K S P+ + +NV + +L + + Sbjct: 294 EKDNVTRYAVAVLGYYNRRGINPEAFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALGD 353 Query: 408 LM 409 + Sbjct: 354 RI 355 >gi|288941617|ref|YP_003443857.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] gi|288896989|gb|ADC62825.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] Length = 341 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 30/178 (16%), Positives = 60/178 (33%), Gaps = 37/178 (20%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L +D+SGSM + +++A ++ F++ + Sbjct: 92 DLMLAIDVSGSMA----------QEDYELDGRPVSRLAVVRTVASAFVER-------RAG 134 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + TR ++ V + + L + T A+ A + L + Sbjct: 135 DRLGLILFGTRAYLQTPLTFDGATVAAMLRDSVVGLAGRETAIGDAIGLAVKRLREQPEG 194 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 Q+ +I LTDG+N + A + +++ TI I Sbjct: 195 ------------------QRVLILLTDGDNTAGALDPLEAAE--LAAQAGVRVYTIGI 232 >gi|198421751|ref|XP_002123463.1| PREDICTED: similar to cartilage matrix protein [Ciona intestinalis] Length = 272 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 35/198 (17%), Positives = 78/198 (39%), Gaps = 27/198 (13%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + +K+ + F D + D+ +G+I + + V + I VR + Sbjct: 46 RPKNFQTVKDYVKNFTDIFEAFGP--NDMQVGVIQFGSGVREEILL--NQFYVRHELMEA 101 Query: 284 MDSL--ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 +D++ + T + A+++ + G ++ + ++ +TDG++ Sbjct: 102 IDNIRYMETGTMTGLALRKLVTETLT----------VEHGARVDNPIVHTVVVIITDGKS 151 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLI 399 ++ T K +AK +I I I N + LL+ S P+ + + V N Sbjct: 152 QDYSRGGVT-KWTKEAKARGFEIFAIGIGRKANRKELLEM-ASEPKELHTFRVQN----- 204 Query: 400 HVFQNISQLMVHRKYSVI 417 F I ++ V+ K ++ Sbjct: 205 --FNAIKRVDVNLKDRIL 220 >gi|311234271|gb|ADP87125.1| Protein of unknown function DUF2134, membrane [Desulfovibrio vulgaris RCH1] Length = 440 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 20/79 (25%), Positives = 37/79 (46%), Gaps = 5/79 (6%) Query: 336 LTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN-GQRLLKTCVSSP----EYHY 390 +T N K N ++ KAKE I++ I S + L+K+ SS +++Y Sbjct: 356 MTSHCENGGKLNAAMLEEARKAKEAGIEVFAIRFGDSDSVDVSLMKSIASSKAGTNDHYY 415 Query: 391 NVVNADSLIHVFQNISQLM 409 + +A + VF+ I + + Sbjct: 416 DAPSAYDIDDVFKKIGRQL 434 >gi|301770509|ref|XP_002920678.1| PREDICTED: integrin alpha-11-like [Ailuropoda melanoleuca] Length = 1203 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 22/182 (12%), Positives = 59/182 (32%), Gaps = 27/182 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ + + +G++ Y V VR V T++ Sbjct: 200 LINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVRDVVEAASHIEQRGGTET--- 255 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A+ I + + K +K ++ +TDGE + + K+ ++ Sbjct: 256 -RTAFGIEFARSEA---------FQKGGRKGAKKVMVVITDGE---SHDSPDLEKVIQQS 302 Query: 358 KENFIKIVTISINA--------SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQ 407 + + + +++ +K S P+ + +NV + +L + + Sbjct: 303 ERDNVTRYAVAVLGYYNRRGINPEAFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALGD 362 Query: 408 LM 409 + Sbjct: 363 RI 364 >gi|222616410|gb|EEE52542.1| hypothetical protein OsJ_34771 [Oryza sativa Japonica Group] Length = 654 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 36/218 (16%), Positives = 74/218 (33%), Gaps = 39/218 (17%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 ++ + V+D+SGSM + V +A + + +++ LK ++ + + Sbjct: 62 ADLNSHVPLDVVAVLDVSGSM-------NDPVAAASPKSNLQGSRLDVLKASMKFVIRKL 114 Query: 243 DLLSHVKEDVYMGLIGYTT---RVEKNIEPSW---GTEKVRQYVTRDMDSLILKPTDSTP 296 + ++ + + + G + + R T P Sbjct: 115 ADGD------RLSIVAFNDGPVKEYSSGLLDVSGDGRSIAGKKIDRLQA---RGGTALMP 165 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+++A +IL RQG + FI+ LTDG++ Sbjct: 166 ALEEAVKIL-----------DERQGSSRNRVG---FILLLTDGDDTTGFRWTRDAIHGAV 211 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 AK + T + AS + + LL S + V + Sbjct: 212 AK---YPVHTFGLGASHDPEALLHIAQGSRGTYSFVDD 246 >gi|4009458|gb|AAC95428.1| calcium-dependent chloride channel-1 [Homo sapiens] Length = 914 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 45/240 (18%), Positives = 84/240 (35%), Gaps = 69/240 (28%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM ++ L A LFL + + ++ Sbjct: 309 LVLDKSGSMA-------------------TGNRLNRLNQAGQLFL--LQTVELGS---WV 344 Query: 255 GLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + E + R + + + + T ++ A+ ++ ++ + Sbjct: 345 GMVTFDSAAHVQSELIQINSGSDRDTLAKRLPAAASGGTSICSGLRSAFTVI----RKKY 400 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 T+ I+ LTDGE+N TI C ++ K++ I T+++ Sbjct: 401 PTDGSE-------------IVLLTDGEDN-------TISGCFNEVKQSGAIIHTVAL-GP 439 Query: 373 PNGQRL---------LKTCVSSPEYHYNVVNADSLIHVFQNISQ---LMVHRKYSVILKG 420 Q L L+T S V N + LI F +S + R + KG Sbjct: 440 SAAQELEELSKMTGGLQTYASD-----QVQN-NGLIDAFGALSSGNGAVSQRSIQLESKG 493 >gi|30794326|ref|NP_851361.1| epithelial chloride channel protein [Bos taurus] gi|2623763|gb|AAB86529.1| Lu-ECAM-1 [Bos taurus] Length = 905 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 39/238 (16%), Positives = 78/238 (32%), Gaps = 61/238 (25%) Query: 177 IPALLRIEMGERPIFL--------IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 + + P F + LV+D SGSM ++ Sbjct: 286 TSPMTEMNPPTHPTFSLLKSKQRVVCLVLDKSGSMSAE-------------------DRL 326 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE-KVRQYVTRDMDSL 287 + A L+L + +++ +G++ + + E + T+ V Q +T + + Sbjct: 327 FQMNQAAELYLIQV-----IEKGSLVGMVTFDSVAEIQNHLTRITDDNVYQKITAKLPQV 381 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T +K +Q + + + + II LTDGE+N S Sbjct: 382 ANGGTSICRGLKAGFQAIIHSDQSTSGSE----------------IILLTDGEDNEINS- 424 Query: 348 VNTIKIC-DKAKENFIKIVTISINASPNGQRL--LKTCVSSPEYHYNVVNADSLIHVF 402 C + K + I TI++ + L L ++ + + L + F Sbjct: 425 ------CFEDVKRSGAIIHTIAL-GPSAAKELETLSN-MTGGYRFFANKDITGLTNAF 474 >gi|4009460|gb|AAC95429.1| calcium-dependent chloride channel-1 [Homo sapiens] Length = 914 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 42/234 (17%), Positives = 83/234 (35%), Gaps = 57/234 (24%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM ++ L A LFL + + ++ Sbjct: 309 LVLDKSGSMA-------------------TGNRLNRLNQAGQLFL--LQTVELGS---WV 344 Query: 255 GLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + E + R + + + + T ++ A+ ++ ++ + Sbjct: 345 GMVTFDSAAHVQSELIQINSGSDRDTLAKRLPAAASGGTSICSGLRSAFTVI----RKKY 400 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 T+ I+ LTDGE+N TI C ++ K++ I T+++ Sbjct: 401 PTDGSE-------------IVLLTDGEDN-------TISGCFNEVKQSGAIIHTVAL-GP 439 Query: 373 PNGQRL--LKTCVSSPEYH-YNVVNADSLIHVFQNISQ---LMVHRKYSVILKG 420 Q L L + + + V + LI F +S + R + KG Sbjct: 440 SAAQELEELSKMTGGLQTYASDQVQNNGLIDAFGALSSGNGAVSQRSIQLESKG 493 >gi|51244490|ref|YP_064374.1| hypothetical protein DP0638 [Desulfotalea psychrophila LSv54] gi|50875527|emb|CAG35367.1| conserved hypothetical membrane protein (BatA) [Desulfotalea psychrophila LSv54] Length = 328 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 40/249 (16%), Positives = 73/249 (29%), Gaps = 59/249 (23%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + I L VD+SGSM + K+ ++ +K+ + F+ Sbjct: 83 KSSGIDILLAVDVSGSMQAMDFTL----------NGKRTNRLEVVKDVMAKFISQ----R 128 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 +GL+ + R P+ + + +I T A+ L Sbjct: 129 PNDS---IGLVAFAGRPYVVCPPTLDHNWLTLRLHSLSIGMIEDGTAIGSAIGTGVNRLR 185 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 K + II LTDG NN K + AK +K+ T Sbjct: 186 EKK------------------SPSQIIILLTDGINNAGKVPPLIAA--EAAKSFKVKVYT 225 Query: 367 IS----------------------INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 I + + L K + ++ + +SL V+ Sbjct: 226 IGAGTRGEAPIPITDAFGRRQLVRARVDIDDKTLSKVAQITGARYFRATDTESLEKVYAE 285 Query: 405 ISQLMVHRK 413 I+ + + Sbjct: 286 INSMETTSR 294 >gi|332828718|gb|EGK01410.1| hypothetical protein HMPREF9455_02243 [Dysgonomonas gadei ATCC BAA-286] Length = 330 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 39/281 (13%), Positives = 86/281 (30%), Gaps = 63/281 (22%) Query: 149 DVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMN 208 + Y F+ ++ +++ ++ ++ I + +D+SG+M Sbjct: 49 STIRVYLRHFPFLLRVIAIALVIIVLARPQSVNSSDVSNSEGIDIVMALDISGTMMA--- 105 Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 QD T++ A K F+ + + + +GL+ + Sbjct: 106 ------------QDFSPTRLEAAKKVAAEFI-------NDRPNDRIGLVIFGGESFTQCP 146 Query: 269 PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 + + + +T +I T + + L K +S Sbjct: 147 LTTDHKVLLNLLTEVKFGMIEDGTAIGLGLANSVNRLKDSKSKS---------------- 190 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS----------INASPN---- 374 + +I LTDG NN + T + A I++ TI I Sbjct: 191 --RVVILLTDGSNNAGQIAPLTAA--ELAASYDIRVYTIGIGSRGTSTARIMTPYGLQTM 246 Query: 375 -------GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + L + + ++ + SL ++ I Q+ Sbjct: 247 QVSGDFDERTLTEIAAITKGQYFRATDNTSLSAIYDEIDQM 287 >gi|296272313|ref|YP_003654944.1| von Willebrand factor type A [Arcobacter nitrofigilis DSM 7299] gi|296096487|gb|ADG92437.1| von Willebrand factor type A [Arcobacter nitrofigilis DSM 7299] Length = 301 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 36/155 (23%), Positives = 60/155 (38%), Gaps = 21/155 (13%) Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 +GL+ + T V S+ ++ + ++ + T ++ + IL + K +S Sbjct: 130 IGLVVFGTSVLTASPLSFDKNSQKEIIKYIDIGIVGEQTAMFDSLATSINILKNSKAKSN 189 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 II LTDGE+N K I AK+ IKI TI I S Sbjct: 190 ------------------IIILLTDGEDNASKIPPQI--ILKLAKKYKIKIYTIGIGESN 229 Query: 374 NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 Q L + + + D L+ V+ I++L Sbjct: 230 R-QMLSTISQETGAKSFLANSKDDLVEVYNTINKL 263 >gi|258544594|ref|ZP_05704828.1| von Willebrand factor type A domain protein [Cardiobacterium hominis ATCC 15826] gi|258520172|gb|EEV89031.1| von Willebrand factor type A domain protein [Cardiobacterium hominis ATCC 15826] Length = 563 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 34/250 (13%), Positives = 82/250 (32%), Gaps = 52/250 (20%) Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 + K++ + +RP + ++D SGSM K+ Sbjct: 182 DAKLIRIAIQAADLAPEKRPPANLVFLIDTSGSMD-------------------DPDKLP 222 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 +K + F +++ + LI Y+ + + P+ G +K + Sbjct: 223 LVKKTVCHFAEALRADD------RISLITYSGSTAEILPPTAGDQK-ETIIAALKPLRAH 275 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T A++ AY + R N I+ TDG+ N S+ Sbjct: 276 GATAGGEALRMAYDA-AAKNYRKDGINR---------------ILLATDGDFNVGISDPA 319 Query: 350 T-IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD---------SLI 399 T +++ I + T+ + +++ + + +Y+ ++++ L Sbjct: 320 TLKNYVADKRKSGISLTTLGYGSGNYNDEMMEQLADAGDGNYSYIDSEAEAKKVLVRQLT 379 Query: 400 HVFQNISQLM 409 +++ + Sbjct: 380 STLATVARDI 389 >gi|84387243|ref|ZP_00990264.1| hypothetical protein V12B01_22476 [Vibrio splendidus 12B01] gi|84377890|gb|EAP94752.1| hypothetical protein V12B01_22476 [Vibrio splendidus 12B01] Length = 421 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 16/129 (12%), Positives = 46/129 (35%), Gaps = 2/129 (1%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 + + ++ +++ +++ FL + + + K +++A ++A LA A+ Sbjct: 1 MLYRVSQSPKKQQGLVAVMITAALLVFLAVSALAVDINHMVVNKTRLQNAVDSATLAAAT 60 Query: 67 KMVSNLSRLGDRFESIS--NHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 + ++ + E + N + + F I S F T + + Sbjct: 61 ILDNSKDKDAVDAEVGTALNAMAASTGNQEIDFSTASISIDYSNDPKDFTGTATFDSTDD 120 Query: 125 SRISMTHMA 133 + + A Sbjct: 121 VYVRVRVDA 129 >gi|281343950|gb|EFB19534.1| hypothetical protein PANDA_009430 [Ailuropoda melanoleuca] Length = 1112 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 22/182 (12%), Positives = 59/182 (32%), Gaps = 27/182 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ + + +G++ Y V VR V T++ Sbjct: 169 LINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVRDVVEAASHIEQRGGTET--- 224 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A+ I + + K +K ++ +TDGE + + K+ ++ Sbjct: 225 -RTAFGIEFARSEA---------FQKGGRKGAKKVMVVITDGE---SHDSPDLEKVIQQS 271 Query: 358 KENFIKIVTISINA--------SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQ 407 + + + +++ +K S P+ + +NV + +L + + Sbjct: 272 ERDNVTRYAVAVLGYYNRRGINPEAFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALGD 331 Query: 408 LM 409 + Sbjct: 332 RI 333 >gi|323450885|gb|EGB06764.1| hypothetical protein AURANDRAFT_71955 [Aureococcus anophagefferens] Length = 1008 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 69/195 (35%), Gaps = 31/195 (15%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 +R IE+V+D+SGSM ++ E ++A Q + + K+A S+D Sbjct: 352 AQRSGVDIEIVLDVSGSMA----TESEVQDAAGNVQRHGFSTLDVCKHAARCVACSLD-- 405 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQI 304 + +GL+ Y + + + T V ++ L T+ ++ Sbjct: 406 ----DTCRLGLVAYDAQARVVVGLARVTPAHVAKVHAALEKLAPGTSTNLWGGLELGVDE 461 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG-ENNNFKSNVNTIKICDKAKENFIK 363 L + ++ LTDG NN+ + ++ + Sbjct: 462 LVGGAGDNARA-----------------VLLLTDGVPNNSPPEGEVAALRAKRLTKDGSE 504 Query: 364 IVTISINASPNGQRL 378 T+++ A+ G L Sbjct: 505 --TVAVFAAGFGYAL 517 >gi|296489197|gb|DAA31310.1| epithelial chloride channel protein [Bos taurus] Length = 905 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 39/238 (16%), Positives = 78/238 (32%), Gaps = 61/238 (25%) Query: 177 IPALLRIEMGERPIFL--------IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 + + P F + LV+D SGSM ++ Sbjct: 286 TSPMTEMNPPTHPTFSLLKSKQRVVCLVLDKSGSMSAE-------------------DRL 326 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE-KVRQYVTRDMDSL 287 + A L+L + +++ +G++ + + E + T+ V Q +T + + Sbjct: 327 FQMNQAAELYLIQV-----IEKGSLVGMVTFDSVAEIQNHLTRITDDNVYQKITAKLPQV 381 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T +K +Q + + + + II LTDGE+N S Sbjct: 382 ANGGTSICRGLKAGFQAIIHSDQSTSGSE----------------IILLTDGEDNEINS- 424 Query: 348 VNTIKIC-DKAKENFIKIVTISINASPNGQRL--LKTCVSSPEYHYNVVNADSLIHVF 402 C + K + I TI++ + L L ++ + + L + F Sbjct: 425 ------CFEDVKRSGAIIHTIAL-GPSAAKELETLSN-MTGGYRFFANKDITGLTNAF 474 >gi|327270782|ref|XP_003220167.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 904 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 38/234 (16%), Positives = 80/234 (34%), Gaps = 56/234 (23%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D+SGSM+ ++ L+ A FL I ++ + Sbjct: 305 LVLDISGSMNGF-------------------DRIYRLRQAGEQFLLQI-----LETGSWA 340 Query: 255 GLIGYTTRVEKNIEPSWGTE-KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + ++ T VRQ ++ + + T+ +++ +Q+ + Sbjct: 341 GIVVFNSQALTKTYLKQITGDSVRQTLSAYLPTAAGGGTNICSGIREGFQVFLKKYPSTE 400 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 I+ LTDGE+ S C + + + I TI++ Sbjct: 401 GCE----------------IVLLTDGEDAGVSS-------CFAEVQRSGSIIHTIAL-GP 436 Query: 373 PNGQRLLKTCVSSPEYHYNVV---NADSLIHVFQNISQ---LMVHRKYSVILKG 420 + L + ++ +++ LI F IS + + + KG Sbjct: 437 SAAKELEMLADMTGGLKFSATDSLDSNGLIDAFSGISSGSGDISQQSIQLESKG 490 >gi|308068881|ref|YP_003870486.1| von Willebrand factor A [Paenibacillus polymyxa E681] gi|305858160|gb|ADM69948.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Paenibacillus polymyxa E681] Length = 600 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 43/220 (19%), Positives = 80/220 (36%), Gaps = 50/220 (22%) Query: 163 HLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 ++ I+++ P + LVVD+S SM+ SDP + + Sbjct: 13 GIICTMIMTSILAWQPQMANASSPSASKVDAVLVVDVSNSMN---TSDPGKIGN------ 63 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV-RQYVT 281 A+K + + D + G++ YT V++ T + +Q + Sbjct: 64 ------EAMKMFIDMLSTQNDKV---------GIVAYTDVVQREKALLNITSEADKQELK 108 Query: 282 RDMDSLILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 +D L TD++ +K+A +IL K I+ L DG Sbjct: 109 TFIDGLNRGAYTDTSVGVKEALRILQDGKTAGHAP----------------MIVMLADGN 152 Query: 341 NN--------NFKSNVNTIKICDKAKENFIKIVTISINAS 372 N+ +S+ + + +AK + + I TI +NA Sbjct: 153 NDFNKTTGRTESQSDQDMAQAVAEAKNSGVPIYTIGLNAD 192 >gi|55741484|ref|NP_001006980.1| cartilage matrix protein [Rattus norvegicus] gi|54035339|gb|AAH83869.1| Matrilin 1, cartilage matrix protein [Rattus norvegicus] gi|149024105|gb|EDL80602.1| matrilin 1, cartilage matrix protein [Rattus norvegicus] Length = 498 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 34/170 (20%), Positives = 61/170 (35%), Gaps = 26/170 (15%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR--QYVTRDMDSLILKPTDSTPAMK 299 I L +GL+ Y + V+ T K Q V R + T + A++ Sbjct: 70 IKSLDVGPNATRVGLVNYASTVKPEFPLRAHTSKASLLQAVHRIQP--LSTGTMTGLALQ 127 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 A SD + G + S K +I +TDG + +V+ ++A+ Sbjct: 128 FAITKALSDAE----------GGRSRSSDISKVVIVVTDGRPQDSVRDVS-----ERARA 172 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPE----YHYNVVNA-DSLIHVFQN 404 + I++ I + + L+ S P+ + N + L FQ Sbjct: 173 SGIELFAIGVG--RVDKATLRQIASEPQDEHVDYVESYNVIEKLAKKFQE 220 Score = 41.8 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 26/190 (13%), Positives = 74/190 (38%), Gaps = 29/190 (15%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + +K + +D++D+ + + GL+ Y++ + + G ++ + Sbjct: 288 RPENFELVKKFINQIVDTLDVSDRLAQV---GLVQYSSSIRQEFPL--GRFHTKKDIKAA 342 Query: 284 MD--SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + S + K T + A+K D + + K+ I+F TDG + Sbjct: 343 VRNMSYMEKGTMTGAALKY-----LIDNSFTVSSGARPGAQKVG-------IVF-TDGRS 389 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLI 399 ++ ++ KAK+ K+ + + + L+ S P ++++ + ++ Sbjct: 390 QDYINDAA-----RKAKDLGFKMFAVGVG--NAVEEELREIASEPVADHYFYTADFKTIN 442 Query: 400 HVFQNISQLM 409 + + + + + Sbjct: 443 QIGKKLQKKI 452 >gi|148253748|ref|YP_001238333.1| hypothetical protein BBta_2249 [Bradyrhizobium sp. BTAi1] gi|146405921|gb|ABQ34427.1| putative exported protein of unknown function [Bradyrhizobium sp. BTAi1] Length = 432 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 68/438 (15%), Positives = 128/438 (29%), Gaps = 61/438 (13%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 R+ + +E N +IIFAL+++ L +G I K + ++ +AA+LA Sbjct: 6 RYLGLLSRFRRNESGNIAIIFALALLPILTFVGSAIDYSMAVRAKAKLSASLDAAMLAAT 65 Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 + ++ A A A + S S T ++ + Sbjct: 66 G------------YTAMRGTAADAKTS-ATNMYNGQMS-SHKLTSNSLNITVTDSVTART 111 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEM 185 + N F M V S F ++ S +M Sbjct: 112 VTGTASVVVN-TAFMYMFGFPTMTVTASSSASASFPTYMDFYVLVDNSPSQGLGATTADM 170 Query: 186 GE-RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT-KMAALKNALLLFLDSID 243 + + + + + + I ++K T ++ +++A D+ Sbjct: 171 TTLQNATSDKCAFACHDTYTSSTKKTLQTNSYYQIAKNKGVTMRIDVVRSATQSLTDTAT 230 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 V M + + G V + + M Y Sbjct: 231 SSQVVSNQYRMAVYSLGSDCGSL-----GLTTVASLSSSMSSVKSSVG--ALDLMTIPYS 283 Query: 304 ILTSDKKRSFFTNFFRQGVKIP------SLPFQKFIIFLTDG--ENNNFKSNVNTI---- 351 +D F IP S QK++ F++DG + + + T+ Sbjct: 284 GYNNDMCTDFDGAMSGMNGVIPAQGDGSSTSPQKWLFFVSDGVADYSYPTTCSKTVLSGG 343 Query: 352 --------KICDKAKENFIKI---VT-----------ISINASPNG--QRLLKTCVSSPE 387 CD K IKI T + A ++K+C SP Sbjct: 344 RCQEPLNTTTCDTLKARGIKIAVLYTTYLAITNNSWYTTYIAPWRDSISGIMKSCA-SPG 402 Query: 388 YHYNVVNADSLIHVFQNI 405 Y+Y V ++ S+ + Sbjct: 403 YYYEVDSSGSIGAALTAL 420 >gi|255009407|ref|ZP_05281533.1| aerotolerance-related membrane protein [Bacteroides fragilis 3_1_12] gi|313147166|ref|ZP_07809359.1| aerotolerance protein BatA [Bacteroides fragilis 3_1_12] gi|313135933|gb|EFR53293.1| aerotolerance protein BatA [Bacteroides fragilis 3_1_12] Length = 327 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 39/239 (16%), Positives = 72/239 (30%), Gaps = 63/239 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I L +D+S SM D K ++ A K+ F++ Sbjct: 88 IDIMLAIDVSTSMLAE---------------DLKPNRLEAAKDVAAEFIN----GRPNDN 128 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +G+ + + + +I T + A L K Sbjct: 129 ---IGITLFAGESFTQCPLTVDHAVLLNLFQGIKCDIIEDGTAVGMGIANAVTRLKDSKA 185 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI- 369 +S K II LTDG NN K +++ + + AK I++ TI + Sbjct: 186 KS------------------KVIILLTDGTNN--KGDISPLTAAEIAKSFGIRVYTIGVG 225 Query: 370 ---NAS-----------------PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 A + + L + + ++ + L V++ I +L Sbjct: 226 TNGMAPYPVPVGGTVQYINTPVEIDEKTLTQIAGITDGNYFRATSNSKLKEVYEEIDKL 284 >gi|108762540|ref|YP_633801.1| BatA protein [Myxococcus xanthus DK 1622] gi|108466420|gb|ABF91605.1| batA protein [Myxococcus xanthus DK 1622] Length = 336 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 28/177 (15%), Positives = 60/177 (33%), Gaps = 41/177 (23%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I + +DLS SM + + +M K L F+ + V + Sbjct: 92 IDIVVALDLSTSMEAG--------------DFRPQNRMHVAKEVLSEFI-----ANRVND 132 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +GL+ + + +++ V + ++ T A+ + L + Sbjct: 133 --RIGLVVFAGAAYTQAPLTLDYGVLKEVVKQLRTRVLEDGTAIGDALATSLNRLRDSEA 190 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 +S + ++ +TDG+NN+ K + + A+ + I TI Sbjct: 191 KS------------------RVVVLITDGDNNSGKISPMDSA--NMAQALKVPIYTI 227 >gi|54025448|ref|YP_119690.1| hypothetical protein nfa34780 [Nocardia farcinica IFM 10152] gi|81374389|sp|Q5YU15|Y3478_NOCFA RecName: Full=UPF0353 protein NFA_34780 gi|54016956|dbj|BAD58326.1| hypothetical protein [Nocardia farcinica IFM 10152] Length = 335 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 40/296 (13%), Positives = 83/296 (28%), Gaps = 62/296 (20%) Query: 140 SNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDL 199 SN + + R I +L I + P ++ R + LV+D+ Sbjct: 40 SNMEVLEKVAPSRPSPLRHAPIALMLVGLVFLTIAAAGPTSVQKVPRNRA--TVVLVMDV 97 Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 S SM D +++ + A F + + + + +G + + Sbjct: 98 SLSMEA---------------TDVPPSRLEVAQQAGKEF------VDGLTQGINLGFVTF 136 Query: 260 TTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFR 319 P+ E V+ + + T + + A Q + Sbjct: 137 AGTASVMQSPTTNREAVKAAIDNIKLAER---TATGEGILTALQSI----------ETLA 183 Query: 320 QGVKIPSLPFQKFIIFLTDGENNNFK----SNVNTIKICDK-AKENFIKIVTISI----- 369 + P I+ ++DG+ N + AK I + TIS Sbjct: 184 TVLGGAETPPPARIVLMSDGKQTVPDDKDVDNPRHAFTAARLAKSKGIPVSTISFGTEWG 243 Query: 370 ----------------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + + L + S Y + + L V+ + + + Sbjct: 244 SVEIPDQDGQGGSQRVKVPVDNESLREIAKLSGGEFYTASSLEELTAVYDTLEEQI 299 >gi|46580532|ref|YP_011340.1| von Willebrand factor type A domain-containing protein [Desulfovibrio vulgaris str. Hildenborough] gi|46449951|gb|AAS96600.1| von Willebrand factor type A domain protein [Desulfovibrio vulgaris str. Hildenborough] Length = 420 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 20/79 (25%), Positives = 37/79 (46%), Gaps = 5/79 (6%) Query: 336 LTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN-GQRLLKTCVSSP----EYHY 390 +T N K N ++ KAKE I++ I S + L+K+ SS +++Y Sbjct: 336 MTSHCENGGKLNAAMLEEARKAKEAGIEVFAIRFGDSDSVDVSLMKSIASSKAGTNDHYY 395 Query: 391 NVVNADSLIHVFQNISQLM 409 + +A + VF+ I + + Sbjct: 396 DAPSAYDIDDVFKKIGRQL 414 >gi|254482897|ref|ZP_05096133.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] gi|214036769|gb|EEB77440.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] Length = 330 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 41/233 (17%), Positives = 77/233 (33%), Gaps = 49/233 (21%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 ++ + + VDLSGSM + P V ++ A+K + L Sbjct: 88 QQKSGRDLMIAVDLSGSMEARDFTLPSGVTVD---------RLDAVK-------QVLKEL 131 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 + +E +GLI + + + +Q + + T A+ + L Sbjct: 132 AANRESDRLGLIVFGAAAYLQTPFTDDHQVWQQLLDETEIGMAGPSTVFGDAIGLSI-KL 190 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK---ENFI 362 SD + +I LTDG +T+ D AK N + Sbjct: 191 FSDSDSDN-----------------RVLIMLTDG-----NDTGSTVPPVDAAKVAAANGV 228 Query: 363 KIVTISINASPN-GQRLL------KTCVSSPEYHYNVVNADSLIHVFQNISQL 408 +I TI+I G+ L + + + ++ D + + I +L Sbjct: 229 RIYTIAIGDPATVGEDALDMDTITRVSKIADGRTFRALDQDEMRQAYITIGEL 281 >gi|34541235|ref|NP_905714.1| batB protein [Porphyromonas gingivalis W83] gi|34397551|gb|AAQ66613.1| batB protein [Porphyromonas gingivalis W83] Length = 339 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 35/186 (18%), Positives = 65/186 (34%), Gaps = 45/186 (24%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 E + +D+S SM +C+D K +++ K L D + Sbjct: 85 PKEEKGIEAMICLDISNSM---------------LCEDVKPNRLSFAKQVLGKLFDGLQ- 128 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQ 303 + V GL+ + I + +Q++ +++ T A++ A + Sbjct: 129 ----NDKV--GLVVFAGNAYTQIPITTDLSAAKQFLADISPNMVTAQGTAIGAAIELASK 182 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 + +K+ K II LTDGEN N I+ +A E I+ Sbjct: 183 SFSDNKEIG------------------KTIIVLTDGEN----HEGNAIEAAQQAHEAGIR 220 Query: 364 IVTISI 369 + I + Sbjct: 221 VNVIGL 226 >gi|22299719|ref|NP_682966.1| hypothetical protein tlr2176 [Thermosynechococcus elongatus BP-1] gi|22295903|dbj|BAC09728.1| tlr2176 [Thermosynechococcus elongatus BP-1] Length = 415 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 37/232 (15%), Positives = 73/232 (31%), Gaps = 49/232 (21%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 +R + L++D SGSM +A +K A +D + Sbjct: 34 QQRAPLNLCLILDHSGSMAWQ--------------------PLAMVKQAAASLVDRLLPS 73 Query: 246 SHVKEDVYMGLIGYTT--RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + +I + +V + W E ++ + T MK + Sbjct: 74 D------RLSVIAFDHKAKVLVPNQTVWDKEAIKAQIATLE---PGGGTAIDEGMKLGLK 124 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 + + K+ + I LTDGEN + N +++ A E I Sbjct: 125 EIAAGKQGTISQ-----------------IFLLTDGENEHG-DNQRCLELAKLAAEYNIT 166 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + + N L + ++ + A+ I FQ++ + Y+ Sbjct: 167 LNALGFGVHWNQDVLEQIADAAGGRLVFIEYAEQAIACFQSLFSHISSVDYT 218 >gi|149922178|ref|ZP_01910616.1| flagellar biosynthesis protein P [Plesiocystis pacifica SIR-1] gi|149816918|gb|EDM76403.1| flagellar biosynthesis protein P [Plesiocystis pacifica SIR-1] Length = 689 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 20/124 (16%), Positives = 41/124 (33%), Gaps = 13/124 (10%) Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI-IFLTDGENNNFKSNV 348 T + ++ L D + + + G + P+ + I +TDG+ N + +N Sbjct: 420 SGTYTHLGLQ-----LIKDNQVQYQADGLMDGAEFPTNDETIYFNILITDGQYNGYSTNA 474 Query: 349 NTIKICDKAKENFIKIVTISI-------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 ++ + I I A Q + + E +Y+ N L Sbjct: 475 QVQGELEEMYNDGITTYVIGFGDGVDTPAAMAQLQNMAQWGSGDSENYYDANNQAELEAA 534 Query: 402 FQNI 405 +I Sbjct: 535 LTSI 538 >gi|134093103|gb|ABO52963.1| matrilin 4 isoform 1 precursor [Lemur catta] Length = 583 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 66/170 (38%), Gaps = 24/170 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKPTDST 295 + + L+ +G+I Y+++V+ G + + R + +L+ + T + Sbjct: 59 LVGLLHGLNVGPNATRVGVIQYSSQVQSVFPL--GAFSRPEDMERAIRALVPLAQGTMTG 116 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A + S + G + P + + +TDG + ++ Sbjct: 117 LAIQYAMNVAFSVAE----------GARPPEERVPRVAVIVTDG---RPQD--RVAEVAA 161 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 +A+ I+I + + + G L+ S P E+ + V + LI F Sbjct: 162 QARARGIEIYAVGVQRADVGS--LRAMASHPLDEHVFLVESF-DLIQEFG 208 Score = 43.3 bits (100), Expect = 0.064, Method: Composition-based stats. Identities = 28/177 (15%), Positives = 65/177 (36%), Gaps = 26/177 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR--DMDSLILKPTDSTPAMK 299 +D L + +GL+ +++RV G V + + + T + A++ Sbjct: 374 VDFLDVSPDGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALR 431 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S+ QG + +L + + TDG + +++ +AKE Sbjct: 432 HMVEHSFSE----------AQGARPRALNVPRVGLVFTDG---RSQDDISV--WAARAKE 476 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I + + + + L+ S P + + D F ++ L+ + + S+ Sbjct: 477 EGIVMYAVGVG--KAVEEELREIASEPAELHVSYSPD-----FSTMTHLLENLRGSI 526 >gi|281357358|ref|ZP_06243847.1| von Willebrand factor type A [Victivallis vadensis ATCC BAA-548] gi|281316389|gb|EFB00414.1| von Willebrand factor type A [Victivallis vadensis ATCC BAA-548] Length = 342 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 39/242 (16%), Positives = 82/242 (33%), Gaps = 50/242 (20%) Query: 191 FLIELVVDLSGSMHCA---MNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 I L +D+SGSM N + A + + ++ K + F++ Sbjct: 84 IDIVLALDMSGSMEAYDVPRNINDARTLIAAVKNKEVENRIEVAKKEIRRFIEQ------ 137 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + + +GLIG+ + P+ + ++ + +I + T + L Sbjct: 138 -RPNDRIGLIGFADQAYSFAPPTLDHAWLLAHLEQLEPGMIGQQTGIAAPLASGVNRL-- 194 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 K P ++ ++ TDG NN + + + KE + I T+ Sbjct: 195 ---------------KKSDAP-RRVLVLFTDGRNN-VDNRLTPEQAAALGKEFDVVIHTV 237 Query: 368 SI--------------------NASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNIS 406 I ++LL++ + +++ +AD + V I+ Sbjct: 238 GIGSRNAFVLVTDPFGRQQFQGIEDEFDEKLLRSLAEITGGTYFHAADADGMKQVMDEIN 297 Query: 407 QL 408 QL Sbjct: 298 QL 299 >gi|262403351|ref|ZP_06079911.1| protein BatA [Vibrio sp. RC586] gi|262350850|gb|EEY99983.1| protein BatA [Vibrio sp. RC586] Length = 248 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 45/235 (19%), Positives = 79/235 (33%), Gaps = 55/235 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLS SM ED+ S D ++ A+K L F+ +E Sbjct: 15 DLMLVVDLSYSMS------QEDMQSGQQMVD----RLTAVKQVLSEFIAQ-------REG 57 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + V + + + + LI T + A + Sbjct: 58 DRIGLILFADHAYLQTPLTLDRQTVTEQLNQAVLKLIGTQTAMGEGIGLATKTFID---- 113 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-- 369 S Q+ +I L+DG N + + AK+ I T+ + Sbjct: 114 --------------SAAPQRVMILLSDGSNTAGVLDPLEAA--NIAKQYQTTIYTVGVGA 157 Query: 370 ---------------NASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 A ++ L+T S + ++ N L ++ I+QL Sbjct: 158 GEMIVKDFLFSRKVNTAQDLDEKTLQTIASTTGGQYFRARNQQDLQSIYDTINQL 212 >gi|257883753|ref|ZP_05663406.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium 1,231,501] gi|257819591|gb|EEV46739.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium 1,231,501] Length = 1475 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 36/229 (15%), Positives = 69/229 (30%), Gaps = 30/229 (13%) Query: 93 DAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMT 152 D N + + E +N+ + N + N+ + YN + Sbjct: 257 DYTNGGANWRNYDYATNNGSTAGEEDRNV----PAQTVQLWGNERNFENSYLDYNGAYIK 312 Query: 153 SYDYRLQFIEHL--LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 + + L+ + + ++ E E I V+D S SM Sbjct: 313 KWVEPVLPSSTASDLHPEDATTLYNVYLDVIGGEKKEISPIDIVFVLDKSASMSELTAGT 372 Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY--------TTR 262 +TK AAL A+ S DLLS D+ +G++ + Sbjct: 373 N------------SQTKNAALIEAVNEM--SKDLLSDPSLDIRIGMVNFYHNSTAINNHE 418 Query: 263 VEKN--IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + + ++ ++ + T T +K Y+ L D Sbjct: 419 QISSDIFPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGYETLYKDN 467 >gi|307354884|ref|YP_003895935.1| hypothetical protein Mpet_2754 [Methanoplanus petrolearius DSM 11571] gi|307158117|gb|ADN37497.1| conserved hypothetical protein [Methanoplanus petrolearius DSM 11571] Length = 316 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 39/245 (15%), Positives = 80/245 (32%), Gaps = 60/245 (24%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 + + LV+D SGSM D ++ A K+A ++ +D Sbjct: 83 QTKEGVNVVLVIDDSGSMQA---------------TDYSPNRLEATKSAAEELINDLDPK 127 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 Y+G++ + + S + V + + M+ T + + Sbjct: 128 D------YVGIVVFESGASTASYLSPDKDSVIENLENIME--KDGATAIGDGLSLGINMA 179 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 S R +K +I L+DG NN + + AK++ I++ Sbjct: 180 DSIPNR------------------KKVVILLSDGVNNAGVISPDEAIQ--FAKDSDIQVF 219 Query: 366 TISINASPN----------------GQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 TI + + + LK + ++ V+ +L ++ NI+ Sbjct: 220 TIGMGSEQPVVMGYDWFGNPQYAELDEATLKEIADETGGKYFKSVDDQTLNEIYSNINSE 279 Query: 409 MVHRK 413 + K Sbjct: 280 IKREK 284 >gi|327471789|gb|EGF17230.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK408] Length = 464 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 24/161 (14%), Positives = 54/161 (33%), Gaps = 28/161 (17%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I V D SGSM +N + + +++M+ L++ + + + + +V ++ Sbjct: 199 ISFVFDKSGSMSWDLNGNNTNYWG-------PKSRMSILQDKATIMMRDLKDIGNVSVNL 251 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 I + + E GT + + T+ ++ L + + Sbjct: 252 VSFSILGSYVQKDFSELDKGTTTIEASINALQTG---GVTNPGDGLRYGMMSLQNHSAQL 308 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 K+++ LTDG N + + N Sbjct: 309 ------------------KYVVLLTDGIPNAYTVDTNDTSW 331 >gi|292627943|ref|XP_695559.4| PREDICTED: integrin alpha-11 [Danio rerio] Length = 1104 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 20/183 (10%), Positives = 59/183 (32%), Gaps = 26/183 (14%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ + + +G++ Y RV V + V + ++ Sbjct: 170 DFLINVLQKFYIGPGQIQVGVVQYGERVVNEFRLD-DFRTVDEVVAAAKNIDQRGGEETR 228 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ + +K +I +TDGE + + + Sbjct: 229 TAL-------------GINVARTQAFKHGGRPDAKKVMIVITDGE---SHDSPDLKAAVE 272 Query: 356 KAKENFIKIVTISINA--------SPNGQRLLKTCVSSPE-YHYNVVNADSLIHVFQNIS 406 +++++ I + I++ R +K + P+ + ++V + +L + + Sbjct: 273 ESEKDNITLYGIAVLGYYNRRGINPEAFLREIKFIATDPDEHFFSVTDESALKDIVDALG 332 Query: 407 QLM 409 + + Sbjct: 333 EKI 335 >gi|33152377|ref|NP_873730.1| tight adherence protein G [Haemophilus ducreyi 35000HP] gi|21326716|gb|AAL92476.1| TadG [Haemophilus ducreyi] gi|33148600|gb|AAP96119.1| tight adherence protein G [Haemophilus ducreyi 35000HP] Length = 562 Score = 48.3 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 41/287 (14%), Positives = 90/287 (31%), Gaps = 21/287 (7%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 M + K+ I ++ + I AL + + L+ + V K + A A Sbjct: 1 MIMRKYVITQTKRFIQNQSGVYIIFGALLTLPIVALLFVSLEVAGIIQDKARLNDALEQA 60 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDA-----KRFIKNHIKESLSGYSAVFYN 115 +L+ ++ S + + L D + +K +K L Sbjct: 61 VLSLTAENNSGRKSYDYALTNAEKANGKYLADSEAGKRDSQIVKTFVKLYLPQIDENTMK 120 Query: 116 TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVS 175 E ++ I+ + SS+ T + L + ++ +++ Sbjct: 121 FEPICTTQNNAITPKNGKQYAYSSSHVTCTVTGSINHR---SLFPMTVGKSKIIPEQVSL 177 Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 ++ + + +V DLSGSM +N+ + + + +K+ LK L Sbjct: 178 SSGSMAQKINNVNLPLDLMVVADLSGSMDYNINN-------HKVYSNTEASKLTLLKQVL 230 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNI------EPSWGTEKV 276 D L + + +I + + I W + Sbjct: 231 EELTDKYLLSEEANPNNRISMIPFAMGAQHPIRNSCVLPFEWNQSHI 277 >gi|327270784|ref|XP_003220168.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 952 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 44/224 (19%), Positives = 82/224 (36%), Gaps = 64/224 (28%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D+SGSM+ ++A LK A +FL I +++ ++ Sbjct: 305 LVLDVSGSMYGV--------------------RVARLKQAAEIFLLQI-----IEDGSWV 339 Query: 255 GLIGYTTRVEKNIEPSWGTE-KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + T +R+ +T + T ++ +++ Sbjct: 340 GIVTFNSAATIKTGLQQITSDSIRRSLTGYLPVTAGGGTRICNGVEAGFKVFKQKYASEK 399 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISI--N 370 I+ LTDGE++ TI C D+ K + I TI++ + Sbjct: 400 GCE----------------IVLLTDGEDS-------TISYCLDEVKRSGSIIHTIALGRS 436 Query: 371 ASPNGQRL------LKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 A P + L LK + ++++SLI F IS Sbjct: 437 ADPGLEELADMTGGLKFSATDS------LDSNSLIDAFTGISSS 474 >gi|293569033|ref|ZP_06680345.1| von Willebrand factor type A domain protein [Enterococcus faecium E1071] gi|291588214|gb|EFF20050.1| von Willebrand factor type A domain protein [Enterococcus faecium E1071] Length = 1502 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 36/229 (15%), Positives = 69/229 (30%), Gaps = 30/229 (13%) Query: 93 DAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMT 152 D N + + E +N+ + N + N+ + YN + Sbjct: 284 DYTNGGANWRNYDYATNNGSTAGEEDRNV----PAQTVQLWGNERNFENSYLDYNGAYIK 339 Query: 153 SYDYRLQFIEHL--LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 + + L+ + + ++ E E I V+D S SM Sbjct: 340 KWVEPVLPSSTASDLHPEDATTLYNVYLDVIGGEKKEISPIDIVFVLDKSASMSELTAGT 399 Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY--------TTR 262 +TK AAL A+ S DLLS D+ +G++ + Sbjct: 400 N------------SQTKNAALIEAVNEM--SKDLLSDPSLDIRIGMVNFYHNSTAINNHE 445 Query: 263 VEKN--IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + + ++ ++ + T T +K Y+ L D Sbjct: 446 QISSDIFPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGYETLYKDN 494 >gi|274320027|ref|NP_001162099.1| matrilin-4 [Macaca mulatta] gi|134093113|gb|ABO52973.1| matrilin 4 isoform 1 precursor [Macaca mulatta] Length = 581 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 65/169 (38%), Gaps = 22/169 (13%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTP 296 + + L+ +G+I Y+++V+ ++ + + RD+ L T + Sbjct: 57 LVGLLRGLNVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQG-TMTGL 115 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A + S + G + P + + +TDG + ++ + Sbjct: 116 AIQYAMNVAFSVAE----------GARPPEERVPRVAVIVTDG---RPQD--RVAEVAAQ 160 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 A+ I+I + + + G L+ S P E+ + V + LI F Sbjct: 161 ARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESF-DLIQEFG 206 Score = 43.3 bits (100), Expect = 0.077, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 65/177 (36%), Gaps = 26/177 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR--DMDSLILKPTDSTPAMK 299 +D L E +GL+ +++RV G V + + + T + A++ Sbjct: 372 VDFLDVSPEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALR 429 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S+ QG + +L + + TDG + +++ +AKE Sbjct: 430 HMVEHSFSE----------AQGARPRALNVPRVGLVFTDG---RSQDDISV--WAARAKE 474 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I + + + + + L+ S P + D F ++ L+ + + S+ Sbjct: 475 EGIAMYAVGVGKAVEAE--LREIASEPAELHVSYAPD-----FGTMTHLLRNLRSSI 524 >gi|281183022|ref|NP_001162498.1| matrilin-4 [Papio anubis] gi|134093054|gb|ABO52914.1| matrilin 4 isoform 1 precursor [Papio anubis] Length = 581 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 65/169 (38%), Gaps = 22/169 (13%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTP 296 + + L+ +G+I Y+++V+ ++ + + RD+ L T + Sbjct: 57 LVGLLRGLNVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQG-TMTGL 115 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A + S + G + P + + +TDG + ++ + Sbjct: 116 AIQYAMNVAFSVAE----------GARPPEERVPRVAVIVTDG---RPQD--RVAEVAAQ 160 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 A+ I+I + + + G L+ S P E+ + V + LI F Sbjct: 161 ARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESF-DLIQEFG 206 Score = 42.9 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 65/177 (36%), Gaps = 26/177 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR--DMDSLILKPTDSTPAMK 299 +D L E +GL+ +++RV G V + + + T + A++ Sbjct: 372 VDFLDVSPEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALR 429 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S+ QG + +L + + TDG + +++ +AKE Sbjct: 430 HMVEHSFSE----------AQGARPRALNVPRVGLVFTDG---RSQDDISV--WAARAKE 474 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I + + + + + L+ S P + D F ++ L+ + + S+ Sbjct: 475 EGIVMYAVGVGKAVEAE--LREIASEPAELHVSYAPD-----FGTMTHLLQNLRGSI 524 >gi|325279871|ref|YP_004252413.1| von Willebrand factor type A [Odoribacter splanchnicus DSM 20712] gi|324311680|gb|ADY32233.1| von Willebrand factor type A [Odoribacter splanchnicus DSM 20712] Length = 341 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 34/188 (18%), Positives = 71/188 (37%), Gaps = 45/188 (23%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 +++ ++ + + +D+S SM QD K +++ K A+ ++ Sbjct: 81 KLQQVKKKGVELMIALDVSNSMMA---------------QDIKPSRLEKAKMAISRMVEK 125 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-MDSLILKPTDSTPAMKQ 300 + S+ K +GLI + + + + +++ D + ++ T A+ Sbjct: 126 L---SNDK----IGLIVFAGDAYVQLPITTDYSSAKLFLSNISTDIVPVQGTAIGSAIDL 178 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A + T + + S K II +TDGEN+ + + +A E Sbjct: 179 AARSFTPETETS------------------KAIIVITDGENHQDDA----VAAAKQAHEK 216 Query: 361 FIKIVTIS 368 I I TI Sbjct: 217 GIVIHTIG 224 >gi|294054316|ref|YP_003547974.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] gi|293613649|gb|ADE53804.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] Length = 678 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 59/187 (31%), Gaps = 46/187 (24%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I V+D S SM D + T++ K A+L ++ ++ Sbjct: 90 LDIVFVLDSSKSMLA---------------SDLRPTRLERAKLAILDLVEQLESD----- 129 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-MDSLILKPTDSTPAMKQAYQILTSDK 309 +GL+ + P+ R+ + D + +D A+++A + + Sbjct: 130 --RIGLVAFAGSAFLQTPPTLDYGAFRESLDATAPDMMSRGGSDLGVALREATKAFPVEN 187 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 K ++ LTDGE+ + KA + +K+ I + Sbjct: 188 NY-------------------KAVVLLTDGEDLGGHAIDEAK----KASKEGVKVFAIGL 224 Query: 370 NASPNGQ 376 Sbjct: 225 GTPEGDY 231 >gi|209549601|ref|YP_002281518.1| hypothetical protein Rleg2_2008 [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209535357|gb|ACI55292.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 429 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 64/435 (14%), Positives = 129/435 (29%), Gaps = 70/435 (16%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 I+ NF I+ AL ++ L G + + + +A +AA + + Sbjct: 9 ISDRSGNFGIMTALLMVPLLGTAGMAVDFAHAMSLRTQLFAAADAAAVGS---IAEKSGA 65 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLS--GYSAVFYNTEIQNIVNSSRISMTHM 132 + N DA+ + + L+ T+ N +NS + Sbjct: 66 VAAAMTMTGNGTISLGKTDARSIFLSQVSGELADVNVDLGIDVTKTANKLNSQVSFTAVV 125 Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFL 192 + T+ F++ + P++ + Sbjct: 126 PTTFMRVLGKDSITISGTATAEYLTASFMDFYILLDN-------TPSMGVGATAKDVA-T 177 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 +E S + C + + + ++ ++ A + Sbjct: 178 MEKNTSDSCAFACHETENKNNYYNLAKTLGVSM-RIDVVRQATKELTLTAKSTRVSTNQF 236 Query: 253 YMGLIGYTTRVEKN-----IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 MG+ + T+ E +P+ +KVR Y TD+ M Q + Sbjct: 237 RMGVYTFGTKAEDANLTTISDPTDDLDKVRTY------------TDAVDLMTIPKQGYNN 284 Query: 308 DKKRSFFTNFFRQGVKIPSLP--------FQKFIIFLTDGENNNFKSNVNTIKI------ 353 D++ SF N Q I + P QK + F++DG ++ K T K+ Sbjct: 285 DQQTSFD-NALTQMKDIITTPGDGSTATTPQKILFFVSDGVGDSEKPKGCTKKLTGNRCQ 343 Query: 354 -------CDKAKENFIKI---VTISINASPNG-------------QRLLKTCVSSPEYHY 390 C K+ I+I T + N ++ C SP ++ Sbjct: 344 EPIDTSFCKPLKDKGIRIAVLYTTYLPLPKNSWYNTWISPFQSQIPTKMQECA-SPGLYF 402 Query: 391 NVVNADSLIHVFQNI 405 V + + + + Sbjct: 403 EVTPTEGIADAMKAL 417 >gi|115379114|ref|ZP_01466238.1| TPR domain protein [Stigmatella aurantiaca DW4/3-1] gi|310823566|ref|YP_003955924.1| Batb protein [Stigmatella aurantiaca DW4/3-1] gi|115363897|gb|EAU63008.1| TPR domain protein [Stigmatella aurantiaca DW4/3-1] gi|309396638|gb|ADO74097.1| BatB protein [Stigmatella aurantiaca DW4/3-1] Length = 352 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 28/189 (14%), Positives = 71/189 (37%), Gaps = 43/189 (22%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + EM +R + +V+D S SM +D + +++ K L LD Sbjct: 95 KSEMTKRKGIDVVVVLDASKSMLA---------------RDVQPSRLERAKLELNTLLDE 139 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR-DMDSLILKPTDSTPAMKQ 300 + +GL+ + + V+ ++ D + + ++ A+K Sbjct: 140 LK-------GDRVGLVVFAGDAFIQSPLTSDYSAVKLFLRAVDPEQMPQGGSNIGAALKL 192 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A Q+L++ + + ++ ++ L+DGE+ + T + K+ Sbjct: 193 ANQVLSNADRGAK----------------ERAVVLLSDGEDLFGEVGEAT----EALKDG 232 Query: 361 FIKIVTISI 369 ++++ + + Sbjct: 233 GVQVLAVGV 241 >gi|69244819|ref|ZP_00603043.1| von Willebrand factor, type A [Enterococcus faecium DO] gi|257882064|ref|ZP_05661717.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,502] gi|257889959|ref|ZP_05669612.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,410] gi|260560224|ref|ZP_05832401.1| von Willebrand factor [Enterococcus faecium C68] gi|314947791|ref|ZP_07851198.1| LPXTG-motif protein cell wall anchor domain protein [Enterococcus faecium TX0082] gi|68196173|gb|EAN10603.1| von Willebrand factor, type A [Enterococcus faecium DO] gi|257817722|gb|EEV45050.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,502] gi|257826319|gb|EEV52945.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,410] gi|260073791|gb|EEW62116.1| von Willebrand factor [Enterococcus faecium C68] gi|313645771|gb|EFS10351.1| LPXTG-motif protein cell wall anchor domain protein [Enterococcus faecium TX0082] Length = 1345 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 36/229 (15%), Positives = 69/229 (30%), Gaps = 30/229 (13%) Query: 93 DAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMT 152 D N + + E +N+ + N + N+ + YN + Sbjct: 127 DYTNGGANWRNYDYATNNGSTAGEEDRNV----PAQTVQLWGNERNFENSYLDYNGAYIK 182 Query: 153 SYDYRLQFIEHL--LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 + + L+ + + ++ E E I V+D S SM Sbjct: 183 KWVEPVLPSSTASDLHPEDATTLYNVYLDVIGGEKKEISPIDIVFVLDKSASMSELTAGT 242 Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY--------TTR 262 +TK AAL A+ S DLLS D+ +G++ + Sbjct: 243 N------------SQTKNAALIEAVNEM--SKDLLSDPSLDIRIGMVNFYHNSTAINNHE 288 Query: 263 VEKN--IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + + ++ ++ + T T +K Y+ L D Sbjct: 289 QISSDIFPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGYETLYKDN 337 >gi|257876693|ref|ZP_05656346.1| von Willebrand factor type A domain-containing protein [Enterococcus casseliflavus EC20] gi|257810859|gb|EEV39679.1| von Willebrand factor type A domain-containing protein [Enterococcus casseliflavus EC20] Length = 1195 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 33/152 (21%), Positives = 60/152 (39%), Gaps = 41/152 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM + ++A +K + FL+ I+ S +++ Sbjct: 355 IDVVLVVDWSGSM-------------------NEMGRIAEVKKGVDRFLNQIE-GSGIQD 394 Query: 251 DVYMGLIGYTT--RVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 VYMG +GY++ +N G +V++ + T + ++QA +L++ Sbjct: 395 SVYMGYVGYSSDGSNYQNKTCQLGKFSEVKETIRSMTPETAAGGTFTQRGLRQAGDMLST 454 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 455 QNGH------------------KKVIVLLTDG 468 >gi|258615515|ref|ZP_05713285.1| hypothetical protein EfaeD_07377 [Enterococcus faecium DO] gi|293563519|ref|ZP_06677967.1| Bee1, putative [Enterococcus faecium E1162] gi|294622786|ref|ZP_06701740.1| Bee1, putative [Enterococcus faecium U0317] gi|291597744|gb|EFF28882.1| Bee1, putative [Enterococcus faecium U0317] gi|291604521|gb|EFF34007.1| Bee1, putative [Enterococcus faecium E1162] Length = 1344 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 36/229 (15%), Positives = 69/229 (30%), Gaps = 30/229 (13%) Query: 93 DAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMT 152 D N + + E +N+ + N + N+ + YN + Sbjct: 126 DYTNGGANWRNYDYATNNGSTAGEEDRNV----PAQTVQLWGNERNFENSYLDYNGAYIK 181 Query: 153 SYDYRLQFIEHL--LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 + + L+ + + ++ E E I V+D S SM Sbjct: 182 KWVEPVLPSSTASDLHPEDATTLYNVYLDVIGGEKKEISPIDIVFVLDKSASMSELTAGT 241 Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY--------TTR 262 +TK AAL A+ S DLLS D+ +G++ + Sbjct: 242 N------------SQTKNAALIEAVNEM--SKDLLSDPSLDIRIGMVNFYHNSTAINNHE 287 Query: 263 VEKN--IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + + ++ ++ + T T +K Y+ L D Sbjct: 288 QISSDIFPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGYETLYKDN 336 >gi|325969627|ref|YP_004245819.1| von Willebrand factor type A [Vulcanisaeta moutnovskia 768-28] gi|323708830|gb|ADY02317.1| von Willebrand factor type A [Vulcanisaeta moutnovskia 768-28] Length = 498 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 50/294 (17%), Positives = 102/294 (34%), Gaps = 55/294 (18%) Query: 118 IQNIVNSSRISMTHMANNRLDSSNNTIFYNMDV------MTSYDYRLQFIEHLLNQRYNQ 171 ++N+V ++ SM + R + Y + V +D LQ L+++ Sbjct: 241 LRNVVRNAMSSMGQVKIVRFTDIDKYPTYVVSVREYKVGDNYFDVDLQKTAMNLSRKTMM 300 Query: 172 KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 + ++ E I L +D+SGSM + P K+ Sbjct: 301 HKLFTNKDIVVKEYANVKTIDIVLCLDVSGSMRELSSGMP---------------KIEIA 345 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL---- 287 K+A+ ++ + + + ++ + R + WG +VR+Y + L Sbjct: 346 KDAVSQYIQFLSKTND-----RLAMVLFNFRAD----VLWGLHQVRRYWQQMNYMLKYVY 396 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T+ A++++ ++LT K S K +I +TDG N Sbjct: 397 AGGGTNLANALERSREVLTRSKSNS------------------KHVICVTDGRTVNSSMC 438 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 V + + N I TI+I + + + L++ + + L Sbjct: 439 VKEAV---RLRRNGTTISTIAIGENSDDELLMRLSKIGGGLFIKISSIHDLGKA 489 >gi|189230272|ref|NP_001121460.1| anthrax toxin receptor 2 [Xenopus (Silurana) tropicalis] gi|183985706|gb|AAI66225.1| LOC100158556 protein [Xenopus (Silurana) tropicalis] Length = 488 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 34/216 (15%), Positives = 63/216 (29%), Gaps = 45/216 (20%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 P E + F + ++D SGS+ E V R Sbjct: 28 PYTGAEEPSCQGAFDLYFILDKSGSVASNWVEIYEFVEKLTERFVSPR------------ 75 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 + + I ++T+ + + + ++ + + + T Sbjct: 76 --------------MRLSFIVFSTQAKIILPLTGDRYEITKGLKDLSSVIPAGETYMHEG 121 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 K A Q VK II LTDG+ + V T K + A Sbjct: 122 FKLA----------------NEQIVKAGGKSTASVIIALTDGK-LADQIPVLTEKEANIA 164 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV 393 + ++ + + Q LK ++PE + V Sbjct: 165 RGRGARVYCVGVLDFNFDQ--LKRIAAAPENVFRVE 198 >gi|120616160|gb|ABG80452.1| collagen [Hydra vulgaris] Length = 2439 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 28/172 (16%), Positives = 65/172 (37%), Gaps = 20/172 (11%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK--PTDSTPAMK 299 ID + V +G++ Y+ + I + + ++ V + +D++ T ++ Sbjct: 1867 IDKIELSSSGVRVGVLTYSDEAKIRIRFDYSFD--KEDVKKAIDNIPYDSMGTRIDLGLE 1924 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 A ++ +K +I LTDG+ + + + E Sbjct: 1925 AAKELFLEKS--------------GGRGSSKKVLILLTDGQQTYIPDAKDPVDYAKELAE 1970 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI-HVFQNISQLMV 410 + I I I + L+ +S P++ + + +SLI + ++IS + Sbjct: 1971 YGVDIFAIGI-GDEINKVDLEDLISKPQHIFLSDDINSLITDLSKDISTALS 2021 >gi|332246079|ref|XP_003272177.1| PREDICTED: complement factor B-like [Nomascus leucogenys] Length = 764 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 41/267 (15%), Positives = 91/267 (34%), Gaps = 43/267 (16%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 YD + E L + + + A GE+ ++V+D SGSM+ + D D Sbjct: 224 YDTPQEVAEAFL--SSLTETIEGVDAEDGHSPGEQQK--RKIVLDPSGSMNIYLVLDGSD 279 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 A K L+ ++ + GL+ Y T ++ S Sbjct: 280 SIGASNFTG--------AKKCLVNLIEKVASYGVKPRY---GLVTYATYPRIWVKVSEQD 328 Query: 274 EKVRQYVTRDMDSL------ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 +VT+ ++ + + T++ A++ Y ++ ++ Sbjct: 329 SSNADWVTKQLNKINYEDHKLKSGTNTKKALQAVYSMM----------SWPDDIPPEGWN 378 Query: 328 PFQKFIIFLTDGENNNFK------SNVNTIKICDKAKEN------FIKIVTISINASPNG 375 + II +TDG +N + + K ++N + + + + Sbjct: 379 RTRHVIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQVN 438 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVF 402 L + + ++ + V + ++L VF Sbjct: 439 INALASKKDNEQHVFKVKDMENLEDVF 465 >gi|119901955|ref|XP_602058.3| PREDICTED: integrin, alpha 11 [Bos taurus] gi|297479009|ref|XP_002690571.1| PREDICTED: integrin alpha 11 subunit-like [Bos taurus] gi|296483745|gb|DAA25860.1| integrin alpha 11 subunit-like [Bos taurus] Length = 1194 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 22/182 (12%), Positives = 59/182 (32%), Gaps = 27/182 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ + + +G++ Y V V+ V T++ Sbjct: 191 LINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVKDVVEAASHIEQRGGTET--- 246 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A+ I + + K +K +I +TDGE + + K+ ++ Sbjct: 247 -RTAFGIEFARSEA---------FQKGGRKGAKKVMIVITDGE---SHDSPDLEKVIQQS 293 Query: 358 KENFIKIVTISINA--------SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQ 407 + + + +++ +K S P+ + +NV + +L + + Sbjct: 294 ERDNVTRYAVAVLGYYNRRGINPEAFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALGD 353 Query: 408 LM 409 + Sbjct: 354 RI 355 >gi|74000923|ref|XP_535527.2| PREDICTED: similar to integrin, alpha 11 precursor [Canis familiaris] Length = 1183 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 22/182 (12%), Positives = 59/182 (32%), Gaps = 27/182 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ + + +G++ Y V V+ V T++ Sbjct: 180 LINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVKDVVEAASHIEQRGGTET--- 235 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A+ I + + K +K +I +TDGE + + K+ ++ Sbjct: 236 -RTAFGIEFARSEA---------FQKGGRKGAKKVMIVITDGE---SHDSPDLEKVIQQS 282 Query: 358 KENFIKIVTISINA--------SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQ 407 + + + +++ +K S P+ + +NV + +L + + Sbjct: 283 ERDNVTRYAVAVLGYYNRRGINPEAFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALGD 342 Query: 408 LM 409 + Sbjct: 343 RI 344 >gi|150005795|ref|YP_001300539.1| hypothetical protein BVU_3288 [Bacteroides vulgatus ATCC 8482] gi|149934219|gb|ABR40917.1| conserved hypothetical protein BatA [Bacteroides vulgatus ATCC 8482] Length = 332 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 48/281 (17%), Positives = 79/281 (28%), Gaps = 68/281 (24%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 Y F+ + I++ + E I L VD+S SM Sbjct: 51 YLLHTPFVLRTVAIIMVILILARPQTTDNWQNTEIEGIDIMLAVDVSTSMLAE------- 103 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 D K ++ A K F++ +GL + + Sbjct: 104 --------DLKPNRLEAAKQVASEFIN----GRPNDN---IGLTIFAGESFTQCPLTVDH 148 Query: 274 EKVRQYVTRD-----MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 + LI T + A L K +S Sbjct: 149 GVLLNLFNSIKGDIAQRGLIEDGTAIGMGIANAVTRLKDSKAKS---------------- 192 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS-------------------I 369 K II LTDG NN + T + AK+ I+I TI + Sbjct: 193 --KVIILLTDGSNNRGDISPLTAA--EIAKQFGIRIYTIGVGTNGTAPYPMQTYAGTQYV 248 Query: 370 NAS--PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 N + + L + ++ ++ + L V+Q I +L Sbjct: 249 NVPVEIDEKTLTEIAGTTNGNYFRATSNSKLKEVYQEIDKL 289 >gi|332298719|ref|YP_004440641.1| von Willebrand factor type A [Treponema brennaborense DSM 12168] gi|332181822|gb|AEE17510.1| von Willebrand factor type A [Treponema brennaborense DSM 12168] Length = 333 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 34/237 (14%), Positives = 66/237 (27%), Gaps = 64/237 (27%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I V+D+S SM +++ A K A+ + + Sbjct: 93 ILFVLDVSPSMAAK--------------DIAGMSRLEAAKQAVRVIV-------PEAGGT 131 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 GL+ + + P+ E + + + + A L S Sbjct: 132 AFGLVALASEAALMVPPTLDREAFFARLNSLQAGELGDGSAIGMGVSTAAYHLIS----- 186 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA- 371 S +K I+ +TDGENN + T A EN I + + + Sbjct: 187 -------------SAAPKKSIVLITDGENNAGSVHPGTAAQ--LAFENGITLYVLGVGTR 231 Query: 372 ---------SPNGQRL------------LKTCV-SSPEYHYNVVNADSLIHVFQNIS 406 G+ L+ ++ ++ V + L ++ Sbjct: 232 GSVPLEYVDPATGKTYSGYLDSRFDESPLQEIALTAGGRYFGVESMGELTAAVSAVT 288 >gi|149709406|ref|XP_001496048.1| PREDICTED: similar to calcium-activated chloride channel [Equus caballus] Length = 904 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 39/211 (18%), Positives = 71/211 (33%), Gaps = 51/211 (24%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM D + +M A L + I+ S V Sbjct: 312 LVLDKSGSMDS----------------DDRLLRMNQA--AELYLIQIIEKESLV------ 347 Query: 255 GLIGYTTRVEKNIEPSWGTE-KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + E + T+ Q + + T +K +Q + + + Sbjct: 348 GMVTFDSSAEIQNNLTKITDDNAYQNIIAKLPQFAGGGTSICNGLKAGFQAIVYSNQSTS 407 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 + II LTDGE+N S C ++AK + I +I++ Sbjct: 408 GSE----------------IILLTDGEDNQMSS-------CFEEAKASGAIIHSIAL-GP 443 Query: 373 PNGQRLLKTCVSSPEYHYNVV-NADSLIHVF 402 + L + ++ + + LI F Sbjct: 444 SAAKELETLSNMTGGLRFSANKDINGLIDAF 474 >gi|300776751|ref|ZP_07086609.1| aerotolerance protein BatA [Chryseobacterium gleum ATCC 35910] gi|300502261|gb|EFK33401.1| aerotolerance protein BatA [Chryseobacterium gleum ATCC 35910] Length = 330 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 39/259 (15%), Positives = 79/259 (30%), Gaps = 64/259 (24%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 + + + + I L +D+S SM D ++ ALK Sbjct: 70 MARPRTFTISQDRDDTKGVDIMLSIDVSLSMLAK---------------DLNPDRITALK 114 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 + + F + + +G++ Y + + + V + + + T Sbjct: 115 DIAVKF---VQKRPND----RIGVVAYAAEAFTKVPVTSDHQVVIDEIKNLNSAGLEPGT 167 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + A L K +S K +I +TDG +N + + Sbjct: 168 AIGEGLSVAVNHLVKSKAKS------------------KVVILMTDGVSNIQNAIPPQVA 209 Query: 353 ICDKAKENFIKIVTISI-----------------------NASPNGQRLLKTCVSSPEYH 389 + AK N IK+ I I + + L + ++ + Sbjct: 210 A-ELAKNNNIKVYAIGIGTNGYALMPTSQDIFGDLVFTETEVTIDENTLREIAQTTGGKY 268 Query: 390 YNVVNADSLIHVFQNISQL 408 + + SL V+ I+QL Sbjct: 269 FRATSNSSLEEVYDEINQL 287 >gi|242097078|ref|XP_002439029.1| hypothetical protein SORBIDRAFT_10g030210 [Sorghum bicolor] gi|241917252|gb|EER90396.1| hypothetical protein SORBIDRAFT_10g030210 [Sorghum bicolor] Length = 607 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 33/224 (14%), Positives = 73/224 (32%), Gaps = 51/224 (22%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 R + +V+D+SGSM + ++ LK+A+ + + Sbjct: 99 TAGRAALDLVVVLDVSGSM-------------------RDFGRLDKLKSAMRFIIKKLAP 139 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQ 303 + + ++ + + +E +T +D L+ T+ +K Q Sbjct: 140 MD------RLSVVTFNGGATRECPLRAMSEDAVPVLTDIVDGLVARGGTNIEAGLKMGLQ 193 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 +L + T +I ++DGE N+ + Sbjct: 194 VLDGRRYTGARTAG---------------VILMSDGEQNSGDA-TRVRN------PQNYP 231 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNAD--SLIHVFQNI 405 + T+S S LL+ + V+++ S++ VF + Sbjct: 232 VYTLSF-GSNADMNLLQKLAGGGGTYNPVLDSGGMSMLDVFSQL 274 >gi|47212423|emb|CAF93579.1| unnamed protein product [Tetraodon nigroviridis] Length = 688 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 25/171 (14%), Positives = 63/171 (36%), Gaps = 25/171 (14%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDS 294 +D ++ L +G++ Y+++V + V + + L T + Sbjct: 28 RFMIDILNTLDIGLNSTRVGVVQYSSQVRSEFSLR-SHASLDSMVKAIQEMVPLAQGTMT 86 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ + + + K+P++ ++ +TDG + ++ Sbjct: 87 GLAIRYTMNVAFTAAEGD--------RPKVPNV-----VVIVTDG---RPQD--RVAEVA 128 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 +A+E ++I + + L+ S P ++ + V + LIH F Sbjct: 129 AEARERGMEIYAVGVA--RADMTSLRAMASPPFEDHVFLVESF-DLIHQFG 176 >gi|47168593|pdb|1Q0P|A Chain A, A Domain Of Factor B Length = 223 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 34/226 (15%), Positives = 80/226 (35%), Gaps = 39/226 (17%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 +V+D SGSM+ + D D A K +L+ ++ + Sbjct: 8 IVLDPSGSMNIYLVLDGSDSIGASNFTG--------AKKSLVNLIEKVASYGVKPRY--- 56 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL------ILKPTDSTPAMKQAYQILTSD 308 GL+ Y T + ++ S +VT+ ++ + + T++ A++ Y ++ Sbjct: 57 GLVTYATYPKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMM--- 113 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK------SNVNTIKICDKAKEN-- 360 ++ + II +TDG +N + + K ++N Sbjct: 114 -------SWPDDVPPEGWNRTRHVIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPR 166 Query: 361 ----FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + + + L + + ++ + V + ++L VF Sbjct: 167 EDYLDVYVFGVGPLVNQVNINALASKKDNEQHVFKVKDMENLEDVF 212 >gi|332208765|ref|XP_003253479.1| PREDICTED: LOW QUALITY PROTEIN: matrilin-4-like [Nomascus leucogenys] Length = 448 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 65/169 (38%), Gaps = 22/169 (13%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTP 296 + + L+ +G+I Y+++V+ ++ + + RD+ L T + Sbjct: 57 LMGLLRGLNVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQG-TMTGL 115 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A + S + G + P + + +TDG + ++ + Sbjct: 116 AIQYAMNVAFSVAE----------GARPPEERVPRVAVIVTDG---RPQD--RVAEVAAQ 160 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 A+ I+I + + + G L+ S P E+ + V + LI F Sbjct: 161 ARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESF-DLIQEFG 206 >gi|193786651|dbj|BAG51974.1| unnamed protein product [Homo sapiens] Length = 581 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 65/169 (38%), Gaps = 22/169 (13%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTP 296 + + L+ +G+I Y+++V+ ++ + + RD+ L T + Sbjct: 57 LMGLLRGLNVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQG-TMTGL 115 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A + S + G + P + + +TDG + ++ + Sbjct: 116 AIQYAMNVAFSVAE----------GARPPEERVPRVAVIVTDG---RPQD--RVAEVAAQ 160 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 A+ I+I + + + G L+ S P E+ + V + LI F Sbjct: 161 ARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESF-DLIQEFG 206 Score = 43.3 bits (100), Expect = 0.075, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 65/177 (36%), Gaps = 26/177 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR--DMDSLILKPTDSTPAMK 299 +D L E +GL+ +++RV G V + + + T + A++ Sbjct: 372 VDFLDVSPEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALR 429 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S+ QG + +L + + TDG + +++ +AKE Sbjct: 430 HMVEHSFSE----------AQGARPRALNVPRVGLVFTDG---RSQDDISV--WAARAKE 474 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I + + + + + L+ S P + D F ++ L+ + + S+ Sbjct: 475 EGIVMYAVGVGKAVEAE--LREIASEPAELHVSYAPD-----FGTMTHLLENLRSSI 524 >gi|13699836|ref|NP_085095.1| matrilin-4 isoform 3 precursor [Homo sapiens] Length = 499 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 65/169 (38%), Gaps = 22/169 (13%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTP 296 + + L+ +G+I Y+++V+ ++ + + RD+ L T + Sbjct: 57 LMGLLRGLNVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQG-TMTGL 115 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A + S + G + P + + +TDG + ++ + Sbjct: 116 AIQYAMNVAFSVAE----------GARPPEERVPRVAVIVTDG---RPQD--RVAEVAAQ 160 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 A+ I+I + + + G L+ S P E+ + V + LI F Sbjct: 161 ARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESF-DLIQEFG 206 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 65/177 (36%), Gaps = 26/177 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR--DMDSLILKPTDSTPAMK 299 +D L E +GL+ +++RV G V + + + T + A++ Sbjct: 290 VDFLDVSPEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALR 347 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S+ QG + +L + + TDG + +++ +AKE Sbjct: 348 HMVEHSFSE----------AQGARPRALNVPRVGLVFTDG---RSQDDISV--WAARAKE 392 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I + + + + + L+ S P + D F ++ L+ + + S+ Sbjct: 393 EGIVMYAVGVGKAVEAE--LREIASEPAELHVSYAPD-----FGTMTHLLENLRGSI 442 >gi|134093095|gb|ABO52955.1| matrilin 4 isoform 1 precursor [Gorilla gorilla gorilla] Length = 581 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 65/169 (38%), Gaps = 22/169 (13%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTP 296 + + L+ +G+I Y+++V+ ++ + + RD+ L T + Sbjct: 57 LMGLLRGLNVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQG-TMTGL 115 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A + S + G + P + + +TDG + ++ + Sbjct: 116 AIQYAMNVAFSVAE----------GARPPEERVPRVAVIVTDG---RPQD--RVAEVAAQ 160 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 A+ I+I + + + G L+ S P E+ + V + LI F Sbjct: 161 ARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESF-DLIQEFG 206 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 65/177 (36%), Gaps = 26/177 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR--DMDSLILKPTDSTPAMK 299 +D L E +GL+ +++RV G V + + + T + A++ Sbjct: 372 VDFLDVSPEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALR 429 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S+ QG + +L + + TDG + +++ +AKE Sbjct: 430 HMVEHSFSE----------AQGARPRALNVPRVGLVFTDG---RSQDDISV--WAARAKE 474 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I + + + + + L+ S P + D F ++ L+ + + S+ Sbjct: 475 EGIVMYAVGVGKAVEAE--LREIASEPAELHVSYAPD-----FGTMTHLLENLRGSI 524 >gi|119596273|gb|EAW75867.1| matrilin 4, isoform CRA_b [Homo sapiens] Length = 620 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 65/169 (38%), Gaps = 22/169 (13%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTP 296 + + L+ +G+I Y+++V+ ++ + + RD+ L T + Sbjct: 57 LMGLLRGLNVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQG-TMTGL 115 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A + S + G + P + + +TDG + ++ + Sbjct: 116 AIQYAMNVAFSVAE----------GARPPEERVPRVAVIVTDG---RPQD--RVAEVAAQ 160 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 A+ I+I + + + G L+ S P E+ + V + LI F Sbjct: 161 ARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESF-DLIQEFG 206 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 65/177 (36%), Gaps = 26/177 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR--DMDSLILKPTDSTPAMK 299 +D L E +GL+ +++RV G V + + + T + A++ Sbjct: 411 VDFLDVSPEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALR 468 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S+ QG + +L + + TDG + +++ +AKE Sbjct: 469 HMVEHSFSE----------AQGARPRALNVPRVGLVFTDG---RSQDDISV--WAARAKE 513 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I + + + + + L+ S P + D F ++ L+ + + S+ Sbjct: 514 EGIVMYAVGVGKAVEAE--LREIASEPAELHVSYAPD-----FGTMTHLLENLRGSI 563 >gi|13699830|ref|NP_003824.2| matrilin-4 isoform 1 precursor [Homo sapiens] gi|4499937|emb|CAB39280.1| matrilin 4 [Homo sapiens] Length = 581 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 65/169 (38%), Gaps = 22/169 (13%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTP 296 + + L+ +G+I Y+++V+ ++ + + RD+ L T + Sbjct: 57 LMGLLRGLNVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQG-TMTGL 115 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A + S + G + P + + +TDG + ++ + Sbjct: 116 AIQYAMNVAFSVAE----------GARPPEERVPRVAVIVTDG---RPQD--RVAEVAAQ 160 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 A+ I+I + + + G L+ S P E+ + V + LI F Sbjct: 161 ARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESF-DLIQEFG 206 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 65/177 (36%), Gaps = 26/177 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR--DMDSLILKPTDSTPAMK 299 +D L E +GL+ +++RV G V + + + T + A++ Sbjct: 372 VDFLDVSPEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALR 429 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S+ QG + +L + + TDG + +++ +AKE Sbjct: 430 HMVEHSFSE----------AQGARPRALNVPRVGLVFTDG---RSQDDISV--WAARAKE 474 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I + + + + + L+ S P + D F ++ L+ + + S+ Sbjct: 475 EGIVMYAVGVGKAVEAE--LREIASEPAELHVSYAPD-----FGTMTHLLENLRGSI 524 >gi|13699834|ref|NP_085080.1| matrilin-4 isoform 2 precursor [Homo sapiens] gi|119596274|gb|EAW75868.1| matrilin 4, isoform CRA_c [Homo sapiens] Length = 540 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 65/169 (38%), Gaps = 22/169 (13%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTP 296 + + L+ +G+I Y+++V+ ++ + + RD+ L T + Sbjct: 57 LMGLLRGLNVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQG-TMTGL 115 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A + S + G + P + + +TDG + ++ + Sbjct: 116 AIQYAMNVAFSVAE----------GARPPEERVPRVAVIVTDG---RPQD--RVAEVAAQ 160 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 A+ I+I + + + G L+ S P E+ + V + LI F Sbjct: 161 ARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESF-DLIQEFG 206 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 65/177 (36%), Gaps = 26/177 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR--DMDSLILKPTDSTPAMK 299 +D L E +GL+ +++RV G V + + + T + A++ Sbjct: 331 VDFLDVSPEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALR 388 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S+ QG + +L + + TDG + +++ +AKE Sbjct: 389 HMVEHSFSE----------AQGARPRALNVPRVGLVFTDG---RSQDDISV--WAARAKE 433 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I + + + + + L+ S P + D F ++ L+ + + S+ Sbjct: 434 EGIVMYAVGVGKAVEAE--LREIASEPAELHVSYAPD-----FGTMTHLLENLRGSI 483 >gi|73920229|sp|O95460|MATN4_HUMAN RecName: Full=Matrilin-4; Flags: Precursor gi|5419632|emb|CAB46380.1| matrilin 4 [Homo sapiens] Length = 622 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 65/169 (38%), Gaps = 22/169 (13%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTP 296 + + L+ +G+I Y+++V+ ++ + + RD+ L T + Sbjct: 57 LMGLLRGLNVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQG-TMTGL 115 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A + S + G + P + + +TDG + ++ + Sbjct: 116 AIQYAMNVAFSVAE----------GARPPEERVPRVAVIVTDG---RPQD--RVAEVAAQ 160 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 A+ I+I + + + G L+ S P E+ + V + LI F Sbjct: 161 ARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESF-DLIQEFG 206 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 65/177 (36%), Gaps = 26/177 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR--DMDSLILKPTDSTPAMK 299 +D L E +GL+ +++RV G V + + + T + A++ Sbjct: 413 VDFLDVSPEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALR 470 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S+ QG + +L + + TDG + +++ +AKE Sbjct: 471 HMVEHSFSE----------AQGARPRALNVPRVGLVFTDG---RSQDDISV--WAARAKE 515 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I + + + + + L+ S P + D F ++ L+ + + S+ Sbjct: 516 EGIVMYAVGVGKAVEAE--LREIASEPAELHVSYAPD-----FGTMTHLLENLRGSI 565 >gi|325570952|ref|ZP_08146571.1| von Willebrand factor type A domain protein [Enterococcus casseliflavus ATCC 12755] gi|325156278|gb|EGC68462.1| von Willebrand factor type A domain protein [Enterococcus casseliflavus ATCC 12755] Length = 1176 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 31/153 (20%), Positives = 56/153 (36%), Gaps = 43/153 (28%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM + ++ +K + FL+ I+ S +++ Sbjct: 336 IDVVLVVDWSGSM-------------------NEMGRITEVKKGVDRFLNQIE-GSGIQD 375 Query: 251 DVYMGLIGY----TTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 VYMG +GY K + +V++ + T + ++QA +L+ Sbjct: 376 SVYMGYVGYSSDGNNYQNKTCQLG-KFSEVKETIRTMTPETAAGGTFTQRGLRQAGDMLS 434 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + +K I+ LTDG Sbjct: 435 TQNGH------------------KKVIVLLTDG 449 >gi|212634559|ref|YP_002311084.1| Von Willebrand factor type A domain-containing protein [Shewanella piezotolerans WP3] gi|212556043|gb|ACJ28497.1| Von Willebrand factor type A domain protein [Shewanella piezotolerans WP3] Length = 328 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 37/220 (16%), Positives = 68/220 (30%), Gaps = 43/220 (19%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + + VDLSGSM D + + ++ +K+ LL FL S Sbjct: 95 MMVAVDLSGSMEARDFVDLKGDKTR---------RIDGVKSLLLDFLAQ--RASD----- 138 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GLI + + + + + T A+ A + Sbjct: 139 RVGLIAFGDAAYLQAPFTEDKGALSLLLKEMDVRMAGAGTALGDAIGVAVNHFSH----- 193 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 S K ++ LTDG N+ S ++ A + I I I+I Sbjct: 194 -------------SDTDNKVLLLLTDG--NDTSSEFPPLEAARYAAQQGIVIYPIAIGDP 238 Query: 373 PN------GQRLLKTCVS-SPEYHYNVVNADSLIHVFQNI 405 N +L+ + + + ++ V+ I Sbjct: 239 ANVGEDSLDIEMLQQIADITYGQVFEAQDGEAFTQVYSII 278 >gi|315444579|ref|YP_004077458.1| Mg-chelatase subunit ChlD [Mycobacterium sp. Spyr1] gi|315262882|gb|ADT99623.1| Mg-chelatase subunit ChlD [Mycobacterium sp. Spyr1] Length = 335 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 36/234 (15%), Positives = 71/234 (30%), Gaps = 51/234 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + LV+D+S SM D ++ A + A F + + Sbjct: 99 VMLVIDVSQSM---------------RATDVAPNRLTAAQEAAKQF------ADQLTPGI 137 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GLI Y + P+ E + + + + T + + A Q + + Sbjct: 138 NLGLIAYAGTATVLVSPTTNRESTKTAIDKLQLADR---TATGEGIFTALQAIATVGAV- 193 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK--SNVNTIKICD-KAKENFIKIVTISI 369 + P ++ ++DG+ N AK+ + I T+S Sbjct: 194 ---------IGGGDEPPPARVVLMSDGKETVPSNPDNPKGAYTAARTAKDQGVPISTVSF 244 Query: 370 NAS-------------PNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLM 409 P +LK S + + + L VF N+ + + Sbjct: 245 GTPYGYVEINEQRQPVPVDDEMLKKIADLSGGEAFTASSLEQLKQVFTNLQEQI 298 >gi|195978918|ref|YP_002124162.1| fimbrial structural subunit protein FszD [Streptococcus equi subsp. zooepidemicus MGCS10565] gi|195975623|gb|ACG63149.1| fimbrial structural subunit protein FszD [Streptococcus equi subsp. zooepidemicus MGCS10565] Length = 967 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 46/265 (17%), Positives = 94/265 (35%), Gaps = 44/265 (16%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF--LDSI 242 G++ + +VVD S SM ++ + + A AL A L +I Sbjct: 409 TGKKQPLDVLVVVDRSASMKEGISQNDIPRDQAVKN---------ALTGAGGLLQKFINI 459 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK--- 299 + + + + G + Y +R K SW + + + + D+ +LK +S+ A+ Sbjct: 460 NAENKLSVIGFQGSLNYNSREGKPERISWRSIIYQPSINNNKDADVLKNWESSSALNRDD 519 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG-------------------- 339 +Y+ + K+ +K ++F++DG Sbjct: 520 LSYKDKNGTNYHAALVKADEMLNKVADDGHRKIMVFVSDGVPTFYFGSDHYRAGNGTSDA 579 Query: 340 ENNNFKSNVNTIKICD-KAKENFIKIVTISINASPNGQRL------LKTCVSSPEYHYNV 392 N + I D K K + I ++ ++ N LK +S +++Y + Sbjct: 580 SNIKSSQDGTRAAIDDFKKKHPNLSIYSLGVSKDINSDTASSSPVVLKY-LSGEDHYYGI 638 Query: 393 VNADSLIHVFQNISQL--MVHRKYS 415 N L + I + + H + S Sbjct: 639 TNTVELEKIANKIVEDSKVSHLEIS 663 >gi|48428051|sp|Q864W1|CFAB_PONPY RecName: Full=Complement factor B; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement factor B Ba fragment; Contains: RecName: Full=Complement factor B Bb fragment; Flags: Precursor gi|29690183|gb|AAM10003.1| complement factor B precursor [Pongo pygmaeus] Length = 764 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 34/226 (15%), Positives = 79/226 (34%), Gaps = 39/226 (17%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 +V+D SGSM+ + D D A K L+ ++ + Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGAGNFTG--------AKKCLVNLIEKVASYGVKPRY--- 309 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL------ILKPTDSTPAMKQAYQILTSD 308 GL+ Y T + ++ S +VT+ ++ + + T++ A++ Y ++ Sbjct: 310 GLVTYATYPKIWVKVSEPDSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMM--- 366 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK------SNVNTIKICDKAKEN-- 360 ++ + II +TDG +N + + K ++N Sbjct: 367 -------SWPDDIPPEGWNRTRHVIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPR 419 Query: 361 ----FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + + + L + + ++ + V + ++L VF Sbjct: 420 EDYLDVYVFGVGPLVNQVNINALASKKDNEQHVFKVKDMENLEDVF 465 >gi|145224243|ref|YP_001134921.1| hypothetical protein Mflv_3659 [Mycobacterium gilvum PYR-GCK] gi|189040172|sp|A4T9I4|Y3659_MYCGI RecName: Full=UPF0353 protein Mflv_3659 gi|145216729|gb|ABP46133.1| von Willebrand factor, type A [Mycobacterium gilvum PYR-GCK] Length = 335 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 36/234 (15%), Positives = 71/234 (30%), Gaps = 51/234 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + LV+D+S SM D ++ A + A F + + Sbjct: 99 VMLVIDVSQSM---------------RATDVAPNRLTAAQEAAKQF------ADQLTPGI 137 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GLI Y + P+ E + + + + T + + A Q + + Sbjct: 138 NLGLIAYAGTATVLVSPTTNRESTKTAIDKLQLADR---TATGEGIFTALQAIATVGAV- 193 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK--SNVNTIKICD-KAKENFIKIVTISI 369 + P ++ ++DG+ N AK+ + I T+S Sbjct: 194 ---------IGGGDEPPPARVVLMSDGKETVPSNPDNPKGAYTAARTAKDQGVPISTVSF 244 Query: 370 NAS-------------PNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLM 409 P +LK S + + + L VF N+ + + Sbjct: 245 GTPYGYVEINEQRQPVPVDDEMLKKIADLSGGEAFTASSLEQLKQVFTNLQEQI 298 >gi|254882023|ref|ZP_05254733.1| BatA aerotolerance operon protein [Bacteroides sp. 4_3_47FAA] gi|294776174|ref|ZP_06741663.1| von Willebrand factor type A domain protein [Bacteroides vulgatus PC510] gi|319640969|ref|ZP_07995677.1| hypothetical protein HMPREF9011_01274 [Bacteroides sp. 3_1_40A] gi|254834816|gb|EET15125.1| BatA aerotolerance operon protein [Bacteroides sp. 4_3_47FAA] gi|294449997|gb|EFG18508.1| von Willebrand factor type A domain protein [Bacteroides vulgatus PC510] gi|317387414|gb|EFV68285.1| hypothetical protein HMPREF9011_01274 [Bacteroides sp. 3_1_40A] Length = 332 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 48/281 (17%), Positives = 79/281 (28%), Gaps = 68/281 (24%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 Y F+ + I++ + E I L VD+S SM Sbjct: 51 YLLHAPFVLRTVAIIMVILILARPQTTDNWQNTEIEGIDIMLAVDVSTSMLAE------- 103 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 D K ++ A K F++ +GL + + Sbjct: 104 --------DLKPNRLEAAKQVASEFIN----GRPNDN---IGLTIFAGESFTQCPLTVDH 148 Query: 274 EKVRQYVTRD-----MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 + LI T + A L K +S Sbjct: 149 GVLLNLFNSIKGDIAQRGLIEDGTAIGMGIANAVTRLKDSKAKS---------------- 192 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS-------------------I 369 K II LTDG NN + T + AK+ I+I TI + Sbjct: 193 --KVIILLTDGSNNRGDISPLTAA--EIAKQFGIRIYTIGVGTNGTAPYPMQTYAGTQYV 248 Query: 370 NAS--PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 N + + L + ++ ++ + L V+Q I +L Sbjct: 249 NVPVEIDEKTLTEIAGTTNGNYFRATSNSKLKEVYQEIDKL 289 >gi|156408065|ref|XP_001641677.1| predicted protein [Nematostella vectensis] gi|156228817|gb|EDO49614.1| predicted protein [Nematostella vectensis] Length = 1418 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 24/123 (19%), Positives = 47/123 (38%), Gaps = 19/123 (15%) Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T+ A + AY I+ G K P+ + + LTDG+ + Sbjct: 86 FRSGMTNMHDAFQAAYDIIFGK----------YSGHKRPTHQVKTAVFLLTDGQ---WNW 132 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 N + I + K+ I+I TI + N L ++SP +++ + F+ ++ Sbjct: 133 NGDPWPIAKRLKDRGIEIFTIGVTNGVNVNTLRS--LASPNNYFHYNDFTQ----FRELA 186 Query: 407 QLM 409 + Sbjct: 187 TCI 189 >gi|311033467|sp|A8K7I4|CLCA1_HUMAN RecName: Full=Calcium-activated chloride channel regulator 1; AltName: Full=Calcium-activated chloride channel family member 1; Short=hCLCA1; AltName: Full=Calcium-activated chloride channel protein 1; Short=CaCC-1; Short=hCaCC-1; Flags: Precursor gi|56203695|emb|CAI22169.1| CLCA family member 1, chloride channel regulator [Homo sapiens] Length = 914 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 45/240 (18%), Positives = 85/240 (35%), Gaps = 69/240 (28%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM ++ L A LFL + + ++ Sbjct: 309 LVLDKSGSMA-------------------TGNRLNRLNQAGQLFL--LQTVELGS---WV 344 Query: 255 GLIGYTTRV-EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + +N + R + + + + T ++ A+ ++ ++ + Sbjct: 345 GMVTFDSAAHVQNELIQINSGSDRDTLAKRLPAAASGGTSICSGLRSAFTVI----RKKY 400 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 T+ I+ LTDGE+N TI C ++ K++ I T+++ Sbjct: 401 PTDGSE-------------IVLLTDGEDN-------TISGCFNEVKQSGAIIHTVAL-GP 439 Query: 373 PNGQRL---------LKTCVSSPEYHYNVVNADSLIHVFQNISQ---LMVHRKYSVILKG 420 Q L L+T S V N + LI F +S + R + KG Sbjct: 440 SAAQELEELSKMTGGLQTYASD-----QVQN-NGLIDAFGALSSGNGAVSQRSIQLESKG 493 >gi|257867801|ref|ZP_05647454.1| von Willebrand factor type A/Cna B-type domain-containing protein [Enterococcus casseliflavus EC30] gi|257874128|ref|ZP_05653781.1| von Willebrand factor type A domain-containing protein [Enterococcus casseliflavus EC10] gi|257801884|gb|EEV30787.1| von Willebrand factor type A/Cna B-type domain-containing protein [Enterococcus casseliflavus EC30] gi|257808292|gb|EEV37114.1| von Willebrand factor type A domain-containing protein [Enterococcus casseliflavus EC10] Length = 1191 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 33/152 (21%), Positives = 60/152 (39%), Gaps = 41/152 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM + ++A +K + FL+ I+ S +++ Sbjct: 351 IDVVLVVDWSGSM-------------------NEMGRIAEVKKGVDRFLNQIE-GSGIQD 390 Query: 251 DVYMGLIGYTT--RVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 VYMG +GY++ +N G +V++ + T + ++QA +L++ Sbjct: 391 SVYMGYVGYSSDGSNYQNKTCQLGKFSEVKETIRSMTPETAAGGTFTQRGLRQAGDMLST 450 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 451 QNGH------------------KKVIVLLTDG 464 >gi|222081474|ref|YP_002540837.1| von Willebrand factor, type A [Agrobacterium radiobacter K84] gi|221726153|gb|ACM29242.1| von Willebrand factor, type A [Agrobacterium radiobacter K84] Length = 329 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 37/223 (16%), Positives = 75/223 (33%), Gaps = 43/223 (19%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 P + L +DLS SM D ++ A++ + F+ Sbjct: 91 EPQRDLMLGLDLSQSM---------DTKDFRAPDGNLEARVDAVRKVVGDFVAR------ 135 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + +GLI + + E VR+ ++ + + T A+ A Sbjct: 136 -RPGDRIGLIAFGDAPYPLAPFTMDHELVREIISGTLPGIAGPRTSLGDAVGLA------ 188 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + + ++P +K +I LTDG N+ S + +K + AK N + + T+ Sbjct: 189 -----------IKMFEKTTVP-EKVLIVLTDG--NDTASKMPPLKAAEIAKRNGVVVHTV 234 Query: 368 SINASPN-GQRLLKTCV------SSPEYHYNVVNADSLIHVFQ 403 I G+ L ++ ++ + L + Sbjct: 235 GIGDPQATGEDRLDATALEKIAETTGGRYFFGGDQAQLAAAYD 277 >gi|110611231|ref|NP_001276.2| calcium-activated chloride channel regulator 1 precursor [Homo sapiens] gi|146327635|gb|AAI41452.1| Chloride channel accessory 1 [synthetic construct] gi|162318850|gb|AAI56806.1| Chloride channel accessory 1 [synthetic construct] Length = 914 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 45/240 (18%), Positives = 85/240 (35%), Gaps = 69/240 (28%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM ++ L A LFL + + ++ Sbjct: 309 LVLDKSGSMA-------------------TGNRLNRLNQAGQLFL--LQTVELGS---WV 344 Query: 255 GLIGYTTRV-EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + +N + R + + + + T ++ A+ ++ ++ + Sbjct: 345 GMVTFDSAAHVQNELIQINSGSDRDTLAKRLPAAASGGTSICSGLRSAFTVI----RKKY 400 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 T+ I+ LTDGE+N TI C ++ K++ I T+++ Sbjct: 401 PTDGSE-------------IVLLTDGEDN-------TISGCFNEVKQSGAIIHTVAL-GP 439 Query: 373 PNGQRL---------LKTCVSSPEYHYNVVNADSLIHVFQNISQ---LMVHRKYSVILKG 420 Q L L+T S V N + LI F +S + R + KG Sbjct: 440 SAAQELEELSKMTGGLQTYASD-----QVQN-NGLIDAFGALSSGNGAVSQRSIQLESKG 493 >gi|291402773|ref|XP_002718214.1| PREDICTED: integrin, alpha 11 [Oryctolagus cuniculus] Length = 1188 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 21/182 (11%), Positives = 59/182 (32%), Gaps = 27/182 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ + + +G++ Y + V+ V T++ Sbjct: 185 LINILKKFYIGPGQIQVGVVQYGEDAVHEFHL-NDYKSVKDVVEAASHIEQRGGTET--- 240 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A+ I + + K +K +I +TDGE + + K+ ++ Sbjct: 241 -RTAFGIEFARSEA---------FQKGGRKGAKKVMIVITDGE---SHDSPDLKKVIRQS 287 Query: 358 KENFIKIVTISINA--------SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQ 407 + + + +++ +K S P+ + +NV + +L + + Sbjct: 288 ERDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALGD 347 Query: 408 LM 409 + Sbjct: 348 RI 349 >gi|319784280|ref|YP_004143756.1| hypothetical protein Mesci_4597 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317170168|gb|ADV13706.1| hypothetical protein Mesci_4597 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 643 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 68/206 (33%), Gaps = 21/206 (10%) Query: 62 LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI 121 L GA M + + + +++SN A A+R + + L Y+ F+N + I Sbjct: 9 LMGALAMAVDFTEMSREKQAVSNALDAANFATARRLTEGATDDQLRAYALDFFNANLNKI 68 Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL 181 ++ + +N M Y + L + I + Sbjct: 69 NPANTTLTVTLPSNTTGGG----LLKMTARLDYKPYFYPVFGQLVGKSETDANQRISFNI 124 Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD- 240 E+ + + LV+D SGSM +T++ LK A +D Sbjct: 125 TSEVRLKNTLEVALVLDNSGSMT-------------KTGTGSGQTRIDLLKTAAKQLVDT 171 Query: 241 ---SIDLLSHVKEDVYMGLIGYTTRV 263 ++ V V GL+ + V Sbjct: 172 LAQQAAMIKQVDRPVQFGLVPFAASV 197 Score = 39.9 bits (91), Expect = 0.85, Method: Composition-based stats. Identities = 19/97 (19%), Positives = 35/97 (36%), Gaps = 20/97 (20%) Query: 338 DGENNNFKS--NVNTIKICDKAKENFIKIVTISI------NASPNGQRLLKTC------- 382 D N N+ + N +C+ AK I ++T+++ A LK+C Sbjct: 545 DYSNGNYTNALNEQMATLCNNAKAANIMVMTVALDLSTTKTADQQAIEALKSCSSNSRFR 604 Query: 383 -----VSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 S P + SL + F+ I + + + Sbjct: 605 KDPTDASKPAKLFWNATGASLSNDFKEIGNELSNLRI 641 >gi|260769129|ref|ZP_05878062.1| putative hemolysin [Vibrio furnissii CIP 102972] gi|260614467|gb|EEX39653.1| putative hemolysin [Vibrio furnissii CIP 102972] Length = 1476 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 33/162 (20%), Positives = 59/162 (36%), Gaps = 20/162 (12%) Query: 191 FLIELVVDLSGSMHCAM-NSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + I L++D SGSM + +D NS ++M + +AL + V Sbjct: 865 YNIALIIDTSGSMKFDLAGNDNGFSNSYQSQSQYNASRMKLVIDALTNLATDLVNHDGV- 923 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + + LIG+ + + + ++Q +T D T+ A A Sbjct: 924 --ININLIGFESSAHSALTLQLTADNLQQLLTEIQDMDAEGGTNYEAAFDLA-------- 973 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 +N+F P+ ++ FLTDG+ S N Sbjct: 974 -----SNWFSHQ---PTEGYENLTYFLTDGDPTFSNSGDNGA 1007 >gi|332844134|ref|XP_510503.3| PREDICTED: integrin alpha-11 [Pan troglodytes] Length = 1188 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 22/182 (12%), Positives = 59/182 (32%), Gaps = 27/182 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ + + +G++ Y V V+ V T++ Sbjct: 185 LINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVKDVVEAASHIEQRGGTET--- 240 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A+ I + + K +K +I +TDGE + + K+ ++ Sbjct: 241 -RTAFGIEFARSEA---------FQKGGRKGAKKVMIVITDGE---SHDSPDLEKVIQQS 287 Query: 358 KENFIKIVTISINA--------SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQ 407 + + + +++ +K S P+ + +NV + +L + + Sbjct: 288 ERDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALGD 347 Query: 408 LM 409 + Sbjct: 348 RI 349 >gi|332236004|ref|XP_003267196.1| PREDICTED: integrin alpha-11 [Nomascus leucogenys] Length = 1188 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 22/182 (12%), Positives = 59/182 (32%), Gaps = 27/182 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ + + +G++ Y V V+ V T++ Sbjct: 185 LINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVKDVVEAASHIEQRGGTET--- 240 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A+ I + + K +K +I +TDGE + + K+ ++ Sbjct: 241 -RTAFGIEFARSEA---------FQKGGRKGAKKVMIVITDGE---SHDSPDLEKVIQQS 287 Query: 358 KENFIKIVTISINA--------SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQ 407 + + + +++ +K S P+ + +NV + +L + + Sbjct: 288 ERDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALGD 347 Query: 408 LM 409 + Sbjct: 348 RI 349 >gi|297296749|ref|XP_001083531.2| PREDICTED: integrin alpha-11 [Macaca mulatta] Length = 1149 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 22/182 (12%), Positives = 59/182 (32%), Gaps = 27/182 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ + + +G++ Y V V+ V T++ Sbjct: 185 LINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVKDVVEAASHIEQRGGTET--- 240 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A+ I + + K +K +I +TDGE + + K+ ++ Sbjct: 241 -RTAFGIEFARSEA---------FQKGGRKGAKKVMIVITDGE---SHDSPDLEKVIQQS 287 Query: 358 KENFIKIVTISINA--------SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQ 407 + + + +++ +K S P+ + +NV + +L + + Sbjct: 288 ERDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALGD 347 Query: 408 LM 409 + Sbjct: 348 RI 349 >gi|296213559|ref|XP_002753319.1| PREDICTED: integrin alpha-11 [Callithrix jacchus] Length = 1188 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 22/182 (12%), Positives = 59/182 (32%), Gaps = 27/182 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ + + +G++ Y V V+ V T++ Sbjct: 185 LINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVKDVVEAASHIEQRGGTET--- 240 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A+ I + + K +K +I +TDGE + + K+ ++ Sbjct: 241 -RTAFGIEFARSEA---------FQKGGRKGAKKVMIVITDGE---SHDSPDLEKVIQQS 287 Query: 358 KENFIKIVTISINA--------SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQ 407 + + + +++ +K S P+ + +NV + +L + + Sbjct: 288 ERDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALGD 347 Query: 408 LM 409 + Sbjct: 348 RI 349 >gi|193788254|dbj|BAG53148.1| unnamed protein product [Homo sapiens] Length = 437 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 22/182 (12%), Positives = 59/182 (32%), Gaps = 27/182 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ + + +G++ Y V V+ V T++ Sbjct: 185 LINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVKDVVEAASHIEQRGGTET--- 240 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A+ I + + K +K +I +TDGE + + K+ ++ Sbjct: 241 -RTAFGIEFARSEA---------FQKGGRKGAKKVMIVITDGE---SHDSPDLEKVIQQS 287 Query: 358 KENFIKIVTISINA--------SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQ 407 + + + +++ +K S P+ + +NV + +L + + Sbjct: 288 ERDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALGD 347 Query: 408 LM 409 + Sbjct: 348 RI 349 >gi|158258322|dbj|BAF85134.1| unnamed protein product [Homo sapiens] Length = 1188 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 22/182 (12%), Positives = 59/182 (32%), Gaps = 27/182 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ + + +G++ Y V V+ V T++ Sbjct: 185 LINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVKDVVEAASHIEQRGGTET--- 240 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A+ I + + K +K +I +TDGE + + K+ ++ Sbjct: 241 -RTAFGIEFARSEA---------FQKGGRKGAKKVMIVITDGE---SHDSPDLEKVIQQS 287 Query: 358 KENFIKIVTISINA--------SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQ 407 + + + +++ +K S P+ + +NV + +L + + Sbjct: 288 ERDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALGD 347 Query: 408 LM 409 + Sbjct: 348 RI 349 >gi|119598226|gb|EAW77820.1| integrin, alpha 11, isoform CRA_a [Homo sapiens] Length = 1189 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 22/182 (12%), Positives = 59/182 (32%), Gaps = 27/182 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ + + +G++ Y V V+ V T++ Sbjct: 185 LINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVKDVVEAASHIEQRGGTET--- 240 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A+ I + + K +K +I +TDGE + + K+ ++ Sbjct: 241 -RTAFGIEFARSEA---------FQKGGRKGAKKVMIVITDGE---SHDSPDLEKVIQQS 287 Query: 358 KENFIKIVTISINA--------SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQ 407 + + + +++ +K S P+ + +NV + +L + + Sbjct: 288 ERDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALGD 347 Query: 408 LM 409 + Sbjct: 348 RI 349 >gi|5915662|gb|AAD51919.2|AF137378_1 integrin alpha 11 subunit precursor [Homo sapiens] gi|119598227|gb|EAW77821.1| integrin, alpha 11, isoform CRA_b [Homo sapiens] Length = 1188 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 22/182 (12%), Positives = 59/182 (32%), Gaps = 27/182 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ + + +G++ Y V V+ V T++ Sbjct: 185 LINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVKDVVEAASHIEQRGGTET--- 240 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A+ I + + K +K +I +TDGE + + K+ ++ Sbjct: 241 -RTAFGIEFARSEA---------FQKGGRKGAKKVMIVITDGE---SHDSPDLEKVIQQS 287 Query: 358 KENFIKIVTISINA--------SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQ 407 + + + +++ +K S P+ + +NV + +L + + Sbjct: 288 ERDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALGD 347 Query: 408 LM 409 + Sbjct: 348 RI 349 >gi|6013141|gb|AAF01258.1|AF109681_1 integrin alpha-11 subunit precursor [Homo sapiens] Length = 1189 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 22/182 (12%), Positives = 59/182 (32%), Gaps = 27/182 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ + + +G++ Y V V+ V T++ Sbjct: 185 LINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVKDVVEAASHIEQRGGTET--- 240 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A+ I + + K +K +I +TDGE + + K+ ++ Sbjct: 241 -RTAFGIEFARSEA---------FQKGGRKGAKKVMIVITDGE---SHDSPDLEKVIQQS 287 Query: 358 KENFIKIVTISINA--------SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQ 407 + + + +++ +K S P+ + +NV + +L + + Sbjct: 288 ERDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALGD 347 Query: 408 LM 409 + Sbjct: 348 RI 349 >gi|52485853|ref|NP_001004439.1| integrin alpha-11 precursor [Homo sapiens] gi|313104119|sp|Q9UKX5|ITA11_HUMAN RecName: Full=Integrin alpha-11; Flags: Precursor gi|189442879|gb|AAI67840.1| Integrin, alpha 11 [synthetic construct] Length = 1188 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 22/182 (12%), Positives = 59/182 (32%), Gaps = 27/182 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ + + +G++ Y V V+ V T++ Sbjct: 185 LINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVKDVVEAASHIEQRGGTET--- 240 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A+ I + + K +K +I +TDGE + + K+ ++ Sbjct: 241 -RTAFGIEFARSEA---------FQKGGRKGAKKVMIVITDGE---SHDSPDLEKVIQQS 287 Query: 358 KENFIKIVTISINA--------SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQ 407 + + + +++ +K S P+ + +NV + +L + + Sbjct: 288 ERDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALGD 347 Query: 408 LM 409 + Sbjct: 348 RI 349 >gi|310823567|ref|YP_003955925.1| Bata protein [Stigmatella aurantiaca DW4/3-1] gi|309396639|gb|ADO74098.1| BatA protein [Stigmatella aurantiaca DW4/3-1] Length = 302 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 38/247 (15%), Positives = 76/247 (30%), Gaps = 64/247 (25%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I + +DLS SM + + ++ K L F+ + ++ Sbjct: 58 IDIVVALDLSTSMEAG--------------DFRPQNRLHVAKEVLAEFIS--NRVND--- 98 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +GL+ + + VR+ + + ++ T A+ + L + Sbjct: 99 --RIGLVVFAGAAYTQAPLTLDYGVVREVLKQIRTRVLEDGTAIGDALATSLNRLRDSEA 156 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI- 369 +S + ++ +TDG+NN K + A+ I I TI + Sbjct: 157 KS------------------RVVVLITDGDNNAGKISPLDAA--SMAESLKIPIYTILVG 196 Query: 370 ---NAS-PNGQ-----------------RLLKTCVS-SPEYHYNVVNADSLIHVFQNISQ 407 P GQ LL+ S + +Y + + L Q + Sbjct: 197 KGGKVPFPQGQDLFGNTVWRDTEIPINPELLQDIASRTGGEYYRATDPEGLKQGLQKVLD 256 Query: 408 LMVHRKY 414 + K Sbjct: 257 SLERSKL 263 >gi|257879128|ref|ZP_05658781.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,230,933] gi|257813356|gb|EEV42114.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,230,933] Length = 1258 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 36/229 (15%), Positives = 69/229 (30%), Gaps = 30/229 (13%) Query: 93 DAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMT 152 D N + + E +N+ + N + N+ + YN + Sbjct: 40 DYTNGGANWRNYDYATNNGSTAGEEDRNV----PAQTVQLWGNERNFENSYLDYNGAYIK 95 Query: 153 SYDYRLQFIEHL--LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 + + L+ + + ++ E E I V+D S SM Sbjct: 96 KWVEPVLPSSTASDLHPEDATTLYNVYLDVIGGEKKEISPIDIVFVLDKSASMSELTAGT 155 Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY--------TTR 262 +TK AAL A+ S DLLS D+ +G++ + Sbjct: 156 N------------SQTKNAALIEAVNEM--SKDLLSDPSLDIRIGMVNFYHNSTAINNHE 201 Query: 263 VEKN--IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + + ++ ++ + T T +K Y+ L D Sbjct: 202 QISSDIFPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGYETLYKDN 250 >gi|115379116|ref|ZP_01466240.1| von Willebrand factor type A domain, putative [Stigmatella aurantiaca DW4/3-1] gi|115363899|gb|EAU63010.1| von Willebrand factor type A domain, putative [Stigmatella aurantiaca DW4/3-1] Length = 284 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 38/247 (15%), Positives = 76/247 (30%), Gaps = 64/247 (25%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I + +DLS SM + + ++ K L F+ + ++ Sbjct: 40 IDIVVALDLSTSMEAG--------------DFRPQNRLHVAKEVLAEFIS--NRVND--- 80 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +GL+ + + VR+ + + ++ T A+ + L + Sbjct: 81 --RIGLVVFAGAAYTQAPLTLDYGVVREVLKQIRTRVLEDGTAIGDALATSLNRLRDSEA 138 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI- 369 +S + ++ +TDG+NN K + A+ I I TI + Sbjct: 139 KS------------------RVVVLITDGDNNAGKISPLDAA--SMAESLKIPIYTILVG 178 Query: 370 ---NAS-PNGQ-----------------RLLKTCVS-SPEYHYNVVNADSLIHVFQNISQ 407 P GQ LL+ S + +Y + + L Q + Sbjct: 179 KGGKVPFPQGQDLFGNTVWRDTEIPINPELLQDIASRTGGEYYRATDPEGLKQGLQKVLD 238 Query: 408 LMVHRKY 414 + K Sbjct: 239 SLERSKL 245 >gi|332185455|ref|ZP_08387203.1| hypothetical protein SUS17_655 [Sphingomonas sp. S17] gi|332014433|gb|EGI56490.1| hypothetical protein SUS17_655 [Sphingomonas sp. S17] Length = 530 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 28/182 (15%), Positives = 52/182 (28%), Gaps = 41/182 (22%) Query: 265 KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 W + +++Y+ T M + S K Sbjct: 358 AASMQGWSRDDLKKYLNTL---TPDGGTYHDNGMMWG-----ARWASSGGIFGGNNPEKY 409 Query: 325 PSLPFQKFIIFLTDG---------------ENNNFKSNVNTIK----------------I 353 +P +K+IIF+TDG E + ++ + Sbjct: 410 NMMPVKKYIIFMTDGLFETGYSRLYSSYGVEQLDARATPGGAYSNQDDQLARHKQRFNLL 469 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 C KAK I + + + L C S+P N +L+ F I + + + Sbjct: 470 CSKAKSMGYSIWVLGFATTLD--ASLTNCASTPSQASTSSNQAALMARFVEIGKNIGALR 527 Query: 414 YS 415 + Sbjct: 528 LT 529 >gi|149692293|ref|XP_001495918.1| PREDICTED: integrin, alpha 11 [Equus caballus] Length = 1188 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 22/182 (12%), Positives = 59/182 (32%), Gaps = 27/182 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ + + +G++ Y V V+ V T++ Sbjct: 185 LINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVKDVVEAASHIEQRGGTET--- 240 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A+ I + + K +K +I +TDGE + + K+ ++ Sbjct: 241 -RTAFGIEFARSEA---------FQKGGRKGAKKVMIVITDGE---SHDSPDLEKVIQQS 287 Query: 358 KENFIKIVTISINA--------SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQ 407 + + + +++ +K S P+ + +NV + +L + + Sbjct: 288 ERDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALGD 347 Query: 408 LM 409 + Sbjct: 348 RI 349 >gi|300871001|ref|YP_003785873.1| aerotolerance-like membrane protein [Brachyspira pilosicoli 95/1000] gi|300688701|gb|ADK31372.1| aerotolerance-related membrane protein [Brachyspira pilosicoli 95/1000] Length = 328 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 46/282 (16%), Positives = 90/282 (31%), Gaps = 66/282 (23%) Query: 151 MTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 Y + FI +L ++ ++ + I +VVD+S SM Sbjct: 46 SKYYAKDIPFILLMLGLTFSIIGLARPATVDSSANINGEGIYISMVVDISPSMMAE---- 101 Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 D T++ A K + F+ + + L+ + R + Sbjct: 102 -----------DMLPTRLEASKKTMADFIKK-------RNFDKISLVAFALRASVLSPST 143 Query: 271 WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 + + + + ++ T + A +L S K + + Sbjct: 144 FDYTSLEKEI-GNIKIDEEGSTSIGLGIATAVDMLRSVKDDA-----------------E 185 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG--------------- 375 K II LTDGENN+ + + + A IKI TI I + Sbjct: 186 KVIILLTDGENNSGEIDPKLAS--EIASNFNIKIYTIGIGDAAGSHAWVTYTDPNYGKRR 243 Query: 376 ---------QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + L++ + ++N + +L +V+ I +L Sbjct: 244 IRADFTLNEKALIEIASITGGKYFNAKTSSALDNVYNTIDRL 285 >gi|240169099|ref|ZP_04747758.1| hypothetical protein MkanA1_07284 [Mycobacterium kansasii ATCC 12478] Length = 741 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 34/224 (15%), Positives = 72/224 (32%), Gaps = 59/224 (26%) Query: 162 EHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELV---------------VDLSGSMHCA 206 + +L R ++ +S L+ G+ + + LV +D SGSM Sbjct: 252 DFVLRFRLDRGALSSSALLVPDADGDEGTWSVTLVPPAEPSSAPRDVVVVLDRSGSMGGW 311 Query: 207 MNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN 266 KM A + A +D +D + ++ + R++ Sbjct: 312 --------------------KMVAARRAAGRIVDMLDTVD------RFCVLAFDDRIDTP 345 Query: 267 IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 + + G + R + L S + A + L + ++ RQ Sbjct: 346 TDMAPGLVEGSDQ-NRFAAASWLGSLRSRGGTEMA-EPLRRAVELLAGSDEGRQAS---- 399 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF-IKIVTISI 369 ++ +TDG+ + ++ A+ I+I + I Sbjct: 400 ------VVLVTDGQITGEDHLLRSL-----AQSLGRIRIYCVGI 432 >gi|325108274|ref|YP_004269342.1| von Willebrand factor A [Planctomyces brasiliensis DSM 5305] gi|324968542|gb|ADY59320.1| von Willebrand factor type A [Planctomyces brasiliensis DSM 5305] Length = 1584 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 40/197 (20%), Positives = 67/197 (34%), Gaps = 39/197 (19%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 L QR + I A + + + + +V+D S SM+ + P + + + Sbjct: 1029 LGQRLSAAIADETSAQVTVIDSQAKPVSVMVVLDCSQSMN---QTIPWEASQ------QD 1079 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ------ 278 TK A A L+++ L + +GL+ Y RV W T Q Sbjct: 1080 LTKFDAAITATNQILNALSRLPN----ARVGLVLYGHRV------GWNTASPSQILRQTG 1129 Query: 279 YVTRDMDSL-ILKPTDSTPAM-------KQAYQILTSDKK------RSFFTNFFRQGVKI 324 Y DSL + T+ + A + L K + Sbjct: 1130 YANPIPDSLMPYEDTEIVLPFGRFARAERDAVEKLLKTVKPWGETPLYLSIRNALNEMNS 1189 Query: 325 PSLPFQKFIIFLTDGEN 341 SL ++ I+ +TDG N Sbjct: 1190 KSLGGKQQIVVITDGVN 1206 >gi|301768026|ref|XP_002919432.1| PREDICTED: calcium-activated chloride channel regulator 1-like [Ailuropoda melanoleuca] Length = 913 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 44/223 (19%), Positives = 81/223 (36%), Gaps = 66/223 (29%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM ++ L A LFL I V++ ++ Sbjct: 309 LVLDKSGSMS-------------------NGDRLKRLNQAGKLFLLQI-----VEQGSWV 344 Query: 255 GLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + E R +T+ + ++ T ++ A+ ++ ++ F Sbjct: 345 GMVTFDSAAHVQSELVQINGATERDALTKSLPTVASGGTSICSGLRSAFAVI----RKKF 400 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 T+ I+ LTDGE+N TI C ++ K++ I T+++ Sbjct: 401 STDGSE-------------IVLLTDGEDN-------TISSCFNEVKQSGAVIHTVAL-GP 439 Query: 373 PNGQRL---------LKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + L L+T S N + LI F +S Sbjct: 440 SAAKELEELSKMTGGLQTYASD-----QAQN-NGLIDAFGALS 476 >gi|188994392|ref|YP_001928644.1| putative aerotolerance-related exported protein BatB [Porphyromonas gingivalis ATCC 33277] gi|188594072|dbj|BAG33047.1| putative aerotolerance-related exported protein BatB [Porphyromonas gingivalis ATCC 33277] Length = 339 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 34/186 (18%), Positives = 65/186 (34%), Gaps = 45/186 (24%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + + +D+S SM +C+D K +++ K L D + Sbjct: 85 PKKEKGIEAMICLDISNSM---------------LCEDVKPNRLSFAKQVLGKLFDGLQ- 128 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQ 303 + V GL+ + I + +Q++ +++ T A++ A + Sbjct: 129 ----NDKV--GLVVFAGNAYTQIPITTDLSAAKQFLADISPNMVTAQGTAIGAAIELASK 182 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 + +K+ K II LTDGEN N I+ +A E I+ Sbjct: 183 SFSDNKEIG------------------KTIIVLTDGEN----HEGNAIEAAQQAHEAGIR 220 Query: 364 IVTISI 369 + I + Sbjct: 221 VNVIGL 226 >gi|149188995|ref|ZP_01867284.1| hypothetical protein VSAK1_21554 [Vibrio shilonii AK1] gi|148837181|gb|EDL54129.1| hypothetical protein VSAK1_21554 [Vibrio shilonii AK1] Length = 266 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 42/240 (17%), Positives = 83/240 (34%), Gaps = 43/240 (17%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E+ + + VDLSGSM D + +N ++ +K L FL Sbjct: 24 EKSGREMMVAVDLSGSMEAKDFVDQQGINVR---------RIDGVKLLLESFLQQ----- 69 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + +GLI + + + + + + + T A+ A Sbjct: 70 --RTGDRVGLIAFGDDAYLQAPVTEDFDTLSLLLEQMDVRMAGAGTALGDAIGVA----- 122 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 N F S K ++ LTDG++ + A E + I Sbjct: 123 --------VNHFEH-----SESNNKVLLLLTDGKDTTSQFPPVDAA--HFAGERGVTIYP 167 Query: 367 ISI-NASPNGQR------LLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 I+I +A+ G+ L + + + ++ D+L V+Q +++L S +++ Sbjct: 168 IAIGDATNVGEEAIDLEMLARIASYTGGQVFEALDGDALAAVYQTLNELEASLFESYVIR 227 >gi|194388296|dbj|BAG65532.1| unnamed protein product [Homo sapiens] Length = 650 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 22/182 (12%), Positives = 59/182 (32%), Gaps = 27/182 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ + + +G++ Y V V+ V T++ Sbjct: 185 LINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVKDVVEAASHIEQRGGTET--- 240 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A+ I + + K +K +I +TDGE + + K+ ++ Sbjct: 241 -RTAFGIEFARSEA---------FQKGGRKGAKKVMIVITDGE---SHDSPDLEKVIQQS 287 Query: 358 KENFIKIVTISINA--------SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQ 407 + + + +++ +K S P+ + +NV + +L + + Sbjct: 288 ERDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALGD 347 Query: 408 LM 409 + Sbjct: 348 RI 349 >gi|78186535|ref|YP_374578.1| hypothetical protein Plut_0657 [Chlorobium luteolum DSM 273] gi|78166437|gb|ABB23535.1| putative membrane protein [Chlorobium luteolum DSM 273] Length = 356 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 19/81 (23%), Positives = 39/81 (48%), Gaps = 4/81 (4%) Query: 10 YFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMV 69 + + + S++ +I+FA+ + L + + H K +++A +AA L GA Sbjct: 6 HSSRRLQSQRGGTAILFAIVLPVLLGFAALAVDLARIHLVKVELQNAADAASLGGA---- 61 Query: 70 SNLSRLGDRFESISNHAKRAL 90 +LS G + + S + +AL Sbjct: 62 RSLSDPGGQPYNWSAASIKAL 82 >gi|320353059|ref|YP_004194398.1| von Willebrand factor type A [Desulfobulbus propionicus DSM 2032] gi|320121561|gb|ADW17107.1| von Willebrand factor type A [Desulfobulbus propionicus DSM 2032] Length = 336 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 37/240 (15%), Positives = 79/240 (32%), Gaps = 50/240 (20%) Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 + M + LV+D+SGSM +K+ A+K A Sbjct: 53 AITGQAGASGMTGGTGLDLMLVIDVSGSMSG--------------------SKLTAVKAA 92 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTR---VEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + ++S+ +G++ Y++ VE + + + + S Sbjct: 93 AVALVNSLP-----NNTTQVGIVKYSSSANMVEMLQDLTSNKSDLIATINGLSAS---GS 144 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A++ A L S + + + L+DGE + ++ Sbjct: 145 TATGTAIQVATAELLSSRAIAGHAKME---------------VVLSDGE---YNVGIDPK 186 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY-NVVNADSLIHVFQNISQLMV 410 +A I + T+ + G ++ + + NV N + L+ +F +V Sbjct: 187 IAAAQAHAQGITVHTVGVQLYGTGYTSMQQTAVAGGGIFTNVNNLNDLVALFSGTGGNLV 246 >gi|314918209|gb|EFS82040.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL050PA1] Length = 320 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 38/241 (15%), Positives = 78/241 (32%), Gaps = 51/241 (21%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + + +D+S SM + D + T+++A K A FL + Sbjct: 86 RDRATVVVAIDVSRSM---------------VATDVEPTRLSAAKTAAKDFLGDL---PP 127 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 L+ + + + P+ V +T +L T + Y L Sbjct: 128 RFNV---SLVKFAASAQVVVPPTTDRAAVSTAITNLQ---VLPSTAIGEGI---YSSL-- 176 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + + + P I+ L+DG N + ++ K +A + + TI Sbjct: 177 -NALKLVPDDPKHPGQKPPAA----IVLLSDGATNVGRPSLEAAK---EAGRQHVPVYTI 228 Query: 368 SIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + N L +S ++ + L V+++I+Q + + K Sbjct: 229 AYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESLGQLSDVYKSIAQSVGYEK 288 Query: 414 Y 414 Sbjct: 289 V 289 >gi|260786070|ref|XP_002588082.1| hypothetical protein BRAFLDRAFT_83083 [Branchiostoma floridae] gi|229273239|gb|EEN44093.1| hypothetical protein BRAFLDRAFT_83083 [Branchiostoma floridae] Length = 528 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 28/178 (15%), Positives = 59/178 (33%), Gaps = 23/178 (12%) Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 ++ ++ +G++ Y+ R K + S T++ A+ Sbjct: 19 VNVVNTFDISPTATRVGVVQYSDRNSLVFNLGNKVNKPSTVSAINGISYQSGGTNTGAAL 78 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 K Q + +P K II LTDG++++ S + + Sbjct: 79 KYVRQY------------AAWREGNVP-----KVIIVLTDGKSSDSVSGPSRDLV----- 116 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 +++ I + +GQ LL+ + + N ++L IS + V Sbjct: 117 AAGVEVYAIGVGKFDHGQ-LLQIANNKQNNVIELNNFNALATKIDMISTNVCSYALHV 173 >gi|327543524|gb|EGF29943.1| von Willebrand factor type A domain-containing protein [Rhodopirellula baltica WH47] Length = 274 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 44/221 (19%), Positives = 77/221 (34%), Gaps = 43/221 (19%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 E P + L+VDLSGSM KK +++ A+K L FL Sbjct: 86 TKEIPTRDLLLLVDLSGSMA---------QEDFQNDAGKKVSRLDAVKEVLDGFLAK--- 133 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 ++ +GL+ + + + ++ + + T A+ + Sbjct: 134 ----RKGDRVGLVVFGDAAYLQAPFTTDLQLSQELLGECEVGMAGPRTAFGDAIGLGVNL 189 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 D +R+ K II LTDG N+ KS V ++ A + IKI Sbjct: 190 FDEDTERA------------------KTIIALTDG--NDTKSKVPPVEAARVATQRDIKI 229 Query: 365 VTISINASPN------GQRLLKTCVS-SPEYHYNVVNADSL 398 T++I ++ LK S + ++ +L Sbjct: 230 YTVAIGDPTTVGEDKLDEQSLKDVASETGGKYFFRRGPRAL 270 >gi|281352224|gb|EFB27808.1| hypothetical protein PANDA_008060 [Ailuropoda melanoleuca] Length = 911 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 44/223 (19%), Positives = 81/223 (36%), Gaps = 66/223 (29%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM ++ L A LFL I V++ ++ Sbjct: 309 LVLDKSGSMS-------------------NGDRLKRLNQAGKLFLLQI-----VEQGSWV 344 Query: 255 GLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + E R +T+ + ++ T ++ A+ ++ ++ F Sbjct: 345 GMVTFDSAAHVQSELVQINGATERDALTKSLPTVASGGTSICSGLRSAFAVI----RKKF 400 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 T+ I+ LTDGE+N TI C ++ K++ I T+++ Sbjct: 401 STDGSE-------------IVLLTDGEDN-------TISSCFNEVKQSGAVIHTVAL-GP 439 Query: 373 PNGQRL---------LKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + L L+T S N + LI F +S Sbjct: 440 SAAKELEELSKMTGGLQTYASD-----QAQN-NGLIDAFGALS 476 >gi|194374787|dbj|BAG62508.1| unnamed protein product [Homo sapiens] Length = 677 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 42/234 (17%), Positives = 83/234 (35%), Gaps = 57/234 (24%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM ++ L A LFL + + ++ Sbjct: 72 LVLDKSGSMA-------------------TGNRLNRLNQAGQLFL--LQTVELGS---WV 107 Query: 255 GLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + E + R + + + + T ++ A+ ++ ++ + Sbjct: 108 GMVTFDSAAHVQSELIQINSGSDRDTLAKRLPAAASGGTSICSGLRSAFTVI----RKKY 163 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 T+ I+ LTDGE+N TI C ++ K++ I T+++ Sbjct: 164 PTDGSE-------------IVLLTDGEDN-------TISGCFNEVKQSGAIIHTVAL-GP 202 Query: 373 PNGQRL--LKTCVSSPEYH-YNVVNADSLIHVFQNISQ---LMVHRKYSVILKG 420 Q L L + + + V + LI F +S + R + KG Sbjct: 203 SAAQELEELSKMTGGLQTYASDQVQNNGLIDAFGALSSGNGAVSQRSIQLESKG 256 >gi|126306100|ref|XP_001362237.1| PREDICTED: similar to calcium-dependent chloride channel-1 [Monodelphis domestica] Length = 959 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 45/233 (19%), Positives = 82/233 (35%), Gaps = 55/233 (23%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM ++ L A LFL I +++ + Sbjct: 309 LVLDKSGSMAGG-------------------DRLNRLNQASQLFLLQI-----IEKGSWT 344 Query: 255 GLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + T+ R + + + T ++ A+ ++ K F Sbjct: 345 GMVTFDSSATIQSALIQIETDAQRNSLISRLPTAAGGGTSICSGLRTAFTVI----KNKF 400 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISI--N 370 T+ I+ LTDGE++ TI C D+ K++ I T+++ + Sbjct: 401 STDGSE-------------IVLLTDGEDS-------TISSCFDEVKQSGAIIHTVALGPS 440 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL---MVHRKYSVILKG 420 A P + L K + + LI F +S + R + KG Sbjct: 441 ADPGLEELAKMTGGMKTSATDNAQNNGLIDAFSALSSENGAITQRSIQLDSKG 493 >gi|327270778|ref|XP_003220165.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 925 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 34/213 (15%), Positives = 73/213 (34%), Gaps = 53/213 (24%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D+SGSM ++A LK A F+ ++++ ++ Sbjct: 306 LVLDVSGSMSGN-------------------NRIARLKQAAETFILQ-----NIEDGSWV 341 Query: 255 GLIGYTTRVEKNIEPSWGTE-KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + VR+ + + T+ +++ +Q+ +S + Sbjct: 342 GIVTFNNAATIQTGLQQVVSDTVRKTLNGYLPISANGGTNICAGVQKGFQVFSSKYASTE 401 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 I+ LTDGE++ S C + + + I TI++ Sbjct: 402 GCE----------------IVLLTDGEDSGLSS-------CFAEVQRSGSVIHTIAL-GP 437 Query: 373 PNGQRLLKTCVSSPEYHYNV---VNADSLIHVF 402 + L + ++ V++ SL F Sbjct: 438 SAAKELEMLADMTGGLKFSATDSVDSSSLEDAF 470 >gi|310657870|ref|YP_003935591.1| hypothetical protein CLOST_0560 [Clostridium sticklandii DSM 519] gi|308824648|emb|CBH20686.1| exported protein of unknown function [Clostridium sticklandii] Length = 873 Score = 47.2 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 45/248 (18%), Positives = 80/248 (32%), Gaps = 57/248 (22%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + LV+D SGSM + D + AP+ K A + F +SI + Sbjct: 397 EKPVDVILVIDTSGSMGTRIPGD----SKAPLYY---------AKLAAINFANSIIDENP 443 Query: 248 VKEDVYM----GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + G GY + I + + + + T+ + AY Sbjct: 444 DSRVGVIEFSGGYYGYASDASTVINLTNNKANLASSINGLTTHNM---TNIQAGFRLAYN 500 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG----------ENNNFKSNVNTIKI 353 +++ K ++FLTDG +N N +TI Sbjct: 501 KISAISSTRDSV---------------KSVVFLTDGVANVSIGNWSSSNPVVHNTHTIAA 545 Query: 354 CDKAKENFIKI----VTISI-------NASPNGQRLLKTCVSSP-EYHYNVVNADSLIHV 401 + + + I TI + + + L+ V E +Y +A L V Sbjct: 546 YTEGQSLYSYINGNLFTIGLFGAISNSSVKSIARDTLQKAVYDDLEKYYEASSAVDLGPV 605 Query: 402 FQNISQLM 409 ++ ISQ + Sbjct: 606 YETISQKL 613 >gi|149923516|ref|ZP_01911918.1| hypothetical protein PPSIR1_08092 [Plesiocystis pacifica SIR-1] gi|149815646|gb|EDM75176.1| hypothetical protein PPSIR1_08092 [Plesiocystis pacifica SIR-1] Length = 716 Score = 47.2 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 18/123 (14%), Positives = 36/123 (29%), Gaps = 10/123 (8%) Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T + ++ + + + +L F I +TDG+ N + +N Sbjct: 446 SGTYTHLGLQLVKDYQQAYSGSTMNNDMAPYPTADETLYFN---ILITDGQYNGYSTNAQ 502 Query: 350 TIKICDKAKENFIKIVTISI-------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 ++ + I I A Q + + S +Y+ N L Sbjct: 503 VQGELEEMYNSGITTYVIGFGDGVDTAAAQAQLQNMAQWGSGSQNNYYDANNQTELEQAL 562 Query: 403 QNI 405 I Sbjct: 563 TTI 565 >gi|20306196|gb|AAH28343.1| Chloride channel calcium activated 3 [Mus musculus] Length = 913 Score = 47.2 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 34/186 (18%), Positives = 68/186 (36%), Gaps = 51/186 (27%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM D + +M + LL V++ ++ Sbjct: 310 LVLDKSGSML----------------NDDRLNRMNQA--------SRLFLLQTVEQGSWV 345 Query: 255 GLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + E + R + + + ++ T ++ A+ ++ K+ + Sbjct: 346 GMVTFDSAAYVQSELKQLNSGADRDLLIKHLPTVSAGGTSICSGLRTAFTVI----KKKY 401 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 T+ I+ LTDGE+N TI C D K++ I T+++ Sbjct: 402 PTDGSE-------------IVLLTDGEDN-------TISSCFDLVKQSGAIIHTVAL-GP 440 Query: 373 PNGQRL 378 + L Sbjct: 441 AAAKEL 446 >gi|296208407|ref|XP_002751080.1| PREDICTED: calcium-activated chloride channel regulator 1 [Callithrix jacchus] Length = 914 Score = 47.2 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 37/217 (17%), Positives = 76/217 (35%), Gaps = 54/217 (24%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM ++ L A LFL + + ++ Sbjct: 309 LVLDKSGSMA-------------------TGNRLNRLNQAGQLFL--LQTVELGS---WV 344 Query: 255 GLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + E + R + + + + T ++ A+ ++ ++ + Sbjct: 345 GMVTFDSAAYVQSELVQINSGSDRDTLAKRLPTAAAGGTSICTGLRSAFTVI----RKKY 400 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 T+ I+ LTDGE+N TI C ++ K++ I T+++ Sbjct: 401 PTDGSE-------------IVLLTDGEDN-------TISGCFNEVKQSGAVIHTVAL-GP 439 Query: 373 PNGQRLLKTCVSSPEYHYNVVNADS---LIHVFQNIS 406 Q L + + + + LI F +S Sbjct: 440 SAAQELEQLSKMTGGFQTYASDQAQNNGLIDAFGALS 476 >gi|57114202|ref|NP_001009169.1| complement factor B precursor [Pan troglodytes] gi|38502961|sp|Q864W0|CFAB_PANTR RecName: Full=Complement factor B; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement factor B Ba fragment; Contains: RecName: Full=Complement factor B Bb fragment; Flags: Precursor gi|29690185|gb|AAM10004.1| complement factor B precursor [Pan troglodytes] Length = 764 Score = 47.2 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 34/226 (15%), Positives = 79/226 (34%), Gaps = 39/226 (17%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 +V+D SGSM+ + D D A K L+ ++ + Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGASNFTG--------AKKCLVNLIEKVASYGVKPRY--- 309 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL------ILKPTDSTPAMKQAYQILTSD 308 GL+ Y T + ++ S +VT+ ++ + + T++ A++ Y ++ Sbjct: 310 GLVTYATHPKIWVKVSDPDSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMM--- 366 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK------SNVNTIKICDKAKEN-- 360 ++ + II +TDG +N + + K ++N Sbjct: 367 -------SWPDDIPPEGWNRTRHVIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPR 419 Query: 361 ----FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + + + L + + ++ + V + ++L VF Sbjct: 420 EDYLDVYVFGVGPLVNQVNINALASKKDNEQHVFKVKDMENLEDVF 465 >gi|126309708|ref|XP_001376394.1| PREDICTED: similar to B-factor, properdin [Monodelphis domestica] Length = 764 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 33/227 (14%), Positives = 72/227 (31%), Gaps = 41/227 (18%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 +V+D +GSM+ + D D K L +D + Sbjct: 265 IVLDPAGSMNIYLVLDASDSIGKNNFTG--------AKKCLSSLIDKVASYGVEPRY--- 313 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK------PTDSTPAMKQAYQ-ILTS 307 ++ Y T + ++ S +V ++++ + T++ A+ Y+ ++ Sbjct: 314 AVVTYATEAKAVVKLSDKESSNADWVKQELEKIKYSDHRLKAGTNTKKALTMLYEMMILQ 373 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC----DKAKEN--- 360 + + N R I+ +TDG N V I+ D + Sbjct: 374 ESQNDINWNKTRH-----------VIVLMTDGNYNMGGDPVAAIEQIREFLDIGRNRKNP 422 Query: 361 -----FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + I L + ++ + V + + L +VF Sbjct: 423 RENYLDVYVFGIGPLVDQEKINALASKKDGEKHVFKVKDMEDLENVF 469 >gi|90408685|ref|ZP_01216835.1| hypothetical protein PCNPT3_08475 [Psychromonas sp. CNPT3] gi|90310199|gb|EAS38334.1| hypothetical protein PCNPT3_08475 [Psychromonas sp. CNPT3] Length = 349 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 54/273 (19%), Positives = 93/273 (34%), Gaps = 40/273 (14%) Query: 143 TIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGS 202 T + V T + L ++ K P +R ++G I +VVDLSGS Sbjct: 57 TQGASQLVATRWQKYLLCFTWIMILFALTKPTLLGPPQVREQIGRD----IMVVVDLSGS 112 Query: 203 MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR 262 M Q K +++ A+K L F+ + ++ +GLI + Sbjct: 113 M---------QKKDFVNQQGHKISRLDAVKEVLTDFIKT-------RQGDRLGLILFGDA 156 Query: 263 VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 + + + + ++ K T A+ + K Sbjct: 157 AFVQTPFTADHDVWLDLLMQTRVNMAGKSTHLGDAIGLTIKRFNEATKN----------- 205 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP--NGQRL-- 378 + +K I L+DG N+ S V I AK N ++I I+I Q L Sbjct: 206 QTSEKTREKVAIILSDG--NDTGSYVPPIDAAMVAKVNAVRIYMIAIGDPKSVGEQSLDM 263 Query: 379 --LKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 + S S Y +N L++ + I +L Sbjct: 264 QTINKIASVSGGQAYQALNQQELLNAYAKIDKL 296 >gi|320102039|ref|YP_004177630.1| VWFA-like domain-containing protein [Isosphaera pallida ATCC 43644] gi|319749321|gb|ADV61081.1| VWFA-related domain protein [Isosphaera pallida ATCC 43644] Length = 784 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 37/230 (16%), Positives = 82/230 (35%), Gaps = 64/230 (27%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LV+D SGSM ++ ALK A+ +FL L + Sbjct: 530 TVVLVLDTSGSMLQD-------------------NRIGALKEAVGVFLG---TLPPGSKV 567 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEK---------VRQYVTRDMDSLILKPTDSTPAMKQAY 302 +I + + V +G V+ V R + T A+ +A Sbjct: 568 ---AVIEFNSFVN---PLVFGPANEIFTTRFDDVKSQVNRFRAN---GGTSYYDAVDRAL 618 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK-SNVNTIKICDKAKENF 361 +++ + R ++ LTDGE+ + + + ++++ + KA+ Sbjct: 619 ELIANQTGRRA-------------------VLALTDGEDTSSRLAGLDSVIL--KARNLG 657 Query: 362 IKIVTISINASPNGQ--RLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + + T+ + + L + + ++ +A L +F ++Q + Sbjct: 658 LPVHTLGVGREDEIEVGELQRLARETRGRYFPARDATKLRVIFAELAQSL 707 >gi|156383644|ref|XP_001632943.1| predicted protein [Nematostella vectensis] gi|156220006|gb|EDO40880.1| predicted protein [Nematostella vectensis] Length = 982 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 38/221 (17%), Positives = 74/221 (33%), Gaps = 38/221 (17%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + +V+D SGS+ K K+ +++ + +HV Sbjct: 82 IDLAMVLDSSGSIG-----------------KKDWVKLLEFTKSVVDAYSVSEEATHVGV 124 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 Y +K + +++ + L T A++ A +L ++ + Sbjct: 125 ITYSTEATLDIAFDKYSGVEMNSVNLKKDIDIIPQKNNL--TFMDKALELANSVLFTEAR 182 Query: 311 RSFFTNFFRQGVKIPSLPFQKFI-IFLTDGENNNFKSN-VNTIKICDKAKENFIKIVTIS 368 P +K + +FLTDG + + K K+ I + T+ Sbjct: 183 --------------GMRPNKKQVCLFLTDGIQTFDQGPYTKPSIVSQKLKDRGIDVYTVG 228 Query: 369 INASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQL 408 + + LL +SS + + Y+ N D L Q I Q Sbjct: 229 VGDDVDLFELLS--ISSGDKYTYSAKNFDELQAKVQEILQE 267 >gi|261212659|ref|ZP_05926943.1| protein BatA [Vibrio sp. RC341] gi|260837724|gb|EEX64401.1| protein BatA [Vibrio sp. RC341] Length = 232 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 49/233 (21%), Positives = 80/233 (34%), Gaps = 55/233 (23%) Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 LVVDLS SM ED+ S D ++ A+K L F+ +E Sbjct: 1 MLVVDLSYSMS------QEDMQSGQQMVD----RLTAVKQVLSDFIAK-------REGDR 43 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 +GLI + + E V Q + + + LI T + A Sbjct: 44 IGLILFADHAYLQTPLTLDRETVTQQLNQAVLKLIGTQTAIGEGIGLA------------ 91 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI---- 369 + I S Q+ II L+DG N + + AK+ I T+ + Sbjct: 92 ------TKIFIDSDAPQRVIILLSDGSNTAGVLDPLEAA--NIAKQYHSTIYTVGVGAGE 143 Query: 370 -------------NASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 A ++ L+T S + ++ N L +++ I+QL Sbjct: 144 MVVKDFLFSRKVNTAQDLDEKTLQTIASTTGGQYFRARNQQDLQNIYDTINQL 196 >gi|146327011|gb|AAI41812.1| Matrilin 4 [Homo sapiens] Length = 540 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 66/169 (39%), Gaps = 22/169 (13%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTP 296 + + L+ +G+I Y+++V+ ++ + + RD+ L T + Sbjct: 57 LMGLLRGLNVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQG-TMTGL 115 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A + S + G + P + + +TDG + ++ + Sbjct: 116 AIQYAMNVAFSVAE----------GARPPEERVPRVAVIVTDG---RPQD--RVAEVAAQ 160 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 A+ + I+I + + + G L+ S P E+ + V + LI F Sbjct: 161 ARASGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESF-DLIQEFG 206 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 65/177 (36%), Gaps = 26/177 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR--DMDSLILKPTDSTPAMK 299 +D L E +GL+ +++RV G V + + + T + A++ Sbjct: 331 VDFLDVSPEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALR 388 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S+ QG + +L + + TDG + +++ +AKE Sbjct: 389 HMVEHSFSE----------AQGARPRALNVPRVGLVFTDG---RSQDDISV--WAARAKE 433 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I + + + + + L+ S P + D F ++ L+ + + S+ Sbjct: 434 EGIVMYAVGVGKAVEAE--LREIASEPAELHVSYAPD-----FGTMTHLLENLRGSI 483 >gi|3927992|emb|CAA07569.1| matrilin-4 [Homo sapiens] gi|124297520|gb|AAI31764.1| Matrilin 4 [Homo sapiens] gi|153217472|gb|AAI51220.1| Matrilin 4 [Homo sapiens] Length = 581 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 66/169 (39%), Gaps = 22/169 (13%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTP 296 + + L+ +G+I Y+++V+ ++ + + RD+ L T + Sbjct: 57 LMGLLRGLNVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQG-TMTGL 115 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A + S + G + P + + +TDG + ++ + Sbjct: 116 AIQYAMNVAFSVAE----------GARPPEERVPRVAVIVTDG---RPQD--RVAEVAAQ 160 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 A+ + I+I + + + G L+ S P E+ + V + LI F Sbjct: 161 ARASGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESF-DLIQEFG 206 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 65/177 (36%), Gaps = 26/177 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR--DMDSLILKPTDSTPAMK 299 +D L E +GL+ +++RV G V + + + T + A++ Sbjct: 372 VDFLDVSPEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALR 429 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S+ QG + +L + + TDG + +++ +AKE Sbjct: 430 HMVEHSFSE----------AQGARPRALNVPRVGLVFTDG---RSQDDISV--WAARAKE 474 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I + + + + + L+ S P + D F ++ L+ + + S+ Sbjct: 475 EGIVMYAVGVGKAVEAE--LREIASEPAELHVSYAPD-----FGTMTHLLENLRGSI 524 >gi|22760136|dbj|BAC11081.1| unnamed protein product [Homo sapiens] Length = 488 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 66/169 (39%), Gaps = 22/169 (13%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTP 296 + + L+ +G+I Y+++V+ ++ + + RD+ L T + Sbjct: 5 LMGLLRGLNVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQG-TMTGL 63 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A + S + G + P + + +TDG + ++ + Sbjct: 64 AIQYAMNVAFSVAE----------GARPPEERVPRVAVIVTDG---RPQD--RVAEVAAQ 108 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 A+ + I+I + + + G L+ S P E+ + V + LI F Sbjct: 109 ARASGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESF-DLIQEFG 154 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 65/177 (36%), Gaps = 26/177 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR--DMDSLILKPTDSTPAMK 299 +D L E +GL+ +++RV G V + + + T + A++ Sbjct: 279 VDFLDVSPEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALR 336 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S+ QG + +L + + TDG + +++ +AKE Sbjct: 337 HMVEHSFSE----------AQGARPRALNVPRVGLVFTDG---RSQDDISV--WAARAKE 381 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I + + + + + L+ S P + D F ++ L+ + + S+ Sbjct: 382 EGIVMYAVGVGKAVEAE--LREIASEPAELHVSYAPD-----FGTMTHLLENLRGSI 431 >gi|301606773|ref|XP_002932992.1| PREDICTED: integrin alpha-11-like [Xenopus (Silurana) tropicalis] Length = 1188 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 21/182 (11%), Positives = 55/182 (30%), Gaps = 27/182 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 + + + +G++ Y V V V T++ A Sbjct: 188 LISILQKFYIAPGQIQVGVLQYGETVVHEFYL-NNYRSVTDVVEAAKRIEQRGGTETRTA 246 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + + +K +I +TDGE+++ I + + Sbjct: 247 L-------------GIEKAVTEAFQRGGRKGAKKVMIVITDGESHDSPDLQRVI---ESS 290 Query: 358 KENFIKIVTISINA--------SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQ 407 +++ I +++ +K S P+ + +NV + +L + + + Sbjct: 291 EKDNITRYAVAVLGYYNRRGINPEAFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALGE 350 Query: 408 LM 409 + Sbjct: 351 RI 352 >gi|55380211|ref|YP_138060.1| calcium-binding protein-like [Haloarcula marismortui ATCC 43049] gi|55232936|gb|AAV48354.1| calcium-binding protein-like [Haloarcula marismortui ATCC 43049] Length = 1562 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 30/221 (13%), Positives = 62/221 (28%), Gaps = 55/221 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LV+D SGSM ++ L+N ++ + + Sbjct: 501 DVTLVMDTSGSMSSSVK----------------------LRNTAGQ-----RFVAGLLDV 533 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 ++ + + + + + TD + A S+ Sbjct: 534 DRAAVVDFDSSAYVAQDLTSDFGAANSTLDNLGSG---GGTDIGSGLSTANSQFASNSND 590 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S + +I LTDG N S T A + T+ + Sbjct: 591 SRA----------------QVMILLTDGRGNGGISEAQT------AANQNTTVYTVGFDN 628 Query: 372 SPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVH 411 + + L+ + + V + L +VF I++ Sbjct: 629 ANRDK--LRDIANITDGEFNYVTDRSELPNVFSRIAENTTE 667 >gi|312196190|ref|YP_004016251.1| von Willebrand factor type A [Frankia sp. EuI1c] gi|311227526|gb|ADP80381.1| von Willebrand factor type A [Frankia sp. EuI1c] Length = 319 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 45/241 (18%), Positives = 81/241 (33%), Gaps = 52/241 (21%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R I L +D+S SM D +++AA K F+D + Sbjct: 84 RERATIVLAIDVSNSMAA---------------TDISPSRLAAAKQGAQAFVDQL----- 123 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILT 306 + +GL+ ++ + P+ R V ++ L L P T + Y L+ Sbjct: 124 -PPKINLGLVSFSGTAAVLVPPT----TDRDAVKAGINGLQLGPATAIGEGI---YAGLS 175 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + Q V I+ L+DGE + N + AK+ I + T Sbjct: 176 AIN------TVSSQFVNSGQAVPPAAIVLLSDGETTRGRPNNQAAQ---AAKDAHIPVST 226 Query: 367 ISINASP-----NGQ---------RLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 I+ GQ L + + H+ + D L +++ + + +R Sbjct: 227 IAYGTPNGTLDVGGQLIPVPVNEPALSQIAEQTGGSHHRATSGDELTSIYKGLGSSIGYR 286 Query: 413 K 413 K Sbjct: 287 K 287 >gi|242247116|ref|NP_081039.2| collagen alpha-4(VI) chain precursor [Mus musculus] gi|189082905|sp|A2AX52|CO6A4_MOUSE RecName: Full=Collagen alpha-4(VI) chain; Flags: Precursor Length = 2309 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 22/144 (15%), Positives = 53/144 (36%), Gaps = 22/144 (15%) Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKPTDSTPAMKQAY 302 L + V +GL+ Y+ + ++ V + +L + T + A++ Sbjct: 265 LDVRSDQVQVGLVQYSDNIYPAFPLK--QSSLKSAVLDRIRNLPYSMGGTSTGSALEF-- 320 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + + T K + ++ +TDGE+++ +V D+ K + + Sbjct: 321 ------IRANSLTEMSGSRAKDG---VPQIVVLVTDGESSDEVQDVA-----DQLKRDGV 366 Query: 363 KIVTISINASPNGQRLLKTCVSSP 386 + + IN + L+ + P Sbjct: 367 FVFVVGINIQDVQE--LQKIANEP 388 >gi|123718334|emb|CAJ77150.1| procollagen type VI alpha 4 [Mus musculus] Length = 762 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 22/144 (15%), Positives = 53/144 (36%), Gaps = 22/144 (15%) Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKPTDSTPAMKQAY 302 L + V +GL+ Y+ + ++ V + +L + T + A++ Sbjct: 265 LDVRSDQVQVGLVQYSDNIYPAFPLK--QSSLKSAVLDRIRNLPYSMGGTSTGSALEF-- 320 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + + T K + ++ +TDGE+++ +V D+ K + + Sbjct: 321 ------IRANSLTEMSGSRAKDG---VPQIVVLVTDGESSDEVQDVA-----DQLKRDGV 366 Query: 363 KIVTISINASPNGQRLLKTCVSSP 386 + + IN + L+ + P Sbjct: 367 FVFVVGINIQDVQE--LQKIANEP 388 >gi|307720884|ref|YP_003892024.1| von Willebrand factor A [Sulfurimonas autotrophica DSM 16294] gi|306978977|gb|ADN09012.1| von Willebrand factor type A [Sulfurimonas autotrophica DSM 16294] Length = 304 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 50/274 (18%), Positives = 101/274 (36%), Gaps = 40/274 (14%) Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFL 192 +S FY + + + ++ I +S +I+ + Sbjct: 26 FKQEKESYYMPHFYQLLPHANKKHYVREILKWTMFTCMIFALSDPIITKKIKAVKSNAVD 85 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I L +D S SM + K++++ +K + F++S + Sbjct: 86 IVLALDTSDSMS----------TYGFNEKKYKQSRLNVVKEVVQNFINS-------RVKD 128 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GL+ + T S+ E + V ++ K T + + Q+L + K +S Sbjct: 129 RIGLVVFGTTAGIASPLSFDKEAQKNIVGNINVGVLGKSTALIDGIVSSIQLLKNSKSKS 188 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 K II L+DGE++ K + AK+ IKI TI+I Sbjct: 189 ------------------KIIILLSDGEDSASKIPLE--FALKLAKKYNIKIYTITI--D 226 Query: 373 PNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNI 405 + ++K + + ++ V N + L+ V+++I Sbjct: 227 KSYSDMMKVIANKNGAKNFEVQNKEDLVKVYKSI 260 >gi|19031199|gb|AAL17973.1| proximal thread matrix protein 1b [Mytilus edulis] Length = 444 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 38/219 (17%), Positives = 76/219 (34%), Gaps = 34/219 (15%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 VVD +G A D ++ +K+ + +D + + Sbjct: 236 VVDCAGHADIAFVFD-----ASSSINANNPNNYGLMKDFMKDIVDRFNKTGP--DGTQFA 288 Query: 256 LIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 ++ + R K +++ + + S I+ T ++ A + ++ Sbjct: 289 VVTFADRATKQFGLKDYSSKAEIKGAIDKVTPS-IIGQTAIGDGLENARLEVFPNRNGG- 346 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 QK +I LTDG+NN KS + + ++ + IV I + Sbjct: 347 -----------GREEVQKVVILLTDGQNNGHKSPEHESSL---LRKEGVVIVAIGV---- 388 Query: 374 NGQRLLK----TCVSSPEYHYNVVNADSLIHVFQNISQL 408 G LK SS EY + + + L + +N+ +L Sbjct: 389 -GTGFLKSELINIASSEEYVFTTSSFNKLSKIMENVVKL 426 Score = 37.9 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 23/161 (14%), Positives = 57/161 (35%), Gaps = 26/161 (16%) Query: 224 KRTKMAALKNALLLFLDSIDLL-SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 + +KN + +DS + + + G++ ++ V+ K ++ + + Sbjct: 60 NKENYQMMKNFVKELVDSFTTVGVNGRNGSQFGVVQFSQGVKTAFPL--NKFKTKEDIKK 117 Query: 283 DMDSLI---LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + ++ T+ ++ + S + + QK +I +TDG Sbjct: 118 GIQDMVPRNGGQTEIGTGLQHVRENSFSGAEGG------------GNPDKQKIVILMTDG 165 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLK 380 ++N + K K + ++ I I GQ +K Sbjct: 166 KSNAGAPPQHEAH---KLKAEGVTVIAIGI-----GQGFVK 198 >gi|90422080|ref|YP_530450.1| hypothetical protein RPC_0556 [Rhodopseudomonas palustris BisB18] gi|90104094|gb|ABD86131.1| conserved hypothetical protein [Rhodopseudomonas palustris BisB18] Length = 453 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 54/372 (14%), Positives = 112/372 (30%), Gaps = 65/372 (17%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 + R F + N +I+FA S++ L+ IG I ++ M+SA +AA + Sbjct: 1 MAQRLSGLFDSFHHDRRGNIAILFAFSLIPLLVAIGCAIDYARATQIRSKMQSAADAASV 60 Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 SK G ++ DA ++ + + +++ V Sbjct: 61 GSVSKASPAFLAAGSMT---TDGPIAVGSTDATNIFNGNMA-----SQSGYTLSKLDAAV 112 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 S ++T +T+ ++ V T++ K I Sbjct: 113 TKSGATLT-----------STVTFSASVATTF------------LTIIGKTALAIGGTSV 149 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR----------------- 225 L++D S SM + + P DV + K Sbjct: 150 STSSMPVYIDFYLLLDNSPSMG--VGATPTDVATMVDNTSDKCAFACHDVNDEHNYYELA 207 Query: 226 ------TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE----PSWGTEK 275 T++ L++A +D+ + M + + + + Sbjct: 208 KTLGVKTRIDVLRDATQQLMDTAAATATYPNQFRMAIYDFGASAQSAALRRLFALSSSLS 267 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 + +D + +K + +Y L G P QK+++F Sbjct: 268 SAKTAAGAIDLMTVKGQNDNDDRDTSYSKLLPA----IDKQITAAGAGTSDAP-QKYLLF 322 Query: 336 LTDGENNNFKSN 347 ++DG + + Sbjct: 323 VSDGVADETNAG 334 >gi|319786316|ref|YP_004145791.1| von Willebrand factor type A [Pseudoxanthomonas suwonensis 11-1] gi|317464828|gb|ADV26560.1| von Willebrand factor type A [Pseudoxanthomonas suwonensis 11-1] Length = 340 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 39/246 (15%), Positives = 77/246 (31%), Gaps = 61/246 (24%) Query: 183 IEMGERPIFLIELVVDLSGSMH---CAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + ++ + L VDLSGSM + P D ++ A K + FL Sbjct: 89 VAPAQQAR-QMLLAVDLSGSMSEVDMMLGMQPVD-------------RLTAAKAVIADFL 134 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 +GL+ + R + VR+ + + L + T A+ Sbjct: 135 QR-RSGD------RVGLLVFGQRAYMLTPLTLDLSAVREQLRDTVAGLAGRETALGDAIG 187 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 A + L + + Q+ ++ LTDG N A++ Sbjct: 188 LAVKRLRTQPEG------------------QRVLVLLTDGVNTTGVLQPLKAAELAAAEQ 229 Query: 360 NFIKIVTISINASPNGQRL----------------LKTCVS-SPEYHYNVVNADSLIHVF 402 +++ TI+ G L L+ + + +A+ L ++ Sbjct: 230 --VRVYTIAFGGDGGGFSLFGVQVPVQGDEVDEATLRKVAEITGGRFFRAHDANQLAGIY 287 Query: 403 QNISQL 408 + +L Sbjct: 288 AELERL 293 >gi|303235701|ref|ZP_07322308.1| von Willebrand factor type A domain protein [Prevotella disiens FB035-09AN] gi|302484148|gb|EFL47136.1| von Willebrand factor type A domain protein [Prevotella disiens FB035-09AN] Length = 322 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 41/264 (15%), Positives = 79/264 (29%), Gaps = 48/264 (18%) Query: 156 YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 L + L +++ I+ G+ I L VD+S SM Sbjct: 53 IHLPMLLRCLVYALVVIVLARPQTTTPIDNGQVEGINIMLAVDISASMLSD--------- 103 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 D ++ K+ F+ S +GL + + T+ Sbjct: 104 ------DVDPNRIEVAKDVAKEFIAS----RPNDN---IGLTIFAGEAFTQCPMT--TDH 148 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 +L + ++ Q T+ + VK S K +I Sbjct: 149 AS-------LLNLLAGIRADLSVNHLIQDGTAIGMGLANAVGRLKDVKEGS----KVVIL 197 Query: 336 LTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG-----------QRLLKTCVS 384 LTDG NN + T A++ +++ TI + + L + Sbjct: 198 LTDGSNNVGDISPLTAA--SIARKFGVRVYTIGLGTDGKDIQGRPVGEIDYKTLQDIAMQ 255 Query: 385 SPEYHYNVVNADSLIHVFQNISQL 408 + Y + L ++++I +L Sbjct: 256 TDGEFYRAQSRAELSQIYKDIDKL 279 >gi|251791982|ref|YP_003006702.1| TadG [Aggregatibacter aphrophilus NJ8700] gi|247533369|gb|ACS96615.1| TadG [Aggregatibacter aphrophilus NJ8700] Length = 592 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 52/367 (14%), Positives = 114/367 (31%), Gaps = 59/367 (16%) Query: 2 HLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 ++ + K+ EK ++++ AL L LI F + + + A A Sbjct: 5 NMTMSLFYMLKRFYHDEKGVYAVMTALLAFPLLFLIAFAVDGSGILLDRARLAQATEQAA 64 Query: 62 LAGASKM------VSNLSRLGDRFESISNH-------AKRALIDDAKRFIKNHIKESLSG 108 L ++ SNLS + E I N R A + + ++ + Sbjct: 65 LLLTTENNQYRADKSNLSNVQVTDEEIKNAKGSFKTAQDRKKGAQALKRNQELVQGMVKL 124 Query: 109 YSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQR 168 Y + + + + + + N ++ + D + +F Sbjct: 125 YLRSYDKEQKSSSPITIPKDFVAECRTQTSTRTNGESSSVACLVEGDVKRKF-----WLP 179 Query: 169 YNQKIVSFIPALLRIEMGERPI---------FLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 ++ + S + I G+ + LV D+S SM P Sbjct: 180 WSYTLTSNNRNTVDINSGKSYAVKEKDILIPIDLMLVNDISTSM-----------FKPPK 228 Query: 220 CQDKKRTKMAALK---NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 + K+ +LK A+ L + ++ + +G+ + ++ + ++ Sbjct: 229 DDPQGPKKIDSLKTVVKAVANILIPDEPPKNISKYNRIGITSFGLGAQQAGKQD--ANQI 286 Query: 277 RQYVTRDMDSLILKPTDSTPAMK----------QAYQILTSDKKRSFFTNFFRQGVKIPS 326 ++ V L + D T +K AY +L K F + P Sbjct: 287 KKCV------LPFRGVDKTVNIKVKYDGKVYPGSAYNVLKEQHKLKLFEDSSHTQEFNPQ 340 Query: 327 LPFQKFI 333 +++ Sbjct: 341 KTTVRYM 347 >gi|20093632|ref|NP_613479.1| Mg-chelatase subunit ChlI /Chld [Methanopyrus kandleri AV19] gi|19886501|gb|AAM01409.1| Mg-chelatase subunit ChlI and Chld (MoxR-like ATPase and vWF domain) [Methanopyrus kandleri AV19] Length = 818 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 37/221 (16%), Positives = 70/221 (31%), Gaps = 47/221 (21%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + E I V+D SGSM D D + Sbjct: 627 SFDREEEVCLDIVYVIDTSGSMS----GDRIDAAKRAAIALAHFS--------------- 667 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + +G++G+ T+ E ++ T V + +T+ M TD A++ Sbjct: 668 VKAGD------RVGIVGFNTKAEIVVDI---TSDVEEIITKVMSLKPGGATDIGDAIRVG 718 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD--KAKE 359 ++ + + +I LTDG + + T + + A Sbjct: 719 TELFRRCGRPDRDWH----------------MILLTDGVPTKGEPDPETKALSEATAASR 762 Query: 360 NFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLI 399 + I TI I G RL++ S +++ + + L Sbjct: 763 MGVTISTIGIKLPEEGIRLIEHIAGISGGRSHHITDPEELT 803 >gi|47208832|emb|CAF90336.1| unnamed protein product [Tetraodon nigroviridis] Length = 443 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 27/189 (14%), Positives = 67/189 (35%), Gaps = 31/189 (16%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVT 281 + ++ +K L +D L + + ++ Y + V+ + +++ ++ Sbjct: 234 RPSEFEKVKIFLADM---VDTLDVGADATRVAVVNYASTVKTEFLLKDHFNKPNLKKAIS 290 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 R + T + A+K A +++ G + K I +TDG Sbjct: 291 RIEP--LATGTMTGLAIKTAVSEAFTEQ----------SGARPRPRNIAKVAIIVTDG-- 336 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISI-NASPNGQRLLKTCVSSP--EYHYNVVN---A 395 + V + A+ + ++I + + A +L+ S P ++ + V Sbjct: 337 -RPQDQVEEVS--AAARASGVEIYAVGVDRADMRSLQLM---ASVPLEDHVFYVETYGVI 390 Query: 396 DSLIHVFQN 404 + L F+ Sbjct: 391 EKLTSKFRE 399 >gi|297583258|ref|YP_003699038.1| von Willebrand factor type A [Bacillus selenitireducens MLS10] gi|297141715|gb|ADH98472.1| von Willebrand factor type A [Bacillus selenitireducens MLS10] Length = 978 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 37/248 (14%), Positives = 78/248 (31%), Gaps = 50/248 (20%) Query: 170 NQKIVSFIPALLRIEMGERPI-FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 N + ++ + + V+D SG+ +N D D + Sbjct: 39 NSHTGASETGQFGVDYCQPGAGLDLMFVLDNSGT----VNLDDTDSIRSSTVS------- 87 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 D ++ G+I + T + E S + ++ D Sbjct: 88 --------------DYAENMLPGDRGGIISFNTEADMLQEMSDNRYDLLDALSALPD--P 131 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 TD + M+ A + K + ++ ++ +TDG + + Sbjct: 132 SGGTDLSQGMRAANEQFVQTKGAN-----------------KQIMVLITDGA--DTINLA 172 Query: 349 NTIKICDKAKENFIKIVTISINASPNG--QRLLKTCVS-SPEYHYNVVNADSLIHVFQNI 405 +A+ N I I T+ + + G + LL+ + + V NA + V Q+I Sbjct: 173 EVYNQVREARMNGITIFTLGLGSLATGLDEALLQDIADQTRGQYRQVPNATVIESVLQDI 232 Query: 406 SQLMVHRK 413 + + Sbjct: 233 RSSLEGMR 240 >gi|194291599|ref|YP_002007506.1| hypothetical protein RALTA_B0833 [Cupriavidus taiwanensis LMG 19424] gi|193225503|emb|CAQ71449.1| conserved hypothetical protein, Von Willebrand factor type A domain (vwa), putative exported protein [Cupriavidus taiwanensis LMG 19424] Length = 356 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 42/258 (16%), Positives = 85/258 (32%), Gaps = 60/258 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LV+DLSGSM QD K +++ A + A L++ Sbjct: 97 TVLLVMDLSGSM---------------RAQDVKPSRLRAAQQAATTLLEA------QPAG 135 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD--K 309 V +G++ P+ E V + R T M A L + Sbjct: 136 VSVGVVAMAGTAAVAQAPTRAREAVATAIERLQ---PQGGTALGNGMLIALTTLLPELTP 192 Query: 310 KRSFFTNFFRQGVKIPSLPFQ-----------KF----IIFLTDGENNNFKSNVNTIKIC 354 N + P + I+ +DGE+N + + + Sbjct: 193 DAERLMNDDTPPPRKPRALANPPADTEPVKPGSYTSGAIVLFSDGESNAGPAALRAAQ-- 250 Query: 355 DKAKENFIKIVTISINASPN--------------GQRLLKTCV-SSPEYHYNVVNADSLI 399 A E+ ++I T+ + +++LK ++ ++ + +A L Sbjct: 251 -LAAEHGVRIYTVGVGTPEGVVLSVDGWSARVRLDEKVLKEVADATSAEYFRLEDAAELK 309 Query: 400 HVFQNI-SQLMVHRKYSV 416 V++ + ++L ++ V Sbjct: 310 RVYRALNARLAFDKRSQV 327 >gi|188580137|ref|YP_001923582.1| hypothetical protein Mpop_0869 [Methylobacterium populi BJ001] gi|179343635|gb|ACB79047.1| conserved hypothetical protein [Methylobacterium populi BJ001] Length = 477 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 52/372 (13%), Positives = 105/372 (28%), Gaps = 66/372 (17%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 + + +I+FAL+++ L L+G I K +++A +AA+LAG ++ Sbjct: 18 LASDRGGSINIMFALALLPTLGLVGLGIDYGMAITSKTRLDNAADAAVLAGVVTAKEYIA 77 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + ++ + K F N K + V+ SR+ +T Sbjct: 78 SNAKQGDATAAGLTAGRNQATKAFAINTGKVPFA-------------TVSVSRLDVTRSG 124 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLI 193 + T+ Y + ++ + + + Sbjct: 125 Q----TLTATVIYTATIQNTFG------------KILGLSSTTFTNTITASADLASYLDF 168 Query: 194 ELVVDLSGSMH---------------------CAMNSDPEDVNSAPICQDKKRTKMAALK 232 L+VD+SGSM C K + + A+ Sbjct: 169 YLMVDVSGSMGLPTAAADAEKLASITKEDQGNCQFACHFPGRKGWNNAAGKIQLRSDAVN 228 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV----------RQYVTR 282 NA+ L V +G + R+ S T + + + Sbjct: 229 NAVCELLKRAAT-PVVPNQYRIGFYPFINRLATLSPLSDTTTSMTALRTAAQCDKTWPLA 287 Query: 283 DMDSLILKPT-----DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + L T ++ + G + + F+ +T Sbjct: 288 FTNLLDTGSTQLFTGNNPTTGTGSGGTHFEKALPQMKATIQPYGDGSSTTNSKPFVFLIT 347 Query: 338 DGENNNFKSNVN 349 DG N+ + N Sbjct: 348 DGMQNSQSYSTN 359 >gi|19033105|gb|AAL83537.1|AF414454_1 proximal thread matrix protein 1 variant a [Mytilus edulis] Length = 441 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 38/219 (17%), Positives = 76/219 (34%), Gaps = 34/219 (15%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 VVD +G A D ++ +K+ + +D + + Sbjct: 233 VVDCAGHADIAFVFD-----ASSSINANNPNNYGLMKDFMKDIVDRFNKTGP--DGTQFA 285 Query: 256 LIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 ++ + R K +++ + + S I+ T ++ A + ++ Sbjct: 286 VVTFADRATKQFGLKDYSSKAEIKGAIDKVTPS-IIGQTAIGDGLENARLEVFPNRNGG- 343 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 QK +I LTDG+NN KS + + ++ + IV I + Sbjct: 344 -----------GREEVQKVVILLTDGQNNGHKSPEHESSL---LRKEGVVIVAIGV---- 385 Query: 374 NGQRLLK----TCVSSPEYHYNVVNADSLIHVFQNISQL 408 G LK SS EY + + + L + +N+ +L Sbjct: 386 -GTGFLKSELINIASSEEYVFTTSSFNKLSKIMENVVKL 423 Score = 37.9 bits (86), Expect = 3.1, Method: Composition-based stats. Identities = 23/161 (14%), Positives = 57/161 (35%), Gaps = 26/161 (16%) Query: 224 KRTKMAALKNALLLFLDSIDLL-SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 + +KN + +DS + + + G++ ++ V+ K ++ + + Sbjct: 57 NKENYQMMKNFVKELVDSFTTVGVNGRNGSQFGVVQFSQGVKTAFPL--NKFKTKEDIKK 114 Query: 283 DMDSLI---LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + ++ T+ ++ + S + + QK +I +TDG Sbjct: 115 GIQDMVPRNGGQTEIGTGLQHVRENSFSGAEGG------------GNPDKQKIVILMTDG 162 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLK 380 ++N + K K + ++ I I GQ +K Sbjct: 163 KSNAGAPPQHEAH---KLKAEGVTVIAIGI-----GQGFVK 195 >gi|198274642|ref|ZP_03207174.1| hypothetical protein BACPLE_00794 [Bacteroides plebeius DSM 17135] gi|198272089|gb|EDY96358.1| hypothetical protein BACPLE_00794 [Bacteroides plebeius DSM 17135] Length = 332 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 45/314 (14%), Positives = 87/314 (27%), Gaps = 73/314 (23%) Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 ++ + T + + + Y F+ +L +++ Sbjct: 23 VMRRKKAEPTMQVSTTRMYMKAPRSWKI-----YLLHAPFVLRILTFVMIVLVLARPQTT 77 Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + E I L VD+S SM D K ++ A K F++ Sbjct: 78 DNWQNTEIEGIDIMLAVDVSTSMLAE---------------DLKPNRIEAAKQVAAEFIN 122 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV-----RQYVTRDMDSLILKPTDST 295 +GL + + + +I T Sbjct: 123 ----GRPNDN---IGLTIFAGEAFTQCPLTVDHGVLLNLFQSVSCDMVQRGMIEDGTALG 175 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 + A L K +S K +I LTDG NN + T + Sbjct: 176 MGLANAVSRLKDSKAKS------------------KVVILLTDGVNNRGDISPLTAA--E 215 Query: 356 KAKENFIKIVTISI----NAS-----------------PNGQRLLKTCVSSPEYHYNVVN 394 AK+ I++ TI + A + Q + + ++ ++ + Sbjct: 216 IAKQFGIRVYTIGVGTNGTAPYPMQTYAGVQYVQMPVEIDEQTMSQIAGTTNGNYFRATS 275 Query: 395 ADSLIHVFQNISQL 408 L V++ I +L Sbjct: 276 NTKLKEVYREIDKL 289 >gi|281420095|ref|ZP_06251094.1| BatA protein [Prevotella copri DSM 18205] gi|281405895|gb|EFB36575.1| BatA protein [Prevotella copri DSM 18205] Length = 332 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 41/244 (16%), Positives = 71/244 (29%), Gaps = 68/244 (27%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I L +D+S SM D + +M A K+ F+ Sbjct: 88 IDIMLAMDVSTSMLAE---------------DLRPNRMEAAKDVATEFI----SGRPNDN 128 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD-----SLILKPTDSTPAMKQAYQIL 305 +GL + + + + + LI T + A L Sbjct: 129 ---IGLTIFAGEAFTQCPMTTDHASLLRLLQATRTDIAARGLIDDGTAVGMGLANAVSRL 185 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 K +S K +I LTDG NN + + T + AK I++ Sbjct: 186 KDSKSKS------------------KVVILLTDGSNNMGEISPMTAA--EIAKSYGIRVY 225 Query: 366 TISI---------NASPNG------------QRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 TI + G + L ++ Y N + L ++++ Sbjct: 226 TIGVGTNKVAPYPMPVAGGVQYVNIPVEIDTKTLSDIAQTTDGNFYRATNNNELKKIYRD 285 Query: 405 ISQL 408 I +L Sbjct: 286 IDKL 289 >gi|22760140|dbj|BAC11083.1| unnamed protein product [Homo sapiens] Length = 540 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 65/169 (38%), Gaps = 22/169 (13%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTP 296 + + L+ +G+I Y+++V+ ++ + + RD+ L T + Sbjct: 57 LMGLLRGLNVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQG-TMTGL 115 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A + S + G + P + + +TDG + ++ + Sbjct: 116 AIQYAMNVAFSVAE----------GARPPEERVPRVAVIVTDG---RPQD--RVAEVAAQ 160 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 A+ I+I + + + G L+ S P E+ + V + LI F Sbjct: 161 ARARGIEIYAVGLQRADVGS--LRAMASPPLDEHVFLVESF-DLIQEFG 206 Score = 43.3 bits (100), Expect = 0.075, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 65/177 (36%), Gaps = 26/177 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR--DMDSLILKPTDSTPAMK 299 +D L E +GL+ +++RV G V + + + T + A++ Sbjct: 331 VDFLDVSPEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALR 388 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S+ QG + +L + + TDG + +++ +AKE Sbjct: 389 HMVEHSFSE----------AQGARPRALNVPRVGLVFTDG---RSQDDISV--WAARAKE 433 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I + + + + + L+ S P + D F ++ L+ + + S+ Sbjct: 434 EGIVMYAVGVGKAVEAE--LREIASEPAELHVSYAPD-----FGTMTHLLENLRSSI 483 >gi|309791336|ref|ZP_07685859.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308226646|gb|EFO80351.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 853 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 37/241 (15%), Positives = 69/241 (28%), Gaps = 46/241 (19%) Query: 162 EHLLNQRYNQKIVSFIPALLRIEMGERPI-FLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 L N + +P L+ + + L++D S SM + Sbjct: 367 SFTLGSYQNTPLADVLPVLMEPPPRPQRSDVALLLIMDRSASMLASFGV----------- 415 Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV--EKNIEPSWGTEKVRQ 278 +K K A L +S+ +GL+ + T + G V Q Sbjct: 416 -----SKFDMAKEAAQLATESLQPED------RIGLLAFDTETLWVVPFQLISGGLSVAQ 464 Query: 279 YVTRDMDSLILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + + SL T A++ L + + LT Sbjct: 465 -IQEQIASLPSGGGTRIERALEVGLPALAEQPTKVRHA------------------VLLT 505 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 DG + N ++ + A+ I + TI+I + L + +Y Sbjct: 506 DG-RSFMNDNALYQRLVETARSQQITLSTIAIGLDSDTALLKQLAAWGGGRYYYADQPAD 564 Query: 398 L 398 + Sbjct: 565 I 565 >gi|242076422|ref|XP_002448147.1| hypothetical protein SORBIDRAFT_06g022130 [Sorghum bicolor] gi|241939330|gb|EES12475.1| hypothetical protein SORBIDRAFT_06g022130 [Sorghum bicolor] Length = 697 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 46/316 (14%), Positives = 96/316 (30%), Gaps = 66/316 (20%) Query: 116 TEIQNIVNSSRISMTHMANNRLDSSNNT---IFYNMDVMTSYDYRLQFIEHLLNQRYNQK 172 +E + + + N SN T Y S L + L++ + Sbjct: 182 SEPADYNDDEPLQHIEAFENLNSGSNKTAEISSYPESQAVSQSSCLDGFDILIHVKAPTS 241 Query: 173 IVSFIPALL----RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 + + L + R + V+D+SGSM TK+ Sbjct: 242 SSNDVTGSLVNESSMRSSRRVPIDLVTVLDVSGSMAG--------------------TKL 281 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL- 287 A LK A+ + + + +I +++ + + RQ + + SL Sbjct: 282 ALLKQAMGFVIQHLRPSD------RLSVIAFSSTARRLFPLQRMSHHGRQQALQAISSLG 335 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T+ A+K+A +++ ++ + II L+DG++ N Sbjct: 336 AGGGTNIADALKKAVKVIEDRNYKNSVCS----------------IILLSDGQDTF---N 376 Query: 348 VNTIKICDKAKENF------------IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA 395 +++ A + + T A + L +S + + Sbjct: 377 ISSNFQGTSAGRRSLVPPSILNELHMVPLHTFGFGADHDSDTLHSISEASGGTFSFIEDE 436 Query: 396 DSLIHVFQN-ISQLMV 410 + F I L+ Sbjct: 437 GVMQDAFAQCIGGLLS 452 >gi|212693197|ref|ZP_03301325.1| hypothetical protein BACDOR_02707 [Bacteroides dorei DSM 17855] gi|237709939|ref|ZP_04540420.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA] gi|237725394|ref|ZP_04555875.1| conserved hypothetical protein [Bacteroides sp. D4] gi|265753591|ref|ZP_06088946.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA] gi|212664302|gb|EEB24874.1| hypothetical protein BACDOR_02707 [Bacteroides dorei DSM 17855] gi|229436081|gb|EEO46158.1| conserved hypothetical protein [Bacteroides dorei 5_1_36/D4] gi|229456032|gb|EEO61753.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA] gi|263235305|gb|EEZ20829.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA] Length = 332 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 48/281 (17%), Positives = 79/281 (28%), Gaps = 68/281 (24%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 Y F+ + I++ + E I L VD+S SM Sbjct: 51 YLLHAPFVLRTVAIIMVILILARPQTTDNWQNTEIEGIDIMLAVDVSTSMLAE------- 103 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 D K ++ A K F++ +GL + + Sbjct: 104 --------DLKPNRLEAAKQVASEFIN----GRPNDN---IGLTIFAGESFTQCPLTVDH 148 Query: 274 EKVRQYVTRD-----MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 + LI T + A L K +S Sbjct: 149 GVLLNLFNSIKGDIAQRGLIEDGTAIGMGIANAVTRLKDSKAKS---------------- 192 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS-------------------I 369 K II LTDG NN + T + AK+ I+I TI + Sbjct: 193 --KVIILLTDGSNNRGDISPLTAA--EIAKQFGIRIYTIGVGTNGTAPYPMQTYAGTQYV 248 Query: 370 NAS--PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 N + + L + ++ ++ + L V+Q I +L Sbjct: 249 NVPVEIDEKTLTEIAGTTNGNYFRATSNSKLKEVYQEIDKL 289 >gi|124002443|ref|ZP_01687296.1| OmpA family protein [Microscilla marina ATCC 23134] gi|123992272|gb|EAY31640.1| OmpA family protein [Microscilla marina ATCC 23134] Length = 756 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 61/408 (14%), Positives = 127/408 (31%), Gaps = 67/408 (16%) Query: 17 SEKANFSIIFALSVMSFL------LLIGFLIYVLDWHYKKNSMESANNAAILAGASKM-- 68 + K NF + V LL + + K+ + ++ L G+ ++ Sbjct: 222 NRKVNFVKLPIKVVPVTFKGKKDILLGSKTLIRWNAKGMKSVIYRG-DSLPLVGSMEVAP 280 Query: 69 VSNLSRLGDRFESISNHAKRALIDDAKR-FIKNHIKESL--SGYSAVFYNTEIQNIVNSS 125 + N + + + K + +R FIKN K++ G F Sbjct: 281 LRNTTYKFLIDDGEKSFVKSFRVRVIRRKFIKNVAKKTTLKKGQKLDFDIFATDRSKFPK 340 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEM 185 + + + + +S N I + SY +F ++ ++ + I Sbjct: 341 EVKLYVLVTD---TSGNFISHLAPPYVSYRESRKFFRKIVESVKGRRYAINNFKVREIHE 397 Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 + I LV+D SGSM + E I K++ +K Sbjct: 398 MISKPYDISLVMDYSGSMAGNIKKLEEATRKF-ILTKHPNDKISVVK------------- 443 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 + R+E + + + + + + L Sbjct: 444 -------------FDERLETELRLT--AQGSKTDCVKFDGLTRYGGS----------TAL 478 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-----KSNVNTIKICDKAKEN 360 + + Q K ++ TDGE N+ K ++ KA+E Sbjct: 479 YAGADEGLESLKNAQN--------NKVMLLFTDGEENSSLQYFGKRAFRASEVVKKAREK 530 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 I++ TI+ N + L + + Y + N D + V++ + ++ Sbjct: 531 GIRVFTIAYGTGVNNKTLNALSMLTDGKTYFIENPDEIKQVYEELPRI 578 >gi|194335401|ref|YP_002017195.1| von Willebrand factor type A [Pelodictyon phaeoclathratiforme BU-1] gi|194307878|gb|ACF42578.1| von Willebrand factor type A [Pelodictyon phaeoclathratiforme BU-1] Length = 336 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 37/247 (14%), Positives = 74/247 (29%), Gaps = 59/247 (23%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 L R E + L +D+S SM + D +++ A + F+ Sbjct: 88 LFRQSEAESRGIDVILALDISESM---LQKDVGGT-----------SRLDAAREVSRNFV 133 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAM 298 + + +GL+ + + + E + + R +I T A+ Sbjct: 134 --LRRSND-----RIGLVVFRGKGYTQCPLTLDHEVLAMLLDRLSPGVIQDDGTAIGTAI 186 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 A L S K +I +TDGENN + T A Sbjct: 187 LIAVNRL------------------KASESLHKVLILVTDGENNAGEVGPGTAA--SIAA 226 Query: 359 ENFIKIVTI--SINA-------SPNGQRLLKT--------CVSSPEYHYNVVNADSLIHV 401 + ++I I R ++ ++ ++ V + + Sbjct: 227 RSGVRIYVINAGFKVVEDRIDPPEESGRYIQKDEESLQGIARTTGGGYFRVEDPAAFDQT 286 Query: 402 FQNISQL 408 ++I +L Sbjct: 287 IRSIDRL 293 >gi|312879450|ref|ZP_07739250.1| von Willebrand factor type A [Aminomonas paucivorans DSM 12260] gi|310782741|gb|EFQ23139.1| von Willebrand factor type A [Aminomonas paucivorans DSM 12260] Length = 813 Score = 47.2 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 48/313 (15%), Positives = 103/313 (32%), Gaps = 43/313 (13%) Query: 100 NHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQ 159 + S++ + +T + + S + A ++ S +T V S+ + Sbjct: 232 GWLNHSIAAFYVTSTDTAQKRMYGKSCWQVLLQAPSQDGVSGDTTVKPSRVQYSFSKPVT 291 Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 LN ++ A L+I + ++V+D SGSM + Sbjct: 292 TSGDYLNAVTPGTHLAAAQANLKILWLAEATMVYQIVLDRSGSMGTNPD----------- 340 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK---- 275 + T ++ K A +DS+ ++VY+G++ + + + Sbjct: 341 -KPDDPTPLSYAKTAACNLVDSL------PKNVYVGIVQFDDSTSQVYPITLIASNDAAA 393 Query: 276 --VRQYVTRDMDSLILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 R ++ L T A A + K S Sbjct: 394 AATRAAAKAAINGLTSGGSTAIYDAASYALSQFVAQKTAL-------------SADLLGV 440 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA--SPNGQRLLKTCVSSPEYHY 390 LTDGE+N+ +V ++ + + + ++T+ A L + + ++ Sbjct: 441 TYLLTDGEDNSSSKSVG--EVIGEYQAQKVPLITVGYGAGGQAGSFALTQLADGTGGQYF 498 Query: 391 -NVVNADSLIHVF 402 + V+ +L VF Sbjct: 499 ASPVDQAALQQVF 511 >gi|332667371|ref|YP_004450159.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] gi|332336185|gb|AEE53286.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] Length = 425 Score = 47.2 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 36/242 (14%), Positives = 77/242 (31%), Gaps = 49/242 (20%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 + N + + +R + LV+D SGSM Sbjct: 20 PKQNFYLYLELQGSEAPASADRVPLNLSLVIDRSGSMAGD-------------------- 59 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT--RVEKNIEPSWGTEKVRQYVTRDM 284 K+A K A +D++ + ++ Y V P +++RQ + Sbjct: 60 KIAYAKKAAQFIVDNLSPED------RVSIVQYDDIVEVLSPSAPVLNKQELRQRIALME 113 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + T+ + M Y+ Q + F ++ L+DG N+ Sbjct: 114 ARNM---TNLSGGMLAGYE----------------QVERTKQARFVNRVLLLSDGLANHG 154 Query: 345 KSNVNTIK--ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 ++ ++ + +K + I + T + A N + +Y + + D + +F Sbjct: 155 ITDPTVLQQMVQEKFRNAGIAVSTFGVGADFNELLMTSLSEYGGANYYFIESPDKIPGIF 214 Query: 403 QN 404 Sbjct: 215 AE 216 >gi|158257430|dbj|BAF84688.1| unnamed protein product [Homo sapiens] Length = 914 Score = 47.2 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 45/240 (18%), Positives = 80/240 (33%), Gaps = 69/240 (28%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM ++ L A LFL + + ++ Sbjct: 309 LVLDKSGSMA-------------------TGNRLNRLNQAGQLFL--LQTVELGS---WV 344 Query: 255 GLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + E + R + + + + T ++ A + K + Sbjct: 345 GMVTFDSAAHVQSELIQINSGSDRDTLAKRLPAAASGGTSICSGLRSA-STVIRKKYPTD 403 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 + I+ LTDGE+N TI C ++ K++ I T+++ Sbjct: 404 GSE----------------IVLLTDGEDN-------TISGCFNEVKQSGAIIHTVAL-GP 439 Query: 373 PNGQRL---------LKTCVSSPEYHYNVVNADSLIHVFQNISQ---LMVHRKYSVILKG 420 Q L L+T S V N + LI F +S + R + KG Sbjct: 440 SAAQELEELSKMTGGLQTYASD-----QVQN-NGLIDAFGALSSGNGAVSQRSIQLESKG 493 >gi|90420796|ref|ZP_01228702.1| conserved hypothetical protein with von Willebrand factor domain [Aurantimonas manganoxydans SI85-9A1] gi|90335087|gb|EAS48848.1| conserved hypothetical protein with von Willebrand factor domain [Aurantimonas manganoxydans SI85-9A1] Length = 320 Score = 47.2 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 36/225 (16%), Positives = 72/225 (32%), Gaps = 48/225 (21%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L +DLSGSM + + +++ A+K A+ Sbjct: 91 DLVLAIDLSGSME----------QEDFVLDGRTVSRLDAVK-AVARDFVRARAGD----- 134 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + ++ E V + + + + + T + + A + L R Sbjct: 135 -RVGLVIFAETAYFAAPLTFDVEAVGRLIDQATIGISGRSTAISDGLGLALKRLARSDAR 193 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS--- 368 S + ++ L+DG NN A+ I++ TI+ Sbjct: 194 S------------------RVVVLLSDGVNNAGAVQPRDAG--SLAERLGIRVHTIALGP 233 Query: 369 --INASPNGQRL-----LKTCV-SSPEYHYNVVNADSLIHVFQNI 405 + P + L+ +S + V D L V + I Sbjct: 234 ADLETDPKSRDAVDTATLRAIAETSGGETFRVRTTDDLRQVARAI 278 >gi|218442094|ref|YP_002380423.1| von Willebrand factor A [Cyanothece sp. PCC 7424] gi|218174822|gb|ACK73555.1| von Willebrand factor type A [Cyanothece sp. PCC 7424] Length = 412 Score = 47.2 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 38/215 (17%), Positives = 67/215 (31%), Gaps = 45/215 (20%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LV+D SGSM + +K A + + +++ Sbjct: 41 LNLCLVLDHSGSMAGK--------------------PLETVKQAAIELVKQLNVED---- 76 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 + +I + R K + P+ G + + + + T +K Q + KK Sbjct: 77 --RLSIIAFDHR-AKVLVPNQGIDNLNTIIEQINSLKPAGGTAIDEGLKLGIQESANGKK 133 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 I LTDGEN + N +K+ A + I + T+ Sbjct: 134 DRVSQ-----------------IFLLTDGENEHG-DNERCLKLAHVASDYNITLNTLGFG 175 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 N L K S+ + D I F + Sbjct: 176 NHWNQDVLEKISDSAGGTLCYIETPDKAIEEFSRL 210 >gi|168698108|ref|ZP_02730385.1| protein containing a von Willebrand factor type A domain [Gemmata obscuriglobus UQM 2246] Length = 821 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 51/267 (19%), Positives = 85/267 (31%), Gaps = 59/267 (22%) Query: 111 AVFYNTEIQNIVNSS--RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQR 168 V+ T N V S +S+T L + +FY D L + + Q Sbjct: 193 NVYSPTHAVNTVRKSDKEVSVTFERKQALLDKDFQLFYG---HGDKDIGLSPLVYKPIQT 249 Query: 169 YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 + + I + E +R + LV+D S SM KM Sbjct: 250 EDGYFMFLISPQVEAE-KKRVARDLVLVLDTSSSMSDI--------------------KM 288 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE--PSWGTEKV---RQYVTRD 283 K A+ L + G++ ++T V K + T+ + +++ Sbjct: 289 QQAKKAVKFCLSQLQPED------RFGVVRFSTTVTKFRSELVAANTDYLDLATKWIDGL 342 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 S T PA+ A PS PF ++F TDG+ Sbjct: 343 KTS---GGTAIWPALNDA----------------LAMRSSDPSRPFT--MVFFTDGQPTV 381 Query: 344 FKSNVNTIKICDKAKENF-IKIVTISI 369 ++N + I AK +I T + Sbjct: 382 DETNADKIVKNVLAKNTGNTRIFTFGV 408 >gi|120403735|ref|YP_953564.1| hypothetical protein Mvan_2751 [Mycobacterium vanbaalenii PYR-1] gi|166988604|sp|A1T8Q8|Y2751_MYCVP RecName: Full=UPF0353 protein Mvan_2751 gi|119956553|gb|ABM13558.1| von Willebrand factor, type A [Mycobacterium vanbaalenii PYR-1] Length = 335 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 37/234 (15%), Positives = 71/234 (30%), Gaps = 51/234 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + LV+D+S SM D ++ A + A F + + Sbjct: 99 VMLVIDVSQSM---------------RATDVAPNRLVAAQEAAKQF------ADQLTPGI 137 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GLI Y + P+ E + + + + T + + A Q + + Sbjct: 138 NLGLIAYAGTATVLVSPTTNREATKAAIDKLQLADR---TATGEGIFTALQAVATVGAV- 193 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK--SNVNTIKICD-KAKENFIKIVTISI 369 + P I+ ++DG+ N AK+ + I T+S Sbjct: 194 ---------IGGGDEPPPARIVLMSDGKETVPSNPDNPKGAYTAARTAKDQGVPISTVSF 244 Query: 370 NAS-------------PNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLM 409 P +LK S + + + L VF N+ + + Sbjct: 245 GTPYGYVEINDQRQPVPVDDEMLKKIADLSGGDAFTASSLEQLKQVFTNLQEQI 298 >gi|226315301|ref|YP_002775197.1| hypothetical protein BBR47_57160 [Brevibacillus brevis NBRC 100599] gi|226098251|dbj|BAH46693.1| hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 597 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 43/227 (18%), Positives = 77/227 (33%), Gaps = 52/227 (22%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 LVVD+S SM DK + A+K + + + + Sbjct: 39 MDAVLVVDVSNSM---------------TQSDKNKVSNEAMKMFVDMTSIQANKV----- 78 Query: 251 DVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILTSD 308 G++ YT ++E+ +E+ + + +DSL TD + +A +IL + Sbjct: 79 ----GVVAYTDKIEREKALLEINSEEDKNDIKAFIDSLQKGAYTDIAVGVTEAVKILDAG 134 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN--------NNFKSNVNTIKICDKAKEN 360 + + I+ L DG N KS+ + +AK+ Sbjct: 135 RNPNNAP----------------IIVLLADGNNFLNKASSRTQAKSDQELQQAVKEAKDK 178 Query: 361 FIKIVTISINASP--NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + TI +NA N L + + + AD L + I Sbjct: 179 GYPVYTIGLNADGQLNRTTLQQIAAETNGKFFETSTADKLPQILSEI 225 >gi|156741348|ref|YP_001431477.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156232676|gb|ABU57459.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 972 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 35/241 (14%), Positives = 75/241 (31%), Gaps = 55/241 (22%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 ++ ++P + +V+D SGSM + +R ++ K A+ + Sbjct: 401 LDTKQQPDLALVMVIDRSGSMSELVGG-------------SRRNRLDLAKEAVYQASLGL 447 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQA 301 + +GL+ + + V + R + S I T+ P ++Q Sbjct: 448 TPID------QVGLVVFDDAANWVLPLQRLPSVV--EIERALGSFGIGGGTNIRPGIEQ- 498 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG--ENNNFKSNVNTIKICDKAKE 359 + K +I LTDG E+N + + + Sbjct: 499 -----------------AAQALASADAKVKHVILLTDGIAESNYSD-------LIAQMRA 534 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ-----NISQLMVHRKY 414 + I T++I L+ + Y V + + +F + +V + Sbjct: 535 AGVTISTVAI-GEDANPNLVDVANAGGGRSYRVTRIEDVPRIFLQETIIAAGRDIVEERI 593 Query: 415 S 415 Sbjct: 594 E 594 >gi|308270598|emb|CBX27210.1| hypothetical protein N47_A12390 [uncultured Desulfobacterium sp.] Length = 312 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 34/190 (17%), Positives = 63/190 (33%), Gaps = 45/190 (23%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + I ++VD+S SM + +D K ++ + +L FL Sbjct: 65 TQKGVDIMVLVDVSPSM---------------MVEDIKPNRLERARREVLDFL------- 102 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQIL 305 +V + +GLI ++ + ++ ++ L + TD A++ Sbjct: 103 NVVQGDRIGLIAFSGVAFVQCPLTLDYGAIQMFLDELKPELIPVAGTDLGAAIEAGISSF 162 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 K I+ +TDGE+N K + KAKE +KI Sbjct: 163 DFKSVT------------------DKVIMLITDGEDNEGKG----LIAAQKAKEKGVKIF 200 Query: 366 TISINASPNG 375 + G Sbjct: 201 VFGMGDPSGG 210 >gi|300716700|ref|YP_003741503.1| von Willebrand factor, type A domain protein [Erwinia billingiae Eb661] gi|299062536|emb|CAX59653.1| von Willebrand factor, type A domain protein [Erwinia billingiae Eb661] Length = 325 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 29/182 (15%), Positives = 66/182 (36%), Gaps = 38/182 (20%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 +P+ + L++D+SGSM D T++ A++ ++ F+ Sbjct: 93 KPMRNMVLILDVSGSMEKNDAQDGI-------------TRLQAVQRSVRAFVAQ------ 133 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 ++ +GL+ + + S + + + + +I + T A+ A ++L S Sbjct: 134 -RKTDRIGLVIFASSAWPFAPISEDKQALLARINQLAPGMIGQQTAIGDALGVAVKLLDS 192 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 R + I LTDG + + + A + +++ TI Sbjct: 193 SLDRDA----------------SRLAILLTDGNDTASQLSPALAAQ--LAASHHVQVHTI 234 Query: 368 SI 369 + Sbjct: 235 AF 236 >gi|295106190|emb|CBL03733.1| hypothetical protein [Gordonibacter pamelaeae 7-10-1-b] Length = 929 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 48/258 (18%), Positives = 80/258 (31%), Gaps = 63/258 (24%) Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV---------YMGLIGY--- 259 E V D+ T++AALK+A F+ S+ S ++ + + +IG+ Sbjct: 250 EGVTFYSFQTDQTVTRIAALKDAANSFVQSVRSNSLGEDGIAGTVDDVPHRIAVIGFASG 309 Query: 260 --TTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD----STPAMKQAYQILTSDKKRSF 313 T + +G QY + D D S + L D Sbjct: 310 DNTELFVGSASYPYGQSAQSQYGSAFQDMTTEMGFDNALASIGQLSADGGTLVDDGLDMA 369 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGE-----NNNFKSNVNTIKICDKAKENFIKIVTIS 368 F + L + + LTDG N+ +N I + K + +I Sbjct: 370 NGVFSANPLVTGELR-NRVTVVLTDGAPGLYGNDRGVAN-EAISQASELKTAGSTVFSIG 427 Query: 369 I--------------------------------NASPN------GQRLLKTCVSSPEYHY 390 I N G R + SSP+Y+ Sbjct: 428 IFPGADASGDLPDQSAMGWGDNDSNRFMHLLSSNYPDASSMGSPGARFVDEEGSSPDYYL 487 Query: 391 NVVNADSLIHVFQNISQL 408 + ++ L +FQ+ISQ Sbjct: 488 SASDSAGLNSIFQSISQS 505 >gi|315080701|gb|EFT52677.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL078PA1] Length = 320 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 37/241 (15%), Positives = 78/241 (32%), Gaps = 51/241 (21%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + + +D+S SM + D + ++++A K A FL + Sbjct: 86 RDRATVVVAIDVSRSM---------------VATDVEPSRLSAAKTAAKDFLGDL---PP 127 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 L+ + + + P+ V +T +L T + Y L Sbjct: 128 RFNV---SLVKFAASAQVVVPPTTDRAAVSTAITNLQ---VLPSTAIGEGI---YSSL-- 176 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + + + P I+ L+DG N + ++ K +A + + TI Sbjct: 177 -NALKLVPDDPKHPGQKPPAA----IVLLSDGATNVGRPSLEAAK---EAGRQHVPVYTI 228 Query: 368 SIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + N L +S ++ + L V+++I+Q + + K Sbjct: 229 AYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESLGQLSDVYKSIAQSVGYEK 288 Query: 414 Y 414 Sbjct: 289 V 289 >gi|314981157|gb|EFT25251.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA3] gi|315091980|gb|EFT63956.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA4] Length = 320 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 37/241 (15%), Positives = 78/241 (32%), Gaps = 51/241 (21%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + + +D+S SM + D + ++++A K A FL + Sbjct: 86 RDRATVVVAIDVSRSM---------------VATDVEPSRLSAAKTAAKDFLGDL---PP 127 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 L+ + + + P+ V +T +L T + Y L Sbjct: 128 RFNV---SLVKFAASAQVVVPPTTDRAAVSTAITNLQ---VLPSTAIGEGI---YSSL-- 176 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + + + P I+ L+DG N + ++ K +A + + TI Sbjct: 177 -NALKLVPDDPKHPGQKPPAA----IVLLSDGATNVGRPSLEAAK---EAGRQHVPVYTI 228 Query: 368 SIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + N L +S ++ + L V+++I+Q + + K Sbjct: 229 AYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESLGQLSDVYKSIAQSVGYEK 288 Query: 414 Y 414 Sbjct: 289 V 289 >gi|314923047|gb|EFS86878.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL001PA1] gi|314966819|gb|EFT10918.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL082PA2] gi|315093261|gb|EFT65237.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL060PA1] gi|315103481|gb|EFT75457.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL050PA2] gi|327327645|gb|EGE69421.1| von Willebrand factor, type A [Propionibacterium acnes HL103PA1] Length = 320 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 37/241 (15%), Positives = 78/241 (32%), Gaps = 51/241 (21%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + + +D+S SM + D + ++++A K A FL + Sbjct: 86 RDRATVVVAIDVSRSM---------------VATDVEPSRLSAAKTAAKDFLGDL---PP 127 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 L+ + + + P+ V +T +L T + Y L Sbjct: 128 RFNV---SLVKFAASAQVVVPPTTDRAAVSTAITNLQ---VLPSTAIGEGI---YSSL-- 176 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + + + P I+ L+DG N + ++ K +A + + TI Sbjct: 177 -NALKLVPDDPKHPGQKPPAA----IVLLSDGATNVGRPSLEAAK---EAGRQHVPVYTI 228 Query: 368 SIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + N L +S ++ + L V+++I+Q + + K Sbjct: 229 AYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESLGQLSDVYKSIAQSVGYEK 288 Query: 414 Y 414 Sbjct: 289 V 289 >gi|289425049|ref|ZP_06426826.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK187] gi|289154027|gb|EFD02715.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK187] gi|313764512|gb|EFS35876.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL013PA1] gi|313772105|gb|EFS38071.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL074PA1] gi|313801850|gb|EFS43084.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA2] gi|313807459|gb|EFS45946.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL087PA2] gi|313809969|gb|EFS47690.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL083PA1] gi|313812999|gb|EFS50713.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL025PA1] gi|313816053|gb|EFS53767.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL059PA1] gi|313818504|gb|EFS56218.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL046PA2] gi|313820270|gb|EFS57984.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL036PA1] gi|313822921|gb|EFS60635.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL036PA2] gi|313825147|gb|EFS62861.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL063PA1] gi|313827718|gb|EFS65432.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL063PA2] gi|313830298|gb|EFS68012.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL007PA1] gi|313833672|gb|EFS71386.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL056PA1] gi|314915506|gb|EFS79337.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA4] gi|314920024|gb|EFS83855.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL050PA3] gi|314925157|gb|EFS88988.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL036PA3] gi|314932038|gb|EFS95869.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL067PA1] gi|314955908|gb|EFT00308.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL027PA1] gi|314958220|gb|EFT02323.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL002PA1] gi|314960059|gb|EFT04161.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL002PA2] gi|314962858|gb|EFT06958.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL082PA1] gi|314967774|gb|EFT11873.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA1] gi|314973303|gb|EFT17399.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL053PA1] gi|314975981|gb|EFT20076.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL045PA1] gi|314978482|gb|EFT22576.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL072PA2] gi|314988184|gb|EFT32275.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA2] gi|314989987|gb|EFT34078.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA3] gi|315078073|gb|EFT50124.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL053PA2] gi|315084373|gb|EFT56349.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL027PA2] gi|315085714|gb|EFT57690.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL002PA3] gi|315088866|gb|EFT60842.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL072PA1] gi|315096218|gb|EFT68194.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL038PA1] gi|315098476|gb|EFT70452.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL059PA2] gi|315101164|gb|EFT73140.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL046PA1] gi|315105440|gb|EFT77416.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL030PA1] gi|315108385|gb|EFT80361.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL030PA2] gi|327326130|gb|EGE67920.1| von Willebrand factor, type A [Propionibacterium acnes HL096PA2] gi|327330198|gb|EGE71947.1| von Willebrand factor, type A [Propionibacterium acnes HL097PA1] gi|327331995|gb|EGE73732.1| von Willebrand factor, type A [Propionibacterium acnes HL096PA3] gi|327443197|gb|EGE89851.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL013PA2] gi|327445982|gb|EGE92636.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL043PA2] gi|327448038|gb|EGE94692.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL043PA1] gi|327450840|gb|EGE97494.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL087PA3] gi|327453083|gb|EGE99737.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL092PA1] gi|327453814|gb|EGF00469.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL083PA2] gi|328753528|gb|EGF67144.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL020PA1] gi|328754259|gb|EGF67875.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL087PA1] gi|328754490|gb|EGF68106.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL025PA2] gi|328760648|gb|EGF74215.1| von Willebrand factor, type A [Propionibacterium acnes HL099PA1] Length = 320 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 37/241 (15%), Positives = 78/241 (32%), Gaps = 51/241 (21%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + + +D+S SM + D + ++++A K A FL + Sbjct: 86 RDRATVVVAIDVSRSM---------------VATDVEPSRLSAAKTAAKDFLGDL---PP 127 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 L+ + + + P+ V +T +L T + Y L Sbjct: 128 RFNV---SLVKFAASAQVVVPPTTDRAAVSTAITNLQ---VLPSTAIGEGI---YSSL-- 176 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + + + P I+ L+DG N + ++ K +A + + TI Sbjct: 177 -NALKLVPDDPKHPGQKPPAA----IVLLSDGATNVGRPSLEAAK---EAGRQHVPVYTI 228 Query: 368 SIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + N L +S ++ + L V+++I+Q + + K Sbjct: 229 AYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESLGQLSDVYKSIAQSVGYEK 288 Query: 414 Y 414 Sbjct: 289 V 289 >gi|282854077|ref|ZP_06263414.1| von Willebrand factor type A domain protein [Propionibacterium acnes J139] gi|282583530|gb|EFB88910.1| von Willebrand factor type A domain protein [Propionibacterium acnes J139] Length = 318 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 37/241 (15%), Positives = 78/241 (32%), Gaps = 51/241 (21%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + + +D+S SM + D + ++++A K A FL + Sbjct: 84 RDRATVVVAIDVSRSM---------------VATDVEPSRLSAAKTAAKDFLGDL---PP 125 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 L+ + + + P+ V +T +L T + Y L Sbjct: 126 RFNV---SLVKFAASAQVVVPPTTDRAAVSTAITNLQ---VLPSTAIGEGI---YSSL-- 174 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + + + P I+ L+DG N + ++ K +A + + TI Sbjct: 175 -NALKLVPDDPKHPGQKPPAA----IVLLSDGATNVGRPSLEAAK---EAGRQHVPVYTI 226 Query: 368 SIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + N L +S ++ + L V+++I+Q + + K Sbjct: 227 AYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESLGQLSDVYKSIAQSVGYEK 286 Query: 414 Y 414 Sbjct: 287 V 287 >gi|50842462|ref|YP_055689.1| aerotolerance protein BatA [Propionibacterium acnes KPA171202] gi|289427042|ref|ZP_06428758.1| von Willebrand factor type A domain protein [Propionibacterium acnes J165] gi|295130539|ref|YP_003581202.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK137] gi|50840064|gb|AAT82731.1| conserved protein, putative BatA (bacteroides aerotolerance operon) [Propionibacterium acnes KPA171202] gi|289159511|gb|EFD07699.1| von Willebrand factor type A domain protein [Propionibacterium acnes J165] gi|291375600|gb|ADD99454.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK137] gi|332675379|gb|AEE72195.1| hypothetical protein PAZ_c10200 [Propionibacterium acnes 266] Length = 318 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 37/241 (15%), Positives = 78/241 (32%), Gaps = 51/241 (21%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + + +D+S SM + D + ++++A K A FL + Sbjct: 84 RDRATVVVAIDVSRSM---------------VATDVEPSRLSAAKTAAKDFLGDL---PP 125 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 L+ + + + P+ V +T +L T + Y L Sbjct: 126 RFNV---SLVKFAASAQVVVPPTTDRAAVSTAITNLQ---VLPSTAIGEGI---YSSL-- 174 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + + + P I+ L+DG N + ++ K +A + + TI Sbjct: 175 -NALKLVPDDPKHPGQKPPAA----IVLLSDGATNVGRPSLEAAK---EAGRQHVPVYTI 226 Query: 368 SIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + N L +S ++ + L V+++I+Q + + K Sbjct: 227 AYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESLGQLSDVYKSIAQSVGYEK 286 Query: 414 Y 414 Sbjct: 287 V 287 >gi|116695550|ref|YP_841126.1| hypothetical protein H16_B1611 [Ralstonia eutropha H16] gi|113530049|emb|CAJ96396.1| conserved hypothetical protein [Ralstonia eutropha H16] Length = 354 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 35/259 (13%), Positives = 80/259 (30%), Gaps = 70/259 (27%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + L +D S SM D T++ A K A DL+ + Sbjct: 86 ITLVLAMDTSRSMAA---------------TDVAPTRIGASKQAAR------DLIVGLPA 124 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD-- 308 V +G++ + + P+ + + + R L T + + QA +L D Sbjct: 125 SVRLGMVSFAATATVVLPPTDNRQDMLDAIDRFQLQL---GTATGSGLIQALAVLFPDDG 181 Query: 309 ----------------------KKRSFFTNFFRQGVKIPSLPFQKF----IIFLTDGENN 342 + + ++ + P+ + +I L+DG Sbjct: 182 IDLEAILFSGESLAPGPGGRSLTEAAAADAVRKREQERPAAQPGSYRHGAVILLSDGRRT 241 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQR--------------LLKTCVS-SPE 387 ++ ++ A + +++ T+ G L+ + + Sbjct: 242 TGPDPLDAARM---AAQRGVRVYTVGFGTPQGGAAAESGLSYYMQLDEPALRAVAAITNG 298 Query: 388 YHYNVVNADSLIHVFQNIS 406 ++ +A L V++ +S Sbjct: 299 EYFQAGSAADLSQVYRQLS 317 >gi|326918160|ref|XP_003205359.1| PREDICTED: hypothetical protein LOC100539194 [Meleagris gallopavo] Length = 1584 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 23/161 (14%), Positives = 60/161 (37%), Gaps = 23/161 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL--KPTDSTPAMK 299 ++ + +G++ Y+ R + G K R+ + + T++ A++ Sbjct: 65 VETFEIGPDKTRVGVVRYSDRPTTEFDL--GKYKTREEIKEAARKIRYYGGNTNTGDALR 122 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 S + G ++ +K I LTDG + ++ + + A++ Sbjct: 123 YINTYSFSKEA----------GGRLSDRTVKKVAILLTDGRSQDYVLDPA-----NAARQ 167 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 I+I + + + L S P + ++V + +++ Sbjct: 168 AGIRIFAVGVG--EALKEELDEIASEPKSAHVFHVSDYNAI 206 >gi|21224547|ref|NP_630326.1| secreted protein [Streptomyces coelicolor A3(2)] gi|3559963|emb|CAA20601.1| putative secreted protein [Streptomyces coelicolor A3(2)] Length = 421 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 42/227 (18%), Positives = 75/227 (33%), Gaps = 33/227 (14%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 A GE ++LV+D+SGSM T+MAA K A Sbjct: 25 PAGASAGEPTGEDAP-KVDLVLDVSGSM--------------RTRDIDGGTRMAAAKQAF 69 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 L E+V +G+ + + + + Y + P D T Sbjct: 70 NEVL------DATPEEVRLGIRTLGADYPGDDRKTGCKDTAQLY--------PVGPLDRT 115 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A K A L+ + + K I+ ++DGE+ + + Sbjct: 116 EA-KTAVATLSPTGWTPIGPALLKAADDLDGGDGSKRIVLISDGEDTCAPLDPCEVAREI 174 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIH 400 AK + I T+ + + ++ L C++ + +V + D L Sbjct: 175 AAKGIGLTIDTLGLVPNTKMRQQLS-CIAEATGGTYTSVEHTDELTD 220 >gi|113475854|ref|YP_721915.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] gi|110166902|gb|ABG51442.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] Length = 460 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 35/207 (16%), Positives = 78/207 (37%), Gaps = 32/207 (15%) Query: 153 SYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPE 212 D+ ++ I +++ + V +P R + I +++D+SGSM Sbjct: 81 ESDFSVETISEFGSKKTIKPTV-ILPTDTRAKTTPA---DIIIMLDMSGSM--------- 127 Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY----------TTR 262 ++R K NA+ F+D+ + V +GL + Sbjct: 128 --KFRDSSPGRRRIKFKGAINAIYKFIDAAN--DKPNLTVRIGLAPFGKGGNQFKVSNKS 183 Query: 263 VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 ++ N PS +EK+++ + + + T+ ++ A + L + + +N Sbjct: 184 LDANFYPS-NSEKLKEKIEELANQELSASTNLYQPLETAVKYLINSVNSTSDSNNKTDDS 242 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVN 349 + L +I L+DG +N+ + Sbjct: 243 QSKQL----VVIVLSDGFHNHDRDTEE 265 >gi|313892786|ref|ZP_07826367.1| von Willebrand factor type A domain protein [Veillonella sp. oral taxon 158 str. F0412] gi|313442717|gb|EFR61128.1| von Willebrand factor type A domain protein [Veillonella sp. oral taxon 158 str. F0412] Length = 230 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 32/232 (13%), Positives = 83/232 (35%), Gaps = 38/232 (16%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + E + L++D+SGSM A K+ +L +A + +++ Sbjct: 9 KAPEAKKLPVVLLLDISGSMSGA--------------------KIDSLYDATIDMIETFS 48 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 ++ + + +I + T VE + + + + + + + S T A++ A Sbjct: 49 AAQAKEQVIDVAIITFGTHVELHTKYTPVKDLQAKGICKFSAS---GLTPMGTALRMAKD 105 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 ++ PS ++ ++ ++DG N+ + I + + + Sbjct: 106 MIEDKDV-------------TPSRIYRPAVVLVSDGAPNDDWKSPMDKFI-NDGRSAKCQ 151 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 ++I + + +L+ P +A + F+ IS + + Sbjct: 152 RFAVAI-GNDADRSILERFTQDPNAVLFAEDAKDISEQFKTISMSISTMAVA 202 >gi|162448865|ref|YP_001611232.1| hypothetical protein sce0595 [Sorangium cellulosum 'So ce 56'] gi|161159447|emb|CAN90752.1| hypothetical protein sce0595 [Sorangium cellulosum 'So ce 56'] Length = 656 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 64/200 (32%), Gaps = 43/200 (21%) Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 +V R+ + ER + +VD SGSM + K+ Sbjct: 279 HHLVRVAVQGKRVPVKERTPVHLVYLVDTSGSM-------------------QSPDKIEL 319 Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 K +L + D++ V Y G + V G E + + D Sbjct: 320 AKKSLKMLTDTLKPGDTVALCTYAGSV---REVLAPT----GIESKGKILAALADLTAGG 372 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE-NNNFKSNVN 349 T + + AY + + + + I+ L+DG+ N S+ Sbjct: 373 STAMSSGIDLAYS-------------LAERTLVKGHV--NRVIV-LSDGDANVGPTSHDE 416 Query: 350 TIKICDKAKENFIKIVTISI 369 +K +A++ I + T+ Sbjct: 417 ILKTIKRARDKGITLSTVGF 436 >gi|116623631|ref|YP_825787.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116226793|gb|ABJ85502.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 589 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 52/355 (14%), Positives = 99/355 (27%), Gaps = 59/355 (16%) Query: 16 ASEKANFSIIFALSVMSFLL-LIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 + ++F L V S L+ ++G I + + + SA + LA A + S + Sbjct: 9 RRQGGQAIVMFTLLVSSVLIPMVGLAIDGGRGYLVRLKLSSAVDGGALAAARLLGSGSN- 67 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + S++ ++ F SLSG + V + + Sbjct: 68 -AAQQLSMAKATAAQFVN--ANFPAKFFGASLSGAANVCVDPGTD----------SSDPC 114 Query: 135 NRLDSSNNTIF--YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFL 192 + S + + + V + F+ + I + R Sbjct: 115 GVGNGSGISTYKVRTVAVKATATMPTLFM---------RIIGMPTVTVSGSGTASRRDVR 165 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL--SHVKE 250 + LV+D S SM + + F + + V Sbjct: 166 VILVMDRSSSMG----------TYYSGINQTPPSINDMALKFVNSFSGAGEFGGRDEVGL 215 Query: 251 DVY--MGLIGY-------TTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQ 300 VY G++ Y P K + + + + T + A+ Sbjct: 216 VVYGGSGIVAYPPRDITKDYTDYTKFTPPDNNFKASGNIPKYIADITSGSNTGTAEALYL 275 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 AY L +D + I+ TDG N + N I + Sbjct: 276 AYMTLRADAATNPDLATKLN-----------VIVLFTDGIPNGVTAMANDKTIAN 319 >gi|329893975|ref|ZP_08269994.1| BatA [gamma proteobacterium IMCC3088] gi|328923374|gb|EGG30692.1| BatA [gamma proteobacterium IMCC3088] Length = 323 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 32/181 (17%), Positives = 62/181 (34%), Gaps = 34/181 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I LV+D S SM A A +++ A+K ++ LD+ E Sbjct: 90 DIMLVLDTSQSMEIADIEGQSGQAMA-------LSRLDAVKQGVMSLLDA-------SEG 135 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + + + VR V++ T + A +L Sbjct: 136 NRIGLIAFGEQSFVMSDLTAYGDTVRYMVSQLETGFAGDSTRLGDGVGYAVSLLADVDSE 195 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 ++ +TDG N+ S++ ++ AK +K+ ++ A Sbjct: 196 RA------------------IVVLITDG--NDTGSDLPPVEAARLAKALDVKLYVAAVGA 235 Query: 372 S 372 Sbjct: 236 D 236 >gi|224072363|ref|XP_002303700.1| predicted protein [Populus trichocarpa] gi|222841132|gb|EEE78679.1| predicted protein [Populus trichocarpa] Length = 587 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 45/326 (13%), Positives = 106/326 (32%), Gaps = 48/326 (14%) Query: 98 IKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYR 157 SL + + + +Q + D S ++ + + V T +Y Sbjct: 64 PNFQRAGSLHAHVSPYNVPPVQIEAEHFSDDEVLSDVSP-DQSLSSRPHAITVKTFTEYP 122 Query: 158 -LQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNS 216 + E N +I++ R + V+D+SGSM Sbjct: 123 AVSASESFSNFGVLVRILAPPLDNTLPHHRARAPIDVVTVLDVSGSMA------------ 170 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 +K+ LK A+ + ++ + ++ +++ + + + Sbjct: 171 ---------SKLILLKRAVNFIIQNLGPSD------RLSIVTFSSSARRMLPLRRMSGSG 215 Query: 277 RQYVTRDMDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 R+ T +DS+ + T+ +K+ Q+L ++ + II Sbjct: 216 REDATSVVDSISAIGGTNIVAGLKKGVQVLEERRQHNSVAT----------------IIL 259 Query: 336 LTDG-ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 L+DG + + + + +++K+ + T + + + +S + + Sbjct: 260 LSDGCDTQSHNAQNRLDYLKEESKQPTFPVYTFGFGSDHDSAAMHAISDASRGTFSFIES 319 Query: 395 ADSLIHVFQN-ISQLMVHRKYSVILK 419 + L F I L+ V LK Sbjct: 320 INILQDAFARCIGGLISIVARDVQLK 345 >gi|153876525|ref|ZP_02003802.1| von Willebrand factor type A domain protein [Beggiatoa sp. PS] gi|152067011|gb|EDN66198.1| von Willebrand factor type A domain protein [Beggiatoa sp. PS] Length = 180 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 22/119 (18%), Positives = 45/119 (37%), Gaps = 9/119 (7%) Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 +++A + + +I LTDGENN + ++ + A Sbjct: 23 LQKAVIGMAGRDTAIGDAIGLAVKKLRERPEGSRILILLTDGENNAGA--LKPLQAAELA 80 Query: 358 KENFIKIVTIS-------INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 K+ I+I TI + N L K + ++ N +L +V+++I + + Sbjct: 81 KQYDIRIYTIGVGGKGGMFSRGLNETELKKIAQLTNGAYFPATNLGALNNVYEHIDKTL 139 >gi|307133505|dbj|BAJ19017.1| TadG [Aggregatibacter actinomycetemcomitans] Length = 538 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 37/274 (13%), Positives = 80/274 (29%), Gaps = 36/274 (13%) Query: 2 HLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 +L ++ K+ +E ++II AL LL + F + K + A + A Sbjct: 6 NLTTKLLSTVKQFSKNEHGVYAIITALLAFPLLLFVAFTVDGTGILLDKARLAQATDQAA 65 Query: 62 LAGASKM-----------VSNLSRLGDRFESISNHAKRALIDDA-----KRFIKNHIKES 105 L ++ V + E + A + + ++ K Sbjct: 66 LLLIAEDNQYRKNKDHSDVKRQNVSQQEIEREGRNFSNAKVQAQWKKRNQELVQGVCKLY 125 Query: 106 LSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLL 165 L + + ++ + + N T ++ + + +F L Sbjct: 126 LRSDDSKGQKNSSPVTIKEPFLAECLEEKTQPKNQNGTAK-SVACVVQGSVQRKFW--LP 182 Query: 166 NQRYNQKIVSFIPALLRIEMGERPI---------FLIELVVDLSGSMHCAMNSDPEDVNS 216 + + I G+ + +V DLSGSM ++ Sbjct: 183 WGQTLVSSNQLHDGRVGINSGKTYAVKEKQITIPIDLMMVTDLSGSMKWYID-------- 234 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 K ++ AL + + + +KE Sbjct: 235 RKGDAHKPNRRIDALVEVVGEVQNILVTPRKIKE 268 Score = 42.2 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 23/136 (16%), Positives = 45/136 (33%), Gaps = 26/136 (19%) Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS- 346 L T T M L + + + ++ ++ L+DGE+N Sbjct: 392 PLGGTAVTSGM------LIGINLMTSKNSEPEAAPSKLNTNTRRVLLILSDGEDNQPSEK 445 Query: 347 ---NVNTIKICDKAKEN---------FIKIVTISINA-----SPNGQRLLKTCVSSPEYH 389 N+ +C + K+ ++ A N K CV +++ Sbjct: 446 TLVNLMGAGLCREIKDKMNSLQDPKYGQVEPRVAFIAFGTNLPDNQLNAWKQCV--GKHY 503 Query: 390 YNVVNADSLIHVFQNI 405 Y+V + L+ F+ I Sbjct: 504 YSVFSKQGLLDAFKQI 519 >gi|311746225|ref|ZP_07720010.1| BatA protein [Algoriphagus sp. PR1] gi|126576455|gb|EAZ80733.1| BatA protein [Algoriphagus sp. PR1] Length = 347 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 38/235 (16%), Positives = 70/235 (29%), Gaps = 62/235 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I LV+D+S SM QD K ++ A K + F+ + + Sbjct: 107 IDIMLVMDISESMD---------------LQDFKPNRLEAAKATAIDFI-------NGRF 144 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYV-TRDMDSLILKPTDSTPAMKQAYQILTSDK 309 +G++ + + + + + + + K T A+ A Sbjct: 145 GDRIGMVVFAGEAYSLAPLTNDYKLLTDLIQDISFNMMEAKGTAIGSAIASATNR----- 199 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 S K +I L+DGE+N NV+ + A IKI TI++ Sbjct: 200 -------------MKESESASKVLILLSDGESNA--GNVDPLFAAQLASALDIKIYTIAV 244 Query: 370 ----NASPNGQRL--------------LKTCVS-SPEYHYNVVNADSLIHVFQNI 405 L+ + + +L ++F I Sbjct: 245 GKDGMVPYGTDFFGRPQMVESYLDETNLREIAKIGNGEFFRASDGGTLNNIFDRI 299 >gi|73960093|ref|XP_855328.1| PREDICTED: similar to calcium activated chloride channel 4 [Canis familiaris] Length = 938 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 37/234 (15%), Positives = 78/234 (33%), Gaps = 56/234 (23%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM+ ++ + A FL + + + ++ Sbjct: 308 LVLDKSGSMNGF-------------------NRLNRMNQAAKHFL--LQTIENGS---WV 343 Query: 255 GLIGYTTRVEKNIEPSWGT-EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + K R + + + T +K A+Q++ + Sbjct: 344 GMVHFDSTAYIKSNLIQIISSKERNNLLESLPTTANGGTSICAGIKSAFQVIGEIYPQID 403 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 + I+ LTDGE+N T K C + K++ I I++ Sbjct: 404 GSE----------------IVLLTDGEDN-------TAKNCIGEVKQSGAIIHLIAL-GP 439 Query: 373 PNGQRLLKTCVSSPEYHYNVVNADS---LIHVFQNIS---QLMVHRKYSVILKG 420 Q +++ + H+ + LI F ++ + + + KG Sbjct: 440 SADQAVIEMSTITGGNHFFASDEAQNNGLIDAFGALASGNTDLSQQPLQLESKG 493 >gi|224077994|ref|XP_002192008.1| PREDICTED: similar to matrilin 4 [Taeniopygia guttata] Length = 580 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 32/173 (18%), Positives = 65/173 (37%), Gaps = 24/173 (13%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKPTD 293 +D I L +G+I Y+++V+ T R + R ++S+I + T Sbjct: 55 RFMMDIIGNLDVGPNATRVGVIQYSSQVQNIFSLK--TFFTRADMERAINSIIPLAQGTM 112 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A++ A + + ++ + + + I +TDG + V + Sbjct: 113 TGLAIQYAMNVAFTTQEGARPLHK----------RIPRIAIVVTDG---RPQDRVTEVAT 159 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQN 404 +A+ I+I + I L+ S P E+ + V + LI F Sbjct: 160 --QARNAGIEIYAVGIQ--RADMNSLRAMASPPLEEHVFLVESF-ELIQQFAK 207 >gi|260810222|ref|XP_002599902.1| hypothetical protein BRAFLDRAFT_74022 [Branchiostoma floridae] gi|229285186|gb|EEN55914.1| hypothetical protein BRAFLDRAFT_74022 [Branchiostoma floridae] Length = 1201 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 27/178 (15%), Positives = 59/178 (33%), Gaps = 23/178 (12%) Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 ++ ++ +G++ Y+ R K + T++ A+ Sbjct: 693 VNVVNTFDVSLTATRVGVVQYSDRNTLVFNLGNKVNKPSTVSAINNIVYQSGGTNTGAAL 752 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 + Q +G +P K II LTDG++++ S + Sbjct: 753 QYVRQY------------AAWRGGNVP-----KVIIVLTDGKSSDSVSGPS-----QNLV 790 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 +++ I + + +GQ LL+ + + N ++L IS + V Sbjct: 791 AAGVEVYAIGVGSFDHGQ-LLQIANNKQNNVIELNNFNALATKIDMISTNVCSYALHV 847 >gi|218781309|ref|YP_002432627.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] gi|218762693|gb|ACL05159.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] Length = 336 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 63/192 (32%), Gaps = 47/192 (24%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 R I + VD+S SM + +D + ++ K + + Sbjct: 85 SRKGVDIMVCVDISNSM---------------MVEDAQPNRLERAKREVADLIR------ 123 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI---LKPTDSTPAMKQAYQ 303 V +GL+ ++ + + ++ ++ + L+ + TD A++ Sbjct: 124 -VATGDRLGLVAFSGVAFTQCPLTLDYQAIQMFLDQLTVDLLPLRFQGTDLGAAIEMGMT 182 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 P K I+ +TDGE+N +K +KA + I+ Sbjct: 183 AF------------------DPKSSTDKVILLITDGEDNE----EAGLKAAEKASDEGIR 220 Query: 364 IVTISINASPNG 375 I + I G Sbjct: 221 IFVLGIGDPAGG 232 >gi|94969533|ref|YP_591581.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] gi|94551583|gb|ABF41507.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] Length = 362 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 47/262 (17%), Positives = 88/262 (33%), Gaps = 61/262 (23%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 L + + + IP +R E I ++ D+SGSM Sbjct: 112 LEKENFEVLDQNIPQQIRHFSSEDAPVSIGVIFDMSGSMS-------------------- 151 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 K+ + A++ F + E ++ + + E + + E +++ + Sbjct: 152 -NKIDKSREAIVEFF---KTANPDDEFF---VVAFNDKPEVLQDFTNRIEDIQEKL---- 200 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 IL+P D T + Y N RQ +K ++ ++DG +N+ Sbjct: 201 --TILQPKDRTSLLDAIY----------LGMNKMRQAKYE-----RKALLIISDGGDNHS 243 Query: 345 KSNVNTIKICDKAKENFIKIVTISI--NASP-----NGQRLLKTCVS-SPEYHYNVVNAD 396 + N IK +E ++I I I A G LL + + + N + Sbjct: 244 RYTENEIK--SMVREADVQIYAIGIYDLAPTTTEEMAGPALLGEISDWTGGRMFPIDNVN 301 Query: 397 SLIHVFQNISQLMVHRKYSVIL 418 L V I V + +L Sbjct: 302 ELADVATKIG---VELRNQYVL 320 >gi|297265499|ref|XP_002799207.1| PREDICTED: matrilin-3-like [Macaca mulatta] Length = 445 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 33/168 (19%), Positives = 62/168 (36%), Gaps = 22/168 (13%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 ID L D + ++ Y + V+ + T+K + + T S A++ A Sbjct: 111 IDTLDIGPADTRVAVVNYASTVKIEFQLQAYTDKQSLKQAVGQITPLSTGTMSGLAIQTA 170 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + + + G + PS K I +TDG + VN + +A+ + Sbjct: 171 MDEVFTAEA----------GARGPSSNIPKVAIIVTDG---RPQDQVNEVA--ARARASG 215 Query: 362 IKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN---ADSLIHVFQN 404 I++ + + + LK S P E+ + V + L FQ Sbjct: 216 IELYAVGV--DRADMQSLKMMASEPLEEHVFYVETYGVIEKLSSRFQE 261 >gi|120407060|ref|NP_766396.2| anthrax toxin receptor-like precursor [Mus musculus] Length = 641 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 69/207 (33%), Gaps = 43/207 (20%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + IF + LV+D SGS+ + S KK T Sbjct: 72 QGIFDLYLVLDKSGSVADNW----IHIYSFAEGLVKKFT--------------------- 106 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 ++ + +I Y+T E + + ++++ + + + + T +++A Sbjct: 107 -NPNLRISIITYSTEAEVILPLTSDSKEINKSLLVLKNIVPQGLTHMQKGLRKA------ 159 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 N + + II LTDG K ++T++ KA+ + T+ Sbjct: 160 --------NEQIRKSTLGGRIVNSVIIALTDGL-LLLKPYLDTMEEAKKARRMGAIVYTV 210 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVN 394 + Q L P+ + V Sbjct: 211 GVFMYSKQQ--LVNIAGDPDRCFGVDE 235 >gi|315636668|ref|ZP_07891900.1| conserved hypothetical protein [Arcobacter butzleri JV22] gi|315479050|gb|EFU69751.1| conserved hypothetical protein [Arcobacter butzleri JV22] Length = 1209 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 44/244 (18%), Positives = 85/244 (34%), Gaps = 27/244 (11%) Query: 117 EIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSF 176 + +NI T +N + T + T + ++N + I+ Sbjct: 649 DAKNINLQIEAKATEARDNEDGQNFKTATDSTTDKTPTLVVGSNKDSVINGGAGKDILIG 708 Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 + + + I LVVD SGSM ++ + + +++ LK+AL Sbjct: 709 DTGGTELNVQAGKNYNIALVVDTSGSM--------KEASGSKTAWGTTISRIDLLKDALK 760 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK-VRQYVTRDMDSLILKPTDST 295 DS+ + + +I + T ++ I + T K + +T+ T+ Sbjct: 761 NLADSLKGHDG---KINVSIIDFDTNAKEPITFNDLTSKNISDLITKIDALKAEGGTN-- 815 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE----NNNFKSNVNTI 351 Y+ F T G ++ FLTDG+ N + K+N +T Sbjct: 816 ------YEDAFLKTTSWFDTQSVTYGKAQG---YENLTYFLTDGDPTFSNRDTKNNGSTT 866 Query: 352 KICD 355 + D Sbjct: 867 EYSD 870 >gi|291401974|ref|XP_002717657.1| PREDICTED: inter-alpha trypsin inhibitor heavy chain precursor 5 [Oryctolagus cuniculus] Length = 940 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 44/222 (19%), Positives = 74/222 (33%), Gaps = 48/222 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D S SM A K+ K+AL L + Sbjct: 296 VVFVLDSSASMVGA--------------------KLRQTKDALFTILHDLRPQD------ 329 Query: 253 YMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +IG++ R++ K+ S +R TD A++ A ++L + Sbjct: 330 RFNIIGFSNRIKVWKDNLISVTPNSIRDGKIYIHHMSPTGGTDINGALQTAIRLLNNYVA 389 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF---IKIVTI 367 + + I+FLTDG+ +T+KI + KE + I TI Sbjct: 390 HNDIEDRSVS-----------LIVFLTDGK--PTVGETHTLKILNNTKEAAQGRVCIFTI 436 Query: 368 SINASPNGQRL----LKTCVSSPEYHYNVVNADSLIHVFQNI 405 I + + L L+ C + H LI + I Sbjct: 437 GIGNDVDFKLLEKLSLENCGLTRRVHEEENAGAQLIGFYDEI 478 >gi|90406741|ref|ZP_01214934.1| hypothetical protein PCNPT3_01875 [Psychromonas sp. CNPT3] gi|90312194|gb|EAS40286.1| hypothetical protein PCNPT3_01875 [Psychromonas sp. CNPT3] Length = 404 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 68/416 (16%), Positives = 131/416 (31%), Gaps = 58/416 (13%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS- 73 I +++ + SI+F + + L ++ I + A++ + +A A +N+S Sbjct: 5 IKAQQGSISIVFIFLLPAMLAMLALSILTAMYLLSVTRASQASDVSSIACAYSQRANVSL 64 Query: 74 ------RLGDRFESISNHAKRALIDDAKRFIK-NHIKESLSGYSAVFYNTEIQNIVNSSR 126 F S N L + I+ + L + + + Sbjct: 65 TQGFAQYYKPNFISHVNAQSTFLSGQKQCKIQIGYAFTPLLKDLLPASSQNKVHASVQIQ 124 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSY---------DYRLQFIEHLLNQRYNQKIVSFI 177 + T ++ + + ++ ++ S + +Q IE + Q S + Sbjct: 125 STSTLTVHSEIKPMDLSLVLDISGSMSGRIGLLKRIINQAIQNIEQQNTKNNTQIRFSIV 184 Query: 178 PALLRIEMGERPIFLI----ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 P + + P L VD +M N A I D +K+ Sbjct: 185 PFSSGVSISNAPWLAKSKGKALCVD---AMSYPGNVLNTAQTVADI--DTHPSKLNIRAK 239 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 L ++ + VY L+ + KVR++V IL T Sbjct: 240 EPLSLINDCN--------VYSLLL----------PLTNNLSKVRKHVDSL---SILGSTA 278 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK- 352 S + L + S + +I TDGE+++ ++ Sbjct: 279 SYQGFIWGVRTLLPN------WQKAWNLQPETSSLLSQRLILFTDGEDDSRDQFDKLVRS 332 Query: 353 -ICDKAKENF-IKIVTISINASPNGQRLLKTCVSSPEY--HYNVVNADSLIHVFQN 404 +C + +++F I I I SP K C+ S Y+ N L F Sbjct: 333 GMCQRIQDDFNIDISFIGFGLSPRRLDQFKKCIGSNGKGVVYDAKNGSDLEKFFAE 388 >gi|301058344|ref|ZP_07199377.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] gi|300447580|gb|EFK11312.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] Length = 598 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 29/190 (15%), Positives = 56/190 (29%), Gaps = 45/190 (23%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + I + +D+S SM + +D K ++ K + FL + Sbjct: 85 TQKGVDIVIALDVSPSM---------------LVEDIKPDRLERAKREITDFLKVVK--- 126 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQIL 305 +GL+ ++ + + ++ + TD A+ A + Sbjct: 127 ----GDRVGLVAFSGAAYTQCPLTLDYGALMMFLNILHPNNIPHPGTDLGAAVLGAIKAF 182 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 P K I+ +TDGE+N + +A IKI Sbjct: 183 ------------------DPKSETDKVILLITDGEDNE----KRGLDAAREAVRKGIKIF 220 Query: 366 TISINASPNG 375 + G Sbjct: 221 VFGMGDPAGG 230 >gi|310286822|ref|YP_003938080.1| von Willebrand factor type A domain [Bifidobacterium bifidum S17] gi|309250758|gb|ADO52506.1| conserved hypothetical protein containing von Willebrand factor type A domain [Bifidobacterium bifidum S17] Length = 1156 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 31/266 (11%), Positives = 71/266 (26%), Gaps = 76/266 (28%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + P I +V D SGSM M +++ K A+ Sbjct: 600 TQSGSSQTTVSPA-DIVVVFDTSGSMSNPMG---------------HNSRLEVAKTAVNS 643 Query: 238 FLDSIDLLSHVKED--VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + + +D + M L+ ++T + + V T+ Sbjct: 644 MAQHLLTSENQGKDSNIRMALVPFSTTAGNVSNFTDNAMDIVSAVNGLGAD---GGTN-- 698 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT----- 350 + +K+I+F++DG+ S+V T Sbjct: 699 ---------------WEAALKAANAKLTSGRKGVKKYIVFMSDGDPTYRTSSVRTGTDWL 743 Query: 351 -------------------------------IKICDKAKENFIKIVTISINASPNGQRLL 379 + + + + ++ +++ P ++ Sbjct: 744 GRPIYDADDGWGLPAGVHGSGLSDRYGANLSSAVAEANRRGDATLFSVGVSSDPT--KMR 801 Query: 380 KTCVSSPEYHYNVVNADSLIHVFQNI 405 + +Y+ + D L F +I Sbjct: 802 GFADQTKGSYYSATSTDELNKAFADI 827 >gi|145219382|ref|YP_001130091.1| hypothetical protein Cvib_0567 [Prosthecochloris vibrioformis DSM 265] gi|145205546|gb|ABP36589.1| conserved hypothetical protein [Chlorobium phaeovibrioides DSM 265] Length = 356 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 28/182 (15%), Positives = 62/182 (34%), Gaps = 11/182 (6%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 + + + ++ +I+FAL + L + + H K +++A +AA L GA Sbjct: 4 THHSSRRLHRQRGGTAILFALVLPVLLGFAALAVDLARIHLVKVELQNAADAASLGGA-- 61 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKE--SLSGYSAVFYNTEIQNIVNSS 125 +LS G + + S ++ A+ +++++ + + +QN Sbjct: 62 --HSLSDAGGQPYNWSAA-----VNAAQNVVQSNVANGAHIQDATIETGYWNLQNPSLGL 114 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEM 185 R + T D + + F +L + S I + Sbjct: 115 RPAGTGSVPAAGDVPAVRTTVAISANQNNGPLPLFFAPILGIAESSIQASAIAVIAPPSG 174 Query: 186 GE 187 G Sbjct: 175 GT 176 >gi|189501234|ref|YP_001960704.1| von Willebrand factor type A [Chlorobium phaeobacteroides BS1] gi|189496675|gb|ACE05223.1| von Willebrand factor type A [Chlorobium phaeobacteroides BS1] Length = 331 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 39/239 (16%), Positives = 76/239 (31%), Gaps = 58/239 (24%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + I +D+S SM + D E +++ A K L F+ Sbjct: 88 VTEASEKGIDIVFALDISESM---LEEDFEG------------SRLDAAKKIALRFIRE- 131 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR-DMDSLILKPTDSTPAMKQA 301 + GL+ + + + + V + +D++ K T A+ Sbjct: 132 ------RPQDRFGLVLFRGKSFTLCPLTLDHRLLGMLVRQVSVDAISDKGTAIGSAILVG 185 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 L + S+ ++ ++ LTDGE+N+ + T + A+ Sbjct: 186 TNRLRA------------------SVSKERVLLLLTDGEHNSGEVGPVTAS--EIAQSEG 225 Query: 362 IKIVTISIN------------ASPNG---QRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 I+I I + A G Q L + ++ + +SL F I Sbjct: 226 IRIYVIGVRNEEEAGSPESMDAEREGVDEQVLGTVAGMTGGRYFRASDENSLKDAFGEI 284 >gi|255566346|ref|XP_002524159.1| protein binding protein, putative [Ricinus communis] gi|223536577|gb|EEF38222.1| protein binding protein, putative [Ricinus communis] Length = 514 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 32/223 (14%), Positives = 69/223 (30%), Gaps = 52/223 (23%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 +RP + V+D+SGSM K++ +K A+L + + Sbjct: 55 SSNDRPGLDLVAVLDVSGSMEGE--------------------KISKVKTAMLFMIKKLS 94 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQAY 302 + + ++ ++ + +E ++ + ++ L T+ T +K Sbjct: 95 SID------RLSIVTFSGDARRLCPLRQISETSQRELENLINGLKAEGATNITAGLKTGL 148 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 +L + I+ ++DGE N + Sbjct: 149 NVLNDRRLSGGRVVG---------------IMLMSDGEQNAGGDAAQVPV-------GNV 186 Query: 363 KIVTISINASPNGQRLLKTCV--SSPEYHYNVVNADSLIHVFQ 403 + T + + +LK S +V N D+L F Sbjct: 187 PVHTFGFGINHEPR-VLKAIAQNSVGGTFSDVQNTDNLSKAFS 228 >gi|223939937|ref|ZP_03631805.1| von Willebrand factor type A [bacterium Ellin514] gi|223891428|gb|EEF57921.1| von Willebrand factor type A [bacterium Ellin514] Length = 342 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 31/193 (16%), Positives = 65/193 (33%), Gaps = 50/193 (25%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + +GE + ++D S SM A D + ++++ K A+L Sbjct: 86 TTEVSKALGED----VMFLLDCSKSMLAA---------------DVQPSRLSRSKYAILD 126 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTP 296 F+ +GL+ + + ++ + R + + + TD Sbjct: 127 FVQQHGRG-------RVGLVAFAGQAFLQCPLTFDYDAFRDALLAIDEQTIPVGGTDIGR 179 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+ +AY+ + + + K ++ +TDGE+ IK Sbjct: 180 ALDEAYRAMEKNDRH-------------------KILVLITDGEDLE----KAGIKTAQA 216 Query: 357 AKENFIKIVTISI 369 E I + TI + Sbjct: 217 LAEKGIVVYTIGV 229 >gi|114682165|ref|XP_001154021.1| PREDICTED: matrilin 4 isoform 4 [Pan troglodytes] gi|114682167|ref|XP_001154256.1| PREDICTED: matrilin 4 isoform 7 [Pan troglodytes] gi|114682175|ref|XP_514674.2| PREDICTED: matrilin 4 isoform 9 [Pan troglodytes] gi|114682177|ref|XP_001154315.1| PREDICTED: matrilin-4 isoform 8 [Pan troglodytes] Length = 581 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 65/169 (38%), Gaps = 22/169 (13%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTP 296 + + L+ +G+I Y+++V+ ++ + + RD+ L T + Sbjct: 57 LMGLLPGLNVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQG-TMTGL 115 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A + S + G + P + + +TDG + ++ + Sbjct: 116 AIQYAMNVAFSVAE----------GARPPEERVPRVAVIVTDG---RPQD--RVAEVAAQ 160 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 A+ I+I + + + G L+ S P E+ + V + LI F Sbjct: 161 ARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESF-DLIQEFG 206 Score = 44.1 bits (102), Expect = 0.048, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 65/177 (36%), Gaps = 26/177 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR--DMDSLILKPTDSTPAMK 299 +D L E +GL+ +++RV G V + + + T + A++ Sbjct: 372 VDFLDVSPEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALR 429 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S+ QG + +L + + TDG + +++ +AKE Sbjct: 430 HMVEHSFSE----------AQGARPRALNVPRVGLVFTDG---RSQDDISV--WAARAKE 474 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I + + + + + L+ S P + D F ++ L+ + + S+ Sbjct: 475 EGIVVYAVGVGKAVEAE--LREIASEPAELHVSYAPD-----FGTMTHLLENLRGSI 524 >gi|114682171|ref|XP_001153893.1| PREDICTED: matrilin 4 isoform 2 [Pan troglodytes] gi|114682181|ref|XP_001153957.1| PREDICTED: matrilin-4 isoform 3 [Pan troglodytes] Length = 499 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 65/169 (38%), Gaps = 22/169 (13%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTP 296 + + L+ +G+I Y+++V+ ++ + + RD+ L T + Sbjct: 57 LMGLLPGLNVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQG-TMTGL 115 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A + S + G + P + + +TDG + ++ + Sbjct: 116 AIQYAMNVAFSVAE----------GARPPEERVPRVAVIVTDG---RPQD--RVAEVAAQ 160 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 A+ I+I + + + G L+ S P E+ + V + LI F Sbjct: 161 ARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESF-DLIQEFG 206 Score = 44.1 bits (102), Expect = 0.048, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 65/177 (36%), Gaps = 26/177 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR--DMDSLILKPTDSTPAMK 299 +D L E +GL+ +++RV G V + + + T + A++ Sbjct: 290 VDFLDVSPEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALR 347 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S+ QG + +L + + TDG + +++ +AKE Sbjct: 348 HMVEHSFSE----------AQGARPRALNVPRVGLVFTDG---RSQDDISV--WAARAKE 392 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I + + + + + L+ S P + D F ++ L+ + + S+ Sbjct: 393 EGIVVYAVGVGKAVEAE--LREIASEPAELHVSYAPD-----FGTMTHLLENLRGSI 442 >gi|114682173|ref|XP_001153832.1| PREDICTED: matrilin 4 isoform 1 [Pan troglodytes] Length = 488 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 65/169 (38%), Gaps = 22/169 (13%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTP 296 + + L+ +G+I Y+++V+ ++ + + RD+ L T + Sbjct: 5 LMGLLPGLNVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQG-TMTGL 63 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A + S + G + P + + +TDG + ++ + Sbjct: 64 AIQYAMNVAFSVAE----------GARPPEERVPRVAVIVTDG---RPQD--RVAEVAAQ 108 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 A+ I+I + + + G L+ S P E+ + V + LI F Sbjct: 109 ARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESF-DLIQEFG 154 Score = 44.1 bits (102), Expect = 0.048, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 65/177 (36%), Gaps = 26/177 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR--DMDSLILKPTDSTPAMK 299 +D L E +GL+ +++RV G V + + + T + A++ Sbjct: 279 VDFLDVSPEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALR 336 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S+ QG + +L + + TDG + +++ +AKE Sbjct: 337 HMVEHSFSE----------AQGARPRALNVPRVGLVFTDG---RSQDDISV--WAARAKE 381 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I + + + + + L+ S P + D F ++ L+ + + S+ Sbjct: 382 EGIVVYAVGVGKAVEAE--LREIASEPAELHVSYAPD-----FGTMTHLLENLRGSI 431 >gi|114682169|ref|XP_001154082.1| PREDICTED: matrilin 4 isoform 5 [Pan troglodytes] gi|114682179|ref|XP_001154207.1| PREDICTED: matrilin-4 isoform 6 [Pan troglodytes] Length = 540 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 65/169 (38%), Gaps = 22/169 (13%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTP 296 + + L+ +G+I Y+++V+ ++ + + RD+ L T + Sbjct: 57 LMGLLPGLNVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQG-TMTGL 115 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A + S + G + P + + +TDG + ++ + Sbjct: 116 AIQYAMNVAFSVAE----------GARPPEERVPRVAVIVTDG---RPQD--RVAEVAAQ 160 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 A+ I+I + + + G L+ S P E+ + V + LI F Sbjct: 161 ARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESF-DLIQEFG 206 Score = 44.1 bits (102), Expect = 0.048, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 65/177 (36%), Gaps = 26/177 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR--DMDSLILKPTDSTPAMK 299 +D L E +GL+ +++RV G V + + + T + A++ Sbjct: 331 VDFLDVSPEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALR 388 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S+ QG + +L + + TDG + +++ +AKE Sbjct: 389 HMVEHSFSE----------AQGARPRALNVPRVGLVFTDG---RSQDDISV--WAARAKE 433 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I + + + + + L+ S P + D F ++ L+ + + S+ Sbjct: 434 EGIVVYAVGVGKAVEAE--LREIASEPAELHVSYAPD-----FGTMTHLLENLRGSI 483 >gi|298291248|ref|YP_003693187.1| von Willebrand factor A [Starkeya novella DSM 506] gi|296927759|gb|ADH88568.1| von Willebrand factor type A [Starkeya novella DSM 506] Length = 313 Score = 46.4 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 59/177 (33%), Gaps = 37/177 (20%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I L +DLSGSM V + K +++ A++ + + Sbjct: 87 IVLALDLSGSM----------VKEDFVLDGKPLSRLDAVRR-VASRFVAARRGD------ 129 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GL+ + R P++ V + + + T + + A + L S Sbjct: 130 RIGLVIFGDRAYVAQPPTFDVGSVAHAIEAAQIGISGRSTAISDGLGLATRRLLQSDATS 189 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 K ++ L+DG + + K A + I++ TI++ Sbjct: 190 ------------------KVVVLLSDGVDTSGKVQAGDAAR--LAASHGIRVHTIAL 226 >gi|192288907|ref|YP_001989512.1| hypothetical protein Rpal_0477 [Rhodopseudomonas palustris TIE-1] gi|192282656|gb|ACE99036.1| conserved hypothetical protein [Rhodopseudomonas palustris TIE-1] Length = 443 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 55/393 (13%), Positives = 118/393 (30%), Gaps = 39/393 (9%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 F + + N ++IFAL ++ + +G + + ++ +++A +AA + S Sbjct: 3 FSRSIARFSRDRRGNIAVIFALVLVPLISAVGCAVDYSRANALRSKLQAAADAASVGAVS 62 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIK--ESLSGYSAVFYNTEIQNIVNS 124 + G + + DDA R ++ + S+ T+ ++V S Sbjct: 63 RTSPAYVAAGAMS---GDGVISSGADDALRIFNGNLNGLTGYTLASSSATVTKASDVVTS 119 Query: 125 ---SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL 181 ++ M + S T+ S + F L N + Sbjct: 120 QVTFSAQISTMFMKVVGMSAMTVGGTSTATASMPKYIDFYLLLDNSPSMGVGATPTDVSA 179 Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 I D +N+ N A T++ L++A +D+ Sbjct: 180 MIAATANKSSD-----DHCAFACHDVNNKNNYYNLAKALGIT--TRIDVLRSATQQLMDT 232 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIE----PSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 + M + + + + + +D + +K + Sbjct: 233 ASATATYSNQFRMAIYDFGASAQTAGLRNLFSLSASLSSAKTAASAIDLMTVKGQNDNND 292 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK------------ 345 +Y + N S QK++ F++DG + + Sbjct: 293 QDTSYTAIFPAIN-----NEISSPGSGVSGSPQKYLFFVSDGVADEYNPSCLKPKTGNRC 347 Query: 346 SNVNTIKICDKAKENFIKI---VTISINASPNG 375 + +C K+ IKI T +N N Sbjct: 348 QSPINPALCKTLKDRGIKIAVLYTTYLNLPSND 380 >gi|19031201|gb|AAL17974.1| proximal thread matrix protein 1 [Mytilus galloprovincialis] Length = 453 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 39/219 (17%), Positives = 75/219 (34%), Gaps = 34/219 (15%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 VVD +G A D ++ +KN + +D + + Sbjct: 245 VVDCAGHADIAFVFD-----ASSSINANNPNNYQLMKNFMKDIVDRFNKTGP--DGTQFA 297 Query: 256 LIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 ++ + R K ++ + + S I+ T ++ A + ++ Sbjct: 298 VVTFADRATKQFGLKDYSSKADIKGAIDKVSPS-IIGQTAIGDGLENARLEVFPNRNGG- 355 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 QK +I LTDG+NN KS + + ++ + IV I + Sbjct: 356 -----------GREEVQKVVILLTDGQNNGHKSPEHESSL---LRKEGVVIVAIGV---- 397 Query: 374 NGQRLLK----TCVSSPEYHYNVVNADSLIHVFQNISQL 408 G LK SS EY + + D L + +++ +L Sbjct: 398 -GTGFLKSELINIASSEEYVFTTSSFDKLSKIMEDVVKL 435 Score = 39.1 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 23/161 (14%), Positives = 57/161 (35%), Gaps = 26/161 (16%) Query: 224 KRTKMAALKNALLLFLDSIDLL-SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 + +K+ + +DS + + + G++ ++ V+ K ++ + + Sbjct: 69 NKENYQMMKDFVKELVDSFTTVGVNGRNGSQFGVVQFSQGVKTAFPL--NKFKTKEDIKK 126 Query: 283 DMDSLI---LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + ++ T+ +K + S + + QK +I +TDG Sbjct: 127 GIQDMVPRNGGQTEIGTGLKHVRENSFSGAEGG------------GNPDKQKIVILMTDG 174 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLK 380 ++N + K K + ++ I I GQ +K Sbjct: 175 KSNAGAPPQHEAH---KLKAEGVTVIAIGI-----GQGFVK 207 >gi|327262385|ref|XP_003216005.1| PREDICTED: vitrin-like [Anolis carolinensis] Length = 748 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 51/373 (13%), Positives = 112/373 (30%), Gaps = 68/373 (18%) Query: 49 KKNSMESANNAAILAGASKMVSNLSR-LGDRFESISNHAKRALIDDAKRFIKNHIKESLS 107 KN++E L+ K +S +++ + A I + + ++E+ Sbjct: 426 LKNAIEKILQKGGLSNVGKALSFVNKNFFEDSNGNRGAAPNVAIVMVDGWPTDKVEEASR 485 Query: 108 -----GYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIE 162 G + F E + + + + + +N +Y+++V + + Sbjct: 486 LARESGINIFFVTIEGPDENEKQNVVEANFVDKAVCRTNG--YYSINVPSWFSLHKVVQP 543 Query: 163 HLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 + + + L ++G V+D S S+ Sbjct: 544 LVKRICDSDHLACSKTCLNSADIG--------FVIDGSSSVGT----------------- 578 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 + L + D +G + YT + +Q V Sbjct: 579 ------GNFRTVLQFVANISKEFEISDTDTRIGAVQYTYEQRLEFGFE--KQSTKQDVLN 630 Query: 283 DMD--SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 + + T + A+ A++ L K + +K +I +TDG Sbjct: 631 AIKRINYWSGGTSTGAAINYAFEQLFIKSKPNK----------------RKIMILITDGR 674 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSL 398 + + T A +N + ++ I Q L+ + P+ + + V DSL Sbjct: 675 SYDDVQGPAT-----AAHQNGVITYSVGIA--WAAQDELEAIATDPDKEHSFFVDEFDSL 727 Query: 399 IHVFQNISQLMVH 411 I Q + Sbjct: 728 YRFVPRIIQNICT 740 >gi|257892717|ref|ZP_05672370.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,408] gi|257829096|gb|EEV55703.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,408] Length = 1347 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 31/223 (13%), Positives = 71/223 (31%), Gaps = 30/223 (13%) Query: 99 KNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRL 158 N + + E N+ + + + N+ + YN + + + Sbjct: 133 ANWRNYDYAADNGSTAGEEDVNV----PAQTVQLWGDERNFENSYLDYNGAYIKKWVEPV 188 Query: 159 --QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNS 216 + L+ + + ++ E E I V+D S SM + Sbjct: 189 LSNNLTSDLHPEDATTLYNVYLDVIGSEKQEISPIDIVFVLDKSASM------------N 236 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY--------TTRVEKN-- 266 + ++K AAL A+ + +LLS D+ +G++ + + Sbjct: 237 EGTLEGGGQSKNAALIEAVNEISE--NLLSDPNMDIRIGMVNFYHNSTVINNQEQISSDI 294 Query: 267 IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + ++ ++ + T T +K Y+ L +D Sbjct: 295 FPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGYETLYADN 337 >gi|301617277|ref|XP_002938060.1| PREDICTED: matrilin-4-like [Xenopus (Silurana) tropicalis] Length = 721 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 24/161 (14%), Positives = 64/161 (39%), Gaps = 23/161 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSLILKPTDST 295 +D I+ L +G++ Y+++V+ + + + + + + T + Sbjct: 55 MIDIINSLEVGLSTTRVGVVQYSSQVQTVFSLKTFSNKSDMEKAINEIIP--LAQGTMTG 112 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A + ++++ G + S + I +TDG + V + + Sbjct: 113 LAIQYAMNVAFTEEE----------GARPLSKNIPRVAIIVTDG---RPQDRVTEVAV-- 157 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN 394 +A+E I+I + + L+ S P ++ ++V + Sbjct: 158 QAREAGIEIYAVGVQ--RADVSSLRAMASHPLDDHVFHVES 196 Score = 40.6 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 26/159 (16%), Positives = 53/159 (33%), Gaps = 23/159 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ +D + + ++GL+ Y++RV S T + K T + A Sbjct: 489 VINIVDSSAISAQGTHIGLVQYSSRVRTEFPLSQYTNGQDIKTAVKNIQYMEKGTMTGLA 548 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 +K + S+ + K + TDG + +++ + KA Sbjct: 549 LKHMVEQSFSEAE-------------GARKNVPKIGLVFTDG---RSQDDIS--EWAKKA 590 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEY---HYNVV 393 KE I + + + + + L S P Y Sbjct: 591 KEAGITMYAVGVGKAVEDE--LNEIASDPVNKHSFYTAD 627 >gi|194227185|ref|XP_001916970.1| PREDICTED: inter-alpha (globulin) inhibitor H5 [Equus caballus] Length = 905 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 49/222 (22%), Positives = 77/222 (34%), Gaps = 48/222 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D S SM TK+ K+AL L + H Sbjct: 253 VVFVLDSSASMVG--------------------TKLRQTKDALFTILHDLRPQDHFN--- 289 Query: 253 YMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +IG++ R++ K+ S + VR TD A+++A ++L + Sbjct: 290 ---VIGFSNRIKVWKDHLVSVTPDSVRDGKVYIHHMSPTGGTDINGALQRAIRLLNNYVA 346 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN---FIKIVTI 367 + + IIFLTDG+ NT+KI + KE I I TI Sbjct: 347 HNDIEDRSVS-----------LIIFLTDGK--PTVGETNTLKILNNTKEATRGQICIFTI 393 Query: 368 SINASPNGQRL----LKTCVSSPEYHYNVVNADSLIHVFQNI 405 I + + L L+ C + H LI + I Sbjct: 394 GIGDDVDFKLLEKLSLENCGLTRRVHDEDDAGSQLIGFYDEI 435 >gi|148694080|gb|EDL26027.1| integrin, alpha 11 [Mus musculus] Length = 1172 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 21/182 (11%), Positives = 59/182 (32%), Gaps = 27/182 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ + + +G++ Y V+ V T++ Sbjct: 162 LINILKKFYIGPGQIQVGIVQYGEDAVHEFHL-NDYRSVKDVVEAASHIEQRGGTET--- 217 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A+ I + + K +K +I +TDGE + + K+ ++ Sbjct: 218 -RTAFGIEFARSEA---------FQKGGRKGAKKVMIVITDGE---SHDSPDLEKVIRQS 264 Query: 358 KENFIKIVTISINA--------SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQ 407 +++ + +++ +K S P+ + +NV + +L + + Sbjct: 265 EKDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALGD 324 Query: 408 LM 409 + Sbjct: 325 RI 326 >gi|73972306|ref|XP_860269.1| PREDICTED: similar to Complement factor B precursor (C3/C5 convertase) (Properdin factor B) (Glycine-rich beta glycoprotein) (GBG) (PBF2) isoform 3 [Canis familiaris] Length = 549 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 41/229 (17%), Positives = 79/229 (34%), Gaps = 45/229 (19%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 +V+D SGSM+ + D D A KN L F++ + + Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGAGNFTR--------AKNCLRDFIEKVASYGVKPKY--- 309 Query: 255 GLIGYTT--------RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 GL+ Y T R + + + W T+ + Q D ++ T++ A++ Y ++ Sbjct: 310 GLVTYATNPKVWVRVRDKNSSDADWVTKILNQ--VSYEDHMLKSGTNTKKALEAVYSMMN 367 Query: 307 SDKKR-SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV----NTIKICDKAKEN- 360 + N R II +TDG N V N D +++ Sbjct: 368 WPGETPPADWNRTRH-----------VIILMTDGLYNMGGDPVSVIHNIRNFLDIGRDHK 416 Query: 361 -------FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + + + L + ++ + V + ++L VF Sbjct: 417 NPREDYLDVYVFGVGPLVNQENINALASKKDKEQHVFKVKDMENLEDVF 465 >gi|73972310|ref|XP_849985.1| PREDICTED: similar to Complement factor B precursor (C3/C5 convertase) (Properdin factor B) (Glycine-rich beta glycoprotein) (GBG) (PBF2) isoform 2 [Canis familiaris] Length = 1112 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 41/229 (17%), Positives = 79/229 (34%), Gaps = 45/229 (19%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 +V+D SGSM+ + D D A KN L F++ + + Sbjct: 606 IVLDPSGSMNIYLVLDGSDSIGAGNFTR--------AKNCLRDFIEKVASYGVKPKY--- 654 Query: 255 GLIGYTT--------RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 GL+ Y T R + + + W T+ + Q D ++ T++ A++ Y ++ Sbjct: 655 GLVTYATNPKVWVRVRDKNSSDADWVTKILNQ--VSYEDHMLKSGTNTKKALEAVYSMMN 712 Query: 307 SDKKR-SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV----NTIKICDKAKEN- 360 + N R II +TDG N V N D +++ Sbjct: 713 WPGETPPADWNRTRH-----------VIILMTDGLYNMGGDPVSVIHNIRNFLDIGRDHK 761 Query: 361 -------FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + + + L + ++ + V + ++L VF Sbjct: 762 NPREDYLDVYVFGVGPLVNQENINALASKKDKEQHVFKVKDMENLEDVF 810 >gi|73972308|ref|XP_532086.2| PREDICTED: similar to Complement factor B precursor (C3/C5 convertase) (Properdin factor B) (Glycine-rich beta glycoprotein) (GBG) (PBF2) isoform 1 [Canis familiaris] Length = 767 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 41/229 (17%), Positives = 79/229 (34%), Gaps = 45/229 (19%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 +V+D SGSM+ + D D A KN L F++ + + Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGAGNFTR--------AKNCLRDFIEKVASYGVKPKY--- 309 Query: 255 GLIGYTT--------RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 GL+ Y T R + + + W T+ + Q D ++ T++ A++ Y ++ Sbjct: 310 GLVTYATNPKVWVRVRDKNSSDADWVTKILNQ--VSYEDHMLKSGTNTKKALEAVYSMMN 367 Query: 307 SDKKR-SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV----NTIKICDKAKEN- 360 + N R II +TDG N V N D +++ Sbjct: 368 WPGETPPADWNRTRH-----------VIILMTDGLYNMGGDPVSVIHNIRNFLDIGRDHK 416 Query: 361 -------FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + + + L + ++ + V + ++L VF Sbjct: 417 NPREDYLDVYVFGVGPLVNQENINALASKKDKEQHVFKVKDMENLEDVF 465 >gi|149027999|gb|EDL83450.1| complement factor B, isoform CRA_c [Rattus norvegicus] Length = 739 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 42/239 (17%), Positives = 84/239 (35%), Gaps = 45/239 (18%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 GE+ ++++D SGSM+ M D D A K L ++ + Sbjct: 227 SPGEQQK--RKIILDPSGSMNIYMVLDGSDSIGASNFTG--------AKRCLANLIEKVA 276 Query: 244 LLSHVKEDVYMGLIGYTT--------RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 GL+ Y T E++ + W TEK+ Q D + T++ Sbjct: 277 SYGVKPRY---GLVTYATVPKVLVRVSEERSSDADWVTEKLNQ--ISYEDHKLKSGTNTK 331 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK--- 352 A++ Y ++ ++ + II +TDG +N V I+ Sbjct: 332 KALQAVYSMM----------SWPGDAPPEGWNRTRHVIIIMTDGLHNMGGDPVTVIEDIR 381 Query: 353 -ICDKAKEN--------FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + D ++ + + + P L + ++ ++ + V + + L +VF Sbjct: 382 DLLDIGRDRKNPREDYLDVYVFGVGPLVDPVNINALASKKNNEQHVFKVKDMEDLENVF 440 >gi|218156285|ref|NP_997631.2| complement factor B [Rattus norvegicus] gi|149027998|gb|EDL83449.1| complement factor B, isoform CRA_b [Rattus norvegicus] Length = 763 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 42/239 (17%), Positives = 84/239 (35%), Gaps = 45/239 (18%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 GE+ ++++D SGSM+ M D D A K L ++ + Sbjct: 251 SPGEQQK--RKIILDPSGSMNIYMVLDGSDSIGASNFTG--------AKRCLANLIEKVA 300 Query: 244 LLSHVKEDVYMGLIGYTT--------RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 GL+ Y T E++ + W TEK+ Q D + T++ Sbjct: 301 SYGVKPRY---GLVTYATVPKVLVRVSEERSSDADWVTEKLNQ--ISYEDHKLKSGTNTK 355 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK--- 352 A++ Y ++ ++ + II +TDG +N V I+ Sbjct: 356 KALQAVYSMM----------SWPGDAPPEGWNRTRHVIIIMTDGLHNMGGDPVTVIEDIR 405 Query: 353 -ICDKAKEN--------FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + D ++ + + + P L + ++ ++ + V + + L +VF Sbjct: 406 DLLDIGRDRKNPREDYLDVYVFGVGPLVDPVNINALASKKNNEQHVFKVKDMEDLENVF 464 >gi|148657647|ref|YP_001277852.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148569757|gb|ABQ91902.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 966 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 39/241 (16%), Positives = 82/241 (34%), Gaps = 55/241 (22%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 ++ ++P + +V+D SGSM + +R K+ K A+ + Sbjct: 401 LDTKQQPDLALVMVIDRSGSMAEPVAGG-------------RRNKLDLAKEAVYQASLGL 447 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQA 301 + +GL+ + ++ V + R + S I T+ P ++QA Sbjct: 448 TPID------QVGLVVFDDTANWVLQLQPLPSMV--EIERALGSFGIGGGTNIRPGIEQA 499 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG--ENNNFKSNVNTIKICDKAKE 359 L S + K ++ LTDG E+N + + + Sbjct: 500 ALALASTDAKI------------------KHVLLLTDGIAESNYSD-------LIAQMRA 534 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ-----NISQLMVHRKY 414 + I I T+++ N L+ + Y V + D + +F + ++ + Sbjct: 535 SGITISTVAVGLDAN-PNLVDVANAGGGRSYRVTSIDEVPRIFLQETIIAAGRDIIEQPI 593 Query: 415 S 415 Sbjct: 594 E 594 >gi|33086684|gb|AAP92654.1| Da1-24 [Rattus norvegicus] Length = 1116 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 42/239 (17%), Positives = 84/239 (35%), Gaps = 45/239 (18%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 GE+ ++++D SGSM+ M D D A K L ++ + Sbjct: 604 SPGEQQK--RKIILDPSGSMNIYMVLDGSDSIGASNFTG--------AKRCLANLIEKVA 653 Query: 244 LLSHVKEDVYMGLIGYTT--------RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 GL+ Y T E++ + W TEK+ Q D + T++ Sbjct: 654 SYGVKPRY---GLVTYATVPKVLVRVSEERSSDADWVTEKLNQ--ISYEDHKLKSGTNTK 708 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK--- 352 A++ Y ++ ++ + II +TDG +N V I+ Sbjct: 709 KALQAVYSMM----------SWPGDAPPEGWNRTRHVIIIMTDGLHNMGGDPVTVIEDIR 758 Query: 353 -ICDKAKEN--------FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + D ++ + + + P L + ++ ++ + V + + L +VF Sbjct: 759 DLLDIGRDRKNPREDYLDVYVFGVGPLVDPVNINALASKKNNEQHVFKVKDMEDLENVF 817 >gi|46237594|emb|CAE83972.1| B-factor, properdin [Rattus norvegicus] gi|56268879|gb|AAH87089.1| Complement factor B [Rattus norvegicus] Length = 761 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 42/239 (17%), Positives = 84/239 (35%), Gaps = 45/239 (18%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 GE+ ++++D SGSM+ M D D A K L ++ + Sbjct: 249 SPGEQQK--RKIILDPSGSMNIYMVLDGSDSIGASNFTG--------AKRCLANLIEKVA 298 Query: 244 LLSHVKEDVYMGLIGYTT--------RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 GL+ Y T E++ + W TEK+ Q D + T++ Sbjct: 299 SYGVKPRY---GLVTYATVPKVLVRVSEERSSDADWVTEKLNQ--ISYEDHKLKSGTNTK 353 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK--- 352 A++ Y ++ ++ + II +TDG +N V I+ Sbjct: 354 KALQAVYSMM----------SWPGDAPPEGWNRTRHVIIIMTDGLHNMGGDPVTVIEDIR 403 Query: 353 -ICDKAKEN--------FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + D ++ + + + P L + ++ ++ + V + + L +VF Sbjct: 404 DLLDIGRDRKNPREDYLDVYVFGVGPLVDPVNINALASKKNNEQHVFKVKDMEDLENVF 462 >gi|255531385|ref|YP_003091757.1| von Willebrand factor A [Pedobacter heparinus DSM 2366] gi|255344369|gb|ACU03695.1| von Willebrand factor type A [Pedobacter heparinus DSM 2366] Length = 332 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 39/239 (16%), Positives = 74/239 (30%), Gaps = 63/239 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I + D+SGSM D K ++ A KN ID + E Sbjct: 92 IDIVIATDISGSMLAE---------------DLKPNRLEAGKNIA------IDFIKGRPE 130 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 D +GL+ ++ + + + + + ++ T + A L + Sbjct: 131 D-RIGLVIFSGESFTQCPLTIDHDVLINLFSDISNGMVEDGTAIGMGLATAVNRLKDSEA 189 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI- 369 +S K +I LTDG N T + AK+ +++ TI + Sbjct: 190 KS------------------KVVILLTDGSNTTGSIPPLTAA--EIAKQMKVRVYTIGVG 229 Query: 370 ------------------NASPN-GQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 + +L + ++ N + L ++Q I +L Sbjct: 230 TKGYAPYPVKTPFGTQYQQVPVTIDEGVLSKIAGITGGKYFRATNNEKLKEIYQQIDKL 288 >gi|159036783|ref|YP_001536036.1| von Willebrand factor type A [Salinispora arenicola CNS-205] gi|157915618|gb|ABV97045.1| von Willebrand factor type A [Salinispora arenicola CNS-205] Length = 319 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 38/237 (16%), Positives = 74/237 (31%), Gaps = 51/237 (21%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + + VD+S SM D + ++ A K A F + Sbjct: 83 RERATVMVAVDVSTSMLAG---------------DVEPDRLTAAKEAARRF------VDG 121 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP-AMKQAYQILT 306 + ++ +GL+ + + P R+ + +D L+ T A+ +A Sbjct: 122 LPDEFNVGLVAFAGSAAVLVPPD----TDREALDEGIDRLVEGATGVQGTAIGEAINTSL 177 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 K + K P I+ L+DG N + + A + + T Sbjct: 178 GAVKALDG-----EAAKDPPPAR---IVLLSDGANTSGMDPMEAAT---DAVAMDVPVHT 226 Query: 367 ISI--------------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 I+ +GQ L + + + +A L V+ +I + Sbjct: 227 IAFGTASGYVDRGGRPIQVPVDGQTLDEVARETGGQFHEADSAKELRAVYDDIGSSV 283 >gi|220908581|ref|YP_002483892.1| von Willebrand factor type A [Cyanothece sp. PCC 7425] gi|219865192|gb|ACL45531.1| von Willebrand factor type A [Cyanothece sp. PCC 7425] Length = 421 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 69/205 (33%), Gaps = 50/205 (24%) Query: 168 RYNQKIVSFIPALLRIEMGERPI-FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 R +Q+ + A + GER + L++D SGSM Sbjct: 18 RVSQRQLEISVAAIAQASGERNAPLNLGLILDHSGSMAGQ-------------------- 57 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT--RVEKNIEPSWGTEKVRQYVTRDM 284 + +K A +D + + +I + +V +P +K++ ++ Sbjct: 58 PLETVKRAAQKLVDRLLPSD------RLAVIVFDHVAKVLIPNQPVTDRDKIKTRISHLA 111 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + T ++ L + K + I LTDGEN + Sbjct: 112 A---MGGTAIDEGLQLGLTELIAAKAGAISQ-----------------IFLLTDGENEHG 151 Query: 345 KSNVNTIKICDKAKENFIKIVTISI 369 +N +++ ++A + I + T+ Sbjct: 152 -NNSRCLQLAEEAAKENITLNTLGF 175 >gi|304406204|ref|ZP_07387861.1| von Willebrand factor type A [Paenibacillus curdlanolyticus YK9] gi|304344788|gb|EFM10625.1| von Willebrand factor type A [Paenibacillus curdlanolyticus YK9] Length = 762 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 38/229 (16%), Positives = 77/229 (33%), Gaps = 46/229 (20%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 A + LV+D+S SM K+ +K+A F Sbjct: 341 ASFSKLTEKPQAISTNLVIDVSDSMSED-------------------NKLTKVKDAATQF 381 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 L S+ +GL+ ++ NI S T ++ + T A+ Sbjct: 382 LSHASFASNDV----VGLMSFSD--ASNIRQSDFTTEIESIKSSIAGMQTSGCTALYEAL 435 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN--NFKSNVNTIKICDK 356 QA S+ K+++ TDG+N + + V+ + + Sbjct: 436 NQAVS-----------------NTAYNSVEGSKYVVVFTDGKNTICDGTNWVSPSTVINN 478 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 A + + I I + + Q++ + ++ +YH + L ++ +I Sbjct: 479 ALQWGVPIYAIGVEEDADLQQIAEQ--TNGQYHVLGNDFTDLNAIYSDI 525 >gi|134093078|gb|ABO52938.1| matrilin 4 isoform 1 precursor [Colobus guereza] Length = 581 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 26/160 (16%), Positives = 62/160 (38%), Gaps = 21/160 (13%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTP 296 + + L+ +G+I Y+++V+ ++ + + RD+ L T + Sbjct: 57 LVGLVRGLNVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQG-TMTGL 115 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A + S + G + P + + +TDG + ++ + Sbjct: 116 AIQYAMNVAFSVAE----------GARPPEERVPRVAVIVTDG---RPQD--RVAEVAAQ 160 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN 394 A+ I+I + + + G L+ S P E+ + V + Sbjct: 161 ARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVES 198 Score = 42.9 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 65/177 (36%), Gaps = 26/177 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR--DMDSLILKPTDSTPAMK 299 +D L E +GL+ +++RV G V + + + T + A++ Sbjct: 372 VDFLDVSPEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALR 429 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S+ QG + +L + + TDG + +++ +AKE Sbjct: 430 HMVEHSFSE----------AQGARPRALNVPRVGLVFTDG---RSQDDISV--WAARAKE 474 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I + + + + + L+ S P + D F ++ L+ + + S+ Sbjct: 475 EGIVMYAVGVGKAVEAE--LREIASEPAELHVSYAPD-----FGTMTHLLQNLRGSI 524 >gi|224282379|ref|ZP_03645701.1| hypothetical protein BbifN4_00972 [Bifidobacterium bifidum NCIMB 41171] Length = 1153 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 32/266 (12%), Positives = 72/266 (27%), Gaps = 76/266 (28%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + P I +V D SGSM M +++ K A+ Sbjct: 597 TQSGSSQTTVSPA-DIVVVFDTSGSMSNPMG---------------HNSRLEVAKTAVNS 640 Query: 238 FLDSIDLLSHVKED--VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + + +D + M L+ ++T V + + V T+ Sbjct: 641 MAQHLLTSENQGKDSNIRMALVPFSTTVGNVSNFTDNAMDIVSAVNGLRAD---GGTN-- 695 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT----- 350 + +K+I+F++DG+ S+V T Sbjct: 696 ---------------WEAALKAANAKLTSGRKGVKKYIVFMSDGDPTFRTSSVRTGTDWW 740 Query: 351 -------------------------------IKICDKAKENFIKIVTISINASPNGQRLL 379 + + + + ++ +++ P ++ Sbjct: 741 GRPTYDDDDRRGLPAGVHGSGSSDQYGANLSSAVAEANRRGDATLFSVGVSSDPT--KMR 798 Query: 380 KTCVSSPEYHYNVVNADSLIHVFQNI 405 + +Y+ + D L F +I Sbjct: 799 GFADQTKGSYYSATSTDELNKAFADI 824 >gi|52138687|ref|NP_001004392.1| collagen alpha-1(XX) chain [Gallus gallus] gi|14280020|gb|AAK58847.1| collagen type XX alpha 1 precursor [Gallus gallus] Length = 1472 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 45/341 (13%), Positives = 106/341 (31%), Gaps = 38/341 (11%) Query: 47 HYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESL 106 + + + + + A+ A ++ +L + + ++ A + + ++SL Sbjct: 89 YTLQVYVLNGSQEALFAKRKFVIEDLKN-ASQTRNNRRNSGAAPGKNLTSVGSSTAEQSL 147 Query: 107 SGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSS-NNTIFYNMDVMTSYDYRLQFIEHLL 165 +A + + +TH A +R + + T + Sbjct: 148 GMETAPPPLSTV----------LTHPAKDRAEKKRHKGTQPKGSGETMRSQPSVGVTQTP 197 Query: 166 NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 + E+P + + C ++ + V R Sbjct: 198 PPTTKSSQRATANTEREPPGKEKPTRDS---LRRGSQLQCDTSAMTDIVLLVDGSWSIGR 254 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 + +K L + ++ ++ + +GL Y++ + S R V + Sbjct: 255 SNFKLIKEFLSALISPFNIA---QDKIRVGLSQYSSDPRTEWDLS--AYATRDQVLEAVR 309 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 +L K ++ + LT +++ + L +K +I LTDG+ + Sbjct: 310 NLRYKGGNTFTGL-----ALTHVLEQNLKPDAGA------RLEAEKLVILLTDGK---SQ 355 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 + N K I+I I + + LK S P Sbjct: 356 DDANLAAQ--TLKNMGIEIFAIGVK--NADEAELKQVASEP 392 >gi|156382663|ref|XP_001632672.1| predicted protein [Nematostella vectensis] gi|156219731|gb|EDO40609.1| predicted protein [Nematostella vectensis] Length = 1235 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 42/226 (18%), Positives = 75/226 (33%), Gaps = 52/226 (23%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV-----K 249 +V+D S SM + K+RT++ K AL L ++ V Sbjct: 287 IVLDTSSSMASRVELGT-----------KRRTRLDVAKAALSTILSTLLPQDKVGVVLFN 335 Query: 250 EDVYMGLIG-----YTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 V + Y+TR+ ++ ++ R S T A K A+ + Sbjct: 336 SKVTLAGSSGVDECYSTRLAPAGRF--NVNYLKDFINR---SRPGGGTQYQNAFKAAFTL 390 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN--------VNTIKICDK 356 L S K Q F++FLTDG K + K ++ Sbjct: 391 LKSAKSGDGGGE-------------QSFLLFLTDG---GPKDDALEVERLIAQNKKEMEE 434 Query: 357 AKENFIKIVTISINASPNGQRLL-KTCVSSPEYHYNVVNADSLIHV 401 ++E + I+TI + + + L + + + V N + Sbjct: 435 SRER-VTIMTIGLGKDEHMKDFLGRLSKNVGSKYSQVDNEAHMYSA 479 >gi|32394646|gb|AAM62130.1| a11 integrin [Mus musculus] Length = 1188 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 21/182 (11%), Positives = 59/182 (32%), Gaps = 27/182 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ + + +G++ Y V+ V T++ Sbjct: 185 LINILKKFYIGPGQIQVGIVQYGEDAVHEFHL-NDYRSVKDVVEAASHIEQRGGTET--- 240 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A+ I + + K +K +I +TDGE + + K+ ++ Sbjct: 241 -RTAFGIEFARSEA---------FQKGGRKGAKKVMIVITDGE---SHDSPDLEKVIRQS 287 Query: 358 KENFIKIVTISINA--------SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQ 407 +++ + +++ +K S P+ + +NV + +L + + Sbjct: 288 EKDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALGD 347 Query: 408 LM 409 + Sbjct: 348 RI 349 >gi|121582838|ref|YP_973280.1| von Willebrand factor, type A [Polaromonas naphthalenivorans CJ2] gi|120596100|gb|ABM39538.1| von Willebrand factor, type A [Polaromonas naphthalenivorans CJ2] Length = 240 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 43/249 (17%), Positives = 87/249 (34%), Gaps = 42/249 (16%) Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVV-DLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 + + L+ + L +V+ D+SGSM + K+ Sbjct: 7 DSAKTTPAATRLKAFAAPQARPLPVIVLADVSGSMS-------------------ENGKI 47 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL- 287 AL AL + S S ++ ++ +GLI + R P V V +++ Sbjct: 48 DALNVALKEMILSFGKESGLRAEIQVGLITFGGREAHEHLPL-----VAAKVIGGVEAFK 102 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE-NNNFKS 346 T A A + L DK+ +IPS ++ +I ++DG + +++ Sbjct: 103 ANGGTPMGSAFALA-RKLLEDKE------------QIPSRAYRPVLILVSDGAPTDAWEA 149 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + +K + ++I A + L + + A + F+ ++ Sbjct: 150 PLADLK--ASERGQKATRFAMAIGADADLDMLAQFPNDREAPVFKTHEARDIGRFFRAVT 207 Query: 407 QLMVHRKYS 415 +V R S Sbjct: 208 MSVVSRSTS 216 >gi|327274978|ref|XP_003222251.1| PREDICTED: collagen alpha-6(VI) chain-like [Anolis carolinensis] Length = 2025 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 54/357 (15%), Positives = 118/357 (33%), Gaps = 57/357 (15%) Query: 54 ESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVF 113 ++ + + +K S S + E +S A RF+ + K + +V Sbjct: 671 HFSDTSKEVFSLNKNTSKKSDIIQAVEDMSLIGSTTNTGGALRFVSKYFKLAKQARPSV- 729 Query: 114 YNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQ------ 167 N + I + L + I Y++ V + +L+ I + Sbjct: 730 -NKILVLITDGEASDEVTAPATELRNDG-IIIYSVGVFNANKTQLEEISGKPEKVFYVEN 787 Query: 168 -----RYNQKIVSFIPALLRIEMGERP-IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 I+ I + + E +R + + V+D SGS+ Sbjct: 788 FDILEDIKGDIIFGICSPYKPEDCKRANLLDVVFVIDSSGSIG----------------- 830 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 A +K+ + +D ++ V G + Y+ + + + + Sbjct: 831 ---PYNYAIMKDFV---IDIVNKSDVGNNRVQFGAVKYSAYPQILFNLNGNKADIIDKI- 883 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 D+L+ T + A++ A L ++ K S Q + + +TDG Sbjct: 884 -KGDTLLNDTTYTAEALRHAEN-LFTESKGSRKRRGVPQLLMV-----------ITDG-T 929 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 ++ K ++ + + + + I I I I + L+ + ++ + V D L Sbjct: 930 SHDKDKLDAVST--RIRNDGITIYAIGIK--DAKREELEIIAENKDHVFFVDTFDGL 982 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 40/232 (17%), Positives = 81/232 (34%), Gaps = 47/232 (20%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 + + E + I +VD SGS+ D KM + Sbjct: 612 IVQDICSEEACKEMKADIMFLVDSSGSIG-----------------DDNFKKMKTFIKEV 654 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD-SLILKPTDS 294 ++ + V +G++ ++ ++ + T K + D SLI T++ Sbjct: 655 ------VNRSYIGVDQVQVGVVHFSDTSKEVFSLNKNTSKKSDIIQAVEDMSLIGSTTNT 708 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ F + + + K K ++ +TDGE ++ + T Sbjct: 709 GGALR--------------FVSKYFKLAKQARPSVNKILVLITDGEASDEVTAPAT---- 750 Query: 355 DKAKENFIKIVTIS-INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + + + I I ++ NA+ L+ PE + V N D L + +I Sbjct: 751 -ELRNDGIIIYSVGVFNANKTQ---LEEISGKPEKVFYVENFDILEDIKGDI 798 >gi|304437812|ref|ZP_07397761.1| von Willebrand factor type A domain protein [Selenomonas sp. oral taxon 149 str. 67H29BP] gi|304369169|gb|EFM22845.1| von Willebrand factor type A domain protein [Selenomonas sp. oral taxon 149 str. 67H29BP] Length = 255 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 29/199 (14%), Positives = 66/199 (33%), Gaps = 24/199 (12%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR-------TKMAALKNALLLFLDS 241 P + L +D SGSM K +++ L+ + +F ++ Sbjct: 18 PRVPVCLCLDTSGSMAAVEAGSYVGTGETIRQDGKLWQIVEGGKSRIQELQKGIEMFFEA 77 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 I + + ++ + + ++ + RQ V + + T + A Sbjct: 78 IRTDILAADSAEISIVTFDNEAKCLLDFA---NIERQTVPQLHAN---GLTAMGEGVNLA 131 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN--TIKICDKAKE 359 +L KK +Q +++ ++DGE N + + T ++ + Sbjct: 132 LDLLAQRKKEYQD---------KGVDYYQPWLVLMSDGEPNGDPTELRRATQRVTELVNA 182 Query: 360 NFIKIVTISINASPNGQRL 378 + + I I + P L Sbjct: 183 GKLTVFPIGIGSEPGMDAL 201 >gi|301606205|ref|XP_002932732.1| PREDICTED: collagen alpha-1(XXI) chain-like [Xenopus (Silurana) tropicalis] Length = 1058 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 26/158 (16%), Positives = 51/158 (32%), Gaps = 20/158 (12%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP- 291 A L+ +G++ Y+ + I T +Q ++ L Sbjct: 78 TAKNWLLNITSSFDIGPSYTQVGVVQYSDFPQLEIPLGHNTS-YQQLLSALKSIKYLGGN 136 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T++ A+K A + + KR + K I +TDG KS N + Sbjct: 137 TNTGRAIKFATEEVFPTSKRLN-------------VSKNKIAIVITDG-----KSQDNVV 178 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH 389 I A+ I + + + + L+ ++ Sbjct: 179 NISSSARAQGIILFAVGVGSEITKSELVAIANMPSTHY 216 >gi|313792199|gb|EFS40300.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA1] gi|314984000|gb|EFT28092.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA1] Length = 320 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 37/241 (15%), Positives = 78/241 (32%), Gaps = 51/241 (21%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + + +D+S SM + D + ++++A K A FL + Sbjct: 86 RDRATVVVAIDVSRSM---------------VATDVEPSRLSAAKTAAKDFLGDL---PP 127 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 L+ + + + P+ V +T +L T + Y L Sbjct: 128 RFNV---SLVKFAASAQVVVPPTPDRAAVSTAITNLQ---VLPSTAIGEGI---YSSL-- 176 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + + + P I+ L+DG N + ++ K +A + + TI Sbjct: 177 -NALKLVPDDPKHPGQKPPAA----IVLLSDGATNVGRPSLEAAK---EAGRQHVPVYTI 228 Query: 368 SIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + N L +S ++ + L V+++I+Q + + K Sbjct: 229 AYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESLGQLSDVYKSIAQSVGYEK 288 Query: 414 Y 414 Sbjct: 289 V 289 >gi|225703035|ref|NP_795896.4| integrin alpha-11 precursor [Mus musculus] Length = 1188 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 21/182 (11%), Positives = 59/182 (32%), Gaps = 27/182 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ + + +G++ Y V+ V T++ Sbjct: 185 LINILKKFYIGPGQIQVGIVQYGEDAVHEFHL-NDYRSVKDVVEAASHIEQRGGTET--- 240 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A+ I + + K +K +I +TDGE + + K+ ++ Sbjct: 241 -RTAFGIEFARSEA---------FQKGGRKGAKKVMIVITDGE---SHDSPDLEKVIRQS 287 Query: 358 KENFIKIVTISINA--------SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQ 407 +++ + +++ +K S P+ + +NV + +L + + Sbjct: 288 EKDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALGD 347 Query: 408 LM 409 + Sbjct: 348 RI 349 >gi|330995094|ref|ZP_08319011.1| von Willebrand factor type A domain protein [Paraprevotella xylaniphila YIT 11841] gi|329576670|gb|EGG58173.1| von Willebrand factor type A domain protein [Paraprevotella xylaniphila YIT 11841] Length = 332 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 50/305 (16%), Positives = 83/305 (27%), Gaps = 74/305 (24%) Query: 134 NNRLDSSNNTIFYNMDVMTSYD--YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF 191 ++T Y + + FI L I++ + + Sbjct: 29 KEPTLLVSDTTAYRYAPKSYKNHLIHAPFILRLFTFVMVVIILARPQTSNSWKNTQVEGI 88 Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L VD+S SM D K ++ A K F+ Sbjct: 89 DIMLAVDVSTSMLAE---------------DLKPNRLEAAKQVASEFI----AGRPNDN- 128 Query: 252 VYMGLIGYTTRVEKNIEPS-------WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 +GL + + ++ +I T + A Sbjct: 129 --IGLTIFAAEAFTQCPMTTDHAVLLNLFHGIKT--DMAQRGMIQDGTAVGMGIANAVSR 184 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L K +S K II LTDG NN + T + AK I++ Sbjct: 185 LKDSKAKS------------------KVIILLTDGTNNAGDISPLTAA--EIAKSFGIRV 224 Query: 365 VTISI---------NASPNGQRLL--------KTCVS----SPEYHYNVVNADSLIHVFQ 403 TI + G + L KT + + Y + L V++ Sbjct: 225 YTIGVGTNGLAPYPMPVAGGVQYLNVPVEIDTKTLAAIAGKTDGEFYRATDNKKLEDVYK 284 Query: 404 NISQL 408 +I +L Sbjct: 285 DIDKL 289 >gi|311105413|ref|YP_003978266.1| hemolysin-type calcium-binding repeat family protein 3 [Achromobacter xylosoxidans A8] gi|310760102|gb|ADP15551.1| hemolysin-type calcium-binding repeat family protein 3 [Achromobacter xylosoxidans A8] Length = 2061 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 33/165 (20%), Positives = 53/165 (32%), Gaps = 27/165 (16%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + I LV+DLSGSM S + T++ K AL L+ + L+ Sbjct: 1441 ASYNIALVLDLSGSMDYYWGSG-----------SNQETRLETAKKALKSLLE--NQLATH 1487 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTE--KVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + + LI + K + G V V + T A + Sbjct: 1488 DGTINVSLITFADSSSKLQKAISGLTPDNVDDMVNILLGLKAGGGTPYGAAFNETKSWFD 1547 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE-NNNFKSNVNT 350 N ++ FLTDGE ++ + N +T Sbjct: 1548 GQPTEDANGN-----------AYKNLTFFLTDGEPSSEYWYNRDT 1581 >gi|48428278|sp|P61622|ITA11_MOUSE RecName: Full=Integrin alpha-11; Flags: Precursor gi|35193068|gb|AAH58716.1| Integrin alpha 11 [Mus musculus] Length = 1188 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 21/182 (11%), Positives = 59/182 (32%), Gaps = 27/182 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ + + +G++ Y V+ V T++ Sbjct: 185 LINILKKFYIGPGQIQVGIVQYGEDAVHEFHL-NDYRSVKDVVEAASHIEQRGGTET--- 240 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A+ I + + K +K +I +TDGE + + K+ ++ Sbjct: 241 -RTAFGIEFARSEA---------FQKGGRKGAKKVMIVITDGE---SHDSPDLEKVIRQS 287 Query: 358 KENFIKIVTISINA--------SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQ 407 +++ + +++ +K S P+ + +NV + +L + + Sbjct: 288 EKDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALGD 347 Query: 408 LM 409 + Sbjct: 348 RI 349 >gi|242074986|ref|XP_002447429.1| hypothetical protein SORBIDRAFT_06g000920 [Sorghum bicolor] gi|241938612|gb|EES11757.1| hypothetical protein SORBIDRAFT_06g000920 [Sorghum bicolor] Length = 519 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 28/224 (12%), Positives = 63/224 (28%), Gaps = 54/224 (24%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 +R + V+D+SGSM K+ +K A+ + + Sbjct: 55 STSDRSGLDLVAVLDVSGSMQGE--------------------KIDKMKTAMKFVVKKLS 94 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAY 302 + + ++ + + T+ + + +D+L T+ T ++ Sbjct: 95 SID------RLSIVTFMDTATRICPLRQVTDASQPELLGLIDALNPGGNTNITDGLQTGL 148 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 ++L S ++ ++DG+ N + K Sbjct: 149 KVLADRNLSSGRVVG---------------VMLMSDGQQNRGGNA-------ADVKIGNA 186 Query: 363 KIVTISINA---SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 + T A + + S V + D L F Sbjct: 187 PVYTFGFGADYDPTVLNAVARN--SMGGTFSVVNDVDKLSMAFS 228 >gi|225352478|ref|ZP_03743501.1| hypothetical protein BIFPSEUDO_04100 [Bifidobacterium pseudocatenulatum DSM 20438] gi|225156985|gb|EEG70354.1| hypothetical protein BIFPSEUDO_04100 [Bifidobacterium pseudocatenulatum DSM 20438] Length = 810 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 28/148 (18%), Positives = 47/148 (31%), Gaps = 37/148 (25%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I LV+D SGSM N D N A L + + ++ Sbjct: 97 DIVLVLDKSGSMK-NSNRDTNAKN--------------AATALASKLLTAANAALPAEQQ 141 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 V M ++ ++ R + + + V+ + T+ A+K A Sbjct: 142 VQMAVVTFSDRARTTSQFTTSPGAIGTAVSAWPN----GGTNWEDALKTA---------- 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + QK I+FL+DG Sbjct: 188 --------NDLSSGRSGVQKHIVFLSDG 207 >gi|289640775|ref|ZP_06472946.1| von Willebrand factor type A [Frankia symbiont of Datisca glomerata] gi|289509351|gb|EFD30279.1| von Willebrand factor type A [Frankia symbiont of Datisca glomerata] Length = 319 Score = 46.4 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 46/247 (18%), Positives = 84/247 (34%), Gaps = 54/247 (21%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 PA R ER I L +D+S SM D ++AA K+ Sbjct: 76 PAQARRVPRERA--TIILAIDVSNSMAA---------------TDVAPNRLAAAKDGADA 118 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTP 296 F+D + + +GL+ ++ + P+ RQ V + L L P T Sbjct: 119 FIDQL------PPRINLGLVSFSGSAALLVPPT----TDRQSVRSGIHGLQLGPSTAIGE 168 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 + Q +T+ + Q P I+ L+DGE + N + Sbjct: 169 GIFAGLQAITTAGE---------QLAADGGTPPPAAIVLLSDGETQRGRPNAQAAQ---A 216 Query: 357 AKENFIKIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 A++ + + TI+ N L + ++ ++ + D L V+ Sbjct: 217 ARDAGVPVDTIAYGTADGSLDVGGQEIPVPVNEDALREIARATDGSYHRAASGDELRSVY 276 Query: 403 QNISQLM 409 +N+ + Sbjct: 277 ENLGSSI 283 >gi|299140484|ref|ZP_07033622.1| BatA protein [Prevotella oris C735] gi|298577450|gb|EFI49318.1| BatA protein [Prevotella oris C735] Length = 332 Score = 46.4 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 49/246 (19%), Positives = 76/246 (30%), Gaps = 72/246 (29%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I L +D+S SM D K ++ A KN F+ D + Sbjct: 88 IDIMLAMDVSTSMLAE---------------DLKPNRLEAAKNVASEFI--ADRPNDN-- 128 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVR-----QYVTRDMDS--LILKPTDSTPAMKQAYQ 303 +GL + + T+ V Q V D+ + LI T + A Sbjct: 129 ---IGLTIFAGEAFTQCPMT--TDHVSLINLLQSVRTDIAARGLISDGTAVGMGLANAVS 183 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L K +S K +I LTDG NN + T A+ I+ Sbjct: 184 RLKDSKAKS------------------KVVILLTDGSNNMGDISPMTSAQ--IARSFGIR 223 Query: 364 IVTISI----NAS-----------------PNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + TI I A + + L ++ Y N L ++ Sbjct: 224 VYTIGIGTNKVAPYPMPVAGGIQYVNIPVEIDSKTLKDIAATTEGNFYRATNNRQLKQIY 283 Query: 403 QNISQL 408 ++I QL Sbjct: 284 KDIDQL 289 >gi|326923641|ref|XP_003208043.1| PREDICTED: anthrax toxin receptor 1-like [Meleagris gallopavo] Length = 597 Score = 46.4 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 30/206 (14%), Positives = 65/206 (31%), Gaps = 43/206 (20%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 F + ++D SGS+ T++ + +L +S Sbjct: 84 HGAFDLYFILDKSGSV------------------RNHWTEIYSFVESLAE-----KFISP 120 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + + M I +++R ++ + E +R+ + + T K+A Sbjct: 121 M---LRMSFIVFSSRGTTIMKLTENREAIRRGLEILQHEVPGGDTFMHEGFKRA------ 171 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 N + II LTDGE + + + ++A+ + + Sbjct: 172 --------NEQIYHETYGGVRTASVIIALTDGELQDVQF-YYAEQEANRARSFGAIVYCV 222 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVV 393 + Q L T S ++ + V Sbjct: 223 GVKDFNETQ--LSTIADSIDHVFPVT 246 >gi|301784737|ref|XP_002927782.1| PREDICTED: collagen alpha-5(VI) chain-like [Ailuropoda melanoleuca] Length = 2524 Score = 46.4 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 50/348 (14%), Positives = 114/348 (32%), Gaps = 56/348 (16%) Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNH----------IKESLSGYSAVFYNTEIQNIV 122 + + D + +S K L +A + + IK+ L + +++ Sbjct: 689 NEISDAIDRMSLIDKNTLTGNALISVDQYFTPAKGARIGIKKFLILITDGEAQDAVRDPA 748 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 + R + + + +N T + + ++++ + L + K++ + AL Sbjct: 749 KALRDKGVVIFSVGVYGANRTQLEEISGDGNLVFQVESFDDL--KAIESKLIFRVCAL-- 804 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + + I V+D SGS+ +++ M AL + + Sbjct: 805 HDCKNIKVLDIVFVLDHSGSI-----------------NTQEQENMMALT------IHLV 841 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 + V +G + Y+ E G R V + D+ Y Sbjct: 842 KKADVGSDRVRIGALKYSDYPEILFHL--GKYSNRSSVIEHLRRRRSTGGDT-------Y 892 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK-ICDKAKENF 361 D FT + ++ ++ +I +TDG + N + K + Sbjct: 893 TARALDHTNMMFTEEYGSRIQQN---VKQMLIVITDG----VSHDRNLLNETALKLRNKG 945 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 I I + + Q L+ + ++V N + L ++ + + M Sbjct: 946 IDIYAVGVG--QADQLELEAMAGNKSKTFHVDNFNKLKDIYLPLQESM 991 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 47/342 (13%), Positives = 108/342 (31%), Gaps = 58/342 (16%) Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVF--------YNTEIQNI 121 SN L + +I +A F + IKE ++ ++ Sbjct: 500 SNNVNLKNAILNIEQLQGNTYTGEALNFTLSIIKEDRKLRTSQVPCYLIVLTDGRSTDDV 559 Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL 181 + + I + + + R+ F ++ + R + V + + Sbjct: 560 LEPAERLRAEQVTIHAVGIGEAIKVQLQQIAGGEERVSFGQNFDSLRSIKNEV--VHRIC 617 Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + E I +VD SGS+ + KM +L Sbjct: 618 TEKGCEDMKADIMFLVDSSGSIGL-----------------ENFGKMKTFMKSL------ 654 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD--SLILKPTDSTPAMK 299 + + + ++G+I ++ + + + + ++ +D SLI K T + A+ Sbjct: 655 LAKIQIGPDRTHIGVIQFSDKTREEFQL--NKYFTQNEISDAIDRMSLIDKNTLTGNAL- 711 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + + K + +KF+I +TDGE + + ++ Sbjct: 712 -------------ISVDQYFTPAKGARIGIKKFLILITDGEAQDAVRDPA-----KALRD 753 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + I ++ + + Q L+ + V + D L + Sbjct: 754 KGVVIFSVGVYGANRTQ--LEEISGDGNLVFQVESFDDLKAI 793 Score = 36.8 bits (83), Expect = 7.3, Method: Composition-based stats. Identities = 33/219 (15%), Positives = 74/219 (33%), Gaps = 30/219 (13%) Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 ++ +D +G + ++ + Q K ++ A+ + V Sbjct: 427 QMDLDRTGCIDTKEADIYFLIDGSTSIQGKHFEQIKEFMLAVTGMFSI------GPDKVQ 480 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 +G + Y+ ++ + + V + T + A+ I+ D+K Sbjct: 481 VGAVQYSDKMRVEFYINDNSNNVNLKNAILNIEQLQGNTYTGEALNFTLSIIKEDRK--- 537 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 ++ +P ++I LTDG + + ++ + + I + I + Sbjct: 538 --------LRTSQVPC--YLIVLTDGRSTDDVLEPA-----ERLRAEQVTIHAVGIGEAI 582 Query: 374 NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 Q L+ E N DSL ++I +VHR Sbjct: 583 KVQ--LQQIAGGEERVSFGQNFDSL----RSIKNEVVHR 615 >gi|281423276|ref|ZP_06254189.1| BatA protein [Prevotella oris F0302] gi|281402612|gb|EFB33443.1| BatA protein [Prevotella oris F0302] Length = 332 Score = 46.4 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 49/246 (19%), Positives = 76/246 (30%), Gaps = 72/246 (29%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I L +D+S SM D K ++ A KN F+ D + Sbjct: 88 IDIMLAMDVSTSMLAE---------------DLKPNRLEAAKNVASEFI--ADRPNDN-- 128 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVR-----QYVTRDMDS--LILKPTDSTPAMKQAYQ 303 +GL + + T+ V Q V D+ + LI T + A Sbjct: 129 ---IGLTIFAGEAFTQCPMT--TDHVSLINLLQSVRTDIAARGLISDGTAVGMGLANAVS 183 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L K +S K +I LTDG NN + T A+ I+ Sbjct: 184 RLKDSKAKS------------------KVVILLTDGSNNMGDISPMTSAQ--IARSFGIR 223 Query: 364 IVTISI----NAS-----------------PNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + TI I A + + L ++ Y N L ++ Sbjct: 224 VYTIGIGTNKVAPYPMPVAGGIQYVNIPVEIDSKTLKDIAATTEGNFYRATNNRQLKQIY 283 Query: 403 QNISQL 408 ++I QL Sbjct: 284 KDIDQL 289 >gi|156742542|ref|YP_001432671.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233870|gb|ABU58653.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 547 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 41/220 (18%), Positives = 78/220 (35%), Gaps = 43/220 (19%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R I LVVD+SGSM K+ A K L FL I Sbjct: 369 RKRADILLVVDVSGSMEGD--------------------KLEAAKAGLGTFLSRILPED- 407 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 +GL+ ++T + P+ + R D++ + A+ Y L Sbjct: 408 -----RVGLVTFSTESRLVVPPA----PLSDTRIRLDDAIAVMRAQGRTAL---YDALID 455 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 K+ + I+ L+DG +N+ ++ + +++ +E+ I I + Sbjct: 456 GKEA------LDSLPSTGDDRI-RAIVLLSDGLDNSSRATLEQVRL--AFEESGISIFPV 506 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + A L+ + +A + +F+N+S+ Sbjct: 507 AYGA-DADTDALQQIATFSRTILVQGDAGDIGQIFENLSR 545 >gi|118355471|ref|XP_001010995.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89292762|gb|EAR90750.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 787 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 34/218 (15%), Positives = 77/218 (35%), Gaps = 45/218 (20%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 RP + V+D SGSM K+ +KN + + L+ Sbjct: 239 RPNLDLICVIDNSGSMDGE--------------------KIENVKNTI------LQLIDM 272 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILT 306 + + + +I + + ++ + ++ + + S+ T+ T ++ A+ IL Sbjct: 273 LNDHDRLSIITFNSYAKQLCGLRKVNKDNKENLQKITKSIQADGGTNITSGLQTAFSILQ 332 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI--KICDKAKENFIKI 364 + K+R+ ++ + L+DG++NN S + + + +E I Sbjct: 333 NRKQRNSVSS----------------VFLLSDGQDNNSDSRIRNLLQTTYQQLQEECFTI 376 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + +G + + Y V D + F Sbjct: 377 HSFGFGNDHDGPLMQRIAQIKDGSFYYVERNDQVDEFF 414 >gi|313681552|ref|YP_004059290.1| von willebrand factor type a [Sulfuricurvum kujiense DSM 16994] gi|313154412|gb|ADR33090.1| von Willebrand factor type A [Sulfuricurvum kujiense DSM 16994] Length = 311 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 38/224 (16%), Positives = 71/224 (31%), Gaps = 39/224 (17%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + R I L +D SGSM+ + S E + ++ ++ F+ + Sbjct: 83 DPRNRNGIDIVLSLDGSGSMNASGFSKEEP----------RLSRFEVVQKIASDFV--MK 130 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + +G++ + ++ E V + + + + T + + Sbjct: 131 RIEDN-----VGVVLFGDFAFIATPVTYEKEIVSEMIGYLSHGMAGQNTAIGEGIAMGVR 185 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L K +S K II LTDGE+N+ + + I+ Sbjct: 186 ALRDSKAKS------------------KVIILLTDGEHNSGSISPKEAV--AMVGKEHIR 225 Query: 364 IVTISI-NASPNGQRLLKTCVSSP-EYHYNVVNADSLIHVFQNI 405 + TI I LLK + N L V+ I Sbjct: 226 LYTIGIGQKGEFDNALLKQLAHDGHGKFFAAANEKELQSVYDEI 269 >gi|156616286|ref|NP_001096077.1| collagen alpha-6(VI) chain isoform 1 [Mus musculus] gi|189082903|sp|Q8C6K9|CO6A6_MOUSE RecName: Full=Collagen alpha-6(VI) chain; Flags: Precursor Length = 2265 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 52/352 (14%), Positives = 102/352 (28%), Gaps = 64/352 (18%) Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRF---IKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 SN LG E+I A F + K+ + S Sbjct: 493 SNKPDLGKAIENIRQMGGNTNTGAALNFTLKLLQRAKKERGSKVPCHLVVLTNGMSRDSV 552 Query: 127 ISMTHMANNR--------LDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP 178 + H + +N T + Y + + L N R + Sbjct: 553 LGPAHKLREENIRVHAIGVKEANQTQLREIAGEEKRVYYVHEFDALRNIRN-----QVVQ 607 Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + E I +VD SGS+ + +KM K + Sbjct: 608 EICAEEACRDMKADIMFLVDSSGSIG-----------------PENFSKM---KMFMKNL 647 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 + + + V +G++ ++ ++ + T + + +D + Sbjct: 648 VSKSQI---GADRVQIGVVQFSHENKEEFQL--NTFMSQSDIANAIDRMTHIG------- 695 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 T F + + K +KF+I +TDGE + + + Sbjct: 696 -----ETTLTGSALTFVSQYFSPDKGARPNVRKFLILITDGEAQDIVRDPAI-----ALR 745 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + + I ++ + S Q L+ PE + V N + Q+I +V Sbjct: 746 KEGVIIYSVGVFGSNVTQ--LEEISGKPEMVFYVEN----FDILQHIEDDLV 791 >gi|148680074|gb|EDL12021.1| mCG3350, isoform CRA_a [Mus musculus] Length = 513 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 57/341 (16%), Positives = 120/341 (35%), Gaps = 65/341 (19%) Query: 41 IYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKN 100 + + +KN++E+ + + G + +V N R + +K L + FI + Sbjct: 170 VDQPFYMSRKNTIEATRCSTRITG-TNVVHNCERGNCVTRACRRDSKTRLYEPKCTFIPD 228 Query: 101 HIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN-RLDSSNNTIFYNMDVMTSYDYRLQ 159 I+ + G S +F + + + A N + N +++ ++ Sbjct: 229 KIQTA--GASIMFMQNLNSVVEFCTEKNHNAEAPNLQNKMCNRRSTWDV---------IK 277 Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 N + + P + R + + LV+D SGSM Sbjct: 278 TSADFQNAPPMRGTEAPPPPTFSLLKSRRRV--VCLVLDKSGSMD--------------- 320 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV-EKNIEPSWGTEKVRQ 278 K ++ + A L+L I V+++ +GL+ + + +N + Q Sbjct: 321 ----KEDRLIRMNQAAELYLTQI-----VEKESMVGLVTFDSAAHIQNYLIKITSSSDYQ 371 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 +T ++ T ++ +Q +TS + + + I+ LTD Sbjct: 372 KITANLPQQASGGTSICHGLQAGFQAITSSDQSTSGSE----------------IVLLTD 415 Query: 339 GENNNFKSNVNTIKICDKA-KENFIKIVTISINASPNGQRL 378 GE+N +S C +A + I TI++ + L Sbjct: 416 GEDNGIRS-------CFEAVSRSGAIIHTIAL-GPSAAREL 448 >gi|170727657|ref|YP_001761683.1| von Willebrand factor type A [Shewanella woodyi ATCC 51908] gi|169813004|gb|ACA87588.1| von Willebrand factor type A [Shewanella woodyi ATCC 51908] Length = 640 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 37/236 (15%), Positives = 76/236 (32%), Gaps = 52/236 (22%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 E + + ++D+SGSM K+ LKNAL + +D Sbjct: 245 EKADLGASQLVFLLDVSGSMSSQ-------------------DKLPLLKNALKMLSQQLD 285 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + VY G G V + T + Q + + T+ ++ AYQ Sbjct: 286 EGDRISIVVYAGASG----VVLDGVKGNDTLAISQALDKLKAG---GSTNGGAGIELAYQ 338 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK-ICDKAKENFI 362 ++ + +I TDG+ N S+ ++ + ++ ++ I Sbjct: 339 -------------LAQKHFIAGGVNR---VILATDGDFNVGVSDQQALEDMIEEKRKQGI 382 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNA---------DSLIHVFQNISQLM 409 + T+ L++ HY ++ D + I++ + Sbjct: 383 ALTTLGFGQGNYNDHLMEQLADKGNGHYAYIDTLNEARKVLVDEISATLLTIAKDV 438 >gi|108799422|ref|YP_639619.1| hypothetical protein Mmcs_2455 [Mycobacterium sp. MCS] gi|119868535|ref|YP_938487.1| hypothetical protein Mkms_2500 [Mycobacterium sp. KMS] gi|126435076|ref|YP_001070767.1| hypothetical protein Mjls_2492 [Mycobacterium sp. JLS] gi|122976988|sp|Q1B971|Y2455_MYCSS RecName: Full=UPF0353 protein Mmcs_2455 gi|166987492|sp|A3PZE9|Y2492_MYCSJ RecName: Full=UPF0353 protein Mjls_2492 gi|166987495|sp|A1UFT9|Y2500_MYCSK RecName: Full=UPF0353 protein Mkms_2500 gi|108769841|gb|ABG08563.1| von Willebrand factor, type A [Mycobacterium sp. MCS] gi|119694624|gb|ABL91697.1| von Willebrand factor, type A [Mycobacterium sp. KMS] gi|126234876|gb|ABN98276.1| von Willebrand factor, type A [Mycobacterium sp. JLS] Length = 335 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 39/234 (16%), Positives = 74/234 (31%), Gaps = 51/234 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + LV+D+S SM D +++AA + A S + + Sbjct: 99 VMLVIDVSQSM---------------RATDVSPSRLAAAQEA------SKQFADELTPGI 137 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GLI Y + P+ E + + + + T + + A Q + + Sbjct: 138 NLGLIAYAGTATVLVSPTTNREATKTAIDKLQLADR---TATGEGIFTALQAIATVGAV- 193 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI---KICDKAKENFIKIVTISI 369 + P I+ +DG+ + N AK+ + I TIS Sbjct: 194 ---------IGGGDEPPPARIVLFSDGKETVPSNPDNPKGAFTAARTAKDQGVPISTISF 244 Query: 370 NAS-------------PNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLM 409 P ++LK S + + + L V+ N+ Q + Sbjct: 245 GTPYGYVEINEQRQPVPVDDQMLKKIADLSEGEAFTASSLEQLREVYANLQQQI 298 >gi|16358975|gb|AAH10260.1| Clca1 protein [Mus musculus] Length = 513 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 57/341 (16%), Positives = 120/341 (35%), Gaps = 65/341 (19%) Query: 41 IYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKN 100 + + +KN++E+ + + G + +V N R + +K L + FI + Sbjct: 170 VDQPFYMSRKNTIEATRCSTRITG-TNVVHNCERGNCVTRACRRDSKTRLYEPKCTFIPD 228 Query: 101 HIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN-RLDSSNNTIFYNMDVMTSYDYRLQ 159 I+ + G S +F + + + A N + N +++ ++ Sbjct: 229 KIQTA--GASIMFMQNLNSVVEFCTEKNHNAEAPNLQNKMCNRRSTWDV---------IK 277 Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 N + + P + R + + LV+D SGSM Sbjct: 278 TSADFQNAPPMRGTEAPPPPTFSLLKSRRRV--VCLVLDKSGSMD--------------- 320 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV-EKNIEPSWGTEKVRQ 278 K ++ + A L+L I V+++ +GL+ + + +N + Q Sbjct: 321 ----KEDRLIRMNQAAELYLTQI-----VEKESMVGLVTFDSAAHIQNYLIKITSSSDYQ 371 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 +T ++ T ++ +Q +TS + + + I+ LTD Sbjct: 372 KITANLPQQASGGTSICHGLQAGFQAITSSDQSTSGSE----------------IVLLTD 415 Query: 339 GENNNFKSNVNTIKICDKA-KENFIKIVTISINASPNGQRL 378 GE+N +S C +A + I TI++ + L Sbjct: 416 GEDNGIRS-------CFEAVSRSGAIIHTIAL-GPSAAREL 448 >gi|91789735|ref|YP_550687.1| von Willebrand factor, type A [Polaromonas sp. JS666] gi|91698960|gb|ABE45789.1| von Willebrand factor, type A [Polaromonas sp. JS666] Length = 346 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 44/256 (17%), Positives = 87/256 (33%), Gaps = 61/256 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +D+SGSM D + +++ A + A FL + Sbjct: 87 TIMLAIDVSGSM---------------RATDVQPSRLVAAQTAAKAFLTEL------PRT 125 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK-- 309 V +GL+ + + P+ E + + R + + T + + L D Sbjct: 126 VKVGLVAFAGSAQVAQIPTVNREDLVSAIDRFQ---LQRGTAIGNGIVMSLATLFPDAGI 182 Query: 310 -----------KRSFFTNFFRQGVK-----IPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 +R F + ++ K P II LTDG+ +++ K+ Sbjct: 183 DLQSMQSGRERQRGFAIDQEKKEAKEFTPVAPGSYNSAAIILLTDGQRTTGVDSLDAAKL 242 Query: 354 CDKAKENFIKIVTISINASPN--------------GQRLLKTCV-SSPEYHYNVVNADSL 398 A + +++ T+ I + LK ++ ++ A L Sbjct: 243 ---AADRGVRVYTVGIGTVDGETIGFEGWSMRVRLDEETLKGIARATQAEYFYAGTATDL 299 Query: 399 IHVFQNI-SQLMVHRK 413 V++ + S+L V +K Sbjct: 300 KKVYETLSSRLTVEKK 315 >gi|120602151|ref|YP_966551.1| von Willebrand factor type A [Desulfovibrio vulgaris DP4] gi|120562380|gb|ABM28124.1| von Willebrand factor, type A [Desulfovibrio vulgaris DP4] Length = 420 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 19/79 (24%), Positives = 36/79 (45%), Gaps = 5/79 (6%) Query: 336 LTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN-GQRLLKTCVSSP----EYHY 390 +T N K N ++ K KE I++ I S + L+K+ SS +++Y Sbjct: 336 MTSHCENGGKLNAAMLEEARKVKEAGIEVFAIRFGDSDSVDVSLMKSIASSKAGTNDHYY 395 Query: 391 NVVNADSLIHVFQNISQLM 409 + +A + VF+ I + + Sbjct: 396 DAPSAYDIDDVFKKIGRQL 414 >gi|310817544|ref|YP_003949902.1| von willebrand factor type a domain-containing protein [Stigmatella aurantiaca DW4/3-1] gi|309390616|gb|ADO68075.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 470 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 33/225 (14%), Positives = 69/225 (30%), Gaps = 48/225 (21%) Query: 184 EMGERPIF--LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 ER + LV+D SGSM + K+ K A F+ Sbjct: 87 APQERRRVPVNVALVIDRSGSM--------------------RGQKLDDAKRAAREFITR 126 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDSTPAMKQ 300 + + L+ Y T V + T + R+ + ++++ T+ + ++ Sbjct: 127 VSEED------RVALVHYGTDV-TVFPSTLATPETREQMLTFVNAIEDEGSTNISGGLEA 179 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKE 359 A Q L + + + II L+DG+ + + Sbjct: 180 AAQQLQKNADQFRVSR----------------IILLSDGQPTAGLTREEQLTALARNLRS 223 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + + + + + L++ + +D L VF Sbjct: 224 QGMAVSALGV-GEDFNENLMQGIADQGGGFSGFLRSDQLAEVFTR 267 >gi|46199004|ref|YP_004671.1| hypothetical protein TTC0696 [Thermus thermophilus HB27] gi|46196628|gb|AAS81044.1| hypothetical membrane associated protein [Thermus thermophilus HB27] Length = 706 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 37/235 (15%), Positives = 76/235 (32%), Gaps = 52/235 (22%) Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 +P L ++ R + LV+D+SGSM L A+ Sbjct: 290 LPEELPLKPLGREGAALVLVLDVSGSMAG-----------------------EKLSMAVA 326 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDST 295 L ++ + +G++ +++ P T + ++ + SL T Sbjct: 327 GALALVESAAPED---RLGVVVFSSGHRVLFPPRPMTAQAKKEAESLLLSLRAGGGTVLG 383 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A ++A ++L +K ++ LTDG + K I D Sbjct: 384 GAFREAVRLLHGVPGE------------------RKAVLVLTDGLIADAKE-----PILD 420 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQLM 409 A+ + +++ +++ LK Y + L +F Q + Sbjct: 421 LAQTSGVEVSALAL-GPDADAPFLKELARRGGGRFYQAPSPRELPRLFLREGQEV 474 >gi|148680075|gb|EDL12022.1| mCG3350, isoform CRA_b [Mus musculus] Length = 527 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 57/341 (16%), Positives = 120/341 (35%), Gaps = 65/341 (19%) Query: 41 IYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKN 100 + + +KN++E+ + + G + +V N R + +K L + FI + Sbjct: 184 VDQPFYMSRKNTIEATRCSTRITG-TNVVHNCERGNCVTRACRRDSKTRLYEPKCTFIPD 242 Query: 101 HIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN-RLDSSNNTIFYNMDVMTSYDYRLQ 159 I+ + G S +F + + + A N + N +++ ++ Sbjct: 243 KIQTA--GASIMFMQNLNSVVEFCTEKNHNAEAPNLQNKMCNRRSTWDV---------IK 291 Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 N + + P + R + + LV+D SGSM Sbjct: 292 TSADFQNAPPMRGTEAPPPPTFSLLKSRRRV--VCLVLDKSGSMD--------------- 334 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV-EKNIEPSWGTEKVRQ 278 K ++ + A L+L I V+++ +GL+ + + +N + Q Sbjct: 335 ----KEDRLIRMNQAAELYLTQI-----VEKESMVGLVTFDSAAHIQNYLIKITSSSDYQ 385 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 +T ++ T ++ +Q +TS + + + I+ LTD Sbjct: 386 KITANLPQQASGGTSICHGLQAGFQAITSSDQSTSGSE----------------IVLLTD 429 Query: 339 GENNNFKSNVNTIKICDKA-KENFIKIVTISINASPNGQRL 378 GE+N +S C +A + I TI++ + L Sbjct: 430 GEDNGIRS-------CFEAVSRSGAIIHTIAL-GPSAAREL 462 >gi|115373770|ref|ZP_01461063.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] gi|115369169|gb|EAU68111.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 420 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 33/225 (14%), Positives = 69/225 (30%), Gaps = 48/225 (21%) Query: 184 EMGERPIF--LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 ER + LV+D SGSM + K+ K A F+ Sbjct: 37 APQERRRVPVNVALVIDRSGSM--------------------RGQKLDDAKRAAREFITR 76 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDSTPAMKQ 300 + + L+ Y T V + T + R+ + ++++ T+ + ++ Sbjct: 77 VSEED------RVALVHYGTDV-TVFPSTLATPETREQMLTFVNAIEDEGSTNISGGLEA 129 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKE 359 A Q L + + + II L+DG+ + + Sbjct: 130 AAQQLQKNADQFRVSR----------------IILLSDGQPTAGLTREEQLTALARNLRS 173 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + + + + + L++ + +D L VF Sbjct: 174 QGMAVSALGV-GEDFNENLMQGIADQGGGFSGFLRSDQLAEVFTR 217 >gi|301627723|ref|XP_002943019.1| PREDICTED: complement C2-like [Xenopus (Silurana) tropicalis] Length = 678 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 46/292 (15%), Positives = 97/292 (33%), Gaps = 50/292 (17%) Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 T + + R + + Y D + QF L ++ + ++I+ Sbjct: 119 TCLESRRWSGTEISCQYPYSFDLPEDVQEQFKASLSGILNIKERSASFGRTIKIK--RDG 176 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 I + ++D S S+ ++ + + +D + Sbjct: 177 ILNVYFLLDASRSVG---EANFDIYKECSVY-----------------LVDELASFDM-- 214 Query: 250 EDVYMGLIGYTTR-------VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 + G+I Y T ++N + +V + + D T+ A+++ Y Sbjct: 215 -TIQFGIISYATVPKVIIPIYDENSDNDAHVFEVIENDLKYSDHKDKTGTNIKTALEEVY 273 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD------K 356 + S +K ++ II LTDG+ N +TIK + K Sbjct: 274 N-MMSFQKETYKNESVWN-------SIHHIIILLTDGKANIGGRPADTIKHIEEFLDIKK 325 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSS---PEYHYNVVNADSLIHVFQNI 405 +E+++ + T I L S + + + +A+ + VFQ I Sbjct: 326 KREDYLDVYTFGI-GPDVDMADLSEIASKKDGESHVFRMESANEMKTVFQKI 376 >gi|126306106|ref|XP_001362488.1| PREDICTED: similar to calcium-dependent chloride channel-1 [Monodelphis domestica] Length = 911 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 43/234 (18%), Positives = 80/234 (34%), Gaps = 57/234 (24%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM ++ L A LFL I +++ + Sbjct: 309 LVLDKSGSMAGE-------------------DRLNRLNQASQLFLLQI-----IEKGSWT 344 Query: 255 GLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + T+ R + + + T ++ A+ ++ K+ F Sbjct: 345 GMVTFDSSATIQSALIQIETDAQRNSLISRLPTAAGGGTSICSGLRTAFTVI----KKKF 400 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 T+ I+ LTDGE++ TI C D+ K++ I T+++ Sbjct: 401 STHGSE-------------IVLLTDGEDS-------TISSCFDEVKQSGAIIHTVAL-GP 439 Query: 373 PNGQRLLKTCVSSPEYHYNVVNADS---LIHVFQNISQ---LMVHRKYSVILKG 420 L K + + + LI F +S + R + KG Sbjct: 440 SADPGLEKLAEMTGGMKTSATDNAQNNGLIDAFSALSSGNGAITQRSIQLESKG 493 >gi|298370193|ref|ZP_06981509.1| pilus-associated protein [Neisseria sp. oral taxon 014 str. F0314] gi|298281653|gb|EFI23142.1| pilus-associated protein [Neisseria sp. oral taxon 014 str. F0314] Length = 1071 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 20/95 (21%), Positives = 40/95 (42%), Gaps = 8/95 (8%) Query: 333 IIFLTDGENNNFKS------NVNTIKICDKAKENFIKIVTISI--NASPNGQRLLKTCVS 384 I F T G + KS + T K + ++ ++ TI + S G+ L S Sbjct: 244 IDFKTSGLDKAGKSWQGDPKDPKTAKNPNGFRDQLVQTFTIGFGRDISSAGRNYLTNGAS 303 Query: 385 SPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 +Y+++ + D L F I+ + +V+++ Sbjct: 304 RDDYYFSAESEDDLYRAFDTITDSIKDESQNVVIE 338 >gi|225010242|ref|ZP_03700714.1| von Willebrand factor type A [Flavobacteria bacterium MS024-3C] gi|225005721|gb|EEG43671.1| von Willebrand factor type A [Flavobacteria bacterium MS024-3C] Length = 351 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 39/219 (17%), Positives = 76/219 (34%), Gaps = 44/219 (20%) Query: 152 TSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDP 211 +++ L+F LL + A ++E +R I VD+S SM Sbjct: 51 SAFKAPLKFSLQLLGVAAIVIALVNPKAGTKLETVKREGVDIVFAVDVSKSMLAE----- 105 Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW 271 D +M K + ++ + +G+I Y + + + Sbjct: 106 ----------DIAPNRMEKAKRLVSEIINELASD-------RIGIIAYAAQAYPQLPITT 148 Query: 272 GTEKVRQYVTRD-MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 + ++ D L + T + A++ A + + N Q Sbjct: 149 DFGAAKMFLQGMNTDMLSSQGTAISDAIELA----------TTYYNDAAQ--------TN 190 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + + ++DGE+++ VN + KA E IKI TI + Sbjct: 191 RVLFIVSDGEDHSEGGAVNAVS---KATEAGIKIFTIGV 226 >gi|116622501|ref|YP_824657.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116225663|gb|ABJ84372.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 337 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 41/239 (17%), Positives = 78/239 (32%), Gaps = 61/239 (25%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 E I +V D SGSM K+A + A+ FL S Sbjct: 104 ASEDVPVSIVIVFDCSGSMG---------------------PKLAKSRAAVAAFLSS--- 139 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + E L+ + R + + T++++ + + T A+ A Sbjct: 140 ANPEDEF---SLVLFNDRAQLVSGFNRQTDELQS---KLFYAQSKGRTALLDAIYLAMDQ 193 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 + K +K ++ ++DG +N + ++ +K ++ KE +I Sbjct: 194 MKHAKHS------------------RKAVLVISDGGDNCSRYSMREVK--NRVKEGDAQI 233 Query: 365 VTISIN----------ASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHR 412 +I I G LL S S + + N + L V I + ++ Sbjct: 234 YSIGILEAMGFRGRSAEELAGPALLDDIASQSGGRLFEIDNLNELSDVASKIGMALRNQ 292 >gi|163846842|ref|YP_001634886.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222524662|ref|YP_002569133.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163668131|gb|ABY34497.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222448541|gb|ACM52807.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 545 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 39/220 (17%), Positives = 73/220 (33%), Gaps = 43/220 (19%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R I LVVD SGSM KM +K + FL I Sbjct: 367 RKRADILLVVDTSGSMEGD--------------------KMTMVKAGIETFLMRILPED- 405 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 +GLI + ++ + + +E + T A+ A Q L + Sbjct: 406 -----RLGLITFDSQARLVVPMAPLSENRIDLQIAVQEMRASGRTALFDALDLARQTLEA 460 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + I+ L+DG +N + + ++ + E+ I I + Sbjct: 461 LPPAEDDR--------------IRAIVLLSDGADNASRLTLEEVRR--QFDESGITIFPV 504 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + S +++L V ++ + +F+N+S+ Sbjct: 505 AY-GSDADRQVLDAIAEFSRTIVVVGDSGDIAQIFENLSR 543 >gi|320105085|ref|YP_004180676.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644] gi|319752367|gb|ADV64127.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644] Length = 323 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 38/238 (15%), Positives = 76/238 (31%), Gaps = 44/238 (18%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 P +R + IEL VD+SGSM T+ A+ Sbjct: 89 PVRFSEPRTKRVMTNIELCVDVSGSMMSPFGDG---------------TRYDMSMKAIDK 133 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR---QYVTRDMDSLILKPTDS 294 FLDS GL + + + ++ ++ ++ L + T+ Sbjct: 134 FLDS-RRGDA------FGLTFFGNNYLHWVPLTSDVSAIKCAPPFMKPEVAPLWMSGTEI 186 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+ + L + + II ++DG + N ++ Sbjct: 187 GKALLGCRRTL------------------VERQEGDRAIILISDGA-SFDLGGGNDEEVA 227 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 K + I + I I+ + ++ C + + + +L +F+ I Q+ R Sbjct: 228 RLLKRDGIVVYAIHIDETEIPDPIVTICSITGGDAFAPDDPSALEAIFKRIDQMTPTR 285 >gi|308511201|ref|XP_003117783.1| hypothetical protein CRE_00574 [Caenorhabditis remanei] gi|308238429|gb|EFO82381.1| hypothetical protein CRE_00574 [Caenorhabditis remanei] Length = 566 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 31/223 (13%), Positives = 64/223 (28%), Gaps = 46/223 (20%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + + I ++ D SGS+ + + A + A++ F L Sbjct: 383 PVKDCRYDIGIIFDSSGSLEKNFQKQLKFATTLVEQMPISP---NATRVAIIQFAGKTKL 439 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 RV + +++ + R T + A+K Sbjct: 440 -----------------RVLADFAQKKSAAELKTIIGRSH--FFSGTTFTNGALKT---- 476 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 Q K + ++ TDG + S + K + + Sbjct: 477 ----------MADLFQKSKRADAKLK--VVLFTDGYSAEDTSEGA-----EALKSQGVVV 519 Query: 365 VTISINASPN-GQRL--LKTCVSSPEYHYNVVNADSLIHVFQN 404 T+ I+ + G + L +SP + +N + L F + Sbjct: 520 YTVGISTEKSTGLNMKELHGMATSPNHFFNASDFVELSKNFPS 562 >gi|301059316|ref|ZP_07200243.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] gi|300446545|gb|EFK10383.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] Length = 527 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 48/237 (20%), Positives = 78/237 (32%), Gaps = 47/237 (19%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 ERP I +V+D SGSM K+ K A + L + Sbjct: 147 ERPPVNIAIVLDRSGSMSGQ--------------------KLEKAKQAAIEALRRLGQKD 186 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 +I Y V K I P+ V +R T + Q + Sbjct: 187 MFS------VIVYDHNV-KTIVPAQSARNVEWIESRIRGIGPGGNTALFGGVSQGASEV- 238 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK-ENFIKIV 365 R +N + II L+DG N S+ + A + I + Sbjct: 239 ----RKNLSNKYVHR-----------IILLSDGLANVGPSSPEDLGRLGAALIKESISVT 283 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF-QNISQL--MVHRKYSVILK 419 TI + N + + +S Y V ++ L +F + + +V +K SVI++ Sbjct: 284 TIGVGTDYNEDLMARLSQNSDGNTYFVESSRDLPKIFAAELGDVLNVVAKKVSVIIQ 340 >gi|300782091|ref|YP_003762382.1| von Willebrand factor type A [Amycolatopsis mediterranei U32] gi|299791605|gb|ADJ41980.1| von Willebrand factor type A [Amycolatopsis mediterranei U32] Length = 602 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 39/242 (16%), Positives = 78/242 (32%), Gaps = 47/242 (19%) Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 + + LL R + LVVD+SGSM + ++K+ K A Sbjct: 393 TVLAKLLTTWTDLRKKANVLLVVDVSGSMGDEVKG-------------TGKSKIDLAKQA 439 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS-------WGTEKVRQYVTRDMDSL 287 + L +GL + T ++ + + G+ +R Sbjct: 440 AIDSLGQFVPRD------QVGLWQFATHLDGDKDYQELLPVQPLGSNGKETLASRLSGLT 493 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + AY+ L + S ++ LTDG N + Sbjct: 494 PQSGTGLYDSSLAAYEYLKAHLDPSAINA----------------VVVLTDGRNEDP-GG 536 Query: 348 VNTIKICDKAKENF----IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 V+ + + + +++ TI+ + L + ++ Y+ DS+ VF Sbjct: 537 VDLDHLVPQLRPEGNAESVRLFTIAYGGDADQNVLKQIAEATAGSEYDSSKPDSINQVFT 596 Query: 404 NI 405 ++ Sbjct: 597 SV 598 >gi|255566338|ref|XP_002524155.1| Inter-alpha-trypsin inhibitor heavy chain H3 precursor, putative [Ricinus communis] gi|223536573|gb|EEF38218.1| Inter-alpha-trypsin inhibitor heavy chain H3 precursor, putative [Ricinus communis] Length = 514 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 35/223 (15%), Positives = 73/223 (32%), Gaps = 52/223 (23%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 +RP + V+D+SGSM K+A +K A+L + + Sbjct: 55 STNDRPGLDLVAVLDVSGSMAGD--------------------KIAKVKTAMLFVIKKLS 94 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS-LILKPTDSTPAMKQAY 302 + + ++ ++ + TE ++ + ++ T+ T ++ Sbjct: 95 PID------RLSVVKFSADASRLCPLRQITEDSQKDLENLINGLNADGATNITAGLQTGL 148 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 ++L S II ++DGE N + + I + + Sbjct: 149 KVLNDRSLSSGRVVG---------------IILMSDGEQNAG-GDAAQVPIGN------V 186 Query: 363 KIVTISINASPNGQRLLKTCV--SSPEYHYNVVNADSLIHVFQ 403 + T + + +LK S +V N D+L F Sbjct: 187 PVYTFGFGINHEPR-VLKAIANNSMGGTFSDVQNTDNLSLAFS 228 >gi|331089974|ref|ZP_08338865.1| hypothetical protein HMPREF1025_02448 [Lachnospiraceae bacterium 3_1_46FAA] gi|330403112|gb|EGG82675.1| hypothetical protein HMPREF1025_02448 [Lachnospiraceae bacterium 3_1_46FAA] Length = 4107 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 35/236 (14%), Positives = 79/236 (33%), Gaps = 50/236 (21%) Query: 222 DKKRTKMAALKNALLLFLDSI----DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 TK+ AL+NA+ F++ D + +K + L+ + + +++ Sbjct: 222 GTNITKLQALQNAVNNFVEQTAKMNDSIDDIKLQHRVSLVKFAS--DESDNIGNDFINNN 279 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK--RSFFTNFFRQGVKIPSLP--FQKFI 333 ++ + L T + + L + F N ++ ++ QK + Sbjct: 280 YNRSQIVTELKSYTTKNISDLTSTVNSLIAAGATRADFGLNQAQRAFQLGGTREGAQKVV 339 Query: 334 IFLTDGE---NNNFKSNVNTIKICD--KAKENFIKIVTISI------------------- 369 +F TDG+ N+++ ++V I + + K+ I +I + Sbjct: 340 VFFTDGQPTSNSDWSNSVAAAAITNAKELKDANALIYSIGVFRDANPNDTNTSAGNFNGY 399 Query: 370 ----------------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 L + +Y+ +AD L ++F IS + Sbjct: 400 MHAVSSNYPDATATSSTRPNRYSCTLGKRTDNSDYYKAATDADELNNIFNEISSDL 455 >gi|328907235|gb|EGG27001.1| von Willebrand factor type A domain protein [Propionibacterium sp. P08] Length = 318 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 36/241 (14%), Positives = 76/241 (31%), Gaps = 51/241 (21%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + + +D+S SM + D ++++A K A FL + Sbjct: 84 RDRATVVVAIDVSRSM---------------VATDVDPSRLSAAKTAAKDFLGDL---PP 125 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 L+ + + + P+ V + +L T + Y L Sbjct: 126 RFNV---SLVKFAASSQVVVPPTTDRAVVSTAIANLQ---VLPSTAIGEGI---YSSL-- 174 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + + + P I+ L+DG N + ++ K +A + + TI Sbjct: 175 -NALKLVPDDPKHPGQKPPAA----IVLLSDGATNVGRPSLEAAK---EAGRQHVPVYTI 226 Query: 368 SIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + N L +S ++ + L V+++I+Q + + K Sbjct: 227 AYGTAGGYVVEGGQRQPVPVNHYELAAVAKASGGEKFSAESLGQLSDVYKSIAQSVGYEK 286 Query: 414 Y 414 Sbjct: 287 V 287 >gi|317502373|ref|ZP_07960539.1| hypothetical protein HMPREF1026_02483 [Lachnospiraceae bacterium 8_1_57FAA] gi|316896246|gb|EFV18351.1| hypothetical protein HMPREF1026_02483 [Lachnospiraceae bacterium 8_1_57FAA] Length = 4107 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 35/236 (14%), Positives = 79/236 (33%), Gaps = 50/236 (21%) Query: 222 DKKRTKMAALKNALLLFLDSI----DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 TK+ AL+NA+ F++ D + +K + L+ + + +++ Sbjct: 222 GTNITKLQALQNAVNNFVEQTAKMNDSIDDIKLQHRVSLVKFAS--DESDNIGNDFINNN 279 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK--RSFFTNFFRQGVKIPSLP--FQKFI 333 ++ + L T + + L + F N ++ ++ QK + Sbjct: 280 YNRSQIVTELKSYTTKNISDLTSTVNSLIAAGATRADFGLNQAQRAFQLGGTREGAQKVV 339 Query: 334 IFLTDGE---NNNFKSNVNTIKICD--KAKENFIKIVTISI------------------- 369 +F TDG+ N+++ ++V I + + K+ I +I + Sbjct: 340 VFFTDGQPTSNSDWSNSVAAAAITNAKELKDANALIYSIGVFRDANPNDTNTSAGNFNGY 399 Query: 370 ----------------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 L + +Y+ +AD L ++F IS + Sbjct: 400 MHAVSSNYPDATATSSTRPNRYSCTLGKRTDNSDYYKAATDADELNNIFNEISSDL 455 >gi|313837214|gb|EFS74928.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA2] gi|314927768|gb|EFS91599.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL044PA1] gi|314971985|gb|EFT16083.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA3] Length = 320 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 36/241 (14%), Positives = 76/241 (31%), Gaps = 51/241 (21%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + + +D+S SM + D ++++A K A FL + Sbjct: 86 RDRATVVVAIDVSRSM---------------VATDVDPSRLSAAKTAAKDFLGDL---PP 127 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 L+ + + + P+ V + +L T + Y L Sbjct: 128 RFNV---SLVKFAASSQVVVPPTTDRAVVSTAIANLQ---VLPSTAIGEGI---YSSL-- 176 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + + + P I+ L+DG N + ++ K +A + + TI Sbjct: 177 -NALKLVPDDPKHPGQKPPAA----IVLLSDGATNVGRPSLEAAK---EAGRQHVPVYTI 228 Query: 368 SIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + N L +S ++ + L V+++I+Q + + K Sbjct: 229 AYGTAGGYVVEGGQRQPVPVNHYELAAVAKASGGEKFSAESLGQLSDVYKSIAQSVGYEK 288 Query: 414 Y 414 Sbjct: 289 V 289 >gi|113971308|ref|YP_735101.1| von Willebrand factor, type A [Shewanella sp. MR-4] gi|113885992|gb|ABI40044.1| von Willebrand factor, type A [Shewanella sp. MR-4] Length = 624 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 37/254 (14%), Positives = 76/254 (29%), Gaps = 54/254 (21%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 Y+ ++ E + ++D+SGSM Sbjct: 211 YNYDMMLLRIGLKGYEQSKAELGASNLVFLLDVSGSMA-------------------SPD 251 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ L+ AL + + V VY G G + Q + ++ Sbjct: 252 KLPLLQTALKMLTQQLGAQDKVSIVVYAGAAG--------VVLDGAAGNDSQTLNYALEQ 303 Query: 287 LILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 L T+ ++ AYQ ++ + + +IF TDG+ N Sbjct: 304 LSAGGSTNGAQGIQLAYQ-------------LAKKHLVEGGINR---VIFATDGDFNVGT 347 Query: 346 SNV-NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA--------- 395 +N+ I + K+ I + T+ L++ Y +++ Sbjct: 348 TNLDELIDLVSAQKQLGIGLTTLGFGMGDYNDHLMEQLADKGNGQYAYIDSLNEARKVLV 407 Query: 396 DSLIHVFQNISQLM 409 + L I++ + Sbjct: 408 EQLSATLLTIAKEV 421 >gi|323350757|ref|ZP_08086417.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis VMC66] gi|322123037|gb|EFX94736.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis VMC66] Length = 458 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 23/175 (13%), Positives = 57/175 (32%), Gaps = 28/175 (16%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + V D+SGSM A+N ++M L++ + ++ + + +V Sbjct: 198 IAVSFVFDISGSMKGALNG--------ANPTSNNPSRMDILRDKAEIMINELQSVGNVSV 249 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 ++ + + + +++ + T+ ++ L Sbjct: 250 NLTTFSTTGSYKQAAFSQLDREAGTIKESIKNLKSD--GGVTNPGDGLRYGMVSLQ---- 303 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 +Q ++ K+++ LTDG N + N + K I+ Sbjct: 304 --------KQHAQL------KYVVLLTDGVPNAYLVNQQGQAGGLEMKREGIQHF 344 >gi|282858824|ref|ZP_06267969.1| von Willebrand factor type A domain protein [Prevotella bivia JCVIHMP010] gi|282588393|gb|EFB93553.1| von Willebrand factor type A domain protein [Prevotella bivia JCVIHMP010] Length = 318 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 41/265 (15%), Positives = 80/265 (30%), Gaps = 54/265 (20%) Query: 156 YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 L + I++ + E I L +D+S SM Sbjct: 53 IHLPMFLRCVTYVLVIIILARPQSNSAWNNKETEGIDIMLAMDISASM------------ 100 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 + D ++ KN F++ +GL + + Sbjct: 101 ---LTNDVTPNRLIVAKNVASDFIN----GRPNDN---IGLTIFAGEAFTQCPLTIDHAT 150 Query: 276 VRQYVTRDMDSLILKP-----TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 + + L++K T + A L + +S Sbjct: 151 LINLLNNVRADLVVKGLIQDGTAIGMGLANAVGRLKASNAKS------------------ 192 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQ-------RLLKTCV 383 K +I LTDG NN + T AK+ I++ TI + +G L + + Sbjct: 193 KIVILLTDGSNNVGSISPMTAAT--IAKKFNIRVYTIGLGTEQSGNYNDIDYTTLKQIAL 250 Query: 384 SSPEYHYNVVNADSLIHVFQNISQL 408 ++ Y + L+ ++ +I++L Sbjct: 251 TTNGEFYRAQSQTELLQIYNDINKL 275 >gi|153815168|ref|ZP_01967836.1| hypothetical protein RUMTOR_01400 [Ruminococcus torques ATCC 27756] gi|145847427|gb|EDK24345.1| hypothetical protein RUMTOR_01400 [Ruminococcus torques ATCC 27756] Length = 4109 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 35/236 (14%), Positives = 79/236 (33%), Gaps = 50/236 (21%) Query: 222 DKKRTKMAALKNALLLFLDSI----DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 TK+ AL+NA+ F++ D + +K + L+ + + +++ Sbjct: 224 GTNITKLQALQNAVNNFVEQTAKMNDSIDDIKLQHRVSLVKFAS--DESDNIGNDFINNN 281 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK--RSFFTNFFRQGVKIPSLP--FQKFI 333 ++ + L T + + L + F N ++ ++ QK + Sbjct: 282 YNRSQIVTELKSYTTKNISDLTSTVNSLIAAGATRADFGLNQAQRAFQLGGTREGAQKVV 341 Query: 334 IFLTDGE---NNNFKSNVNTIKICD--KAKENFIKIVTISI------------------- 369 +F TDG+ N+++ ++V I + + K+ I +I + Sbjct: 342 VFFTDGQPTSNSDWSNSVAAAAITNAKELKDANALIYSIGVFRDANPNDTNTSAGNFNGY 401 Query: 370 ----------------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 L + +Y+ +AD L ++F IS + Sbjct: 402 MHAVSSNYPDATATSSTRPNRYSCTLGKRTDNSDYYKAATDADELNNIFNEISSDL 457 >gi|118468162|ref|YP_887464.1| hypothetical protein MSMEG_3149 [Mycobacterium smegmatis str. MC2 155] gi|118169449|gb|ABK70345.1| conserved hypothetical protein [Mycobacterium smegmatis str. MC2 155] Length = 327 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 36/234 (15%), Positives = 74/234 (31%), Gaps = 51/234 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + LV+D+S SM D +++ A + A F + + Sbjct: 91 VMLVIDVSQSM---------------RATDVAPSRLVAAQEAAKQF------ADQLTPGI 129 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GLI Y ++P+ E + + + + T + + A Q + + Sbjct: 130 NLGLIAYAGTATVLVQPTTNREATKNGLDKLQLADR---TATGEGIFTALQAIATVGAV- 185 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI---KICDKAKENFIKIVTISI 369 + P I+ ++DG+ + N AK+ + I T+S Sbjct: 186 ---------IGGGDEPPPARIVLMSDGKETVPSNPDNPKGAFTAARTAKDQGVPISTVSF 236 Query: 370 NAS-------------PNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLM 409 P +L+ S + + + L VF ++ Q + Sbjct: 237 GTPYGYVEINDQRQPVPVDDEMLEKIAQLSGGDAFTASSLEQLKAVFTSLQQQI 290 >gi|313838674|gb|EFS76388.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL086PA1] Length = 320 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 37/241 (15%), Positives = 78/241 (32%), Gaps = 51/241 (21%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + + +D+S SM + D + ++++A K A FL + Sbjct: 86 RDRATVVVAIDVSRSM---------------VATDVEPSRLSAAKTAAKDFLGDL---PP 127 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 L+ + + + P+ V +T +L T + Y L Sbjct: 128 RFNV---SLVKFAASAQVVVAPTTDRAAVSTAITNLQ---VLPSTAIGEGI---YSSL-- 176 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + + + P I+ L+DG N + ++ K +A + + TI Sbjct: 177 -NALKLVPDDPKHPGQKPPAA----IVLLSDGATNVGRPSLEAAK---EAGRQHVPVYTI 228 Query: 368 SIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + N L +S ++ + L V+++I+Q + + K Sbjct: 229 AYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESLGQLSDVYKSIAQSVGYEK 288 Query: 414 Y 414 Sbjct: 289 V 289 >gi|225174955|ref|ZP_03728952.1| von Willebrand factor type A [Dethiobacter alkaliphilus AHT 1] gi|225169595|gb|EEG78392.1| von Willebrand factor type A [Dethiobacter alkaliphilus AHT 1] Length = 841 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 34/248 (13%), Positives = 84/248 (33%), Gaps = 60/248 (24%) Query: 172 KIVSFIPALLRIEMG---ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 + + A+++ G + LVVD+SGSM + P + T + Sbjct: 18 LLSTLAGAVVQARSGRSHTTEALSVILVVDISGSMD----------RNDPQYLRETATLI 67 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP---SWGTEKVRQYVTRDMD 285 + + Y+G++ + R+E+ + + ++ V + + Sbjct: 68 ---------------FMDLLGPKDYLGVLAFDDRIEELVPLQQVADNKGTFKEAV--EGN 110 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + TD A+++A++ L S + + ++ ++FLTDGE N Sbjct: 111 LVPRGFTDYVGALEEAFEQLHS----------------VETGDARQVVVFLTDGEPNPHL 154 Query: 346 SNVNTIKICD-----------KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 N + + + + + ++ + + L + + + + Sbjct: 155 DARNDDEFMEGYLGELWDLTGEYAAAGVPVYPVAFSDEVGPEVLEQIAGHTGADFVLMPD 214 Query: 395 ADSLIHVF 402 L+ F Sbjct: 215 PGDLVVTF 222 >gi|196250158|ref|ZP_03148852.1| von Willebrand factor type A [Geobacillus sp. G11MC16] gi|196210342|gb|EDY05107.1| von Willebrand factor type A [Geobacillus sp. G11MC16] Length = 668 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 39/184 (21%), Positives = 66/184 (35%), Gaps = 48/184 (26%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 RP + V+D+SGSM K+ + K+AL ++ H Sbjct: 194 RPPIDVVFVMDVSGSMTTM--------------------KLQSAKSALQAAVNYFKTNYH 233 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDS----LILKPTDSTPAMKQAY 302 + LI ++ V+ +G V + +D T+ + A+ A Sbjct: 234 PND--RFALIPFSDDVKATSVVPFGSKSNVISQLDAILDEGNRLTANGGTNYSAALSLA- 290 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN--NFKSNVNTIKICDKAKEN 360 Q +D +R +K+IIFLTDG N S++ +I K++ Sbjct: 291 QSYFNDPER------------------KKYIIFLTDGMPTVLNTTSSITHKEIKKGFKDD 332 Query: 361 FIKI 364 KI Sbjct: 333 GEKI 336 >gi|118349482|ref|XP_001008022.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89289789|gb|EAR87777.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 632 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 44/266 (16%), Positives = 77/266 (28%), Gaps = 50/266 (18%) Query: 142 NTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSG 201 T+ + D + + + Q I+ ++ RP + V+D SG Sbjct: 160 KTLNKHFQFNNEQDCNIPIMVSVKTQDSTNDILEEQKEQVKQAEQSRPSIDLVCVIDNSG 219 Query: 202 SMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT 261 SM K+ +K LL LD ++ + LI + + Sbjct: 220 SMQGE--------------------KIQNVKTTLLQLLDMLNSND------RLSLILFNS 253 Query: 262 RVEKNIEP----SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNF 317 T +++ + TD M A+ IL +KR FF Sbjct: 254 YPTLLCNLRKVDDKNTPNIQKIINSITAEEY---TDINSGMLMAFNIL---QKRQFFNPV 307 Query: 318 FRQGVKIPSLPFQKFIIFLTDG-ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQ 376 I L+DG +N + I K I + + +G Sbjct: 308 SS-------------IFLLSDGQDNGADEKIKKYINSNQSLKNECFSIHSFGFGSDHDGP 354 Query: 377 RLLKTCVSSPEYHYNVVNADSLIHVF 402 + + C Y V + + F Sbjct: 355 LMNRICQLKDGNFYYVEKINQVDEFF 380 >gi|332879552|ref|ZP_08447247.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332682518|gb|EGJ55420.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 332 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 50/305 (16%), Positives = 83/305 (27%), Gaps = 74/305 (24%) Query: 134 NNRLDSSNNTIFYNMDVMTSYD--YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF 191 ++T Y + + FI L I++ + + Sbjct: 29 KEPTLLVSDTTAYRYAPKSYKNHLIHAPFILRLFTFVMVIIILARPQTSNSWKNTQVEGI 88 Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L VD+S SM D K ++ A K F+ Sbjct: 89 DIMLAVDVSTSMLAE---------------DLKPNRLEAAKQVASEFI----AGRPNDN- 128 Query: 252 VYMGLIGYTTRVEKNIEPS-------WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 +GL + + ++ +I T + A Sbjct: 129 --IGLTIFAAEAFTQCPMTTDHAVLLNLFHGIKT--DMAQRGMIQDGTAVGMGIANAVSR 184 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L K +S K II LTDG NN + T + AK I++ Sbjct: 185 LKDSKAKS------------------KVIILLTDGTNNAGDISPLTAA--EIAKSFGIRV 224 Query: 365 VTISI---------NASPNGQRLL--------KTCVS----SPEYHYNVVNADSLIHVFQ 403 TI + G + L KT + + Y + L V++ Sbjct: 225 YTIGVGTNGLAPYPMPVAGGVQYLNVPVEIDTKTLAAIAGKTDGEFYRATDNKKLEDVYK 284 Query: 404 NISQL 408 +I +L Sbjct: 285 DIDKL 289 >gi|311063719|ref|YP_003970444.1| cell surface protein [Bifidobacterium bifidum PRL2010] gi|310866038|gb|ADP35407.1| Cell surface protein with gram positive anchor domain [Bifidobacterium bifidum PRL2010] Length = 1176 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 31/266 (11%), Positives = 71/266 (26%), Gaps = 76/266 (28%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + P I +V D SGSM M +++ K A+ Sbjct: 620 TQSGSSQTTVSPA-DIVVVFDTSGSMSNPMG---------------HNSRLEVAKTAVNS 663 Query: 238 FLDSIDLLSHVKED--VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + + +D + M L+ ++T + + V T+ Sbjct: 664 MAQHLLTSENQGKDSNIRMALVPFSTTAGNVSNFTDNAMDIVSAVNGLGAD---GGTN-- 718 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT----- 350 + +K+I+F++DG+ S+V T Sbjct: 719 ---------------WEAALKAANAKLTSGRKGVKKYIVFMSDGDPTFRTSSVRTGTDWW 763 Query: 351 -------------------------------IKICDKAKENFIKIVTISINASPNGQRLL 379 + + + + ++ +++ P ++ Sbjct: 764 GRPTYDDDDRRGLPAGVHGSGSSDQYGANLSSAVAEANRRGDATLFSVGVSSDPT--KMR 821 Query: 380 KTCVSSPEYHYNVVNADSLIHVFQNI 405 + +Y+ + D L F +I Sbjct: 822 GFADQTKGSYYSATSTDELNKAFADI 847 >gi|301788518|ref|XP_002929675.1| PREDICTED: complement factor B-like [Ailuropoda melanoleuca] gi|281345621|gb|EFB21205.1| hypothetical protein PANDA_019913 [Ailuropoda melanoleuca] Length = 768 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 41/239 (17%), Positives = 81/239 (33%), Gaps = 45/239 (18%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 GE+ +V+D SGSM+ + D D KN L F++ + Sbjct: 252 NPGEQQK--RRIVLDPSGSMNIYLVLDGSDSIGIGNFTR--------AKNCLRDFIEKVA 301 Query: 244 LLSHVKEDVYMGLIGYTT--------RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + GL+ Y T R + + + W T + Q D + T++ Sbjct: 302 SYGVKPKY---GLVTYATIPKVWVKLRDDNSSDADWVTRILNQ--ISYEDHKLKAGTNTK 356 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI---- 351 A+++ Y ++ ++ + + II +TDG +N V+ I Sbjct: 357 KALQEVYNMM----------SWPGNALLEGWNHTRHVIILMTDGLHNMGGDPVSVIHEIR 406 Query: 352 ---KICDKAKEN-----FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 I K + + + + L + ++ + V + ++L VF Sbjct: 407 DFLYIGRDRKNPREDYLDVYVFGVGPLVNQENINALASKKDKEQHVFRVKDMENLEDVF 465 >gi|225010241|ref|ZP_03700713.1| von Willebrand factor type A [Flavobacteria bacterium MS024-3C] gi|225005720|gb|EEG43670.1| von Willebrand factor type A [Flavobacteria bacterium MS024-3C] Length = 330 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 41/276 (14%), Positives = 75/276 (27%), Gaps = 69/276 (25%) Query: 157 RLQFIEHLLNQRYNQKIV----SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPE 212 RL+ + +L ++ + + I + +D+S SM Sbjct: 51 RLKPMLFILRLLSISALIIGLARPQNSSENTKTETTKGIDIVMAIDVSSSMLAK------ 104 Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D ++ ALK F+ + + +GL+ Y + Sbjct: 105 ---------DLSPNRLVALKEVASNFIK-------DRPNDRIGLVVYAGESYTKTPITSD 148 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 V + + + T + + L K K Sbjct: 149 KRLVLEALKEIKYGALTDGTAIGMGLATSVNRLKDSKA------------------LSKV 190 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI---------------------NA 371 II LTDG NN T + A E IK TI + A Sbjct: 191 IILLTDGVNNAGFIEPATAA--ELAVEYDIKTYTIGLGTNGNALSPIAYNPDGSFRYGMA 248 Query: 372 S-PNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNI 405 + LL+ + + ++ + L ++ I Sbjct: 249 EVQIDEALLEQIATLTGGKYFRATDNKKLEAIYNEI 284 >gi|326382237|ref|ZP_08203929.1| hypothetical protein SCNU_04806 [Gordonia neofelifaecis NRRL B-59395] gi|326198967|gb|EGD56149.1| hypothetical protein SCNU_04806 [Gordonia neofelifaecis NRRL B-59395] Length = 330 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 39/248 (15%), Positives = 72/248 (29%), Gaps = 56/248 (22%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + LVVD+S SM D +++ A + A F Sbjct: 85 RNRATVMLVVDVSNSM---------------KSTDVAPSRLKAAQAAGKRF------ADD 123 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + + + +GLI + T + T + + A Q + Sbjct: 124 LTDGINLGLISF---AGTASTLVSPTPDHSATKNALDRLKLADKTATGEGIFAALQQI-- 178 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN---NFKSNVNTIKICDKAKENFIKI 364 T G + P + I+ L+DG+ + KAKE + + Sbjct: 179 ------DTLNAVLGGPSGAPPAR--IVLLSDGKQTVPESPDDPRGGFTAARKAKEKGVPV 230 Query: 365 VTISIN-------------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 TIS + + L K S + + D L V+ + Sbjct: 231 STISFGTLTGTVDLETPGGGVERVPVPVDDESLRKIANLSGGDFFTASSLDELNKVYSTL 290 Query: 406 SQLMVHRK 413 + + + + Sbjct: 291 QKQIGYER 298 >gi|325686522|gb|EGD28550.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK72] Length = 458 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 45/327 (13%), Positives = 100/327 (30%), Gaps = 44/327 (13%) Query: 49 KKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAK----RALIDDAKRFIKNHIKE 104 + SM+ + + S + + S+ + ++ A ++ N K+ Sbjct: 52 MRTSMQ-YVDRTVGKATSIFILDDSKFKGSKQGLTREWSYIGLSADGKKVMNYVWNKQKQ 110 Query: 105 SLSGYSAVFYNTEIQNIVNSSRISMTHMAN-NRLDSSNNTIFYNMDVMTSYDYRLQFIEH 163 + + N+ NRL S N T Y T+ + Sbjct: 111 DW--DVSGLGTKSLYNMKLDLEFKTEGAYQDNRLISYNLTGKYP---DTNNKLGIDTAIS 165 Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPI-----FLIELVVDLSGSMHCAMNSDPEDVNSAP 218 LN + V+ + I PI + V D+SGSM A+N Sbjct: 166 ALNTKQVFSKVAKGKKGIAIAYRTDPIQGQMNIAVSFVFDISGSMKGALNG--------A 217 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 ++M L++ + ++ + + +V ++ + + + +++ Sbjct: 218 NPTSNNPSRMDILRDKAEIMINELQSVGNVSVNLTTFSTTGSYKQAAFSQLDREAGTIKE 277 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T+ ++ L +Q ++ K+++ LTD Sbjct: 278 SIKNLKSD--GGVTNPGDGLRYGMVSLQ------------KQHAQL------KYVVLLTD 317 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIV 365 G N + N + K I+ Sbjct: 318 GVPNAYLVNQQGQAGGLEMKREGIQHF 344 >gi|301609308|ref|XP_002934204.1| PREDICTED: epithelial chloride channel protein [Xenopus (Silurana) tropicalis] Length = 906 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 41/217 (18%), Positives = 83/217 (38%), Gaps = 51/217 (23%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + LV+D+SGSM ++ L A +++ I V++ Sbjct: 309 VSLVLDISGSM-------------------TNANRITRLYQASEVYIMQI-----VEQGA 344 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-MDSLILKPTDSTPAMKQAYQILTSDKKR 311 Y+G++ ++ E + T+ ++ + + ++ T+ ++Q Q+ + + Sbjct: 345 YVGIVTFSNVAEIKSQLVKITDTFQRESLKLKLPTVATGGTNICAGVQQGLQVNRNLDQS 404 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISI- 369 + T I+ LTDGE++ S C ++ I TI++ Sbjct: 405 THGTE----------------IVLLTDGEDSGISS-------CFPDITKSGAIIHTIALG 441 Query: 370 -NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 NA P ++L Y + V+A+ LI F I Sbjct: 442 NNADPGLEKLADLTGGLKLYASDKVDANGLIDSFSGI 478 >gi|324996174|gb|EGC28084.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK678] Length = 458 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 23/175 (13%), Positives = 57/175 (32%), Gaps = 28/175 (16%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + V D+SGSM A+N ++M L++ + ++ + + +V Sbjct: 198 IAVSFVFDISGSMKGALNG--------ANPTSNNPSRMDILRDKAEIMINELQSVGNVSV 249 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 ++ + + + +++ + T+ ++ L Sbjct: 250 NLTTFSTTGSYKQAAFSQLDREAGTIKESIKNLKSD--GGVTNPGDGLRYGMVSLQ---- 303 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 +Q ++ K+++ LTDG N + N + K I+ Sbjct: 304 --------KQHAQL------KYVVLLTDGVPNAYLVNQQGQAGGLEMKREGIQHF 344 >gi|300120207|emb|CBK19761.2| unnamed protein product [Blastocystis hominis] Length = 474 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 34/185 (18%), Positives = 69/185 (37%), Gaps = 37/185 (20%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + + +D + ++ ++T V E ++ VR+ + D + Sbjct: 133 RKEIDFVKSILDYYYLHPNYTLVSILEFSTDVRVLQELTYDACDVRKAIDSDR---MSGL 189 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN--------NN 343 T+ A+++A++IL K I+ +TDG N+ Sbjct: 190 TNIAKAIEEAHRIL-----------------KNSRSDIPDQIVLITDGFQTVHSSINCND 232 Query: 344 FKSNVNTIKI--CDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADSLIH 400 + N I AK + I+I TI + A+ + L+ SSP + ++ SL+ Sbjct: 233 HPHDCNAYAIEKARAAKADDIQIYTIGVGAASYYEDDLRQIASSPSDQYF------SLVD 286 Query: 401 VFQNI 405 + +I Sbjct: 287 DYSSI 291 >gi|290970562|ref|XP_002668176.1| predicted protein [Naegleria gruberi] gi|284081406|gb|EFC35432.1| predicted protein [Naegleria gruberi] Length = 518 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 39/236 (16%), Positives = 81/236 (34%), Gaps = 35/236 (14%) Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 + +++ P E +F V D SGSM ++ P ++ A Sbjct: 312 RHLINNAPPPGPQISSE--VFHFIFVNDKSGSMG--------GSDARPTSSKYSNDRLGA 361 Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 L + FL+ D S + + Y + S + YV Sbjct: 362 LFESCEKFLEVRDGSSDLVSCIMYDHSAYNCFTTNPLSTSL-VSTMSSYV-------AGG 413 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T T AM+ +++S ++ ++F++DGE++ ++ T Sbjct: 414 GTSFTNAMQSVSSLISS--------------TYPNHQSYKIVVLFMSDGEDSADEAVSIT 459 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 ++ + I + TI + S + L + + ++ SL ++Q I+ Sbjct: 460 GQL---VSSHDIILHTIQLGGSSDNTGLRQMAATGRGQFKRANDSASLAGIYQEIA 512 >gi|291613312|ref|YP_003523469.1| von Willebrand factor type A [Sideroxydans lithotrophicus ES-1] gi|291583424|gb|ADE11082.1| von Willebrand factor type A [Sideroxydans lithotrophicus ES-1] Length = 321 Score = 46.0 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 38/226 (16%), Positives = 74/226 (32%), Gaps = 47/226 (20%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 L VD+SGSM + ++ ++ F+ + + Sbjct: 92 LAVDVSGSMSTQDMAG-------------NANRLQVVQKVAGDFIRR-------RHGDQV 131 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 GLI + T+ + V Q++ M + T A+ A + L Sbjct: 132 GLILFGTQPYLQAPLTTDLNTVGQFLDEAMIGVAGTQTAIGDAIGLAIKRLRDA------ 185 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI----- 369 TN + + +I LTDG N+ + A ++I TI + Sbjct: 186 TNVSGRKGET-------VLILLTDGSNDAGAMPPDEAAK--MAAAAGLRIYTIGVGSDQT 236 Query: 370 ------NASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQL 408 A+ + LK ++ ++ + ++L V+ I +L Sbjct: 237 DPFGMGGANDLDEDTLKLIAKTTGGEYFRATDVENLQQVYTRIDRL 282 >gi|256821839|ref|YP_003145802.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] gi|256795378|gb|ACV26034.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] Length = 958 Score = 46.0 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 38/225 (16%), Positives = 73/225 (32%), Gaps = 44/225 (19%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I LV+D SGSM + ++K+ K++ LF+ ++ + Sbjct: 375 DIMLVIDRSGSMSGDAGTG--------------QSKIDEAKDSASLFVQLVEASAGH--- 417 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVR-QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 MGL+ ++T + G + + + + + + Sbjct: 418 -RMGLVSFST----SASIDEGIGNLNPGKKNQLIGPAPYSGGAVGGLIPDGWTSI----- 467 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDG-ENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + + + + P K I+ LTDG +N T I D +I I + Sbjct: 468 -GDGIDKAQSELTGGANP--KTILLLTDGLQNTPPMIETATNDIGDT------RIHAIGL 518 Query: 370 --NASPNGQRLLKTCVSSPEYHYNVVNADSLIH----VFQNISQL 408 A+ NG L S+ + + L F +I + Sbjct: 519 GTEANLNGGLLSDLTQSTGGAYTRAGDGLELKKFFALAFGDIFED 563 >gi|307107982|gb|EFN56223.1| hypothetical protein CHLNCDRAFT_35166 [Chlorella variabilis] Length = 329 Score = 46.0 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 39/280 (13%), Positives = 87/280 (31%), Gaps = 34/280 (12%) Query: 139 SSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE--RPIFLIELV 196 SSN T Y L + LL + + + ++ + R + +V Sbjct: 65 SSNATKPCTKLFCPLYSVGLSP-DPLLGTPASSEFYMAVGLDSGMKAADFARKQLNLVVV 123 Query: 197 VDLSGSMHCAMNSDPEDVNSAPIC----QDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 +D+SGSM +S D P + + + K+ K L + + + Sbjct: 124 LDVSGSMGSPFDSYYYDQTVQPTAGVPDEGETKKKIDVAKEVLAGIVGLLRPDDSLS--- 180 Query: 253 YMGLIGYTTRVEKNIEPS----WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ ++ +K+++ ++ D + + T+ + LT Sbjct: 181 ---VVLFSDAACVPKPLGPVRCADVDKLKEQISA--DVVEMGGTNFQAGIDAGGAQLTGC 235 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE-NNNFKSNVNTIKICDKAKENFIKIVTI 367 ++ + + ++FLTD + N S + + I I Sbjct: 236 AA----------CMEANASLVENRVVFLTDAQPNAGDDSEQGLLARIKALSADGIYTTII 285 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNADS----LIHVFQ 403 + N Q + +++V L+ F Sbjct: 286 GVGLDFNTQLVESIGKVRGSNYFSVHTPGEFRRRLVDEFD 325 >gi|303248312|ref|ZP_07334574.1| von Willebrand factor type A [Desulfovibrio fructosovorans JJ] gi|302490337|gb|EFL50249.1| von Willebrand factor type A [Desulfovibrio fructosovorans JJ] Length = 452 Score = 46.0 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 67/450 (14%), Positives = 140/450 (31%), Gaps = 65/450 (14%) Query: 24 IIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA--------------SKMV 69 ++ A +++ + +G + + + N +++A +AA LAG+ S+ V Sbjct: 1 MVVAATLVGLMAAVGVAVDLGRVYVAHNKLQNAVDAAALAGSLQLPDDPDVDNGKVSQAV 60 Query: 70 SNLSRLGDRFESISN-----------HAKRALIDDAKRFIKNHIKESLSGYSAVFYNT-- 116 + D ++ A +D + +++ + YN Sbjct: 61 TTNLAANDPEAKATDISSGGATRSVCVTAEADVDMTLSKVVGLDATTVTAEACAGYNDIE 120 Query: 117 --EIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIV 174 + + S R + + + I + T L + + N + Sbjct: 121 LVMVLDATGSMRGTPIANVKEAAANLVDLIMPDSGANTRSKIGLVPFQGKVRIDGNDPVT 180 Query: 175 SF-----IPALLRIEMGE--RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + + A R G E S + V++ + Sbjct: 181 AERDPDGVGAGCRNADGTLNDGKLKTEYSDTRSRN-SIFYGYTISGVSTYYDRTCSGMSP 239 Query: 228 MAAL---KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + AL K A+L + +I+ + + I + +V P + + V + M Sbjct: 240 IRALSSDKEAILDNIGAINAGAVTSGTLISEGIKWGHKVLSPKAP-YTEGNTDKKVRKIM 298 Query: 285 DSLILKPTD---------STPAMKQAY--QILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 L T+ S AY R + + S + I Sbjct: 299 IVLTDGDTEDGRCGGRYASASRTVNAYWTNAYFGQGLRPNSASSPYDTLSTASATLAQ-I 357 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENF---IKIVTISI-NASPNGQRLLKTCVSS---- 385 TDG K N + D AK + ++I I ++ L+K SS Sbjct: 358 PDCTDG----GKLNQYVLDEADDAKNDADYPVEIFAIRFGDSDATDISLMKRIASSKSGT 413 Query: 386 PEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 +++Y+ ++ + +F+ I Q + R S Sbjct: 414 DDHYYDAPDSSDIKDMFKKIGQQLGQRLMS 443 >gi|111221591|ref|YP_712385.1| hypothetical protein FRAAL2157 [Frankia alni ACN14a] gi|111149123|emb|CAJ60806.1| conserved hypothetical protein; putative membrane protein [Frankia alni ACN14a] Length = 319 Score = 46.0 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 40/237 (16%), Positives = 73/237 (30%), Gaps = 52/237 (21%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R I L +D+S SM D ++ A K F+D + Sbjct: 84 RERATIILAIDVSNSMAA---------------TDIAPNRLEAAKQGAEAFVDQL----- 123 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILT 306 + +GL+ + + R+ V + L L P T + + Q + Sbjct: 124 -PPRINLGLVSFAGSATVLVP----ASTDRESVRAGIRGLQLGPATAIGEGIFASLQAIN 178 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + KR P I+ L+DGE + N D A++ + + T Sbjct: 179 TAGKRFSDA---------GQSPPPAAIVLLSDGETTRGRPNTQAT---DAARQAHVPVDT 226 Query: 367 ISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 I+ N Q L + + ++ D L V++ + + Sbjct: 227 IAYGTSDGTLDVGGQEVPVPVNEQALNEIADQTEGSYHRAATGDELRSVYKGLGSSI 283 >gi|313139523|ref|ZP_07801716.1| conserved hypothetical protein [Bifidobacterium bifidum NCIMB 41171] gi|313132033|gb|EFR49650.1| conserved hypothetical protein [Bifidobacterium bifidum NCIMB 41171] Length = 835 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 32/266 (12%), Positives = 72/266 (27%), Gaps = 76/266 (28%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + P I +V D SGSM M +++ K A+ Sbjct: 279 TQSGSSQTTVSPA-DIVVVFDTSGSMSNPMG---------------HNSRLEVAKTAVNS 322 Query: 238 FLDSIDLLSHVKED--VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + + +D + M L+ ++T V + + V T+ Sbjct: 323 MAQHLLTSENQGKDSNIRMALVPFSTTVGNVSNFTDNAMDIVSAVNGLRAD---GGTN-- 377 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT----- 350 + +K+I+F++DG+ S+V T Sbjct: 378 ---------------WEAALKAANAKLTSGRKGVKKYIVFMSDGDPTFRTSSVRTGTDWW 422 Query: 351 -------------------------------IKICDKAKENFIKIVTISINASPNGQRLL 379 + + + + ++ +++ P ++ Sbjct: 423 GRPTYDDDDRRGLPAGVHGSGSSDQYGANLSSAVAEANRRGDATLFSVGVSSDPT--KMR 480 Query: 380 KTCVSSPEYHYNVVNADSLIHVFQNI 405 + +Y+ + D L F +I Sbjct: 481 GFADQTKGSYYSATSTDELNKAFADI 506 >gi|301792481|ref|XP_002931207.1| PREDICTED: epithelial chloride channel protein-like [Ailuropoda melanoleuca] Length = 904 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 36/211 (17%), Positives = 69/211 (32%), Gaps = 51/211 (24%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM + + +M A L + I+ S V + Sbjct: 312 LVLDKSGSMSS----------------EDRLLRMNQA--AELYLIQIIEKGSLVGMVTFE 353 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 L + I + + +T ++ T ++ +Q + + + Sbjct: 354 SLATILNYLTDIIG-----DNAYEKITANLPREASGGTSICSGLRAGFQAIIHSNQSTSG 408 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINASP 373 + II LTDGE++ S C ++ K++ I TI++ Sbjct: 409 SE----------------IILLTDGEDDQISS-------CFEEVKQSGAVIHTIAL-GPS 444 Query: 374 NGQRL--LKTCVSSPEYHYNVVNADSLIHVF 402 + L L ++ Y + + L F Sbjct: 445 AARELETLSN-MTGGYRFYANKDINGLTDAF 474 >gi|138896202|ref|YP_001126655.1| hypothetical protein GTNG_2565 [Geobacillus thermodenitrificans NG80-2] gi|134267715|gb|ABO67910.1| Conserved hypothetical protein [Geobacillus thermodenitrificans NG80-2] Length = 668 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 39/184 (21%), Positives = 66/184 (35%), Gaps = 48/184 (26%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 RP + V+D+SGSM K+ + K+AL ++ H Sbjct: 194 RPPIDVVFVMDVSGSMTTM--------------------KLQSAKSALQAAVNYFKTNYH 233 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDS----LILKPTDSTPAMKQAY 302 + LI ++ V+ +G V + +D T+ + A+ A Sbjct: 234 PND--RFALIPFSDDVKATSVVPFGSKSNVISQLDAILDEGNRLTANGGTNYSAALSLA- 290 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN--NFKSNVNTIKICDKAKEN 360 Q +D +R +K+IIFLTDG N S++ +I K++ Sbjct: 291 QSYFNDPER------------------KKYIIFLTDGMPTVLNTTSSITHKEIKKGFKDD 332 Query: 361 FIKI 364 KI Sbjct: 333 GEKI 336 >gi|157823041|ref|NP_001101626.1| integrin alpha-11 [Rattus norvegicus] gi|149041917|gb|EDL95758.1| integrin, alpha 11 (predicted) [Rattus norvegicus] Length = 1171 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 21/182 (11%), Positives = 60/182 (32%), Gaps = 27/182 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ + + +G++ Y + V+ V T++ Sbjct: 161 LIEILTKFYIGPGQIQVGIVQYGEDAVHEFHL-NDYKSVKDVVEAASHIEQRGGTET--- 216 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A+ I + + K +K +I +TDGE + + K+ ++ Sbjct: 217 -RTAFGIEFARSEA---------FQKGGRKGAKKVMIVITDGE---SHDSPDLEKVIRQS 263 Query: 358 KENFIKIVTISINA--------SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQ 407 +++ + +++ +K S P+ + +NV + +L + + Sbjct: 264 EKDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALGD 323 Query: 408 LM 409 + Sbjct: 324 RI 325 >gi|21675084|ref|NP_663149.1| hypothetical protein CT2278 [Chlorobium tepidum TLS] gi|21648324|gb|AAM73491.1| conserved hypothetical protein [Chlorobium tepidum TLS] Length = 332 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 38/240 (15%), Positives = 76/240 (31%), Gaps = 56/240 (23%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + I + +D+S SM+ +++ A + A + F+D+ Sbjct: 91 SSRDTVGIDIMIALDVSDSMNTPDFGG--------------KSRFAGARTAAMRFIDN-R 135 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAY 302 +GL+ ++ + E + + T A+ A Sbjct: 136 PAD------RIGLVVFSGGSFTRCPLTLDHEVLGRLAETVAPGFFDEPGTAIGTAILTAT 189 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 L S +K ++ +TDGENN + T A I Sbjct: 190 NRL------------------KASSSKEKALVLITDGENNAGEVTPETAAR--LAANYGI 229 Query: 363 KIVTI-------SINASPN------GQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 +I T+ + + N G+ L+T S ++ + L+ F++I +L Sbjct: 230 RIYTVFAGKEARAFENTSNTALNRKGRSELETVARISGGRMFSAGDVFGLMKSFRDIDRL 289 >gi|158316887|ref|YP_001509395.1| von Willebrand factor type A [Frankia sp. EAN1pec] gi|158112292|gb|ABW14489.1| von Willebrand factor type A [Frankia sp. EAN1pec] Length = 319 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 43/253 (16%), Positives = 78/253 (30%), Gaps = 54/253 (21%) Query: 172 KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 I PA ER I L +D+S SM D T++ A Sbjct: 70 VISMARPARAERVPRERA--TIILAIDVSNSMAA---------------TDITPTRLEAA 112 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K F+D + + +GL+ + + R+ V + L L P Sbjct: 113 KQGAQAFVDQL------PPRINLGLVSFAGSAAVLVP----ASTDRESVRSGIRGLQLGP 162 Query: 292 -TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + + Q + + +R P I+ L+DGE + N Sbjct: 163 ATAVGEGIYASLQAIATAGQRLSDE---------GQPPPPAAIVLLSDGETTRGRPNTQA 213 Query: 351 IKICDKAKENFIKIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNAD 396 A++ I + TI+ N + L + + ++ + D Sbjct: 214 AT---AARDAEIPVDTIAYGTSDGTLDVGGQQIPVPVNEEALRELADQTGGSYHRATSGD 270 Query: 397 SLIHVFQNISQLM 409 L V++ + + Sbjct: 271 ELQSVYRGLGSSI 283 >gi|256393600|ref|YP_003115164.1| Vault protein inter-alpha-trypsin domain-containing protein [Catenulispora acidiphila DSM 44928] gi|256359826|gb|ACU73323.1| Vault protein inter-alpha-trypsin domain protein [Catenulispora acidiphila DSM 44928] Length = 1033 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 35/225 (15%), Positives = 70/225 (31%), Gaps = 52/225 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L++D SGSM KM A + A +D++ Sbjct: 318 DVALILDRSGSMGGW--------------------KMTAARRAAARIVDTLTAED----- 352 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVR-----QYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++ + ++E G + + V T+ P +++A +L+ Sbjct: 353 -RFAVLTFDDQMETPDGLPTGLSEATDRHRFRAVQHLATVDARGGTEMEPPLRRAATLLS 411 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 D + +I +TDG+ N + T+ K I++ T Sbjct: 412 DDNPDR-----------------DRVLILITDGQVGNEDRLLTTL----SPKLTHIRVHT 450 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + I+ + N L + + V + D L I + Sbjct: 451 VGIDTAVNAAFLQRLSTLGGGHCELVESEDRLDDAMDAIHHRIAT 495 >gi|193788521|dbj|BAG53415.1| unnamed protein product [Homo sapiens] Length = 328 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 42/234 (17%), Positives = 83/234 (35%), Gaps = 57/234 (24%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM ++ L A LFL + + ++ Sbjct: 22 LVLDKSGSMA-------------------TGNRLNRLNQAGQLFL--LQTVELGS---WV 57 Query: 255 GLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + E + R + + + + T ++ A+ ++ ++ + Sbjct: 58 GMVTFDSAAHVQSELIQINSGSDRDTLAKRLPAAASGGTSICSGLRSAFTVI----RKKY 113 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 T+ I+ LTDGE+N TI C ++ K++ I T+++ Sbjct: 114 PTDGSE-------------IVLLTDGEDN-------TISGCFNEVKQSGAIIHTVAL-GP 152 Query: 373 PNGQRL--LKTCVSSPEYH-YNVVNADSLIHVFQNISQ---LMVHRKYSVILKG 420 Q L L + + + V + LI F +S + R + KG Sbjct: 153 SAAQELEELSKMTGGLQTYASDQVQNNGLIDAFGALSSGNGAVSQRSIQLESKG 206 >gi|118092778|ref|XP_421647.2| PREDICTED: similar to tumor endothelial marker 8 [Gallus gallus] Length = 555 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 30/206 (14%), Positives = 65/206 (31%), Gaps = 43/206 (20%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 F + ++D SGS+ T++ + +L +S Sbjct: 43 HGAFDLYFILDKSGSV------------------RNHWTEIYSFVESLAE-----KFISP 79 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + + M I +++R ++ + E +R+ + + T K+A Sbjct: 80 M---LRMSFIVFSSRGTTIMKLTENREAIRRGLEILQYEVPGGDTFMHEGFKRA------ 130 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 N + II LTDGE + + + ++A+ + + Sbjct: 131 --------NEQIYHETYGGVRTASVIIALTDGELQDVQF-YYAEQEANRARSFGAIVYCV 181 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVV 393 + Q L T S ++ + V Sbjct: 182 GVKDFNETQ--LSTIADSIDHVFPVT 205 >gi|229596191|ref|XP_001012539.2| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|225565545|gb|EAR92294.2| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 703 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 38/218 (17%), Positives = 76/218 (34%), Gaps = 44/218 (20%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 RP + V+D SGSM +K+ +KN +L L+ ++ Sbjct: 208 RPNLDLICVIDNSGSM-------------------NDFSKIENVKNTILQLLEMLNEND- 247 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILT 306 + LI + T+ ++ + ++ + S+ TD ++ A+QIL Sbjct: 248 -----RLSLITFNTKAKQLCGLKNVNNQNKKSLQTITKSIKADGGTDIIRGIEIAFQILQ 302 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN--TIKICDKAKENFIKI 364 S K+++ ++ I L+DG++N + + + +E I Sbjct: 303 SRKQKNSVSS----------------IFLLSDGQDNLADAGIKNLLKTTYKQLQEESFTI 346 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + +G + K Y V D + F Sbjct: 347 HSFGFGNDHDGPLMQKIAQIKDGSFYFVEKNDQVDEFF 384 >gi|86134840|ref|ZP_01053422.1| aerotolerance-related membrane protein [Polaribacter sp. MED152] gi|85821703|gb|EAQ42850.1| aerotolerance-related membrane protein [Polaribacter sp. MED152] Length = 349 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 34/246 (13%), Positives = 80/246 (32%), Gaps = 50/246 (20%) Query: 134 NNRLDSSNNTIFYNMDVMTS-YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFL 192 + + SN + + +S + L+ I LL + + +++ +R Sbjct: 32 STQRKFSNAELLQKIAPNSSVFKSTLKLIMLLLGISFLILSLVNPKMGSKLKTVKREGVD 91 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + +D+S SM D ++ K + +D + Sbjct: 92 VVFALDVSKSMLAE---------------DIAPNRLEKAKQIISKIIDKLGSD------- 129 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYV---TRDMDSLILKPTDSTPAMKQAYQILTSDK 309 +G+I Y + + T+ + + D + + T A++ A +D+ Sbjct: 130 RVGVIIYAGNSYPLLPIT--TDHAAANMFLQNANPDMVSSQGTAINEALELAKTYYNNDE 187 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + + +F++ ++DGE T ++ + +KI TI + Sbjct: 188 QTN------------------RFLVIISDGE----DHQEETKQVAQNLSNDGVKIYTIGV 225 Query: 370 NASPNG 375 G Sbjct: 226 GTEKGG 231 >gi|304382530|ref|ZP_07365025.1| aerotolerance protein BatA [Prevotella marshii DSM 16973] gi|304336361|gb|EFM02602.1| aerotolerance protein BatA [Prevotella marshii DSM 16973] Length = 332 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 45/246 (18%), Positives = 72/246 (29%), Gaps = 72/246 (29%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I L +D+S SM D K +M A K+ F+ Sbjct: 88 IDIMLAMDVSTSMLAE---------------DLKPNRMEAAKDVAAEFI----SGRPNDN 128 Query: 251 DVYMGLIGYTTRVEKNIEPS-------WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 +GL + + + VR ++ LI T + A Sbjct: 129 ---IGLTIFAGEAFTQCPMTTDHQSLLNLLQNVRTDLSAR--GLIEDGTAVGMGLANAVS 183 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L K +S K +I LTDG NN + T + AK I+ Sbjct: 184 RLKDSKAKS------------------KVVILLTDGSNNRGDLSPMTSA--NIAKSLGIR 223 Query: 364 IVTISI---------NASPNGQRL-----------LKTCVS-SPEYHYNVVNADSLIHVF 402 + TI + G + L S + Y N + L ++ Sbjct: 224 VYTIGVGTNKVAPYPMPVAGGIQYVNIPVEIDTKTLSGIASVTHGNFYRATNNNELKQIY 283 Query: 403 QNISQL 408 ++I +L Sbjct: 284 KDIDKL 289 >gi|288800165|ref|ZP_06405624.1| BatA protein [Prevotella sp. oral taxon 299 str. F0039] gi|288333413|gb|EFC71892.1| BatA protein [Prevotella sp. oral taxon 299 str. F0039] Length = 323 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 47/246 (19%), Positives = 70/246 (28%), Gaps = 72/246 (29%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I L +D+S SM D K +M A K F+ D + Sbjct: 79 IDIMLAMDVSTSMLAE---------------DLKPNRMEAAKKVAAEFIS--DRANDN-- 119 Query: 251 DVYMGLIGYTTRVEKNIEPS-------WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 +GL + + + VR + LI T + A Sbjct: 120 ---IGLTIFAGEAFTQCPMTTDHASLLNLLQGVRTDIASR--GLIADGTAVGMGLANAVS 174 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L K +S K II LTDG NN + T AK I+ Sbjct: 175 RLKESKAKS------------------KVIILLTDGSNNMGDISPLTAAQ--IAKSLGIR 214 Query: 364 IVTISIN---------------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + TI + A + + L S+ Y N L ++ Sbjct: 215 VYTIGVGTNTVAPYPVTVGGTTQYVNVPAEIDTKTLKDIAQSTDGGFYRATNNAELKEIY 274 Query: 403 QNISQL 408 +I +L Sbjct: 275 NDIDRL 280 >gi|238755460|ref|ZP_04616800.1| hypothetical protein yruck0001_3370 [Yersinia ruckeri ATCC 29473] gi|238706301|gb|EEP98678.1| hypothetical protein yruck0001_3370 [Yersinia ruckeri ATCC 29473] Length = 465 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 43/274 (15%), Positives = 80/274 (29%), Gaps = 59/274 (21%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 LL+ + + ++K I F +S+ F+ +I L K + A L Sbjct: 18 LLNTILMF----LENKKGGIIIPFFISLPFFIAIIMLLFDFTQLINNKIKLSDALEQGAL 73 Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 A + + + D + I +I L T+ NI Sbjct: 74 A----LTAENNAKNDTRNN--------------ELISAYINFYLGHRH---QLTQYNNIT 112 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 + + + + + +L Y++D F L++ FI Sbjct: 113 VNYQQNPDRLYHTQLSQ------YHIDANIEQPTLFPFTSLLIDHDN------FIIGGSA 160 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + + P + V D SGSM ++ + + +K+ LK D I Sbjct: 161 AAIKDVPAMDVVFVTDFSGSMEGDFHNPDD---------PEVLSKLDELKRIFFKIADDI 211 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 + SWGT+ Sbjct: 212 YTANKDS-------------TISFSPFSWGTKSA 232 >gi|81897704|sp|Q8BVM2|ANTRL_MOUSE RecName: Full=Anthrax toxin receptor-like; Flags: Precursor gi|26346064|dbj|BAC36683.1| unnamed protein product [Mus musculus] Length = 641 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 68/207 (32%), Gaps = 43/207 (20%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + IF + LV+D SGS+ + S KK T Sbjct: 72 QGIFDLYLVLDKSGSVADNW----IHIYSFAEGLVKKFT--------------------- 106 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 ++ + +I Y+T E + + ++++ + + + T +++A Sbjct: 107 -NPNLRISIITYSTEAEVILPLTSDSKEINKSLLVLKSIVPQGLTHMQKGLRKA------ 159 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 N + + II LTDG K ++T++ KA+ + T+ Sbjct: 160 --------NEQIRKSTLGGRIVNSVIIALTDGL-LLLKPYLDTMEEAKKARRMGAIVYTV 210 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVN 394 + Q L P+ + V Sbjct: 211 GVFMYSKQQ--LVNIAGDPDRCFGVDE 235 >gi|329849363|ref|ZP_08264209.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] gi|328841274|gb|EGF90844.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] Length = 590 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 55/317 (17%), Positives = 96/317 (30%), Gaps = 66/317 (20%) Query: 100 NHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMT------- 152 N K + S+V TE S + AN R ++ T V T Sbjct: 126 NTEKYNGESVSSVMRVTETPVSTFSVDVDTGAYANVRRMLNDGTTPTEAAVRTEELLNYF 185 Query: 153 SYDYRL-----------QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSG 201 YDY L + +++ + ERP + +VD+SG Sbjct: 186 RYDYPLPQDRSKPFSITTDVAQTPW-NAQTRLMRVGLRAYDVPRSERPAANLVFLVDVSG 244 Query: 202 SMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT 261 SM K+ +K AL + D++ V VY G G Sbjct: 245 SM-------------------NDPDKLPLVKTALSMLSDNLRPDDKVSIVVYAGAAG--- 282 Query: 262 RVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 +YV + ++ L T M AY + NF + Sbjct: 283 -------MVLAPTHEGKYVKQALECLSAGGSTAGGQGMALAY--------ATAEANFIKG 327 Query: 321 GVKIPSLPFQKFIIFLTDGE-NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLL 379 G+ +I TDG+ N S + + +E+ + + + + L+ Sbjct: 328 GINR--------VILATDGDFNVGISSIGEVEALVKQNRESGVTLTALGFGTGNYNEALM 379 Query: 380 KTCVSSPEYHYNVVNAD 396 + +Y +++ Sbjct: 380 EKMADVGNGNYAYIDSA 396 >gi|224370037|ref|YP_002604201.1| hypothetical protein HRM2_29500 [Desulfobacterium autotrophicum HRM2] gi|223692754|gb|ACN16037.1| conserved hypothetical protein [Desulfobacterium autotrophicum HRM2] Length = 332 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 41/238 (17%), Positives = 76/238 (31%), Gaps = 61/238 (25%) Query: 191 FLIELVVDLSGSMHC-AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 I L +DLS SM D VN ++ A+KN + F+ + Sbjct: 87 INIILALDLSKSMAALDFKLDGAIVN-----------RLDAVKNVVKDFIMK-------R 128 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 +G++ + + + + + + ++R T AM + + L K Sbjct: 129 SGDRIGMVVFGSEAFTQMPLTRDYDTIAFVLSRLKIGAAGPSTAIGDAMGISLKRLEDVK 188 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 +S +I LTDG++N+ + D A+E +K+ TI + Sbjct: 189 SKSN------------------IVILLTDGKSNSGEITPGAAA--DIARERGVKVYTIGV 228 Query: 370 ----NAS-----------------PNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNI 405 A LK + + + DSL ++ I Sbjct: 229 GQRGKAPFLVNDPLFGQRYVYQMVDMDHEALKEIADKTGGAFFAAADTDSLKKIYDMI 286 >gi|310641811|ref|YP_003946569.1| von willebrand factor type a [Paenibacillus polymyxa SC2] gi|309246761|gb|ADO56328.1| von Willebrand factor type A [Paenibacillus polymyxa SC2] Length = 429 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 43/226 (19%), Positives = 81/226 (35%), Gaps = 52/226 (23%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 G + + + LV+D SGSM +DP++ A + +D Sbjct: 109 GRKSVQDLVLVIDNSGSMQ---QTDPDNERL----------------TAAKSLIGQMD-- 147 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPS-WGTEKVRQYVTRDMDSL---ILKPTDSTPAMKQA 301 D + ++ + + + + GT+ +Q V +DS+ + T+ A+ + Sbjct: 148 ----GDKRVAIVSFESTAQLVQPFTPIGTDAEKQAVYSKIDSMQTIMSGGTEIGLALDET 203 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + + + + +I L+DG + + T A++ Sbjct: 204 IKEIETQGNAEKGS----------------LVIMLSDG---FSELDTQTALAPYIARQ-- 242 Query: 362 IKIVTISIN-ASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNI 405 I I TI + A G LL+ S + + NV NA L F I Sbjct: 243 IPINTIGLKLAESEGIALLQNIASLTGGTYSNVANAQGLTQAFGKI 288 >gi|117921591|ref|YP_870783.1| von Willebrand factor, type A [Shewanella sp. ANA-3] gi|117613923|gb|ABK49377.1| von Willebrand factor, type A [Shewanella sp. ANA-3] Length = 613 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 36/237 (15%), Positives = 71/237 (29%), Gaps = 54/237 (22%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 E + ++D+SGSM K+ L+ AL + +D Sbjct: 216 SKAELGASNLVFLLDVSGSMA-------------------SPDKLPLLQTALKMLTQQLD 256 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAY 302 V VY G G + Q + ++ L T+ ++ AY Sbjct: 257 AQDKVSIVVYAGAAG--------VVLDGAAGNDTQTLNYALEQLSAGGSTNGAQGIQLAY 308 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV-NTIKICDKAKENF 361 Q ++ + +I TDG+ N +N+ I + K+ Sbjct: 309 Q-------------LAQKHFVEGGINR---VILATDGDFNVGTTNLDELIDLVSARKQQG 352 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA---------DSLIHVFQNISQLM 409 I + T+ L++ Y +++ + L I++ + Sbjct: 353 IGLTTLGFGMGDYNDHLMEQLADKGNGQYAYIDSINEARKVLVEHLSATLLTIAKEV 409 >gi|301768024|ref|XP_002919431.1| PREDICTED: calcium-activated chloride channel regulator 4-like [Ailuropoda melanoleuca] Length = 922 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 36/234 (15%), Positives = 76/234 (32%), Gaps = 56/234 (23%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM ++ + A FL + + + ++ Sbjct: 308 LVLDKSGSMSGF-------------------NRLNRMNQAAKHFL--LQTIENGS---WV 343 Query: 255 GLIGYTTRVEKNIEPSWGT-EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + K R + + T M+ A+Q++ + Sbjct: 344 GMVHFDSTANIKSNLIQIISSKERNNLLESLPKAANGGTSICAGMRSAFQVIREVYPQID 403 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 + I+ LTDGE+N+ K C D+ ++ I I++ Sbjct: 404 GSE----------------IVLLTDGEDNSAKD-------CIDEVTQSGAIIHLIAL-GP 439 Query: 373 PNGQRLLKTCVSSPEYHYNVVNADS---LIHVFQNIS---QLMVHRKYSVILKG 420 Q +++ + H+ + LI F ++ + + + KG Sbjct: 440 SADQAVIEMSAMTGGNHFFASDEAQNNGLIDAFGALASGNTDLSQQPLQLESKG 493 >gi|281352223|gb|EFB27807.1| hypothetical protein PANDA_008059 [Ailuropoda melanoleuca] Length = 907 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 36/234 (15%), Positives = 76/234 (32%), Gaps = 56/234 (23%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM ++ + A FL + + + ++ Sbjct: 309 LVLDKSGSMSGF-------------------NRLNRMNQAAKHFL--LQTIENGS---WV 344 Query: 255 GLIGYTTRVEKNIEPSWGT-EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + K R + + T M+ A+Q++ + Sbjct: 345 GMVHFDSTANIKSNLIQIISSKERNNLLESLPKAANGGTSICAGMRSAFQVIREVYPQID 404 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 + I+ LTDGE+N+ K C D+ ++ I I++ Sbjct: 405 GSE----------------IVLLTDGEDNSAKD-------CIDEVTQSGAIIHLIAL-GP 440 Query: 373 PNGQRLLKTCVSSPEYHYNVVNADS---LIHVFQNIS---QLMVHRKYSVILKG 420 Q +++ + H+ + LI F ++ + + + KG Sbjct: 441 SADQAVIEMSAMTGGNHFFASDEAQNNGLIDAFGALASGNTDLSQQPLQLESKG 494 >gi|218778178|ref|YP_002429496.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] gi|218759562|gb|ACL02028.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] Length = 480 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 34/235 (14%), Positives = 69/235 (29%), Gaps = 43/235 (18%) Query: 172 KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 + +L E + + +V+D SGSM K+ Sbjct: 74 VLTMTADEVLAPEQTKTKPVDMVIVLDRSGSMGGQ--------------------KVRDA 113 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K A+ ++ + L+ Y+ V + T R + +DS+ Sbjct: 114 KAAVKGLVEGLRSQD------RFSLVTYSNSVNGGDGLHYLTADKRNSLNWMVDSIP--- 164 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 A R + +I ++DG+ N ++ N + Sbjct: 165 ---------AGGGTNLGGGLEKGVGVLRAYGAPDRMGK---VILISDGQANQGVTDPNQL 212 Query: 352 KICDKAKENFI--KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 +++ + + T+ I N Q + +Y + N + VFQ Sbjct: 213 AAMAALRDDGLVYSVTTVGIGQDFNEQLMATVADGGRGRYYYLENPGDFLAVFQE 267 >gi|260578579|ref|ZP_05846489.1| conserved hypothetical protein [Corynebacterium jeikeium ATCC 43734] gi|258603294|gb|EEW16561.1| conserved hypothetical protein [Corynebacterium jeikeium ATCC 43734] Length = 646 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 48/231 (20%), Positives = 81/231 (35%), Gaps = 65/231 (28%) Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 L++D SGSM +T+MAA K+A LF V D Sbjct: 42 MLILDASGSM--------------KTPDAGGQTRMAAAKDAAQLF------SVAVPSDAE 81 Query: 254 MGLIGYTTRV-EKNIEPSWGTEKVRQ-------YVTRDMDSL----ILKPTDSTPAMKQA 301 +G + Y T V E G + V+ VT+ + T PA++QA Sbjct: 82 LGFMVYGTEVGNSPEERDAGCKDVKTLLPVEKGNVTKIPAEVGKVEASGHTPMGPALRQA 141 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA---K 358 + L D +RS I+ ++DGE+ +C+ A K Sbjct: 142 AEELPKDGERS--------------------IVLVSDGEDTCAPP-----PVCEVAKDLK 176 Query: 359 ENFI--KIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 + I I T+ ++ L+ C++ + + + SL + + Sbjct: 177 KEGIDLTINTVGFLVDSKARKELE-CIAEAGGGEYMDAKDTVSLADSMKRL 226 >gi|225850253|ref|YP_002730487.1| putative von Willebrand factor type A domain protein [Persephonella marina EX-H1] gi|225645927|gb|ACO04113.1| putative von Willebrand factor type A domain protein [Persephonella marina EX-H1] Length = 304 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 39/238 (16%), Positives = 87/238 (36%), Gaps = 53/238 (22%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 E+ ++ I + +D+S SM K++ K+ K L FL D Sbjct: 79 SEKRVYNIIIALDVSNSM-------------------KEKNKLKISKEILRDFLLKRDEE 119 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDSTPAMKQAYQI 304 +G++ + + + + + + ++ +++ + T + +A + Sbjct: 120 D------RIGILVFDNLPFRLMPLTSDRGALLRVISIIRPAMVDVGGTAMYDGLVEALNM 173 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 D++ K II LTDG + N K + + + ++ KI Sbjct: 174 FMKDRR-------------------NKIIILLTDGGDINSKYTLEDVVRFN--QDIGAKI 212 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVN--ADSLIHVFQNISQL----MVHRKYSV 416 TI +++ N L + ++ + V +L VF I++L + ++ V Sbjct: 213 YTIGVSSGMNFYVLERLSEATGGKAFFVTKDYQKALRSVFDEINRLEPSYIQEYRFDV 270 >gi|224046761|ref|XP_002188607.1| PREDICTED: collagen, type XXII, alpha 1 [Taeniopygia guttata] Length = 1598 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 25/161 (15%), Positives = 59/161 (36%), Gaps = 23/161 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL--KPTDSTPAMK 299 ++ + +G++ Y+ R + G K R+ + + T++ A++ Sbjct: 67 VETFEIGPDKTRVGVVRYSDRPSTEFDL--GKYKTREEIKEAARKIQYYGGNTNTGDALR 124 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 S + G ++ +K I LTDG + +F + T A + Sbjct: 125 YITTYSFSKEA----------GGRLSDRTVKKVAILLTDGRSQDFVLDPAT-----AAHQ 169 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 I+I + + + L S P + ++V + +++ Sbjct: 170 AGIRIFAVGVG--EALKEELDEIASEPKSAHVFHVSDYNAI 208 >gi|157738379|ref|YP_001491063.1| hypothetical protein Abu_2179 [Arcobacter butzleri RM4018] gi|157700233|gb|ABV68393.1| hypothetical protein Abu_2179 [Arcobacter butzleri RM4018] Length = 1866 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 48/313 (15%), Positives = 98/313 (31%), Gaps = 25/313 (7%) Query: 37 IGFLIYVLDWHYKKNSMESAN-NAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAK 95 + S+ +A + S ++ N+ S +K + Sbjct: 1226 GSVFDDTTTVYKYDISLSAALTDTDGSETLSVIIKNVPASATLESSKYEVSKNSDGSYTV 1285 Query: 96 RFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYD 155 + + S + + +NI T +N + T + T Sbjct: 1286 KVPQGETSIS-DKLTMKVPQEDAKNINLQIEAKATEARDNEDGQNFKTATDSTTDKTPTL 1344 Query: 156 YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 + ++N + I+ ++ + + I LVVD SGSM ++ + Sbjct: 1345 VVGSNKDSVINGGAGKDILIGDTGGTQLNVQAGKNYNIALVVDTSGSM--------KEAS 1396 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 + +++ LK+AL DS+ + + +I + T ++ I + T K Sbjct: 1397 GSKTAWGTTISRIDLLKDALKNLADSLKGHDG---KINVSIIDFDTNAKEPITFNDLTSK 1453 Query: 276 VRQYVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 + +D+L T+ Y+ F T G ++ Sbjct: 1454 NISDLITKIDALKAEGGTN--------YEDAFLKTTSWFDTQSVTYGKAQG---YENLTY 1502 Query: 335 FLTDGENNNFKSN 347 FLTDG+ N Sbjct: 1503 FLTDGDPTFSNRN 1515 >gi|292490950|ref|YP_003526389.1| von Willebrand factor A [Nitrosococcus halophilus Nc4] gi|291579545|gb|ADE14002.1| Forkhead-associated protein [Nitrosococcus halophilus Nc4] Length = 510 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 29/178 (16%), Positives = 64/178 (35%), Gaps = 25/178 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT--DSTPAMK 299 ++ ++ ++ D + +I + + ++K R + ++D L +S M+ Sbjct: 70 LNFINGLRGDSQIAIILFDGGTSVAMPLVPVSQKTRDALVTNLDKLTYTGRFRNSAAGME 129 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG--ENNNFKSNVNTIKI---- 353 +A L +K II LT+G E + K + + + Sbjct: 130 RALYEL----------------KNFGRPEAEKAIILLTNGPIETGDEKRDRDFSRWMQEY 173 Query: 354 -CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 +A E IK+ I+ + + L ++ +Y A L F I +++ Sbjct: 174 LAHEAAEAGIKVFGIAFTEAADFHLLQILAHTTGGTYYRAPQAVDLQSAFNRIRKVIT 231 >gi|325678986|ref|ZP_08158584.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] gi|324109490|gb|EGC03708.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] Length = 782 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 36/235 (15%), Positives = 89/235 (37%), Gaps = 37/235 (15%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + E + + ++D SGSM P+++ D ++ ++ Sbjct: 277 SVNEPAVTRVAFLIDNSGSM------YPKELCPTSSENDVDFKRLDFTQS---------- 320 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-MDSLILKPTDSTPAMKQAY 302 L+ +D +G+ +T K + + ++R+ + R + I T + A+K+ Sbjct: 321 LIDKFDDDFRIGISKFTGTYTKMCDFTDDRTELRKVLNRIRTEDEIFDGTYNQTALKKCI 380 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 ++ + I+ L+DGE++ + + + A E + Sbjct: 381 NEFSAA----------------GDGKYVNIIVMLSDGESDEVDAETI-ESLSNLANEKSV 423 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 ++T+ + + L + S+ +Y+ +A SL V++ I + Y ++ Sbjct: 424 IVLTVGLGREIDRAWLQEVAYSTGGKYYSASDATSLDDVYKQI---VTTLNYDIV 475 >gi|198422181|ref|XP_002120553.1| PREDICTED: similar to predicted protein [Ciona intestinalis] Length = 1038 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 37/220 (16%), Positives = 80/220 (36%), Gaps = 26/220 (11%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I +V+D SGSM + ++ + + + + A + A + Sbjct: 193 IVIVIDKSGSMGVTNMNLAKEAAKSVVNTLNPQDRFAVM--AFSSIFVPFQSTVASDQCF 250 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 + +N + KV +V T+ PA+++A+ + S Sbjct: 251 ---ATTFADASPQNKK------KVEDFVDTISSG---GGTNYAPALQKAFSFFQQEPSVS 298 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIVTISINA 371 F KI + I+F++DG N+ S + + +I ++ N + I+T + Sbjct: 299 DF-----NIKKIDPSEIDRVILFMSDGIPNDPGSTILSAQIRANEQLNNSVIILTYGLGN 353 Query: 372 SPNGQRLLKTCVSSPEYHY----NVVNADSLIHVFQNISQ 407 + G +L+ ++ Y N + + +F +I Sbjct: 354 ADFG--VLRNMATNKGDVYGILKNPNIPEPRVGLFTHIPH 391 >gi|68535223|ref|YP_249928.1| hypothetical protein jk0158 [Corynebacterium jeikeium K411] gi|68262822|emb|CAI36310.1| hypothetical protein jk0158 [Corynebacterium jeikeium K411] Length = 646 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 48/231 (20%), Positives = 81/231 (35%), Gaps = 65/231 (28%) Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 L++D SGSM +T+MAA K+A LF V D Sbjct: 42 MLILDASGSM--------------KTPDAGGQTRMAAAKDAAQLF------SVAVPSDAE 81 Query: 254 MGLIGYTTRV-EKNIEPSWGTEKVRQ-------YVTRDMDSL----ILKPTDSTPAMKQA 301 +G + Y T V E G + V+ VT+ + T PA++QA Sbjct: 82 LGFMVYGTEVGNSPEERDAGCKDVKTLLPVEKGNVTKIPAEVGKVEASGHTPMGPALRQA 141 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA---K 358 + L D +RS I+ ++DGE+ +C+ A K Sbjct: 142 AEELPKDGERS--------------------IVLVSDGEDTCAPP-----PVCEVAKDLK 176 Query: 359 ENFI--KIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 + I I T+ ++ L+ C++ + + + SL + + Sbjct: 177 KEGIDLTINTVGFLVDSKARKELE-CIAEAGGGEYMDAKDTVSLADSMKRL 226 >gi|39933553|ref|NP_945829.1| hypothetical protein RPA0476 [Rhodopseudomonas palustris CGA009] gi|39647399|emb|CAE25920.1| conserved hypothetical protein [Rhodopseudomonas palustris CGA009] Length = 443 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 53/393 (13%), Positives = 122/393 (31%), Gaps = 39/393 (9%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 F + + N ++IFAL ++ + +G + + ++ +++A +AA + S Sbjct: 3 FSRSIARFSRDRRGNIAVIFALVLVPLISAVGCAVDYSRANALRSKLQAAADAASVGAVS 62 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + G + + DDA R ++ G + + + +S Sbjct: 63 RTSPAYVAAGAMS---GDGVISSGADDALRIFNGNLN----GLTGYTLASSSATVTKASD 115 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG 186 + + + + S+ M M + ++ P++ Sbjct: 116 VVTSQVTFSAQISTMFMKVVGMSAMAVGGTS-TATASMPKYIDFYLLLDNSPSMGVGATP 174 Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR-----TKMAALKNALLLFLDS 241 +I + S HCA DVN+ + + T++ L++A +D+ Sbjct: 175 TDVSAMIAATANKSSDDHCAFACH--DVNNKNNYYNLAKALGITTRIDVLRSATQQLMDT 232 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIE----PSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 + M + + + + + +D + +K + Sbjct: 233 ATATATYSNQFRMAIYDFGASAQTAGLRNLFSLSASLSSAKTAASAIDLMTVKGQNDNND 292 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK------------ 345 +Y + N S QK++ F++DG + + Sbjct: 293 QDTSYTAIFPAIN-----NEISSPGSGVSGSPQKYLFFVSDGVADEYNPSCLKPKTGNRC 347 Query: 346 SNVNTIKICDKAKENFIKI---VTISINASPNG 375 + +C K+ IKI T +N N Sbjct: 348 QSPINPALCKTLKDRGIKIAVLYTTYLNLPSND 380 >gi|312621140|ref|YP_004022753.1| yd repeat protein [Caldicellulosiruptor kronotskyensis 2002] gi|312201607|gb|ADQ44934.1| YD repeat protein [Caldicellulosiruptor kronotskyensis 2002] Length = 2994 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 53/350 (15%), Positives = 109/350 (31%), Gaps = 56/350 (16%) Query: 50 KNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGY 109 KN + SA A+ V LS I+ + + +K + +++ Sbjct: 632 KNPLTSAEKYAVSEDGKVFVKALSDANIL---IAPLQVKRSDNVFINSLKGIVGKAIEIT 688 Query: 110 SAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRY 169 + F + +VN + + N L + + + Sbjct: 689 AGGFDIKRAEIVVNYDEAELNGVDENNLMLYYVNYDKKILEPLEDVVVDTVYNRVSGKT- 747 Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 + +F+ + + I V+D SGSM D ++ Sbjct: 748 -EHFSTFLLGD-KNMPVDLSKVDIVFVLDNSGSMSS---------------NDPNYYRIE 790 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 A K F+ +ID L++ +GL+ + + V + K+ Q + Sbjct: 791 ATKK----FIQNIDELNN-----RVGLVDFDSSVYVRSNLTSDKNKLLQALNAMR--WTG 839 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T+ +K A L ++ +K I+ L+DG +N + Sbjct: 840 GSTNIGGGLKAAL-ELFDQEQS------------------KKIIVLLSDGYHNTGIHPND 880 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSL 398 + + + I + TI++ + LL + ++ V N L Sbjct: 881 VLP---ELIKQEIVVNTIAL-GKDCDRELLHDIADKTKGDYFYVDNTGGL 926 >gi|283778313|ref|YP_003369068.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] gi|283436766|gb|ADB15208.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] Length = 591 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 35/186 (18%), Positives = 68/186 (36%), Gaps = 17/186 (9%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGD 77 + N + A+ ++ L +I F + V + + ++ + +AA LAGA +V Sbjct: 22 RRGNIIVFTAVLMVVMLGMIAFAVDVGYMYTMQTQLQRSVDAAALAGAGSLVEGTDIAQA 81 Query: 78 RFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRL 137 + + + R + + F+ E + A F + + A+ R Sbjct: 82 KA---TEYLVRNPVGSSMTFVNE---EEVPAKIAQFVAEHGDDFEVEAG---EWNASTRS 132 Query: 138 DSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVV 197 + NT+ + V Y F +L + S + +P I +V+ Sbjct: 133 FETTNTLPSTLSVSMEYPTMPTFFGKILGKDSFSIRASSV-------AMYQPR-DIMVVL 184 Query: 198 DLSGSM 203 D SGSM Sbjct: 185 DFSGSM 190 >gi|332809378|ref|XP_003308230.1| PREDICTED: LOW QUALITY PROTEIN: epithelial chloride channel protein-like [Pan troglodytes] Length = 901 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 43/273 (15%), Positives = 91/273 (33%), Gaps = 63/273 (23%) Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFL 192 N + ++ +++ + + L + + R +LL+ + + Sbjct: 260 PNLQNKMCDHRSTWDVIMSSEDFQHLSPMTEINLPR-------PTFSLLKSKQR-----V 307 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + LV+D SGSM+ + + A L + I+ S V Sbjct: 308 VCLVLDKSGSMNAEDHLFRMNQA------------------AELYLIQIIEKGSLV---- 345 Query: 253 YMGLIGYTTRVEKNIEPSWGTE-KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 GL+ + + + + + Q +T ++ T +K +Q ++ + Sbjct: 346 --GLVTFDSFAKIQSKLIKIIDDNTYQKITANLPQEADGGTSICRGLKAGFQAISQSNQS 403 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISIN 370 +F + II LTDGE+ +C + K++ I TI++ Sbjct: 404 TFGSE----------------IILLTDGEDYQIS-------LCFGEVKQSGTVIHTIAL- 439 Query: 371 ASPNGQRLLKTCVSSPEY-HYNVVNADSLIHVF 402 + L + + Y N + LI F Sbjct: 440 GPSADEELETLSNMTGGHRFYAHKNINGLIDAF 472 >gi|313675311|ref|YP_004053307.1| von willebrand factor type a [Marivirga tractuosa DSM 4126] gi|312942009|gb|ADR21199.1| von Willebrand factor type A [Marivirga tractuosa DSM 4126] Length = 322 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 37/208 (17%), Positives = 72/208 (34%), Gaps = 42/208 (20%) Query: 172 KIVSFIPALLRIEMGERPIF--LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 IV+ + + E I + VDLS SM D +++ Sbjct: 57 LIVALLGPSFGDDKKEIKAVGKDIMISVDLSASMDA---------------NDVAPSRLE 101 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 +K L +D+ + +GLI +++ ++ + ++ L+ Sbjct: 102 KIKYELKNIVDAFNSD-------RIGLIIFSSEAFVQCPLTYDQNALNLFIETLNTGLVP 154 Query: 290 KP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 TD A+ A++ LTS+ S S K II ++DGE+ + Sbjct: 155 GSSTDFGSALNMAHEKLTSEAAPS-------------SQQKSKIIILISDGEDFGDDTEG 201 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQ 376 KI D I++ ++ + + Sbjct: 202 AVSKINDS----GIRLFSLGVGTEQGSK 225 >gi|298713908|emb|CBJ33776.1| conserved unknown protein [Ectocarpus siliculosus] Length = 977 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 24/204 (11%), Positives = 52/204 (25%), Gaps = 7/204 (3%) Query: 202 SMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT 261 S C S+ + + +A + ++ D KE + Sbjct: 43 SCPCGAGSELTFQIEGETSVEANKVNVAVIIDSSGSIFDIDGAFLLEKEFAKNVAASFAA 102 Query: 262 RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQG 321 + GT + D + D + R+ Sbjct: 103 KNLFTNG---GTASYASFSDAASDGGTFGSEAEFNEFVDNASWIEGDTNIEAGLSKGREL 159 Query: 322 VKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT 381 + + F+I +TDG+ N D A++ + + + + L + Sbjct: 160 LANGTSTRTSFLILITDGDWNRGGDPQIEA---DAARDEGTIVYAVGV-GPDVSEATLLS 215 Query: 382 CVSSPEYHYNVVNADSLIHVFQNI 405 ++ N L + I Sbjct: 216 IGGDLTNVFDASNFTELDNTLDEI 239 >gi|55981030|ref|YP_144327.1| hypothetical protein TTHA1061 [Thermus thermophilus HB8] gi|55772443|dbj|BAD70884.1| conserved hypothetical protein [Thermus thermophilus HB8] Length = 706 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 37/235 (15%), Positives = 76/235 (32%), Gaps = 52/235 (22%) Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 +P L ++ R + LV+D+SGSM L A+ Sbjct: 290 LPEELPLKPLGREGAALVLVLDVSGSMAG-----------------------EKLSMAVA 326 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDST 295 L ++ + +G++ +++ P T + ++ + SL T Sbjct: 327 GALALVESAAPED---RLGVVVFSSGHRVLFPPRPMTAQAKKEAESLLLSLRAGGGTVLG 383 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A ++A ++L +K ++ LTDG + K I D Sbjct: 384 GAFREAVRLLQGVPGE------------------RKAVLVLTDGLIADAKE-----PILD 420 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQLM 409 A+ + +++ +++ LK Y + L +F Q + Sbjct: 421 LAQTSGVEVSALAL-GPDADAPFLKELARRGGGRFYQAPSPRELPRLFLREGQEV 474 >gi|241620324|ref|XP_002408644.1| calcium activated chlorine channel, putative [Ixodes scapularis] gi|215503005|gb|EEC12499.1| calcium activated chlorine channel, putative [Ixodes scapularis] Length = 704 Score = 45.6 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 32/214 (14%), Positives = 71/214 (33%), Gaps = 41/214 (19%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + I V+D+SGSM K+ L+ A L+ + Sbjct: 60 QGHIRIVFVLDVSGSMGLE-------------------NKINMLRQAASRSLE-----DN 95 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 V + +G+I ++ + RQ + + S+ T A+ + Q Sbjct: 96 VPDGSDVGIITFSDNATVVAGMRTLSAATRQAIKNAVPSIARGSTAIGKALMTSVQK--- 152 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + ++ + ++ +TDGE N + + + +++ ++ Sbjct: 153 --------HGPQELERNGETAENAALLLMTDGEENEPPYINDVLP---TLLQKRLRVFSV 201 Query: 368 SI-NASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + + +G R+L + E Y + N L Sbjct: 202 PVGKEADDGLRVLS--ERTGENVYPITNTTKLAD 233 >gi|73974730|ref|XP_539177.2| PREDICTED: similar to collagen, type XXII, alpha 1 [Canis familiaris] Length = 1628 Score = 45.2 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 25/174 (14%), Positives = 58/174 (33%), Gaps = 26/174 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 +D + +G++ Y+ R E + + T++ A++ Sbjct: 73 VDTFEVGPDRTRVGVVRYSDRPTTAFELGLFGSREAVKAAARHLAYHGGNTNTGDALRFI 132 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + S + G + F++ I LTDG + + + A Sbjct: 133 TRHSFSRQA----------GGRPGDRAFKQVAILLTDGRSQDLVLDAA-----ATAHRAG 177 Query: 362 IKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVHRK 413 I+I + + A+ + L+ S P + ++V + F I ++ + Sbjct: 178 IRIFAVGVGAALREE--LEEIASEPKSAHVFHVSD-------FDAIDKIRGKLR 222 >gi|293396639|ref|ZP_06640915.1| aerotolerance protein BatA [Serratia odorifera DSM 4582] gi|291420903|gb|EFE94156.1| aerotolerance protein BatA [Serratia odorifera DSM 4582] Length = 325 Score = 45.2 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 32/183 (17%), Positives = 71/183 (38%), Gaps = 38/183 (20%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 ++P+ + L++D+SGSM T++ A+KN++ F+ + Sbjct: 92 QKPMRDLVLILDVSGSMA-------------KNDVPGGITRLQAVKNSVSKFVAA----- 133 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++ +GL+ + + S + ++ +T+ ++ + T A+ A ++L Sbjct: 134 --RQSDRIGLVIFANQAWPFAPVSEDKQALQTRITQLSPGMVGEQTAIGDALGVAVKLLD 191 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 S + K I LTDG N+ S + A + +++ T Sbjct: 192 SSANQDA----------------SKLAILLTDG--NDTASQLAPPLAAQLAAAHHVQVHT 233 Query: 367 ISI 369 I+ Sbjct: 234 IAF 236 >gi|115486675|ref|NP_001068481.1| Os11g0687100 [Oryza sativa Japonica Group] gi|77552567|gb|ABA95364.1| von Willebrand factor type A domain containing protein, expressed [Oryza sativa Japonica Group] gi|113645703|dbj|BAF28844.1| Os11g0687100 [Oryza sativa Japonica Group] Length = 633 Score = 45.2 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 35/218 (16%), Positives = 73/218 (33%), Gaps = 39/218 (17%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 ++ + V+D+SGSM + V +A + + +++ LK ++ + + Sbjct: 62 ADLNSHVPLDVVAVLDVSGSM-------NDPVAAASPKSNLQGSRLDVLKASMKFVIRKL 114 Query: 243 DLLSHVKEDVYMGLIGYTT---RVEKNIEPSW---GTEKVRQYVTRDMDSLILKPTDSTP 296 + ++ + + + G + + R T P Sbjct: 115 ADGD------RLSIVAFNDGPVKEYSSGLLDVSGDGRSIAGKKIDRLQA---RGGTALMP 165 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+++A +IL RQG + FI+ LTDG++ Sbjct: 166 ALEEAVKIL-----------DERQGSSRNHVG---FILLLTDGDDTTGFRWTRDAIHGAV 211 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 K + T + AS + + LL S + V + Sbjct: 212 FK---YPVHTFGLGASHDPEALLHIAQGSRGTYSFVDD 246 >gi|260782980|ref|XP_002586557.1| hypothetical protein BRAFLDRAFT_106340 [Branchiostoma floridae] gi|229271674|gb|EEN42568.1| hypothetical protein BRAFLDRAFT_106340 [Branchiostoma floridae] Length = 534 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 31/175 (17%), Positives = 59/175 (33%), Gaps = 25/175 (14%) Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 L +G++ Y+T V T K + T++ A+K Sbjct: 176 RLDISATTTRVGMVQYSTNVTPEFMLKEHTTKKSVEKAIGDVKRLGGGTNTGKALK---- 231 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 F R + P ++ I +TDG+ + +V T ++ + Sbjct: 232 -------------FVRTEMDWRDPPTKRVAIVVTDGK---SQDDVGTPAT--ALRQAGVV 273 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVIL 418 + + + + LK P Y + + D L + Q+IS +V +V L Sbjct: 274 LYAVGVGLPTDE---LKEITGDPTKVYALNSYDELQDIIQDISNSVVEGSTNVHL 325 >gi|21539497|gb|AAM53301.1| putative protein [Arabidopsis thaliana] gi|23198328|gb|AAN15691.1| putative protein [Arabidopsis thaliana] Length = 704 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 37/233 (15%), Positives = 87/233 (37%), Gaps = 53/233 (22%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + V+D+SGSM TK+A LK A+ + ++ Sbjct: 248 RAPVDLVTVLDVSGSMAG--------------------TKLALLKRAMGFVIQNLGPFD- 286 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILT 306 + +I +++ +N TE +Q + ++SL+ T+ +K+ ++L Sbjct: 287 -----RLSVISFSSTARRNFPLRLMTETGKQEALQAVNSLVSNGGTNIAEGLKKGARVLI 341 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD-------KAKE 359 + ++ ++ I+ L+DG++ ++ N + D + Sbjct: 342 DRRFKNPVSS----------------IVLLSDGQDTYTMTSPNGSRGTDYKALLPKEING 385 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI-HVFQN-ISQLMV 410 N I + A + L+ + + ++ + ++++I F I L+ Sbjct: 386 NRIPVHAFGFGA-DHDASLMHSIAENSGGTFSFIESETVIQDAFAQCIGGLLS 437 >gi|15239414|ref|NP_200879.1| zinc finger (C3HC4-type RING finger) family protein [Arabidopsis thaliana] gi|9759335|dbj|BAB09844.1| retroelement pol polyprotein-like [Arabidopsis thaliana] gi|332009986|gb|AED97369.1| C3H4 type zinc finger protein [Arabidopsis thaliana] Length = 704 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 37/233 (15%), Positives = 87/233 (37%), Gaps = 53/233 (22%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + V+D+SGSM TK+A LK A+ + ++ Sbjct: 248 RAPVDLVTVLDVSGSMAG--------------------TKLALLKRAMGFVIQNLGPFD- 286 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILT 306 + +I +++ +N TE +Q + ++SL+ T+ +K+ ++L Sbjct: 287 -----RLSVISFSSTARRNFPLRLMTETGKQEALQAVNSLVSNGGTNIAEGLKKGARVLI 341 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD-------KAKE 359 + ++ ++ I+ L+DG++ ++ N + D + Sbjct: 342 DRRFKNPVSS----------------IVLLSDGQDTYTMTSPNGSRGTDYKALLPKEING 385 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI-HVFQN-ISQLMV 410 N I + A + L+ + + ++ + ++++I F I L+ Sbjct: 386 NRIPVHAFGFGA-DHDASLMHSIAENSGGTFSFIESETVIQDAFAQCIGGLLS 437 >gi|237808477|ref|YP_002892917.1| von Willebrand factor type A [Tolumonas auensis DSM 9187] gi|237500738|gb|ACQ93331.1| von Willebrand factor type A [Tolumonas auensis DSM 9187] Length = 316 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 40/231 (17%), Positives = 77/231 (33%), Gaps = 54/231 (23%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 P + L VD+S SM + + +++ +K+ L F+ Sbjct: 85 PSRDLLLAVDISQSMQIKDMTINGEAVD----------RLSMVKSYLQSFIKQ------- 127 Query: 249 KEDVYMGLIGY--TTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++ +G+I + + W + + L K T A+ A + Sbjct: 128 RQGDRIGIILFADHAYLMVPFTQDWQAAGLL--LDEVNIGLAGKFTAIGEAITLAVKKTL 185 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA---KENFIK 363 + K K +I L+DG K ++NTI+ D A K + +K Sbjct: 186 HEPKPIQN----------------KTLILLSDG-----KDSINTIQPTDAAALAKASGLK 224 Query: 364 IVTISINASPNGQRL--------LKTCVS-SPEYHYNVVNADSLIHVFQNI 405 I TI I + L+ + + ++ + L ++Q I Sbjct: 225 IYTIGIGSDSTDAEAESDLDETTLEEIANMTGGQYFRARSEQDLSEIYQQI 275 >gi|260828797|ref|XP_002609349.1| hypothetical protein BRAFLDRAFT_99028 [Branchiostoma floridae] gi|229294705|gb|EEN65359.1| hypothetical protein BRAFLDRAFT_99028 [Branchiostoma floridae] Length = 421 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 30/240 (12%), Positives = 67/240 (27%), Gaps = 41/240 (17%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 P+ + G+ I V+D SGS+ +A Sbjct: 19 PSFVLAASGDAG--DIMFVLDGSGSISA-----------------------DDFVSAKSF 53 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 +D + +G++ +++ + ++K T Sbjct: 54 ISRVVDAFDIAADFTRVGVVQFSSFFTEEFPLDRYSDKASLKQAIGNIPQRGGGTLLGQV 113 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + ++ K G + S + + +TDG + + D Sbjct: 114 INYLVNTSFTEAK----------GARPLSDGIPRIAVLMTDG-SAHDNPTTVLAPAIDAL 162 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 + + I +I + + L+ + + V V +I L+V R +I Sbjct: 163 RASGIIAFSIGV-GPSVNRDQLEAVAGDTDRVFLVGA----YSVIDDIRDLLVERVREII 217 >gi|86740090|ref|YP_480490.1| von Willebrand factor, type A [Frankia sp. CcI3] gi|86566952|gb|ABD10761.1| von Willebrand factor, type A [Frankia sp. CcI3] Length = 319 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 43/237 (18%), Positives = 76/237 (32%), Gaps = 52/237 (21%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R I L +D+S SM D T++AA K F+D + Sbjct: 84 RERATIILAIDVSNSMAA---------------TDIAPTRLAAAKQGASAFVDQL----- 123 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILT 306 + +GL+ + + R+ V + L L P T + + Q +T Sbjct: 124 -PPRINLGLVSFAGSATVLVP----ASADRESVRAGIRGLQLGPATAVGEGIFASLQAIT 178 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + KR I+ L+DGE + N I + A++ I + T Sbjct: 179 TAGKRFSD---------TGQSAPPAAIVLLSDGETTRGRPNNQAI---EAARQARIPVDT 226 Query: 367 ISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 I+ N Q L + ++ + D L V++ + + Sbjct: 227 IAYGTADGTLDVGGQEVPVPVNEQALRDIAEQTGGSYHRATSGDELRSVYRGLGSSI 283 >gi|55251329|emb|CAH69127.1| novel protein similar to vertebrate matrilin 3 (MATN3) [Danio rerio] Length = 454 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 32/175 (18%), Positives = 65/175 (37%), Gaps = 28/175 (16%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--VRQYVTRDMDSLILKPTDST 295 D +D L + + ++ Y + V+ K ++Q +TR + T + Sbjct: 87 LADMVDTLDVGPDATRVAVVNYASTVKIEFLLKSHLTKDTIKQAITRIEP--LAAGTMTG 144 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+K+A ++K G + S K I +TDG + V + Sbjct: 145 MAIKKAMDEAFTEK----------SGARPKSKNISKVAIIVTDG---RPQDQVEEVS--A 189 Query: 356 KAKENFIKIVTISI-NASPNGQRLLKTCVSSP--EYHYNVVN---ADSLIHVFQN 404 A+ + I+I + + A +L+ S+P ++ + V + L F+ Sbjct: 190 AARASGIEIYAVGVDRADMRSLKLM---ASNPLEDHVFYVETYGVIEKLTSKFRE 241 >gi|296394903|ref|YP_003659787.1| von Willebrand factor type A [Segniliparus rotundus DSM 44985] gi|296182050|gb|ADG98956.1| von Willebrand factor type A [Segniliparus rotundus DSM 44985] Length = 343 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 35/255 (13%), Positives = 80/255 (31%), Gaps = 56/255 (21%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 P + + + LV+D+S SM D K +++ A + A Sbjct: 86 PTAVSQVAKNQA--TVILVLDISKSMAA---------------TDVKPSRVDAARAAA-- 126 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTP 296 I + + V +G++ + + + PS E ++ + + + + L K T + Sbjct: 127 ----IKFVDGMAPTVQLGVVTFAGSAQPLVRPSTDHETAKKVIDQMIRADKLEKQTATGE 182 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI---KI 353 + A Q + + N I+ ++DG+ Sbjct: 183 GIYTALQQIETIAGALGGKNHTPPAR----------IVLVSDGKETVPDDLNAPRGAYAA 232 Query: 354 CDKAKENFIKIVTISI--------------NASPNGQRL-----LKTCVSSPEYHYNVVN 394 AKE I + T++ + L L + + + Sbjct: 233 ARTAKEKHIPVCTVAFGTKSGKITIDNQVDEVPVDLDSLKKISDLSNSPGNSCRFFPAES 292 Query: 395 ADSLIHVFQNISQLM 409 L ++Q++++ + Sbjct: 293 QAELAQIYQSLNEDI 307 >gi|295398785|ref|ZP_06808791.1| von Willebrand factor type A domain protein [Aerococcus viridans ATCC 11563] gi|294972971|gb|EFG48792.1| von Willebrand factor type A domain protein [Aerococcus viridans ATCC 11563] Length = 516 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 42/246 (17%), Positives = 85/246 (34%), Gaps = 39/246 (15%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 ++ + LV+D SGSM+ + T++ K A + F + + L Sbjct: 76 QQAPVDVVLVLDRSGSMNF-------------VETPNSPTRLDYGKLAAINFAERV-LGP 121 Query: 247 HVKEDVYMGLIGYT-----TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + ++ ++ T V N + +G + + D+ S + TDS + A Sbjct: 122 NGIPGSRVSVVSFSGPAYATGVRNNPQRHYGQQN-QATTDLDLSSDLRAVTDSINRI-TA 179 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV------------- 348 + ++ + + K +I LTDG N Sbjct: 180 FGGTNTEAGFEQGRSVIEGTTSNQNPNSNKVVIMLTDGLPTASNGNPYAETTDINHVHIQ 239 Query: 349 NTIKICDKAKENFIK-IVTISINA--SPNGQRLLKTCVSSPEY--HYNVVNADSLIHVFQ 403 I +N I + TI + + + L ++ + +Y +A L +F+ Sbjct: 240 RAINAGKNIYQNDIADVFTIGLTTGMNATEKALADNILTQAQNKGYYPAPSATDLDAIFE 299 Query: 404 NISQLM 409 ISQ + Sbjct: 300 EISQRL 305 >gi|294055226|ref|YP_003548884.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] gi|293614559|gb|ADE54714.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] Length = 330 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 39/194 (20%), Positives = 67/194 (34%), Gaps = 37/194 (19%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E+ + L VDLSGSM D ++ +++A K L F+ Sbjct: 90 EKTARDLMLAVDLSGSM---------DAADFVDASGEQIDRLSAAKGVLNEFVA------ 134 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 +E +GLI + + E + + ++ T ++ A Sbjct: 135 -GREGDRLGLIVFGNAAYLQAPFTDDHETWLALLDESIVNMAGPSTALGDSIGLA----- 188 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 FRQ S + +I LTDG N+ S V + + AK + I T Sbjct: 189 --------IAHFRQ-----SKTENRVLIVLTDG--NDTGSRVPPLDAAEVAKVEGVTIYT 233 Query: 367 ISINASPN-GQRLL 379 +++ G+ L Sbjct: 234 VAVGDPTTVGEEAL 247 >gi|221108467|ref|XP_002170770.1| PREDICTED: similar to collagen, type XXIX, alpha 1, partial [Hydra magnipapillata] Length = 592 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 52/402 (12%), Positives = 126/402 (31%), Gaps = 49/402 (12%) Query: 4 LSRFRFYFKKGIASEKANFSIIF--ALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 L ++ K + S + F + F + + L L+ + NS + A + + Sbjct: 22 LRKYYQNEKDFLKSVISAFGVSFNGTRAAVITFSYHAQLSIKLNKYSNLNSFKEAVDNIV 81 Query: 62 LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI 121 L G++ + RL + + R + + + + G + Sbjct: 82 LMGSTTRIDKALRLAQKEVFELENGARPGVAKILFLLTDGSQTQERGSEN-----PVAIA 136 Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL 181 +T + ++ N + ++ Y + E L + + I + + Sbjct: 137 NELRSAGVTIIVIGITNAVNVSELSDIAGGEENAYATESFEKLKDVNFLDVIKTKMCETA 196 Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 ++E I I ++D SGS+ ++ + ++ +A + Sbjct: 197 KVEPTCEAIVDIVFLLDSSGSL--------------RKYYQNEKDFLKSVISAFGVSF-- 240 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 +I ++ + +I+ + ++ V + L+ T A++ Sbjct: 241 --------NGTRAAVITFSYHAQLSIKLNKYSNLNSFKEAVDNIV--LMGSTTRIDKALR 290 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 A + + + K + LTDG + + N + I ++ + Sbjct: 291 LAQKEVFELEN-------------GARPGVAKILFLLTDGSQTQERGSENPVAIANELRS 337 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + I+ I I + L E Y + + L V Sbjct: 338 AGVTIIVIGIT-NAVNVSELSDIAGGEENAYAAESFEKLKDV 378 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 54/402 (13%), Positives = 130/402 (32%), Gaps = 49/402 (12%) Query: 4 LSRFRFYFKKGIASEKANFSIIF--ALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 L ++ K + S + F + F + + L L+ + NS + A + + Sbjct: 218 LRKYYQNEKDFLKSVISAFGVSFNGTRAAVITFSYHAQLSIKLNKYSNLNSFKEAVDNIV 277 Query: 62 LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI 121 L G++ + RL + + R + + + + G + Sbjct: 278 LMGSTTRIDKALRLAQKEVFELENGARPGVAKILFLLTDGSQTQERGSEN-----PVAIA 332 Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL 181 +T + ++ N + ++ Y + E L + + I + + Sbjct: 333 NELRSAGVTIIVIGITNAVNVSELSDIAGGEENAYAAESFEKLKDVNFLDVIKTKMCETA 392 Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + + I ++D SGS+ Q++K LK+A+ F S Sbjct: 393 TFQPTCEAVVDIVFLLDSSGSL-------------RKYYQNEK----DFLKSAISAFGVS 435 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 ++ +I ++ + +I+ + ++ V + L+ T A++ Sbjct: 436 VN-------GTRAAVITFSYHAQLSIKLNKYSNLNSFKEAVDNIV--LMGSTTRIDKALR 486 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 A + + + K + LTDG + + N + I ++ + Sbjct: 487 LAQKEVFELEN-------------GARPGVAKILFLLTDGSQTQERGSENPVAIANELRS 533 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + I+ I I + + L ++ + + L V Sbjct: 534 AGVTIIVIGITNAVDVSELFDIAGGEENAYF-ADSFEKLKDV 574 >gi|189485266|ref|YP_001956207.1| aerotolerance-related cytoplasmic membrane protein BatA [uncultured Termite group 1 bacterium phylotype Rs-D17] gi|170287225|dbj|BAG13746.1| aerotolerance-related cytoplasmic membrane protein BatA [uncultured Termite group 1 bacterium phylotype Rs-D17] Length = 333 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 28/184 (15%), Positives = 58/184 (31%), Gaps = 42/184 (22%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 I + +D S SM + +M A K + F+ Sbjct: 86 SDQGIDIIVALDTSTSMRSL--------------DFRSLNRMEAAKKVIRDFMKE----- 126 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-MDSLILKPTDSTPAMKQAYQIL 305 ++ +GL+ ++ + + + +++ + L T A+ + L Sbjct: 127 --RKYDRIGLVIFSGLAFTQCPLTTDKDSLAEFINNINIGDTGLDGTAIGSAIMTSVNRL 184 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 + +S + II +TDG NN + + T A+ IKI Sbjct: 185 KDSRAKS------------------RIIILVTDGNNNMGEIDPLTAS--KIARSYDIKIY 224 Query: 366 TISI 369 + + Sbjct: 225 AVGV 228 >gi|256784255|ref|ZP_05522686.1| secreted protein [Streptomyces lividans TK24] gi|289768140|ref|ZP_06527518.1| secreted protein [Streptomyces lividans TK24] gi|289698339|gb|EFD65768.1| secreted protein [Streptomyces lividans TK24] Length = 421 Score = 45.2 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 43/227 (18%), Positives = 75/227 (33%), Gaps = 33/227 (14%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 A GE ++LV+D+SGSM T+MAA K A Sbjct: 25 PAGASAGEPTGEDAP-KVDLVLDVSGSM--------------RTRDIDGGTRMAAAKQAF 69 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 L E+V +G+ + + + + Y + P D T Sbjct: 70 NEVL------DATPEEVQLGIRTLGADYPGDDRKTGCKDTAQLY--------PVGPLDRT 115 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A K A L+ + + K I+ ++DGE+ + + Sbjct: 116 EA-KTAVATLSPTGWTPIGPALLKAADDLDGGDGSKRIVLISDGEDTCAPLDPCEVAREI 174 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIH 400 AK + I T+ + + +R L C++ + +V + D L Sbjct: 175 AAKGIGLTIDTLGLVPNTKMRRQLS-CIAEATGGTYTSVEHTDELTD 220 >gi|254167839|ref|ZP_04874688.1| von Willebrand factor type A domain protein [Aciduliprofundum boonei T469] gi|197623130|gb|EDY35696.1| von Willebrand factor type A domain protein [Aciduliprofundum boonei T469] Length = 1953 Score = 45.2 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 37/246 (15%), Positives = 77/246 (31%), Gaps = 24/246 (9%) Query: 120 NIVNSSRISMTHMANNRLDSSNNTIF---YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSF 176 + +N +S + + S +T Y D T Y + + ++ + Sbjct: 1177 DFINRRDVSTNVVGDWDWTLSGDTDSDNVYTQDGNTFYTNDTKVLNFTGGAYHSASSAFW 1236 Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 +P R+ +R I V+D SGSM+ + P + T++ A + Sbjct: 1237 MPEAPRLN--KRKPIDIIFVIDTSGSMNSVV---PGATVGDVNGDGRSNTRIDVAIQAAI 1291 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE-KVRQYVTRDMDSLILKPTDST 295 + + V + G + E+ + ++ T ++ ++ D T Sbjct: 1292 DAVKELGPQDRVAVFTFNG----NSHPEEYMGFTYVTADNLQTIISDLKDIQAGGGTPLY 1347 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 + A + + +G I+ LTDG +N + D Sbjct: 1348 DTLSWAVYYMDKYSTDNPDREDATRG-----------ILVLTDGLSNYDTYGTSNGARYD 1396 Query: 356 KAKENF 361 A Sbjct: 1397 YAPGTG 1402 >gi|297624820|ref|YP_003706254.1| von Willebrand factor type A [Truepera radiovictrix DSM 17093] gi|297166000|gb|ADI15711.1| von Willebrand factor type A [Truepera radiovictrix DSM 17093] Length = 802 Score = 45.2 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 41/244 (16%), Positives = 83/244 (34%), Gaps = 49/244 (20%) Query: 166 NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 R + V + LR E+ E P+ + +V+D+S SM Sbjct: 344 WYRTPVEAVLPVTTDLRTEV-EVPLVALVIVMDVSQSM-----------------TAGNP 385 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT-EKVRQYVTRDM 284 +++ K + +D +G I ++ R E P T + R+ + Sbjct: 386 SRLELAKEGAVGVVDLAYERDM------LGFITFSDRPEWVFRPRQATLQGKREMTAAIL 439 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + T PA ++A +L + + K +I LTDG+ + Sbjct: 440 NVAPQGGTIFEPAYREALDVLMAQEAAV------------------KHVIVLTDGKFADG 481 Query: 345 KS------NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 + ++ + + I TI+I + Q+L + +Y ++ +L Sbjct: 482 TGPFSRGPAPDFGRLAALGRRSGITTSTIAIGDGADPQQLTTIARAGGGRYYEALDVSTL 541 Query: 399 IHVF 402 +F Sbjct: 542 PRIF 545 >gi|74002027|ref|XP_544943.2| PREDICTED: similar to anthrax toxin receptor 2 [Canis familiaris] Length = 646 Score = 45.2 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 33/244 (13%), Positives = 68/244 (27%), Gaps = 52/244 (21%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 P + + R F + V+D SGS+ V + Sbjct: 188 PVSAQEQPSCRGAFDLYFVLDKSGSVANNWIEIYNFVQQLTERFVSPQ------------ 235 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 + + I ++++ + + K+ + + + + T Sbjct: 236 --------------MRLSFIVFSSQATIILPLTGDRSKISKGLEDLKNVSPVGETYIHEG 281 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG--ENNNFKSNVNTIKICD 355 +K A Q K L II LTDG + KI Sbjct: 282 LKLA----------------NEQIQKAGGLKTSSIIIALTDGKLDGLVPSYAEKEAKI-- 323 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 ++ ++ + + Q L+ S E + V + I ++ + + Sbjct: 324 -SRSFGARVYCVGVL--DFEQAQLERIADSKEQVFPVKGGFQ---ALKGIINSILAQSCT 377 Query: 416 VILK 419 IL+ Sbjct: 378 EILE 381 >gi|291228410|ref|XP_002734180.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii] Length = 945 Score = 45.2 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 41/206 (19%), Positives = 78/206 (37%), Gaps = 27/206 (13%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL----KNALLLF 238 +E + +V+D SGSM + + I D T + + K ++ F Sbjct: 177 VEAATPEPKNVVIVIDTSGSMA----NLHSGKSLINIAIDAAITVLDTMNPNDKVGVIAF 232 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 D + L + + Y + + ++Q+V + T A Sbjct: 233 SDELKLPPKIGD-----ASCYANELALATTI--NIQNLKQFVLSLVA---RGGTHYGKAF 282 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV-NTIKICDKA 357 A+ +L K S+ + + KI + IIFLTDGE + K+++ I+ ++ Sbjct: 283 DAAFNLL----KESYTLDADNERGKIER---DQVIIFLTDGEPLDDKTSIMRKIRSNNEE 335 Query: 358 KENFIKIVTISINASPNGQRLLKTCV 383 EN + I+T + +G L+ Sbjct: 336 MENKVTILTFGLGL-DSGINFLEDIA 360 >gi|255570578|ref|XP_002526246.1| protein binding protein, putative [Ricinus communis] gi|223534440|gb|EEF36143.1| protein binding protein, putative [Ricinus communis] Length = 513 Score = 45.2 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 55/164 (33%), Gaps = 42/164 (25%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 +RP + V+D+SGSM K+A LK A+L + + Sbjct: 55 SSNDRPGLDLVAVLDVSGSMAGE--------------------KIAKLKMAMLFMIKKLS 94 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAY 302 + + ++ ++T + TE ++ ++ L T+ T ++ A Sbjct: 95 PID------RLSIVTFSTDSTRLCPLRQITENSQKEFENLINRLKADGWTNITAGLETAL 148 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 ++L I+ ++DGE+N Sbjct: 149 KVLNDRSFNGGRVVG---------------IMLMSDGEHNTDGD 177 >gi|329940639|ref|ZP_08289920.1| putative secreted protein [Streptomyces griseoaurantiacus M045] gi|329300700|gb|EGG44597.1| putative secreted protein [Streptomyces griseoaurantiacus M045] Length = 421 Score = 45.2 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 38/225 (16%), Positives = 71/225 (31%), Gaps = 32/225 (14%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 E ++ ++LV+D+SGSM ++MAA K A Sbjct: 26 SGAAADETADQDAPKVDLVLDVSGSMRAKDIDGG--------------SRMAAAKQAFNE 71 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 L E+V +G+ + + + Y + P D T A Sbjct: 72 VL------DATPEEVRLGIRTLGADYPGDNRKEGCKDTAQLY--------PVGPLDRTEA 117 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 K A L+ + + K I+ ++DGE+ + + A Sbjct: 118 -KTAVATLSPTGWTPIGPALLKAADDLDGGNGSKRIVLISDGEDTCAPLDPCEVAREIAA 176 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIH 400 K + I T+ + +C++ + +V + D L Sbjct: 177 KGIGLTIDTLGL-VPDVKLNRQLSCIAEATGGTYTSVEHRDQLTD 220 >gi|290999945|ref|XP_002682540.1| predicted protein [Naegleria gruberi] gi|284096167|gb|EFC49796.1| predicted protein [Naegleria gruberi] Length = 502 Score = 45.2 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 39/230 (16%), Positives = 77/230 (33%), Gaps = 48/230 (20%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D+SGSM P+ K +K+ A K+A+ + + + Sbjct: 77 LVLDISGSMD------------EPLKNRSKGSKLTACKSAIRELVTNFLTYKDT-----I 119 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 LI Y+ + K + T+ A+ A +L + Sbjct: 120 HLITYSDSPKTVFT---EKNKESVNLNDIDKISTEGSTNIASALHSAVDLLHNSN----- 171 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN--------TIKICDKAKENFIKIVT 366 P K I F +DG+ N ++N+N +K + K++ I I + Sbjct: 172 ------------APGTKLIAFFSDGQCNVGETNLNIFGSGLLKKLKDYSEGKDDQIHISS 219 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 + + + L + +Y + + F+ + + KY + Sbjct: 220 YGVGSDYDELWLQAIARTGKGEYYYLEDETYAKDAFE---RSLKKYKYQI 266 >gi|110626529|gb|ABG79013.1| TadG [Yersinia ruckeri] Length = 478 Score = 45.2 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 43/274 (15%), Positives = 80/274 (29%), Gaps = 59/274 (21%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 LL+ + + ++K I F +S+ F+ +I L K + A L Sbjct: 31 LLNTILMF----LENKKGGIIIPFFISLPFFIAIIMLLFDFTQLINNKIKLSDALEQGAL 86 Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 A + + + D + I +I L T+ NI Sbjct: 87 A----LTAENNAKNDTRNN--------------ELISAYINFYLGHRH---QLTQYNNIT 125 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 + + + + + +L Y++D F L++ FI Sbjct: 126 VNYQQNPDRLYHTQLSQ------YHIDANIEQPTLFPFTSLLIDHDN------FIIGGSA 173 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + + P + V D SGSM ++ + + +K+ LK D I Sbjct: 174 AAIKDVPAMDVVFVTDFSGSMEGDFHNPDD---------PEVLSKLDELKRIFFKIADDI 224 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 + SWGT+ Sbjct: 225 YTANKDS-------------TISFSPFSWGTKSA 245 >gi|289596620|ref|YP_003483316.1| von Willebrand factor type A [Aciduliprofundum boonei T469] gi|289534407|gb|ADD08754.1| von Willebrand factor type A [Aciduliprofundum boonei T469] Length = 2166 Score = 45.2 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 36/234 (15%), Positives = 76/234 (32%), Gaps = 24/234 (10%) Query: 120 NIVNSSRISMTHMANNRLDSSNNTIF---YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSF 176 + +N +S + + S +T Y D T Y + + ++ + Sbjct: 1179 DFINRRDVSTNVVGDWDWTLSGDTDSDGVYTQDGNTFYTNDTKVLNFTGGAYHSASSAFW 1238 Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 +P R+ +R I V+D SGSM+ + P + T++ A + Sbjct: 1239 MPEAPRLN--KRKPIDIIFVIDTSGSMNSVV---PGATVGDVNGDGRSNTRIDVAIQAAI 1293 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE-KVRQYVTRDMDSLILKPTDST 295 + + V + G + E+ + ++ T + ++ D T Sbjct: 1294 DAVKELGPQDRVAVFTFDG----NSHPEEYMGFTYVTADNLPTIISDLKDIQADGGTPLY 1349 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 + A + + + +G I+ LTDG +N+ N Sbjct: 1350 DTLSWAVYYMDTQSADNPDREDATRG-----------ILVLTDGLSNSDNYGPN 1392 >gi|254167891|ref|ZP_04874740.1| von Willebrand factor type A domain protein [Aciduliprofundum boonei T469] gi|197623182|gb|EDY35748.1| von Willebrand factor type A domain protein [Aciduliprofundum boonei T469] Length = 2164 Score = 45.2 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 36/234 (15%), Positives = 76/234 (32%), Gaps = 24/234 (10%) Query: 120 NIVNSSRISMTHMANNRLDSSNNTIF---YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSF 176 + +N +S + + S +T Y D T Y + + ++ + Sbjct: 1177 DFINRRDVSTNVVGDWDWTLSGDTDSDGVYTQDGNTFYTNDTKVLNFTGGAYHSASSAFW 1236 Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 +P R+ +R I V+D SGSM+ + P + T++ A + Sbjct: 1237 MPEAPRLN--KRKPIDIIFVIDTSGSMNSVV---PGATVGDVNGDGRSNTRIDVAIQAAI 1291 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE-KVRQYVTRDMDSLILKPTDST 295 + + V + G + E+ + ++ T + ++ D T Sbjct: 1292 DAVKELGPQDRVAVFTFDG----NSHPEEYMGFTYVTADNLPTIISDLKDIQADGGTPLY 1347 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 + A + + + +G I+ LTDG +N+ N Sbjct: 1348 DTLSWAVYYMDTQSADNPDREDATRG-----------ILVLTDGLSNSDNYGPN 1390 >gi|283779589|ref|YP_003370344.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] gi|283438042|gb|ADB16484.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] Length = 1740 Score = 45.2 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 42/253 (16%), Positives = 76/253 (30%), Gaps = 36/253 (14%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 P + I R + V+D S +M + D Q +M K AL Sbjct: 1165 PPTVTIHGFRRRQASVVFVLDCSHTMKALDEVESPDGGVRATSQ-----RMELAKGALRS 1219 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL---------- 287 L + + D +G+ + RV + + + +Y L Sbjct: 1220 MLTQL----AERGDARVGVRFFGHRVGWSTVEANKLLRQNRYAGEIPPELQPYADVENIL 1275 Query: 288 ILKPTDS------TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 L DS AMK + + +K I+ +TDG+N Sbjct: 1276 PLGRFDSVIAGRVFEAMKTV--EPWGESPIYLSITEALRDFAQEDGESEKSIVVITDGKN 1333 Query: 342 ---NNFKSNVNTIKICDKAK-ENFIKIVTISINASPN-----GQRLLKTCVSSPEYHYNV 392 N T A+ IK+ + N P+ + + ++ Sbjct: 1334 YQFNAPSQLARTKDDVLAARGSRDIKVHIVGFNIEPSESEVASREFREIAEATGGEFLPA 1393 Query: 393 VNADSLIHVFQNI 405 +A SL+ +++ Sbjct: 1394 TSAGSLVRSLESV 1406 >gi|118087212|ref|XP_424219.2| PREDICTED: similar to matrilin 2 [Gallus gallus] Length = 1799 Score = 45.2 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 37/174 (21%), Positives = 64/174 (36%), Gaps = 26/174 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKPTDST 295 L + L + +GLI Y + V+ T + +Q + R + ++ T + Sbjct: 55 ILTILQFLDVSPDATRVGLIQYGSTVKHEFSLK--TFRRKQEIERAVRRMMHLATGTMTG 112 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A I S+ + + N + I+ +TDG + I Sbjct: 113 LAIQYAVNIAFSESEGARPLNQ----------NVPRIIMIVTDGRPQDPVGE-----IAA 157 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADS---LIHVFQN 404 KA+ + I I I + LK+ S P E+ + V N L VFQ Sbjct: 158 KARNSGILIFAIGVG--RVDMNTLKSIGSEPHEEHIFLVANFSQIETLTSVFQT 209 >gi|332808244|ref|XP_001147912.2| PREDICTED: cartilage matrix protein [Pan troglodytes] Length = 717 Score = 45.2 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 22/145 (15%), Positives = 48/145 (33%), Gaps = 17/145 (11%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 I+ L +G++ Y + V++ K + T + A++ A Sbjct: 68 IESLDVGPNATRVGMVNYASTVKQEFSLRAHVSKAALLQAVRRIQPLSTGTMTGLAIQFA 127 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 D + S K +I +TDG ++ +V+ +A+ + Sbjct: 128 ITKAFGDAEGGHSR----------SPDISKVVIVVTDGRPHDSVQDVS-----ARARASG 172 Query: 362 IKIVTISINASPNGQRLLKTCVSSP 386 +++ I + L+ S P Sbjct: 173 VELFAIGFG--RVDKATLRQIASEP 195 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 37/223 (16%), Positives = 83/223 (37%), Gaps = 40/223 (17%) Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNA-----------LLLFLDSI-DLLSHVKED 251 +NSD + N T + L + + F++ I D L + Sbjct: 252 GFTLNSDGKTCNVCSGGGGSSATDLVFLIDGSKSVRPENFELVKKFINQIVDTLDVSDKL 311 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD--SLILKPTDSTPAMKQAYQILTSDK 309 +GL+ Y++ V + G ++ + + S + K T + A+K D Sbjct: 312 AQVGLVQYSSSVRQEFPL--GRFHTKKDIKAAVRNMSYMEKGTMTGAALKY-----LIDN 364 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + + K+ I+F TDG + ++ ++ KAK+ K+ + + Sbjct: 365 SFTVSSGARPGAQKVG-------IVF-TDGRSQDYINDAA-----KKAKDLGFKMFAVGV 411 Query: 370 -NASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 NA + L+ S P E+++ + ++ + + + + + Sbjct: 412 GNAVEDE---LREIASEPVAEHYFYTADFKTINQIGKKLQKKI 451 >gi|255557532|ref|XP_002519796.1| protein binding protein, putative [Ricinus communis] gi|223541035|gb|EEF42592.1| protein binding protein, putative [Ricinus communis] Length = 477 Score = 45.2 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 34/274 (12%), Positives = 80/274 (29%), Gaps = 54/274 (19%) Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLI 193 ++ + + + +L+ + K+ + +RP + Sbjct: 5 DDEMIVTPQDSGSTRTPIVPGRVQLKCMNSTTAPLEESKLKVMLELTGGDSSNDRPGLDL 64 Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 +V+DLSGSM K+ LK A+L + + + Sbjct: 65 VVVLDLSGSMEGE--------------------KIEKLKAAILFMIKKLSSID------R 98 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQILTSDKKRS 312 + ++ ++ + TE ++ + ++ L + T ++ ++L + Sbjct: 99 LSIVTFSRDARRLCPLRQITENSQKDLENLINGLHAYGAANITAGLQTGLKVLNDRRFTG 158 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA- 371 I+ ++ E NN D+ + + T A Sbjct: 159 GRVAT---------------IMLVSSSEQNNGDDA-------DQILVGNVPVHTFGFGAY 196 Query: 372 --SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 + + S +V N D+L F Sbjct: 197 HEPGVLKAIAHN--SIGGTFSDVQNMDNLNKAFS 228 >gi|149920127|ref|ZP_01908600.1| hypothetical protein PPSIR1_33369 [Plesiocystis pacifica SIR-1] gi|149819070|gb|EDM78507.1| hypothetical protein PPSIR1_33369 [Plesiocystis pacifica SIR-1] Length = 400 Score = 45.2 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 32/225 (14%), Positives = 73/225 (32%), Gaps = 36/225 (16%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +++D SGSM + + + +++ AL AL + L + Sbjct: 122 SVLMLLDRSGSM----------MQTGFDADEPDKSRWQALHEALGDVM----LADDMDHF 167 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVT--RDMDSLILKPTDSTPAMKQAYQILTSDK 309 V G ++T+ WG V + +D+ L + + + Sbjct: 168 VEFGAKTFSTQ-------GWGECGVSPQIDVPMLLDNSELLLELIPGPL----EDVNGGT 216 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK----AKE-NFIKI 364 + ++ P K ++ +TDG T++ A+E + I Sbjct: 217 PTLAALDAGLGMMRDYEAPGAKAVVLITDGSIGCTDDQAATLEQITTELTLAREVDGIAT 276 Query: 365 VTIS----INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + N++ + + ++ NA+SL + + Sbjct: 277 YVVGISPSYNSAKAQLGAMAEAGGGADDYFEAANAESLHAALEQV 321 >gi|192359934|ref|YP_001981670.1| von Willebrand factor type A domain-containing protein [Cellvibrio japonicus Ueda107] gi|190686099|gb|ACE83777.1| von Willebrand factor type A domain protein [Cellvibrio japonicus Ueda107] Length = 674 Score = 45.2 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 43/324 (13%), Positives = 93/324 (28%), Gaps = 59/324 (18%) Query: 106 LSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLL 165 ++ F I + + + +L + M Y Y L Sbjct: 218 IATRDNPFSTFSIDVDTAAYSFTRRLLNQGQLPPKDAVRIEEMVNYFDYSYPLPSSAQTP 277 Query: 166 NQRYNQKIVSFIP----------ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 + S ++ G P + ++D+SGSM Sbjct: 278 FTTNITVLDSPWKPGNKLLHIGIQGYQLPAGHIPQSNLVFLLDVSGSMD----------- 326 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 + +K+ +K ++ L L ++ V VY G G V + + + Sbjct: 327 --------EPSKLPLVKQSMELLLSTLKPEDTVAIVVYAGAAG---TVLEPTK----VRE 371 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 + + + T + AYQ + +F + II Sbjct: 372 KSKILAALHNLQAGGSTAGGEGLALAYQ----LAEANFNPKGVNR------------IIL 415 Query: 336 LTDGENNNFKSNVNTIK-ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 TDG+ N ++ ++ ++ + I + + L++T N Sbjct: 416 ATDGDFNVGQTGDEPLQDFVERKRAKGIYLSVLGFGQGNYQDALMQTLAQ------NGNG 469 Query: 395 ADSLIHVFQNISQLMVHRKYSVIL 418 + I +++V+ S + Sbjct: 470 TAAYIDTLSEAQKVLVNEATSTLF 493 >gi|6469599|gb|AAF13350.1|AF121336_1 unknown [Eufolliculina uhligi] Length = 494 Score = 45.2 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 46/286 (16%), Positives = 95/286 (33%), Gaps = 60/286 (20%) Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 + +S +D+ + +++ + + + + R I Sbjct: 34 QEESKGEPTVGAVDIAAYGVFAFNYLQLSPEKAQEIPCTINLESPAQTSEASRSGVDIVC 93 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V+D+SGSM K+ ++ L ++ + + Sbjct: 94 VIDVSGSMQGE--------------------KIQLVQTTLNFMVERLSPAD------RIC 127 Query: 256 LIGYTTRVEKNIEPSW----GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 LI ++ K G ++++ + R + S T+ ++ Q L Sbjct: 128 LISFSNDATKISRLVQMSPKGKKQLKSMIPRLVAS---GGTNIVGGLEYGLQAL------ 178 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI---VTIS 368 RQ I L II L+DG++N N T+ KA + I I ++ Sbjct: 179 -------RQRRTINQLSS---IILLSDGQDN----NGTTVLQRAKATMDSIVIRDDYSVH 224 Query: 369 INASPNGQR-LLKTCVSSPEY--HYNVVNADSLIHVFQN-ISQLMV 410 +G L ++ P+ Y V + +++ F N + +LM Sbjct: 225 TFGYGHGHDSTLLNALAEPKNGAFYYVKDEETIATAFANCLGELMS 270 >gi|296206125|ref|XP_002750075.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5 [Callithrix jacchus] Length = 940 Score = 44.9 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 46/222 (20%), Positives = 74/222 (33%), Gaps = 48/222 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D S SM TK+ K+AL L + Sbjct: 295 VVFVLDSSASMVG--------------------TKLRQTKDALFTILHDLRPQD------ 328 Query: 253 YMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +IG++ R++ K+ S + VR TD A++ A ++L Sbjct: 329 RFSIIGFSNRIKVWKDHLISVTPDSVRDGKVYIHHMSPTGGTDINEALQTAIRLL----- 383 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF---IKIVTI 367 N + IIFLTDG+ +T+KI + +E + I TI Sbjct: 384 -----NKYVAHSDSGDRSVS-LIIFLTDGK--PTVGETHTLKILNNTREAARGQVCIFTI 435 Query: 368 SINASPNGQRL----LKTCVSSPEYHYNVVNADSLIHVFQNI 405 I + + L L+ C + H LI + I Sbjct: 436 GIGNDVDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 477 >gi|170079352|ref|YP_001735990.1| von Willebrand factor type A domain-containing protein [Synechococcus sp. PCC 7002] gi|169887021|gb|ACB00735.1| von Willebrand factor type A domain protein [Synechococcus sp. PCC 7002] Length = 414 Score = 44.9 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 35/241 (14%), Positives = 80/241 (33%), Gaps = 54/241 (22%) Query: 170 NQKIVSFIPALLRIEMGERPI-FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 +Q+ +SF + + + + + LV+D SGSM + Sbjct: 21 SQRQLSFSIGAIASDFQDATLPLNLCLVLDHSGSMAGQ--------------------PL 60 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR---VEKNIEPSWGTEKVRQYVTRDMD 285 +K A + +D + + +I + + + N + + ++ + R Sbjct: 61 RTVKEAAIQLVDQLREGD------RLSVIAFDHKAKVIVPNQDVT-DKAHIKAQIDRLEA 113 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL-TDGENNNF 344 + T +K Q L S + +F+ TDGEN + Sbjct: 114 A---GGTCIDDGIKLGLQELASSPGKRAAQ------------------VFMLTDGENEHG 152 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 N ++I A E+ + + ++ + N L K ++ + + + F+ Sbjct: 153 -DNGRCLEIAAVAAEHGVTLNSLGFGENWNQDVLEKIADAANGSLAYIETPNQALTEFER 211 Query: 405 I 405 + Sbjct: 212 L 212 >gi|291541799|emb|CBL14909.1| TadE-like protein [Ruminococcus bromii L2-63] Length = 221 Score = 44.9 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 36/194 (18%), Positives = 73/194 (37%), Gaps = 19/194 (9%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS---N 71 + SE I +++ FLIY+ + Y K +ES + GA+ Sbjct: 11 LNSENGTVIIEATFVFPIMFIILFFLIYMGNAFYMKAQVESVVEQKAIQGAAYCADPILE 70 Query: 72 LSRLGDRFESISNHAKRAL------IDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 + +F S+S + ++D + I N ++E+L+G ++ F+ I Sbjct: 71 TMKETGKFPSLSALETKPYRYIFGGMNDVESKISNEVEEALTGKTSTFFTNMYPKIT--- 127 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEM 185 A + + NN + Y+ Y ++F L + +V A ++ Sbjct: 128 ------TAKSDIAKYNNYVVYS-TFSVEVKYTIKFPISFLGESTPPMLVITSRAEAPVDD 180 Query: 186 GERPIFLIELVVDL 199 I ++V+D+ Sbjct: 181 TAEFIRNTDMVIDV 194 >gi|86143680|ref|ZP_01062056.1| aerotolerance-related membrane protein [Leeuwenhoekiella blandensis MED217] gi|85829723|gb|EAQ48185.1| aerotolerance-related membrane protein [Leeuwenhoekiella blandensis MED217] Length = 349 Score = 44.9 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 27/196 (13%), Positives = 64/196 (32%), Gaps = 45/196 (22%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 ++E +R + VD+S SM D +++ K + +++ Sbjct: 82 KLETVKREGVDVVFAVDVSKSMDAE---------------DIAPSRIDKAKQLVTQIINN 126 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-MDSLILKPTDSTPAMKQ 300 + +G+I Y + + + ++ D L + T A++ Sbjct: 127 LGSD-------RVGIIAYAGSAYPQLPITTDYSSAKMFLNAMNTDMLSSQGTAIRDAIEL 179 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A +++ + + ++ ++DGE I + A E Sbjct: 180 AKTYYNDEEQTN------------------RVLVIISDGE----DHAGEVASIAESATEQ 217 Query: 361 FIKIVTISINASPNGQ 376 I+I TI + + + Sbjct: 218 GIRIFTIGVGSEAGDR 233 >gi|332524448|ref|ZP_08400660.1| von Willebrand factor type A [Rubrivivax benzoatilyticus JA2] gi|332107769|gb|EGJ08993.1| von Willebrand factor type A [Rubrivivax benzoatilyticus JA2] Length = 343 Score = 44.9 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 45/271 (16%), Positives = 88/271 (32%), Gaps = 70/271 (25%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 A L + M ER I L +D+SGSM +D K ++ A + A Sbjct: 76 TATLTLPMAER---TIILAMDVSGSM---------------RAEDVKPNRLVAAQEAARA 117 Query: 238 FLDSIDLLSHVKEDVYMGLIGY--TTRVEKN--------------IEPSWGTEKVRQYVT 281 F++S+ +V +G++ + T V + + GT V Sbjct: 118 FVESL------PREVRVGVVSFAGTAAVVQAPTTSRDDVFAAIERFQLQRGTAIGSGIVL 171 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF----IIFLT 337 D Q +T + + + +P + +I LT Sbjct: 172 SLATIFPDAGIDI--------QQITGQRTMPRMLGDPEKKAEFTPVPPGSYASAAMILLT 223 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN--------------GQRLLKTCV 383 DG+ ++ K+ A + I++ T+ + + + L+ Sbjct: 224 DGQRTTGPDPIDAAKM---AADRGIRVYTVGVGTTQGEIIGFEGWSMRVRLDEDTLRQIA 280 Query: 384 S-SPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + ++ A+ L V+Q + MV + Sbjct: 281 QMTTGEYFYAGTAEDLKKVYQRLGSRMVVER 311 >gi|332308254|ref|YP_004436105.1| LPXTG-motif cell wall anchor domain protein [Glaciecola agarilytica 4H-3-7+YE-5] gi|332175583|gb|AEE24837.1| LPXTG-motif cell wall anchor domain protein [Glaciecola agarilytica 4H-3-7+YE-5] Length = 777 Score = 44.9 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 44/317 (13%), Positives = 92/317 (29%), Gaps = 74/317 (23%) Query: 110 SAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVM--------TSYDYR--LQ 159 ++ F + + I +T ++ T D Sbjct: 294 NSGFSISPEDIVSEHHPIDITPKDQQAGHYQIELTKEHIANQDFSLRWKPTVGDIPSAAH 353 Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFL---IELVVDLSGSMHCAMNSDPEDVNS 216 F E + N RY +++ + + + ++D SGSM Sbjct: 354 FSETIGNYRYAMVMLTPPRQDDADDKSTKTPVSAREVVFLLDTSGSMAGE---------- 403 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK-----NIEPSW 271 + K A+ L + V +I + + + Sbjct: 404 ----------SIVQAKRAVDFALTQLHPEDSVN------VIEFNDAPQALWNLAMPATAN 447 Query: 272 GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 ++ R +V + T+ PA+ A TN +Q + S + Sbjct: 448 NIQRARNWVASLSAN---GGTEMAPALSMALHK----------TNLEQQNINEGSPVQLR 494 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYN 391 ++F+TDG +N + ++ I + + ++ TI I S+P ++ Sbjct: 495 QVVFITDGSVSNEDALMSLI----ENQLADSRLFTIGI-------------GSAPNSYFM 537 Query: 392 VVNADSLIHVFQNISQL 408 A + F I + Sbjct: 538 TQAAQAGRGTFTYIGDI 554 >gi|56797855|emb|CAG27023.1| matrilin-3a [Danio rerio] Length = 460 Score = 44.9 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 32/175 (18%), Positives = 65/175 (37%), Gaps = 28/175 (16%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--VRQYVTRDMDSLILKPTDST 295 D +D L + + ++ Y + V+ K ++Q +TR + T + Sbjct: 87 LADMVDTLDVGPDATRVAVVNYASTVKIESLLKSHLTKDTIKQAITRIEP--LAAGTMTG 144 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+K+A ++K G + S K I +TDG + V + Sbjct: 145 MAIKKAMDEAFTEK----------SGARPKSKNISKVAIIVTDG---RPQDQVEEVS--A 189 Query: 356 KAKENFIKIVTISI-NASPNGQRLLKTCVSSP--EYHYNVVN---ADSLIHVFQN 404 A+ + I+I + + A +L+ S+P ++ + V + L F+ Sbjct: 190 AARASGIEIYAVGVDRADMRSLKLM---ASNPLEDHVFYVETYGVIEKLTSKFRE 241 >gi|325116955|emb|CBZ52508.1| unnamed protein product [Neospora caninum Liverpool] Length = 765 Score = 44.9 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 40/244 (16%), Positives = 81/244 (33%), Gaps = 45/244 (18%) Query: 174 VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 + + G I +VD SGS+ A +K Sbjct: 59 LMKSGGTIGAAEGCTSQLDICFLVDSSGSIGEA--------------------HYEEVKQ 98 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL--KP 291 L FL + ++V L+ ++T V + ++ +D+ ++ Sbjct: 99 FLHAFLSKL---PIGNDEVNTSLVIFSTTVHPHWSLRANNASDKETAMQDVLTIPYHGGT 155 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T++ ++ Q+L + K +I +TDGE+++ VN Sbjct: 156 TNTAAGLQTCNQMLFDYPREE-------------RQTVPKLVIAMTDGESDSDFHTVNEA 202 Query: 352 KICDKAKENF--IKIVTISINASPNGQRLLKTCV--SSPEYHYNVVNADSLIHVFQNISQ 407 K+ +E I ++++ + + N R + C SSP Y L+ I + Sbjct: 203 KV---IRERGGIITVLSVGMYVNHNECRSMCGCRNDSSPCPLYLQTEWSQLLPSISPILK 259 Query: 408 LMVH 411 + Sbjct: 260 EVCK 263 >gi|6017001|gb|AAF01565.1|AF061273_1 thrombospondin-related adhesive protein homolog [Neospora caninum] Length = 765 Score = 44.9 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 40/244 (16%), Positives = 81/244 (33%), Gaps = 45/244 (18%) Query: 174 VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 + + G I +VD SGS+ A +K Sbjct: 59 LMKSGGTIGAAEGCTSQLDICFLVDSSGSIGEA--------------------HYEEVKQ 98 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL--KP 291 L FL + ++V L+ ++T V + ++ +D+ ++ Sbjct: 99 FLHAFLSKL---PIGNDEVNTSLVIFSTTVHPHWSLRANNASDKETAMQDVLTIPYHGGT 155 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T++ ++ Q+L + K +I +TDGE+++ VN Sbjct: 156 TNTAAGLQTCNQMLFDYPREE-------------RQTVPKLVIAMTDGESDSDFHTVNEA 202 Query: 352 KICDKAKENF--IKIVTISINASPNGQRLLKTCV--SSPEYHYNVVNADSLIHVFQNISQ 407 K+ +E I ++++ + + N R + C SSP Y L+ I + Sbjct: 203 KV---IRERGGIITVLSVGMYVNHNECRSMCGCRNDSSPCPLYLQTEWSQLLPSISPILK 259 Query: 408 LMVH 411 + Sbjct: 260 EVCK 263 >gi|293377912|ref|ZP_06624093.1| LPXTG-motif cell wall anchor domain protein [Enterococcus faecium PC4.1] gi|292643459|gb|EFF61588.1| LPXTG-motif cell wall anchor domain protein [Enterococcus faecium PC4.1] Length = 1498 Score = 44.9 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 31/223 (13%), Positives = 70/223 (31%), Gaps = 30/223 (13%) Query: 99 KNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRL 158 N + + E N+ + + + N+ + YN + + + Sbjct: 290 ANWRNYDYAADNGSTAGEEDVNV----PAQTVQLWGDERNFENSYLDYNGAYIKKWVEPV 345 Query: 159 --QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNS 216 L+ + + ++ E E I V+D S SM + Sbjct: 346 LSNNPTSDLHPEDATTLYNVYLDVIGSEKQEISPIDIVFVLDKSASM------------N 393 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY--------TTRVEKN-- 266 + ++K AAL A+ + +LLS D+ +G++ + + Sbjct: 394 EGTLEGGGQSKNAALIEAVNEISE--NLLSDPNMDIRIGMVNFYHNSTVINNQEQISSDI 451 Query: 267 IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + ++ ++ + T T +K Y+ L +D Sbjct: 452 FPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGYETLYADN 494 >gi|119478003|ref|ZP_01618103.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2143] gi|119448916|gb|EAW30158.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2143] Length = 341 Score = 44.9 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 35/225 (15%), Positives = 72/225 (32%), Gaps = 43/225 (19%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + + VDLSGSM Q +K ++ A+K L F S Sbjct: 101 RSARDLMVAVDLSGSMEAQ---------DFTTEQGEKIDRLTAVKQVLTEF-------SQ 144 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 ++ +GLI + + + + + ++ T A+ + Sbjct: 145 RRDGDRLGLIVFGSAAYLQAPFTADKDTWLTLLQETEIAMAGASTSIGDAIGLSISTFEH 204 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 S + +I LTDG + + A++ +KI TI Sbjct: 205 ------------------SDTDNRVLIVLTDGNDTGSRVPPVDAARVANARD--VKIYTI 244 Query: 368 SINASPN-GQRLL------KTCVSSPEYHYNVVNADSLIHVFQNI 405 +I G+ + + + ++ ++ +L + +I Sbjct: 245 AIGDPETIGEDAMDVDTLKQVSDITGGAYFEALDRQALERAYLDI 289 >gi|47216147|emb|CAG10021.1| unnamed protein product [Tetraodon nigroviridis] Length = 1453 Score = 44.9 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 73/187 (39%), Gaps = 29/187 (15%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 ++ +K L ++ +D+ + +G++ Y +RV+ + T + + + + + Sbjct: 557 PSEFEQVKVFLAKVIEGLDVGPNA---TRVGVVNYASRVKNEVSLK--THRTKAGLIKAV 611 Query: 285 D--SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + T + A++ A + S+ + G ++ S K I +TDG Sbjct: 612 TKIEPLSTGTMTGLAIQFAMNVAFSEAE----------GARLRSPDISKVAIVVTDG--- 658 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN---ADS 397 + NV + +A++ I+I I + L+ S P ++ V + + Sbjct: 659 RPQDNVKDVAQ--RARDAGIEIFAIGVG--RVEMSTLRQMASDPLDDHVDYVESYSVIEK 714 Query: 398 LIHVFQN 404 L FQ Sbjct: 715 LTKKFQE 721 Score = 42.2 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 28/176 (15%), Positives = 69/176 (39%), Gaps = 31/176 (17%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVT 281 + +K + +D +D+ ++GL+ Y++ V++ + +++ V Sbjct: 785 RPENFELVKKWINQIIDKLDVSD---NKAHVGLVQYSSAVKQEFPLGRYNNKKDLKEAVK 841 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + + + T + A++ + + QG + I+F TDG + Sbjct: 842 KM--AYMERGTMTGQALRY----------LTDNSFGPGQGARPGVTKVG--IVF-TDGRS 886 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISI-NASPNGQRLLKTCVSSP--EYHYNVVN 394 ++ + KAK+ K+ + + NA + LK S P E+++ + Sbjct: 887 QDYIGDAA-----KKAKDQGFKMYAVGVGNAVEDE---LKEIASEPTAEHYFYTAD 934 >gi|238011090|gb|ACR36580.1| unknown [Zea mays] Length = 516 Score = 44.9 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 29/224 (12%), Positives = 67/224 (29%), Gaps = 54/224 (24%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 +R + V+D+SGSM K+ +K A+ + + Sbjct: 55 STSDRSGLDLVAVLDVSGSMQGE--------------------KIEKMKTAMKFVVKKLS 94 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAY 302 + + ++ + + TE + + + +D+L T+ + ++ Sbjct: 95 SID------RLSIVTFLDTANRICPLRQVTEDSQPQLLKLIDALQPGGNTNISDGLQTGL 148 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 ++L K S ++ ++DG+ N + N K + Sbjct: 149 KVLADRKLSSGRVVG---------------VMLMSDGQQNRGEPAAN-------VKIGNV 186 Query: 363 KIVTISINA---SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 + T A + + S V + + L F Sbjct: 187 PVYTFGFGADYDPTVLNAVARN--SMGGTFSVVNDVNLLSMAFS 228 >gi|284163331|ref|YP_003401610.1| von Willebrand factor A [Haloterrigena turkmenica DSM 5511] gi|284012986|gb|ADB58937.1| von Willebrand factor type A [Haloterrigena turkmenica DSM 5511] Length = 1446 Score = 44.9 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 35/212 (16%), Positives = 67/212 (31%), Gaps = 50/212 (23%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V D SGSM P A K + DS G Sbjct: 539 VNDESGSMS----GSPTHYAEL------------AGKRFVGALTDSE----------RAG 572 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 +GY + + + + V + R S T++ ++ L + + Sbjct: 573 RVGYASGANLDQPLTTDHDAVNSSLERLSAS---GGTNTRAGLRVGLNHLEEEGWENRSA 629 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 +I L+DG KS + + + + A E ++I T+ + + N Sbjct: 630 ----------------VMILLSDG-----KSGSDPLPVAEDAAEAGVEISTVGLGNNINE 668 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 L + + Y+V + L F+ +++ Sbjct: 669 NELREIAAITGGDFYHVEREEDLPDTFERVAE 700 >gi|212720733|ref|NP_001132911.1| collagen, type XXII, alpha 1 [Gallus gallus] Length = 1599 Score = 44.9 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 22/161 (13%), Positives = 59/161 (36%), Gaps = 23/161 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL--KPTDSTPAMK 299 ++ + +G++ Y+ R + G K + + + T++ A++ Sbjct: 65 VETFEIGPDKTRVGVVRYSDRPTTEFDL--GKYKTCEEIKEAARKIRYYGGNTNTGDALR 122 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 S + G ++ +K I LTDG + ++ + + A++ Sbjct: 123 YINTYSFSKEA----------GGRLSDRTVKKVAILLTDGRSQDYVLDPA-----NAARQ 167 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 I+I + + + L S P + ++V + +++ Sbjct: 168 AGIRIFAVGVG--EALKEELDEIASEPKSAHVFHVSDYNAI 206 >gi|302669471|ref|YP_003829431.1| von Willebrand factor type A domain-containing protein [Butyrivibrio proteoclasticus B316] gi|302393944|gb|ADL32849.1| von Willebrand factor type A domain-containing protein [Butyrivibrio proteoclasticus B316] Length = 561 Score = 44.9 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 44/199 (22%), Positives = 72/199 (36%), Gaps = 45/199 (22%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + G RP + V D+SGSM T++ +LKN+LL + ID Sbjct: 378 KNGTRPTIAV-FVTDISGSM--------------------NGTRIKSLKNSLLSTMQYID 416 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAY 302 S Y+GL+ Y+ +V N+ + K R Y + + L + T + A+ Sbjct: 417 SSS------YIGLVSYSDKVYINLPIAQFDNKQRAYFSGAVKDLDVGGQTATYDAVLVGM 470 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 Q+L K N + L+DG N + +I I Sbjct: 471 QMLMEKSKEVPDANMM--------------LFVLSDGAQN---AGFELKRITPIVGGLGI 513 Query: 363 KIVTISINASPNGQRLLKT 381 + TI + + + LK Sbjct: 514 SVYTIGYEMTDSDKEDLKA 532 >gi|227552322|ref|ZP_03982371.1| von Willebrand factor, type A [Enterococcus faecium TX1330] gi|227178545|gb|EEI59517.1| von Willebrand factor, type A [Enterococcus faecium TX1330] Length = 1518 Score = 44.9 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 31/223 (13%), Positives = 70/223 (31%), Gaps = 30/223 (13%) Query: 99 KNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRL 158 N + + E N+ + + + N+ + YN + + + Sbjct: 310 ANWRNYDYAADNGSTAGEEDVNV----PAQTVQLWGDERNFENSYLDYNGAYIKKWVEPV 365 Query: 159 --QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNS 216 L+ + + ++ E E I V+D S SM + Sbjct: 366 LSNNPTSDLHPEDATTLYNVYLDVIGSEKQEISPIDIVFVLDKSASM------------N 413 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY--------TTRVEKN-- 266 + ++K AAL A+ + +LLS D+ +G++ + + Sbjct: 414 EGTLEGGGQSKNAALIEAVNEISE--NLLSDPNMDIRIGMVNFYHNSTVINNQEQISSDI 471 Query: 267 IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + ++ ++ + T T +K Y+ L +D Sbjct: 472 FPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGYETLYADN 514 >gi|148652289|ref|YP_001279382.1| von Willebrand factor, type A [Psychrobacter sp. PRwf-1] gi|148571373|gb|ABQ93432.1| von Willebrand factor, type A [Psychrobacter sp. PRwf-1] Length = 571 Score = 44.9 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 41/256 (16%), Positives = 85/256 (33%), Gaps = 53/256 (20%) Query: 165 LNQRYNQKI-VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 NQ + V L ++P + +VD+SGSM Sbjct: 188 PWHPTNQIVKVGIKAEDLLTAKQKQPPANLVFLVDVSGSMD------------------- 228 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 K+ K++L + + + Y G T+V P T+K+ + Sbjct: 229 TEDKLQLAKSSLKMLTKQLRAQDSITLITYAG----NTKVVLPSTPGNQTQKI---LNAI 281 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE-NN 342 + T+ A+K AYQ + K + I+ LTDG+ N Sbjct: 282 DNLTASGSTNGEAAIKLAYQQ-------------ATEHFKKDGINR---ILMLTDGDFNV 325 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA------- 395 S + ++I ++ I + T+ +++ + +Y+ +++ Sbjct: 326 GVSSVKDMLQIIRSNRDKGISLSTLGFGQGNYNDHMMEQVADNGNGNYSYIDSLSEAKKV 385 Query: 396 --DSLIHVFQNISQLM 409 D + F +++ + Sbjct: 386 LIDEMSATFNTVAKDV 401 >gi|329922540|ref|ZP_08278115.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] gi|328942084|gb|EGG38366.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] Length = 595 Score = 44.9 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 51/241 (21%), Positives = 90/241 (37%), Gaps = 53/241 (21%) Query: 178 PALLRIEMGERPI-FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 AL + + LV+D S SM NSDPE ++ A+K + Sbjct: 24 GALPKAAAASQGSNIDAVLVMDASNSMK---NSDPERISG------------EAMKMFID 68 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKP-TDS 294 + + D + G++ YT R+++ +E + + +D L P TD Sbjct: 69 MLATTGDKV---------GIVSYTDRIQREKALLEIQSEADKTALKEFIDQLDRGPYTDM 119 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT---- 350 + + +A ++L +QG+ P I+ L DG N+ + T Sbjct: 120 SVGLDEAVKVL-------------KQGMDPAHAPM---IVVLADGNNDLDPNTGRTSKEA 163 Query: 351 ----IKICDKAKENFIKIVTISINASP--NGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + +AK + I I TI +NA N + L + + + +AD L + Sbjct: 164 SEQLAQAVKEAKGSGIPIYTIGLNADGKLNKETLAELANQTGGKSFTTSSADDLPQILSE 223 Query: 405 I 405 I Sbjct: 224 I 224 >gi|241760758|ref|ZP_04758849.1| type IV fimbrial tip adhesin [Neisseria flavescens SK114] gi|241318655|gb|EER55207.1| type IV fimbrial tip adhesin [Neisseria flavescens SK114] Length = 1065 Score = 44.9 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 28/113 (24%), Positives = 45/113 (39%), Gaps = 5/113 (4%) Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII---FLTDGENNNFKSNVNTIKICDKAK 358 Y+ S +KR + F K + F TDG + KS K Sbjct: 224 YRQAGSCQKRELGAYDTSWDIDDGLAFFSKKLAEQDFKTDGTDAAGKSWNGDPKDPADYS 283 Query: 359 ENFIKIVTISIN--ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + ++ T+ +P G+R L+ S PEY+Y +SL VF +I + + Sbjct: 284 KQLVQTFTVGFGQGITPTGKRYLQLAASRPEYYYEADKPESLSKVFNDIVEQI 336 >gi|118394228|ref|XP_001029494.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89283721|gb|EAR81831.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 406 Score = 44.9 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 39/220 (17%), Positives = 80/220 (36%), Gaps = 48/220 (21%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 RP + V+D SGSM +K+ +KN +L L+ ++ Sbjct: 158 RPNLDLICVIDNSGSMS-------------------GCSKIENVKNTILQLLEMLNEND- 197 Query: 248 VKEDVYMGLIGYTTRVEKN---IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + LI + T+ ++ + + ++ Q +T+ + + TD T ++ A+QI Sbjct: 198 -----RLSLITFNTKAKQLCGLKKVNNQNKESLQTITKSIKAD--GGTDITSGLEIAFQI 250 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN--TIKICDKAKENFI 362 L S K+++ ++ I L+DG+++ + + +E Sbjct: 251 LQSRKQKNSVSS----------------IFLLSDGQDDGADIKIKNLLKTTYQQLQEESF 294 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 I + +G + K Y V D + F Sbjct: 295 TIHSFGFGNDHDGPLMQKIAQIKDGSFYFVEKNDQVDEFF 334 >gi|220897447|emb|CAX15331.1| complement component 2 (within H-2S) [Mus musculus] Length = 1276 Score = 44.9 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 41/239 (17%), Positives = 80/239 (33%), Gaps = 45/239 (18%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 GE+ ++V+D SGSM+ + D D + K L ++ + Sbjct: 764 SPGEQQK--RKIVLDPSGSMNIYLVLDGSDSIGSSNFTG--------AKRCLTNLIEKVA 813 Query: 244 LLSHVKEDVYMGLIGYTT--------RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 GL+ Y T E++ + W TEK+ Q D + T++ Sbjct: 814 SYGVRPRY---GLLTYATVPKVLVRVSDERSSDADWVTEKLNQ--ISYEDHKLKSGTNTK 868 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK--- 352 A++ Y ++ ++ + II +TDG +N + V I+ Sbjct: 869 RALQAVYSMM----------SWAGDAPPEGWNRTRHVIIIMTDGLHNMGGNPVTVIQDIR 918 Query: 353 ----ICDKAKEN-----FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 I K + + + L + + + + V + + L +VF Sbjct: 919 ALLDIGRDPKNPREDYLDVYVFGVGPLVDSVNINALASKKDNEHHVFKVKDMEDLENVF 977 >gi|220897445|emb|CAX15329.1| complement component 2 (within H-2S) [Mus musculus] Length = 970 Score = 44.9 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 41/239 (17%), Positives = 80/239 (33%), Gaps = 45/239 (18%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 GE+ ++V+D SGSM+ + D D + K L ++ + Sbjct: 458 SPGEQQK--RKIVLDPSGSMNIYLVLDGSDSIGSSNFTG--------AKRCLTNLIEKVA 507 Query: 244 LLSHVKEDVYMGLIGYTT--------RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 GL+ Y T E++ + W TEK+ Q D + T++ Sbjct: 508 SYGVRPRY---GLLTYATVPKVLVRVSDERSSDADWVTEKLNQ--ISYEDHKLKSGTNTK 562 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK--- 352 A++ Y ++ ++ + II +TDG +N + V I+ Sbjct: 563 RALQAVYSMM----------SWAGDAPPEGWNRTRHVIIIMTDGLHNMGGNPVTVIQDIR 612 Query: 353 ----ICDKAKEN-----FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 I K + + + L + + + + V + + L +VF Sbjct: 613 ALLDIGRDPKNPREDYLDVYVFGVGPLVDSVNINALASKKDNEHHVFKVKDMEDLENVF 671 >gi|218156291|ref|NP_001136178.1| complement factor B isoform 2 [Mus musculus] gi|220897443|emb|CAX15327.1| complement factor B [Mus musculus] Length = 713 Score = 44.9 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 41/239 (17%), Positives = 80/239 (33%), Gaps = 45/239 (18%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 GE+ ++V+D SGSM+ + D D + K L ++ + Sbjct: 251 SPGEQQK--RKIVLDPSGSMNIYLVLDGSDSIGSSNFTG--------AKRCLTNLIEKVA 300 Query: 244 LLSHVKEDVYMGLIGYTT--------RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 GL+ Y T E++ + W TEK+ Q D + T++ Sbjct: 301 SYGVRPRY---GLLTYATVPKVLVRVSDERSSDADWVTEKLNQ--ISYEDHKLKSGTNTK 355 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK--- 352 A++ Y ++ ++ + II +TDG +N + V I+ Sbjct: 356 RALQAVYSMM----------SWAGDAPPEGWNRTRHVIIIMTDGLHNMGGNPVTVIQDIR 405 Query: 353 ----ICDKAKEN-----FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 I K + + + L + + + + V + + L +VF Sbjct: 406 ALLDIGRDPKNPREDYLDVYVFGVGPLVDSVNINALASKKDNEHHVFKVKDMEDLENVF 464 >gi|218156289|ref|NP_032224.2| complement factor B isoform 1 [Mus musculus] gi|220897442|emb|CAX15326.1| complement factor B [Mus musculus] Length = 763 Score = 44.9 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 41/239 (17%), Positives = 80/239 (33%), Gaps = 45/239 (18%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 GE+ ++V+D SGSM+ + D D + K L ++ + Sbjct: 251 SPGEQQK--RKIVLDPSGSMNIYLVLDGSDSIGSSNFTG--------AKRCLTNLIEKVA 300 Query: 244 LLSHVKEDVYMGLIGYTT--------RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 GL+ Y T E++ + W TEK+ Q D + T++ Sbjct: 301 SYGVRPRY---GLLTYATVPKVLVRVSDERSSDADWVTEKLNQ--ISYEDHKLKSGTNTK 355 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK--- 352 A++ Y ++ ++ + II +TDG +N + V I+ Sbjct: 356 RALQAVYSMM----------SWAGDAPPEGWNRTRHVIIIMTDGLHNMGGNPVTVIQDIR 405 Query: 353 ----ICDKAKEN-----FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 I K + + + L + + + + V + + L +VF Sbjct: 406 ALLDIGRDPKNPREDYLDVYVFGVGPLVDSVNINALASKKDNEHHVFKVKDMEDLENVF 464 >gi|148694792|gb|EDL26739.1| complement factor B, isoform CRA_b [Mus musculus] Length = 760 Score = 44.9 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 41/239 (17%), Positives = 80/239 (33%), Gaps = 45/239 (18%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 GE+ ++V+D SGSM+ + D D + K L ++ + Sbjct: 248 SPGEQQK--RKIVLDPSGSMNIYLVLDGSDSIGSSNFTG--------AKRCLTNLIEKVA 297 Query: 244 LLSHVKEDVYMGLIGYTT--------RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 GL+ Y T E++ + W TEK+ Q D + T++ Sbjct: 298 SYGVRPRY---GLLTYATVPKVLVRVSDERSSDADWVTEKLNQ--ISYEDHKLKSGTNTK 352 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK--- 352 A++ Y ++ ++ + II +TDG +N + V I+ Sbjct: 353 RALQAVYSMM----------SWAGDAPPEGWNRTRHVIIIMTDGLHNMGGNPVTVIQDIR 402 Query: 353 ----ICDKAKEN-----FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 I K + + + L + + + + V + + L +VF Sbjct: 403 ALLDIGRDPKNPREDYLDVYVFGVGPLVDSVNINALASKKDNEHHVFKVKDMEDLENVF 461 >gi|148694794|gb|EDL26741.1| complement factor B, isoform CRA_d [Mus musculus] Length = 731 Score = 44.9 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 41/239 (17%), Positives = 80/239 (33%), Gaps = 45/239 (18%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 GE+ ++V+D SGSM+ + D D + K L ++ + Sbjct: 219 SPGEQQK--RKIVLDPSGSMNIYLVLDGSDSIGSSNFTG--------AKRCLTNLIEKVA 268 Query: 244 LLSHVKEDVYMGLIGYTT--------RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 GL+ Y T E++ + W TEK+ Q D + T++ Sbjct: 269 SYGVRPRY---GLLTYATVPKVLVRVSDERSSDADWVTEKLNQ--ISYEDHKLKSGTNTK 323 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK--- 352 A++ Y ++ ++ + II +TDG +N + V I+ Sbjct: 324 RALQAVYSMM----------SWAGDAPPEGWNRTRHVIIIMTDGLHNMGGNPVTVIQDIR 373 Query: 353 ----ICDKAKEN-----FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 I K + + + L + + + + V + + L +VF Sbjct: 374 ALLDIGRDPKNPREDYLDVYVFGVGPLVDSVNINALASKKDNEHHVFKVKDMEDLENVF 432 >gi|119628048|gb|EAX07643.1| matrilin 1, cartilage matrix protein, isoform CRA_b [Homo sapiens] Length = 519 Score = 44.9 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 23/145 (15%), Positives = 51/145 (35%), Gaps = 17/145 (11%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 I+ L +G++ Y + V++ K + T + A++ A Sbjct: 68 IESLDVGPNATRVGMVNYASTVKQEFSLRAHVSKAALLQAVRRIQPLSTGTMTGLAIQFA 127 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 D + G + S K +I +TDG + +V+ +A+ + Sbjct: 128 ITKAFGDAE----------GGRSRSPDISKVVIVVTDGRPQDSVQDVS-----ARARASG 172 Query: 362 IKIVTISINASPNGQRLLKTCVSSP 386 +++ I + + + L+ S P Sbjct: 173 VELFAIGVGSVD--KATLRQIASEP 195 Score = 42.6 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 37/223 (16%), Positives = 82/223 (36%), Gaps = 40/223 (17%) Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNA-----------LLLFLDSI-DLLSHVKED 251 +NSD + N T + L + + F+ I D L + Sbjct: 252 GFTLNSDGKTCNVCSGGGGSSATDLVFLIDGSKSVRPENFELVKKFISQIVDTLDVSDKL 311 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD--SLILKPTDSTPAMKQAYQILTSDK 309 +GL+ Y++ V + G ++ + + S + K T + A+K D Sbjct: 312 AQVGLVQYSSSVRQEFPL--GRFHTKKDIKAAVRNMSYMEKGTMTGAALKY-----LIDN 364 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + + K+ I+F TDG + ++ ++ KAK+ K+ + + Sbjct: 365 SFTVSSGARPGAQKVG-------IVF-TDGRSQDYINDAA-----KKAKDLGFKMFAVGV 411 Query: 370 -NASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 NA + L+ S P E+++ + ++ + + + + + Sbjct: 412 GNAVEDE---LREIASEPVAEHYFYTADFKTINQIGKKLQKKI 451 >gi|110678222|ref|YP_681229.1| hypothetical protein RD1_0875 [Roseobacter denitrificans OCh 114] gi|109454338|gb|ABG30543.1| conserved hypothetical protein [Roseobacter denitrificans OCh 114] Length = 320 Score = 44.9 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 26/178 (14%), Positives = 62/178 (34%), Gaps = 37/178 (20%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +V+DLSGSM V ++ T+ A+ + Sbjct: 92 DLAIVLDLSGSM----------VRDDFDLDGRQVTRRDAVAT-VGADFARRRGGD----- 135 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + L+ + + S+ E + + + + + T + + A + + + + Sbjct: 136 -RVALVVFGSEAYFAAPFSFDVEAIARQIEGAQIGISGRATSISDGLGLALKRMENSEAA 194 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 S + +I L+DG NN +N + + A + +++ TI++ Sbjct: 195 S------------------RVVILLSDGVNNAGATNPRGVA--ELAAQMGVRVHTIAL 232 >gi|116126|sp|P04186|CFAB_MOUSE RecName: Full=Complement factor B; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement factor B Ba fragment; Contains: RecName: Full=Complement factor B Bb fragment; Flags: Precursor gi|192412|gb|AAA37379.1| factor B [Mus musculus] gi|192414|gb|AAA63293.1| factor B [Mus musculus] gi|2944424|gb|AAC05283.1| complement factor B [Mus musculus] gi|3986764|gb|AAC84160.1| Bf [Mus musculus] gi|13529437|gb|AAH05451.1| Complement factor B [Mus musculus] gi|74146351|dbj|BAE28943.1| unnamed protein product [Mus musculus] gi|148694791|gb|EDL26738.1| complement factor B, isoform CRA_a [Mus musculus] Length = 761 Score = 44.9 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 41/239 (17%), Positives = 80/239 (33%), Gaps = 45/239 (18%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 GE+ ++V+D SGSM+ + D D + K L ++ + Sbjct: 249 SPGEQQK--RKIVLDPSGSMNIYLVLDGSDSIGSSNFTG--------AKRCLTNLIEKVA 298 Query: 244 LLSHVKEDVYMGLIGYTT--------RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 GL+ Y T E++ + W TEK+ Q D + T++ Sbjct: 299 SYGVRPRY---GLLTYATVPKVLVRVSDERSSDADWVTEKLNQ--ISYEDHKLKSGTNTK 353 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK--- 352 A++ Y ++ ++ + II +TDG +N + V I+ Sbjct: 354 RALQAVYSMM----------SWAGDAPPEGWNRTRHVIIIMTDGLHNMGGNPVTVIQDIR 403 Query: 353 ----ICDKAKEN-----FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 I K + + + L + + + + V + + L +VF Sbjct: 404 ALLDIGRDPKNPREDYLDVYVFGVGPLVDSVNINALASKKDNEHHVFKVKDMEDLENVF 462 >gi|242072180|ref|XP_002446026.1| hypothetical protein SORBIDRAFT_06g000656 [Sorghum bicolor] gi|241937209|gb|EES10354.1| hypothetical protein SORBIDRAFT_06g000656 [Sorghum bicolor] Length = 434 Score = 44.9 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 36/226 (15%), Positives = 76/226 (33%), Gaps = 35/226 (15%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 ER + V+D+SGSM + N ++ K A+ + S+ + Sbjct: 8 ERAPVDVVAVLDVSGSMAWDYGNGTTVENH----------RLELAKEAMAKAIQSLGPAA 57 Query: 247 HVKEDV-----YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQ 300 + ++ ++ V++ + + +Q V +D+L D +K Sbjct: 58 AAVAAGGARRNRLAVVPFSNVVKQVTPLTEMDMEGQQTVKNAVDALKPGGQADYLMPLKI 117 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A +IL K IIF++DG+++ F+ + + + K Sbjct: 118 AAKILDERKAEEKDRLA--------------IIIFVSDGQDHYFRDTDDMKETLTQHKLI 163 Query: 361 FIKIVTISINASP-----NGQRLLKTCVSSPEYHYNVVNADSLIHV 401 I ++ S G + L+ + Y + D + Sbjct: 164 KYPIHAFGVSVSEQDSSGGGAKALRAMADATSGSYTSITQDDDVDT 209 >gi|156405002|ref|XP_001640521.1| predicted protein [Nematostella vectensis] gi|156227656|gb|EDO48458.1| predicted protein [Nematostella vectensis] Length = 308 Score = 44.9 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 47/289 (16%), Positives = 96/289 (33%), Gaps = 45/289 (15%) Query: 118 IQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFI 177 + + ++ D N + + +F+E+ L+ R +++ Sbjct: 26 KDSFERNYYDRISQEWRFAKDKWNGDDPGYQKIPPRIETARKFMENFLD-RDIKELDDGA 84 Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + R +G+ + + V+D S S+ +K+ Sbjct: 85 KNIRRRSVGDNIFYDVVFVLDSSASVG--------------------------VKDYKNG 118 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 L L++ KED I ++T T+ + + T++ A Sbjct: 119 ILALQTLITRAKEDTRYAGITFSTEANITFYF---TDPLDAMKGLGGITYAPGMTNTQAA 175 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + L +KK F L F++ I+ +TDG++N + T+ + Sbjct: 176 LDICRTQLWLNKKSGFR-----------RLSFKR-ILIVTDGQSNI--NMERTLYNAFQL 221 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSS-PEYHYNVVNADSLIHVFQNI 405 K I+I +++ G + SS + Y V N L+ V I Sbjct: 222 KNMGIEIFVVAVGKYLRGIAEIVGLASSTDAHLYRVRNLRGLLEVVHLI 270 >gi|149641369|ref|XP_001505343.1| PREDICTED: similar to matrilin-3, partial [Ornithorhynchus anatinus] Length = 354 Score = 44.9 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 31/178 (17%), Positives = 65/178 (36%), Gaps = 27/178 (15%) Query: 235 LLLFLDS-IDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSLILKP 291 + FL ID L + + ++ Y + V+ E ++Q V+R + Sbjct: 170 VKTFLSQVIDTLDIGETATRVAVVNYASTVKVEFHLQTHSDKESLKQAVSRIAP--LATG 227 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T S A++ A + + + G + P+ K ++ +TDG + V Sbjct: 228 TMSGLAIRTAMDEVFTVEA----------GARAPAFNIPKVVVIVTDG---RPQDQVQEA 274 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN---ADSLIHVFQN 404 +A+ + I+I + + + L+ S P + + V + L F+ Sbjct: 275 V--AQAQASGIEIYAVGVG--RADMQSLRQLASEPVETHAFYVETYGVIEKLTSTFRK 328 >gi|3766289|emb|CAA06890.1| matrilin-4 precursor, alternate splice product [Mus musculus] Length = 434 Score = 44.9 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 30/177 (16%), Positives = 64/177 (36%), Gaps = 26/177 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR--DMDSLILKPTDSTPAMK 299 +D L E +GL+ +++RV G V + + + T + A++ Sbjct: 225 VDFLDVSPEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALR 282 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S+ QG + L + + TDG + +++ +AKE Sbjct: 283 HMVEHSFSE----------AQGARPRDLNVPRVGLVFTDG---RSQDDISV--WAARAKE 327 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I + + + + L+ S P + + D F ++ L+ + K S+ Sbjct: 328 EGIVMYAVGVG--KAVEEELREIASEPSELHVSYSPD-----FSTMTHLLENLKGSI 377 >gi|261409467|ref|YP_003245708.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] gi|261285930|gb|ACX67901.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] Length = 595 Score = 44.9 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 52/241 (21%), Positives = 90/241 (37%), Gaps = 53/241 (21%) Query: 178 PALLRIEMGERPI-FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 AL + + LV+D S SM NSDPE ++ A+K + Sbjct: 24 GALPKAAAASQGSNIDAVLVMDASNSMK---NSDPERISG------------EAMKMFID 68 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKP-TDS 294 + + D + G++ YT R+++ +E + + +D L P TD Sbjct: 69 MLATTGDKV---------GIVSYTDRIQREKALLEIQSEADKTALKEFIDQLDRGPYTDM 119 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK-- 352 + + +A ++L +QG+ P I+ L DG N+ + T K Sbjct: 120 SVGLDEAVKVL-------------KQGMDPAHAPM---IVVLADGNNDLDPNTGRTSKEA 163 Query: 353 ------ICDKAKENFIKIVTISINASP--NGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 +AK + I I TI +NA N + L + + + +AD L + Sbjct: 164 SEQLNQAVKEAKGSGIPIYTIGLNADGKLNKETLAELAKQTGGKSFTTSSADDLPQILSE 223 Query: 405 I 405 I Sbjct: 224 I 224 >gi|223974345|gb|ACN31360.1| unknown [Zea mays] Length = 516 Score = 44.9 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 29/224 (12%), Positives = 67/224 (29%), Gaps = 54/224 (24%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 +R + V+D+SGSM K+ +K A+ + + Sbjct: 55 STSDRSGLDLVAVLDVSGSMQGE--------------------KIEKMKTAMKFVVKKLS 94 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAY 302 + + ++ + + TE + + + +D+L T+ + ++ Sbjct: 95 SID------RLSIVTFLDTANRICPLRQVTEDSQPQLLKLIDALQPGGNTNISDGLQTGL 148 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 ++L K S ++ ++DG+ N + N K + Sbjct: 149 KVLADRKLSSGRVVG---------------VMLMSDGQQNRGEPAAN-------VKIGNV 186 Query: 363 KIVTISINA---SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 + T A + + S V + + L F Sbjct: 187 PVYTFGFGADYDPTVLNAVARN--SMGGTFSVVNDVNLLSMAFS 228 >gi|189347154|ref|YP_001943683.1| von Willebrand factor type A [Chlorobium limicola DSM 245] gi|189341301|gb|ACD90704.1| von Willebrand factor type A [Chlorobium limicola DSM 245] Length = 6006 Score = 44.9 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 50/152 (32%), Gaps = 16/152 (10%) Query: 174 VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 VS + + + + +D+SGSM +N+D T+M Sbjct: 5437 VSPTSNSYEVPLQNTNLL---ITLDVSGSMSRNLNND-------SHPTGNDPTRMDIAVE 5486 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 ++ L D + DV + L+ ++T + W T + + + + + T+ Sbjct: 5487 SIAEMLSQYDY----RGDVSVKLVIFSTNGQSLTTAEWVTVEEAKIMLNSLVAN--GGTN 5540 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIP 325 A++ A + + + Sbjct: 5541 YDGAIEAADDAFVNTNGMIANADNIAYFISDG 5572 >gi|326328639|ref|ZP_08194979.1| LigA [Nocardioidaceae bacterium Broad-1] gi|325953600|gb|EGD45600.1| LigA [Nocardioidaceae bacterium Broad-1] Length = 871 Score = 44.9 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 42/228 (18%), Positives = 73/228 (32%), Gaps = 48/228 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + L++D SGSM+ + K+ +++A + LL+ E Sbjct: 651 VILLIDNSGSMNDEVAPG--------------AAKIDRVQSAANAAIG---LLAPKDE-- 691 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 + + + + V K G V ++ ++ T + A Q Sbjct: 692 -LAVWTFGSSVHKTALAPMGNR--ISQVRAEIGAIEAGGTTTQLP--AAVQA-------- 738 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS--NVNTIKICDKAKENFI-------K 363 + P P K ++ LTDG N + K + A I + Sbjct: 739 --AHDALAQTNDPDNPKTKAVVLLTDGATNLTPDGADEEENKAANDALVADIRGSESHVR 796 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL-----IHVFQNIS 406 I TI S + L K +S +Y +SL + VF N Sbjct: 797 IYTIPYGNSADKCLLEKVAAASGARYYGAGARESLINDVMLAVFGNFG 844 >gi|293571291|ref|ZP_06682325.1| von Willebrand factor type A domain protein [Enterococcus faecium E980] gi|291608698|gb|EFF37986.1| von Willebrand factor type A domain protein [Enterococcus faecium E980] Length = 1364 Score = 44.9 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 31/223 (13%), Positives = 71/223 (31%), Gaps = 30/223 (13%) Query: 99 KNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRL 158 N + + + E N+ + + + N+ + YN + + + Sbjct: 152 ANWRNYDYAADNGSAASEEDVNV----PAQTVQLWGDERNFENSYLDYNGAYIKKWVEPV 207 Query: 159 --QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNS 216 L+ + + ++ E E I V+D S SM + Sbjct: 208 PSNNPTADLHPEDATTLYNVYLDVIGSEKQEISPIDIVFVLDKSASM------------N 255 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY--------TTRVEKN-- 266 + ++K AAL A+ + +LLS D+ +G++ + + Sbjct: 256 EGTLEGGGQSKNAALIEAVNEISE--NLLSDPNMDIRIGMVNFYHNSTVINNQEQISSDI 313 Query: 267 IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + ++ ++ + T T +K Y+ L +D Sbjct: 314 FPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGYETLYADN 356 >gi|288919019|ref|ZP_06413360.1| von Willebrand factor type A [Frankia sp. EUN1f] gi|288349559|gb|EFC83795.1| von Willebrand factor type A [Frankia sp. EUN1f] Length = 319 Score = 44.9 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 39/237 (16%), Positives = 74/237 (31%), Gaps = 52/237 (21%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R I L +D+S SM D + T++ A K F+D + Sbjct: 84 RERATIILAIDVSNSMAA---------------TDIQPTRLEAAKQGAQAFVDQL----- 123 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILT 306 + +GL+ + + R+ V + L L P T + + Q +T Sbjct: 124 -PPRINLGLVSFAGSAAVLVP----ASTDRESVRSGIRGLQLGPATAVGEGIFASLQAIT 178 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + +R P I+ L+DGE + N A++ + + T Sbjct: 179 TAGERMSDE---------GQPPPPAAIVLLSDGETTRGRPNTQAAT---AARDAEVPVDT 226 Query: 367 ISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 I+ N L + + ++ D L V++ + + Sbjct: 227 IAYGTSDGTLDVGGQQIPVPVNEDALRELAEQTGGSYHRATTGDELQSVYRGLGSSI 283 >gi|257895102|ref|ZP_05674755.1| von Willebrand factor domain-containing protein [Enterococcus faecium Com12] gi|257831667|gb|EEV58088.1| von Willebrand factor domain-containing protein [Enterococcus faecium Com12] Length = 1341 Score = 44.9 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 31/223 (13%), Positives = 70/223 (31%), Gaps = 30/223 (13%) Query: 99 KNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRL 158 N + + E N+ + + + N+ + YN + + + Sbjct: 133 ANWRNYDYAADNGSTAGEEDVNV----PAQTVQLWGDERNFENSYLDYNGAYIKKWVEPV 188 Query: 159 --QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNS 216 L+ + + ++ E E I V+D S SM + Sbjct: 189 LSNNPTSDLHPEDATTLYNVYLDVIGSEKQEISPIDIVFVLDKSASM------------N 236 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY--------TTRVEKN-- 266 + ++K AAL A+ + +LLS D+ +G++ + + Sbjct: 237 EGTLEGGGQSKNAALIEAVNEISE--NLLSDPNMDIRIGMVNFYHNSTVINNQEQISSDI 294 Query: 267 IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + ++ ++ + T T +K Y+ L +D Sbjct: 295 FPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGYETLYADN 337 >gi|229816811|ref|ZP_04447093.1| hypothetical protein BIFANG_02059 [Bifidobacterium angulatum DSM 20098] gi|229785827|gb|EEP21941.1| hypothetical protein BIFANG_02059 [Bifidobacterium angulatum DSM 20098] Length = 1185 Score = 44.9 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 28/148 (18%), Positives = 50/148 (33%), Gaps = 37/148 (25%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I LV+D SGSM+ +D K A K+ L + ++ Sbjct: 492 DIVLVMDKSGSMN-------------ENNRDANAQK--AAKDLAKKLLTGTNSKLPPEQQ 536 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 V M ++ ++T + + + + + T+ A+KQA Sbjct: 537 VQMAVVTFSTEASLKQKFTTNVSE----INNAVRGNPDGGTNWEAALKQA---------- 582 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 ++ +K IIFL+DG Sbjct: 583 --------NDMQGGRRGVKKHIIFLSDG 602 >gi|163735880|ref|ZP_02143308.1| hypothetical protein RLO149_07941 [Roseobacter litoralis Och 149] gi|161390816|gb|EDQ15157.1| hypothetical protein RLO149_07941 [Roseobacter litoralis Och 149] Length = 320 Score = 44.9 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 33/222 (14%), Positives = 75/222 (33%), Gaps = 49/222 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +V+DLSGSM V ++ T+ A+ + Sbjct: 92 DLAIVLDLSGSM----------VRDDFDLDGRQVTRREAVAT-VGADFARRRGGD----- 135 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + L+ + + S+ E + + + + + T + + A + + + + Sbjct: 136 -RVALVVFGSEAYFAAPFSFDVEAIARQIESAQIGVSGRATSISDGLGIALKRMENSEAA 194 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-- 369 S + +I L+DG NN +N + + A + +++ TI++ Sbjct: 195 S------------------RVVILLSDGVNNAGATNPRGVA--ELAAQMGVRVHTIALGP 234 Query: 370 ----NASPNGQ-----RLLKTCVS-SPEYHYNVVNADSLIHV 401 +A P + L+ S + V + L+ V Sbjct: 235 KDLSSADPGERGVVDAATLRAISEISGGESFRVRTTEDLVAV 276 >gi|73538307|ref|YP_298674.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] gi|72121644|gb|AAZ63830.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] Length = 353 Score = 44.9 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 34/256 (13%), Positives = 76/256 (29%), Gaps = 65/256 (25%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + L +D+S SM D T+++A + A DL+ + Sbjct: 86 ITLVLAMDVSRSMEA---------------TDVAPTRISAAQQAAR------DLIVGLPA 124 Query: 251 DVYMGLIGY-----------TTRVE-----KNIEPSWGTEKVRQYVTRDMDSLILKPTD- 293 V +G++ + + R + + GT + D Sbjct: 125 SVRLGIVSFAGTATVVLRPTSNRQDMLDAIDRFQLQRGTATGSGLIQALAVLFPDDGIDL 184 Query: 294 -----STPAMKQAYQILTSDKKRSFFT----NFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + + + + D+ + Q + S I L+DG Sbjct: 185 EAILFADEPVFSTRRAVPLDEAAAADAVRKREQATQSAQPGSYRHGAVI-LLSDGRRTVG 243 Query: 345 KSNVNTIKICDKAKENFIKIVTISI-----NASPNGQR--------LLKTCVS-SPEYHY 390 V+ ++ A + +++ T+ NA L+ + + ++ Sbjct: 244 PDPVDAARM---AAQRGVRVYTVGFGTLGGNAPETSLSYYMQLDEPALRAVATITGAEYF 300 Query: 391 NVVNADSLIHVFQNIS 406 +A L V++ +S Sbjct: 301 QAGSAADLSQVYRQLS 316 >gi|183985704|gb|AAI66222.1| LOC100158554 protein [Xenopus (Silurana) tropicalis] Length = 899 Score = 44.9 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 40/217 (18%), Positives = 82/217 (37%), Gaps = 51/217 (23%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + LV+D+SGSM ++ L A +++ I V++ Sbjct: 309 VSLVLDISGSM-------------------TNANRITRLYQASEVYIMQI-----VEQGA 344 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-MDSLILKPTDSTPAMKQAYQILTSDKKR 311 Y+G++ ++ E + T+ ++ + + ++ T+ ++Q Q+ + + Sbjct: 345 YVGIVTFSNVAEIKSQLVKITDTFQRESLKLKLPTVATGGTNICAGVQQGLQVNRNLDQS 404 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISI- 369 + T I+ LTDGE++ S C ++ I TI++ Sbjct: 405 THGTE----------------IVLLTDGEDSGISS-------CFPDITKSGAIIHTIALG 441 Query: 370 -NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 NA P ++L Y + V+ + LI F I Sbjct: 442 NNADPGLEKLADLTGGLKLYASDKVDVNGLIDSFSGI 478 >gi|85374662|ref|YP_458724.1| von Willebrand factor type A domain-containing protein [Erythrobacter litoralis HTCC2594] gi|84787745|gb|ABC63927.1| von Willebrand factor type A domain protein [Erythrobacter litoralis HTCC2594] Length = 580 Score = 44.9 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 41/250 (16%), Positives = 88/250 (35%), Gaps = 47/250 (18%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + +++ A IE ERP + ++D+SGSM + K Sbjct: 192 NEDTRLIRIGLAGYDIERSERPPANLVFLMDVSGSMG-------------------RPDK 232 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +K AL + V VY G G +EP+ T K+R + + Sbjct: 233 LPLVKTALAGLAGELQPQDKVSIVVYAGAAGL------VLEPTNDTRKIRAALNQLQAGG 286 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE-NNNFKS 346 ++ AYQ + +F + +I TDG+ N S Sbjct: 287 ST---AGGAGIQLAYQ----IAEDNFIEGGVNR------------VILATDGDFNVGVSS 327 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 I++ +K +++ I + T+ + +++ + +Y +++ + + + Sbjct: 328 RDALIEMIEKKRDSGITLTTLGFGTGNYNEAMMEQIANHGNGNYAYIDSA--LEAKKVLG 385 Query: 407 QLMVHRKYSV 416 M +++ Sbjct: 386 DEMSSTLFTI 395 >gi|288942396|ref|YP_003444636.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] gi|288897768|gb|ADC63604.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] Length = 346 Score = 44.9 bits (104), Expect = 0.028, Method: Composition-based stats. Identities = 33/239 (13%), Positives = 74/239 (30%), Gaps = 56/239 (23%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 P + + + VD S SM + + ++ +MA +K + F+D+ Sbjct: 97 PGYDLMIAVDASHSMEALDFTV----------EGRQVNRMAVVKGVMGRFIDA------- 139 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ +GLI + ++ + RQ + + S+ T A+ L Sbjct: 140 RQGDRVGLILFGSQAFILSPLTLDRHAARQLLDGVVPSIAGPATALGDAIALGVSKLRER 199 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + S + +I + DG+NN A+ +I I Sbjct: 200 PEGS------------------RVMIVIADGDNNAGSFAPKEAAR--LARATGTRIYVIG 239 Query: 369 I-----NASPNGQRLLKTCVS--------------SPEYHYNVVNADSLIHVFQNISQL 408 + + + ++ + ++ + +L + I QL Sbjct: 240 VGSKQPSIPILEEGSVRYRDDLTMDEGTLQEIADLTGGGYFRATDTRALEEISSRIGQL 298 >gi|226496057|ref|NP_001151334.1| LOC100284967 [Zea mays] gi|195645892|gb|ACG42414.1| protein binding protein [Zea mays] Length = 516 Score = 44.9 bits (104), Expect = 0.028, Method: Composition-based stats. Identities = 29/224 (12%), Positives = 67/224 (29%), Gaps = 54/224 (24%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 +R + V+D+SGSM K+ +K A+ + + Sbjct: 55 STSDRSGLDLVAVLDVSGSMQGE--------------------KIEKMKTAMKFVVKKLS 94 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAY 302 + + ++ + + TE + + + +D+L T+ + ++ Sbjct: 95 SID------RLSIVTFLDTANRICPLQQVTEDSQPQLLKLIDALQPGGNTNISDGLQTGL 148 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 ++L K S ++ ++DG+ N + N K + Sbjct: 149 KVLADRKLSSGRVVG---------------VMLMSDGQQNRGEPAAN-------VKIGNV 186 Query: 363 KIVTISINA---SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 + T A + + S V + + L F Sbjct: 187 PVYTFGFGADYDPTVLNAVARN--SMGGTFSVVNDVNLLSMAFS 228 >gi|149636528|ref|XP_001511995.1| PREDICTED: similar to putative calcium activated chloride channel-like protein 1; eCLCA1 [Ornithorhynchus anatinus] Length = 800 Score = 44.9 bits (104), Expect = 0.028, Method: Composition-based stats. Identities = 32/177 (18%), Positives = 66/177 (37%), Gaps = 49/177 (27%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM ++ + A+ LFL I ++ ++ Sbjct: 309 LVLDKSGSMAAM-------------------DRLNRMNQAVKLFLLQI-----TEKGSWV 344 Query: 255 GLIGYTTR-VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + R + +N +E R Y+ + T ++ A+Q + + + Sbjct: 345 GIVLFDERAIIRNPLIQIISEDDRNYLMTRLPEAAGGGTSICSGVQAAFQAIKQKFQTTD 404 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISI 369 + I+ LTDGE+ T+ C ++ K++ I T+++ Sbjct: 405 GSE----------------IVLLTDGED-------VTVSSCFEEVKQSGATIHTVAL 438 >gi|309792347|ref|ZP_07686816.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308225613|gb|EFO79372.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 845 Score = 44.9 bits (104), Expect = 0.028, Method: Composition-based stats. Identities = 25/202 (12%), Positives = 66/202 (32%), Gaps = 42/202 (20%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD------------------- 222 +I + +V+D SGSM + + C + Sbjct: 394 QIIPTSQKPIQYVVVLDASGSMSANFDGQCNNSGGVKQCANGPSGFPDVQVSNTGYDYWW 453 Query: 223 --KKRTKMAALKNALLLFLDSIDLLSHVKEDV------YMGLIGYTTRV--EKNIEPSWG 272 + + ++ K AL + + + M ++ + V + + Sbjct: 454 TTESQRRIYVAKKALERLVT-LSNMPGNPGYTNTRPSDQMAVVWFNDGVSSSQTQAFTNN 512 Query: 273 TEKVRQYVTRDMDSL----ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 ++ Y+T + T+ + +A +L + ++ N ++ Sbjct: 513 PTTLKNYITTLNNVNGNYRSAGGTNGAGGLYRA-SLLYQNAPKTVSFNGT-------NVE 564 Query: 329 FQKFIIFLTDGENNNFKSNVNT 350 +++ ++F+TDG +N F + + Sbjct: 565 YKRVVLFVTDGVSNYFLNTSAS 586 >gi|254458905|ref|ZP_05072328.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] gi|207084176|gb|EDZ61465.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] Length = 309 Score = 44.9 bits (104), Expect = 0.028, Method: Composition-based stats. Identities = 36/251 (14%), Positives = 73/251 (29%), Gaps = 37/251 (14%) Query: 157 RLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNS 216 +L F L + + + + I L++D S SM Sbjct: 50 KLLFFLKWLGIIMLILALMSPVKDEPYTLEPKDGYEIALILDASQSM----------KAQ 99 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 + + T+ +K+ + F+ + +GL+ + ++ + Sbjct: 100 GFDVTNPQLTRFDVVKDIVSNFIKERQNDN-------IGLVVFGAYSFIASPLTYDENIL 152 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + V++ M Y L + + + K I L Sbjct: 153 NKIVSQL-----------YIGMAGKYTALFTSLAQGVNLLKMSESK-------SKVGILL 194 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN-GQRLLKTCVS-SPEYHYNVVN 394 TDG + + D A + IKI I I +L+ + + + Sbjct: 195 TDGFSTPEVDKIPFDVALDMAIKEKIKIYPIGIGMPHEYNIEVLRKIAEKTGGKAFGAAS 254 Query: 395 ADSLIHVFQNI 405 A L V++ I Sbjct: 255 ATELKEVYKEI 265 >gi|281340555|gb|EFB16139.1| hypothetical protein PANDA_003424 [Ailuropoda melanoleuca] Length = 191 Score = 44.5 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 20/147 (13%), Positives = 48/147 (32%), Gaps = 17/147 (11%) Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 V + I Y+T ++ + ++R+ + + + + T +++A + + Sbjct: 31 DNPKVRISFITYSTDGHTLMKITSDKNEIRENLAKLQNVVPSGATHMQEGLRKANEQIEQ 90 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + I+ LTDG T ++++ + I Sbjct: 91 ENAGEKKAPI--------------VILALTDG-TLLPFPFEETKMEAEESRRLGATVYCI 135 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVN 394 + Q L SP++ + V N Sbjct: 136 GVKDYRKDQLL--DIADSPDHMFGVDN 160 >gi|332817903|ref|XP_003310057.1| PREDICTED: collagen alpha-6(VI) chain [Pan troglodytes] Length = 2263 Score = 44.5 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 55/353 (15%), Positives = 108/353 (30%), Gaps = 66/353 (18%) Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 SN LG E+I A F + ++++ + + N Sbjct: 494 SNKQDLGKAIENIRQMGGNTNTGAALNFTLSLLQKAKKQRGNK-VPCHLVVLTNGMSKDS 552 Query: 130 THMANNRL------------DSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFI 177 NRL +N T + Y + + L + R + Sbjct: 553 ILEPANRLREEHIRVYAIGIKEANQTQLREIAGEEKRVYYVHNFDALKDIRN-----QVV 607 Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + E + I +VD SGS+ + +KM K + Sbjct: 608 QEICTEEACKDMKADIMFLVDSSGSIG-----------------PENFSKM---KTFMKN 647 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 + + + V +G++ ++ ++ + R M + ++ Sbjct: 648 LVSKSQIGP---DRVQIGVVQFSDINKEEFQL-----------NRFMSQSDIS--NAIDQ 691 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 M Q + SF + +F K +KF+I +TDGE + + Sbjct: 692 MAHIGQTTLTGSALSFVSQYF-SPTKGARPNIRKFLILITDGEAQDIVKEPAVV-----L 745 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 ++ + I ++ + S Q L+ PE + V N + Q I +V Sbjct: 746 RQEGVIIYSVGVFGSNVTQ--LEEISGRPEMVFYVEN----FDILQRIEDDLV 792 >gi|254776724|ref|ZP_05218240.1| hypothetical protein MaviaA2_18936 [Mycobacterium avium subsp. avium ATCC 25291] Length = 335 Score = 44.5 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 39/234 (16%), Positives = 75/234 (32%), Gaps = 51/234 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + LV+D+S SM D ++AA K A F D + + Sbjct: 99 VMLVIDVSESMAS---------------TDVPPNRLAAAKEAGKQFADQLTPA------I 137 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GL+ + + P+ V+ + + T + + A Q + + Sbjct: 138 NLGLVEFAANATLLVPPTTNRSAVKAGIDSLQPAPK---TATGEGIFTALQAIAT----- 189 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENN---NFKSNVNTIKICDKAKENFIKIVTISI 369 + P I+ +DG N + + AK ++I TIS Sbjct: 190 -----VGSVMGGGEGPPPARIVLESDGAENVPLDPNAPQGAFTAARAAKAEGVQISTISF 244 Query: 370 N--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + Q L K C + ++ + DSL +V+ + + + Sbjct: 245 GTPYGTVDYEGATIPVPVDDQTLQKICEITDGQAFHADSLDSLKNVYSTLQRQI 298 >gi|257785062|ref|YP_003180279.1| Cna B domain-containing protein [Atopobium parvulum DSM 20469] gi|257473569|gb|ACV51688.1| Cna B domain protein [Atopobium parvulum DSM 20469] Length = 863 Score = 44.5 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 39/254 (15%), Positives = 82/254 (32%), Gaps = 53/254 (20%) Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 I E+ + +V D S SM ++ A K + Sbjct: 74 IKGETSAASEEQKA-NVLVVFDNSSSMTAQTGGGEM--------------RLDAAKRVVN 118 Query: 237 LFLDSIDLLSHV--KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 +I ++ K+ V M L+ + + N+E W T + ++ + T+ Sbjct: 119 QLSSTILGINRNAQKDVVEMALLSFNEK--PNLECGW-TADLNEFQRATNNMGFHTGTNW 175 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG----ENNNFKSNVNT 350 A+++A + +D+K + ++IF+TDG + N + N Sbjct: 176 ESALERA--KVLADQKAANGNPT--------------YVIFVTDGLPTQDRNGWVRNNQI 219 Query: 351 --IKICDKAK---ENFIKIVTISINASPNGQRLLKTCVSSPE--------YHYNVVNADS 397 D+A+ ++ + R L + Y+Y N Sbjct: 220 GYEHALDEARAIGSAGYHFYSVYMYGGHAYLRQLTNYAYTGNPFGNPGGTYYYEANNTAQ 279 Query: 398 LIHVFQNISQLMVH 411 + F+ I+ ++ Sbjct: 280 MEQAFKEIASVITK 293 >gi|218679029|ref|ZP_03526926.1| hypothetical protein RetlC8_09174 [Rhizobium etli CIAT 894] Length = 151 Score = 44.5 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 17/64 (26%), Positives = 34/64 (53%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 ++R +F + I + +II L + LL +G I + + + ++SA ++A+LA Sbjct: 1 MTRRLSFFSRLIDNRDGAVAIIVILVAVPLLLAVGASIDFIRAYNNRVDLQSAVDSAVLA 60 Query: 64 GASK 67 A+K Sbjct: 61 AAAK 64 >gi|326334019|ref|ZP_08200248.1| BatA protein [Nocardioidaceae bacterium Broad-1] gi|325948168|gb|EGD40279.1| BatA protein [Nocardioidaceae bacterium Broad-1] Length = 336 Score = 44.5 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 44/261 (16%), Positives = 88/261 (33%), Gaps = 50/261 (19%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 + + P + ER L+ + D S SM +D ++ A K Sbjct: 70 VATTQPQVRAQVPYERATILVAI--DTSASM---------------TAEDMPPNRLEAAK 112 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 A + F+D + + +V GL+ +++ P+ V + + T Sbjct: 113 AAAISFIDQL----PARYNV--GLVSFSSSARVVTSPTTDHALVVRSIDGLGP--PDGGT 164 Query: 293 DSTPAMKQAYQILTSDKKRSFFT---------NFFRQGVKIPSLPFQKFIIFLTDGENNN 343 A+ + L + + + + + ++ L+DG N+ Sbjct: 165 AIGEAVYSSIDDLQQILEEAAASGPSQEPEESEGSEKSEESEEERSPAHLVLLSDGGNSA 224 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINAS------PNGQRL------LKTCV-SSPEYHY 390 +S V + A+E + TI+ P GQ + L+ ++ Y Sbjct: 225 GRSPVAAAEA---AREAGLPTSTIAYGTEGSSATLPGGQSVEVREDTLRNLADTTGGRFY 281 Query: 391 NVVNADSLIHVFQNISQLMVH 411 +AD L V+ +I L+ H Sbjct: 282 RASSADELREVYDDIGTLVGH 302 >gi|224049043|ref|XP_002191793.1| PREDICTED: similar to capillary morphogenesis protein 2 [Taeniopygia guttata] Length = 554 Score = 44.5 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 58/187 (31%), Gaps = 43/187 (22%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 E F + V+D SGS+ + VN + Sbjct: 111 EPSCHGAFDMYFVLDKSGSVATNWREIFDFVNQLTERFVSPK------------------ 152 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + + I ++T+ + + EK+++ + + T +KQA Sbjct: 153 --------MRLSFIVFSTQAHVIMPLTGDREKIKKGLKDLEEVKPAGETYIHEGLKQA-- 202 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 Q K + F II LTDG+ + + + K K++E + Sbjct: 203 --------------NEQIAKQGASRFSSIIIALTDGK-LDGQIPLYAEKEAKKSRELGAR 247 Query: 364 IVTISIN 370 + + + Sbjct: 248 VYCVGVQ 254 >gi|326332116|ref|ZP_08198400.1| putative von Willebrand factor type A domain protein [Nocardioidaceae bacterium Broad-1] gi|325950087|gb|EGD42143.1| putative von Willebrand factor type A domain protein [Nocardioidaceae bacterium Broad-1] Length = 338 Score = 44.5 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 29/194 (14%), Positives = 54/194 (27%), Gaps = 34/194 (17%) Query: 171 QKIVSFIPALLRIEMGERPI-FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 IV P+ R + +VVD + SM E +M Sbjct: 59 TVIVMVRPSWGAAPSEMRTADLDVLVVVDRTRSMVAEDGPGGEA-------------RMT 105 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNI-EPSWGTEKVRQYVTRDMDSLI 288 LK L ++ V G I + V + ++ T +V Sbjct: 106 QLKKDLKALSAALPS-------VRFGAITFGGEVVRTEMPFTYDTTAFNAWVDGLYAERA 158 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 + S + + + R + P ++ ++F +DGEN + Sbjct: 159 FDGSGS----------MVDAPRDEVISALERDQERFP--ERRRIVVFASDGENTREGVDQ 206 Query: 349 NTIKICDKAKENFI 362 + D + Sbjct: 207 QSFSEIDDLSAGGV 220 >gi|326918656|ref|XP_003205604.1| PREDICTED: anthrax toxin receptor 2-like, partial [Meleagris gallopavo] Length = 480 Score = 44.5 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 37/261 (14%), Positives = 84/261 (32%), Gaps = 50/261 (19%) Query: 158 LQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSA 217 ++ L R+N+ + PA + F + V+D SGS+ + + VN Sbjct: 17 VRGQCQTLGLRFNRSFRGYXPA--DEQPSCHGAFDMYFVLDKSGSVAQNWHEIFDFVNQL 74 Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 + + + I ++++ + + + EK++ Sbjct: 75 TERFVSPK--------------------------MRLSFIVFSSQAQVIMPLTGDREKIK 108 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + + + T +KQA Q K + F II LT Sbjct: 109 EGLKNLSEVKPAGDTYIHEGLKQA----------------NLQIEKQGASRFSSIIIALT 152 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 DG+ + + + K +++ ++ + + Q L+ + E + V Sbjct: 153 DGK-LDGQIPLYAEKEAKTSRQLGARVYCVGVL--DFVQAQLERIADTKEQVFPVTGGFQ 209 Query: 398 LIHVFQNISQLMVHRKYSVIL 418 + I ++ + + IL Sbjct: 210 ---ALKGIINSVLKQSCTEIL 227 >gi|325927536|ref|ZP_08188772.1| hypothetical protein containing a von Willebrand factor type A (vWA) domain [Xanthomonas perforans 91-118] gi|325542075|gb|EGD13581.1| hypothetical protein containing a von Willebrand factor type A (vWA) domain [Xanthomonas perforans 91-118] Length = 501 Score = 44.5 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 36/238 (15%), Positives = 81/238 (34%), Gaps = 54/238 (22%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 I + P + +VD+SGSM K+ L+++L L + + Sbjct: 124 IATADLPPANLVFLVDVSGSMDA-------------------PDKLPLLQSSLKLLVRQL 164 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQA 301 + L+ Y + + P+ G + + + +D+L T ++ A Sbjct: 165 RAQD------RITLVTYAGNISVVLPPTPGDQ--QGRIVEAIDALQSGGSTAGASGIELA 216 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK-ICDKAKEN 360 Y+ +QG + I+ TDG+ N +N + +K + + + + Sbjct: 217 YKA-------------AQQGYLRGGINR---ILLATDGDFNVGVTNFDQLKGMVAEKRRS 260 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN---------ADSLIHVFQNISQLM 409 I + T+ L++ + + Y ++ L I++ + Sbjct: 261 GIALSTLGFGTGNYNDNLMEQLADAGDGAYAYIDTALEARKVLTHELGATLATIARDV 318 >gi|296207278|ref|XP_002750612.1| PREDICTED: cartilage matrix protein [Callithrix jacchus] Length = 496 Score = 44.5 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 28/168 (16%), Positives = 58/168 (34%), Gaps = 22/168 (13%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 I+ L +G++ Y + V++ K + T + A++ A Sbjct: 68 IESLDVGPNATRVGVVNYASTVKQEFPLRAHVSKAALLQAVRRIQPLSTGTMTGLAIQFA 127 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 SD + G + S K +I +TDG + +V+ +A+ + Sbjct: 128 ITKAFSDAE----------GGRSRSPDISKVVIVVTDGRPQDSVRDVS-----ARARASG 172 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEY----HYNVVNA-DSLIHVFQN 404 +++ I + + L+ S P+ + N + L FQ Sbjct: 173 VELFAIGVG--RVDKATLQQIASEPQEEHVDYVESYNVIEKLSRKFQE 218 Score = 42.9 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 36/223 (16%), Positives = 83/223 (37%), Gaps = 40/223 (17%) Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNA-----------LLLFLDSI-DLLSHVKED 251 +N+D + N T + L + + F++ I D L + Sbjct: 252 GFTLNTDGKTCNVCSGGGGSSATDLVFLIDGSKSVRPENFELVKKFINQIVDTLDVSDKL 311 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD--SLILKPTDSTPAMKQAYQILTSDK 309 +GL+ Y++ V + G ++ + + S + K T + A+K D Sbjct: 312 AQVGLVQYSSSVRQEFPL--GRFHTKKDIKAAVRNMSYMEKGTMTGAALKY-----LIDN 364 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + + K+ I+F TDG + ++ ++ KAK+ K+ + + Sbjct: 365 SFTVSSGARPGAQKVG-------IVF-TDGRSQDYINDAA-----KKAKDLGFKMFAVGV 411 Query: 370 -NASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 NA + L+ S P E+++ + ++ + + + + + Sbjct: 412 GNAVEDE---LREIASEPVAEHYFYTADFKTINQIGKKLQKKI 451 >gi|317502942|ref|ZP_07961034.1| aerotolerance protein BatA [Prevotella salivae DSM 15606] gi|315665941|gb|EFV05516.1| aerotolerance protein BatA [Prevotella salivae DSM 15606] Length = 332 Score = 44.5 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 43/244 (17%), Positives = 72/244 (29%), Gaps = 68/244 (27%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I L +D+S SM D K ++ A KN F+ D + Sbjct: 88 IDIMLAMDVSTSMLAE---------------DLKPNRLEAAKNVASEFIS--DRPNDN-- 128 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD-----SLILKPTDSTPAMKQAYQIL 305 +GL + + + + LI T + A L Sbjct: 129 ---IGLTIFAGEAFTQCPMTTDHASLINMLRSVRTDIAARGLISDGTAIGMGLANAVSRL 185 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 K +S K +I LTDG NN + T AK I++ Sbjct: 186 KDSKAKS------------------KVVILLTDGSNNMGDISPLTSAQ--IAKSLGIRVY 225 Query: 366 TISI---------NASPNGQRL-----------LKTCV-SSPEYHYNVVNADSLIHVFQN 404 TI + G + LK ++ +Y + + L ++++ Sbjct: 226 TIGVGTNKVAPYPMPVAGGVQYVNIPVEIDSKTLKNIAETTDGNYYRATSNNQLKQIYKD 285 Query: 405 ISQL 408 I +L Sbjct: 286 IDKL 289 >gi|291242482|ref|XP_002741137.1| PREDICTED: chloride channel calcium activated 3-like [Saccoglossus kowalevskii] Length = 975 Score = 44.5 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 40/285 (14%), Positives = 84/285 (29%), Gaps = 58/285 (20%) Query: 119 QNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP 178 + + N + + + + + + + I P Sbjct: 279 DDPSDMYSFHNRMAPNRQNIICEGQSAWGVMLENT--------DFKDGNNPTRIIDDTTP 330 Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + G R + LV+D SGSM ++ L + F Sbjct: 331 NFIIKRSGVRR---VVLVLDTSGSMDGD--------------------RIQRLYQSATYF 367 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 ++ + + +G Y + E +G + R V+ + T + Sbjct: 368 IE--NRIEDGSFVGIVGFSSYAVILASMTELKYGYQ--RSEVSSKVPQEADGATSIGGGV 423 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG-ENNNFKSNVNTIKICDKA 357 + A Q+L S + ++ +TDG EN + + + Sbjct: 424 RLALQVLQDGNVTSEGAS----------------LLLITDGVENTYP----FLMNVMQEV 463 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 ++ +++ TI+ + Q L+ S+ YN V D F Sbjct: 464 YDSGVRVDTIAF--TEAAQSTLQQLSSNTGGLYNYVPEDDNSTAF 506 >gi|220933243|ref|YP_002512142.1| von Willebrand factor type A [Thioalkalivibrio sp. HL-EbGR7] gi|219994553|gb|ACL71155.1| von Willebrand factor type A [Thioalkalivibrio sp. HL-EbGR7] Length = 325 Score = 44.5 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 36/223 (16%), Positives = 72/223 (32%), Gaps = 42/223 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVD SGSM + + + +++ L+ + F+ + +E Sbjct: 98 DLVLVVDTSGSM----------LVRDYRAEGRPVSRIEVLQGVVTRFVRA-------REG 140 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 LI + + E V + R ++ T + A + L + Sbjct: 141 DRFALIPMAEEAATLVPLTGDRELVASQLARLRAGMLGDDTAIGDGIALALRQLQASGAE 200 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-- 369 +I +DGE+N + + A+ + + T+ I Sbjct: 201 RRPA-----------------LILFSDGESNAGL--LRPSEALALARAAGVALYTVEITG 241 Query: 370 ---NASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQL 408 A G+ L ++ H++V + L V I +L Sbjct: 242 GQALAPVEGEPSLADMAETTGGRHFHVTRSADLEAVIATIDRL 284 >gi|56797859|emb|CAG27402.1| matrilin-3b [Danio rerio] Length = 434 Score = 44.5 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 31/178 (17%), Positives = 62/178 (34%), Gaps = 34/178 (19%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDST 295 + +D L + + L+ Y + V + +V+Q +R + T + Sbjct: 227 LSEMVDSLDIGSDATRVALVNYASTVNIEFHLKKYFSKAEVKQAFSRIDP--LSTGTMTG 284 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI----IFLTDGENNNFKSNVNTI 351 A+K A + ++ + P +K I I +TDG + V + Sbjct: 285 MAIKTAMDQVFTENA--------------GARPLKKGIGKVAIIVTDG---RPQDKVEEV 327 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN---ADSLIHVFQN 404 A+ + I+I + + R LK S P ++ + V + L F+ Sbjct: 328 S--AAARASGIEIYAVGV--DRAEMRSLKQMASQPLDDHVFYVETYGVIEKLTSKFRE 381 >gi|223973011|gb|ACN30693.1| unknown [Zea mays] Length = 481 Score = 44.5 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 29/224 (12%), Positives = 67/224 (29%), Gaps = 54/224 (24%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 +R + V+D+SGSM K+ +K A+ + + Sbjct: 20 STSDRSGLDLVAVLDVSGSMQGE--------------------KIEKMKTAMKFVVKKLS 59 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAY 302 + + ++ + + TE + + + +D+L T+ + ++ Sbjct: 60 SID------RLSIVTFLDTANRICPLRQVTEDSQPQLLKLIDALQPGGNTNISDGLQTGL 113 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 ++L K S ++ ++DG+ N + N K + Sbjct: 114 KVLADRKLSSGRVVG---------------VMLMSDGQQNRGEPAAN-------VKIGNV 151 Query: 363 KIVTISINA---SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 + T A + + S V + + L F Sbjct: 152 PVYTFGFGADYDPTVLNAVARN--SMGGTFSVVNDVNLLSMAFS 193 >gi|157105665|ref|XP_001648969.1| hypothetical protein AaeL_AAEL014547 [Aedes aegypti] gi|108868963|gb|EAT33188.1| hypothetical protein AaeL_AAEL014547 [Aedes aegypti] Length = 541 Score = 44.5 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 38/288 (13%), Positives = 90/288 (31%), Gaps = 67/288 (23%) Query: 132 MANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFI-------PALLRIE 184 + + + F ++ V +S + + + + N I + + + Sbjct: 69 DPTSHISMRDMLSFRSIAVDSSGSQEGLSMSNEIFKNENSVIKAKVDRLSGSFKKSVEKI 128 Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + I ++D S S+ + +++ +K L F Sbjct: 129 KIKHKRVDIVFLIDASSSVG----------------KANFYSEIKFVKKLLSDF------ 166 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY-----VTRDMD--SLILKPTDSTPA 297 + + +I ++++++ + V + + T + A Sbjct: 167 -NVSYNYTRVAVITFSSQMKIFRHIDQISTSVEDNDKCLLLNYQIPKIEFSGGGTYTYGA 225 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 +K+A +I + + S +K I +TDG F + + I + + Sbjct: 226 LKEAEEIFQNARADS-----------------KKIIFLITDG----FSNGRDPIPLAESL 264 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS-LIHVFQN 404 K + I +I I L SSP ++ S L+ F + Sbjct: 265 KRKNVVIYSIGIQ--SGNYAELYNMSSSPG------DSHSFLLDSFDH 304 >gi|109899476|ref|YP_662731.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like [Pseudoalteromonas atlantica T6c] gi|109701757|gb|ABG41677.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Pseudoalteromonas atlantica T6c] Length = 1364 Score = 44.5 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 51/352 (14%), Positives = 104/352 (29%), Gaps = 51/352 (14%) Query: 89 ALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNM 148 A + + I+ + F N + N ++ + + + N + + Sbjct: 358 AYQNSSNNSASMTIRAANEADPNSFNNYPRYLLRNKAKTASVNWSGIERWYRNRD-YQSP 416 Query: 149 DVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMN 208 V + + ++ + L + F ERP +LV++ G Sbjct: 417 SVASIINQLVERNDWQLGNDMVFILSDFSNTRGAYTYAERPSGAAKLVIEFQG------- 469 Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 ++ +K+ L + + + +Y G++ Y ++ N Sbjct: 470 --QATPGQTSTVREHLVSKVDELSASGYT---------PIVDTLYEGVMYYGG-LDVNYG 517 Query: 269 PSWGTEKVRQYV---TRDMDSLILKPTDSTPAMKQAYQILTSDK---KRSFFTNFFRQGV 322 + G V V TR L D+T L+S ++ + + Sbjct: 518 LARGNNSVSNTVRRNTRVSHRLSYSGQDATLPSGCEEDNLSSSNCITQQIPQGARYISPI 577 Query: 323 KIPSLPFQKFIIFLTDGE--NNNFKSNVNTI--KIC--------------DKAKENF--- 361 I+ L+DGE NN+ + ++ C + A Sbjct: 578 SDRQCQVNNHIVLLSDGEANNNHSVDEIESLLSASCTGSGGEKCGLSLVRNVADSEGSVI 637 Query: 362 ---IKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQLM 409 I TI A+ L Y N+ L+ FQ+I + + Sbjct: 638 DSRIITHTIGFAANTEANSFLNQIALQGGGGFYQADNSQELLGAFQSILKTV 689 >gi|329117975|ref|ZP_08246688.1| von Willebrand factor type A domain protein [Neisseria bacilliformis ATCC BAA-1200] gi|327465863|gb|EGF12135.1| von Willebrand factor type A domain protein [Neisseria bacilliformis ATCC BAA-1200] Length = 562 Score = 44.5 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 41/235 (17%), Positives = 90/235 (38%), Gaps = 46/235 (19%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 Q + I I A + P L+ LV D+SGSM Sbjct: 171 PWQSEAKLIKIGIQAQDTAKKDLPPANLVFLV-DVSGSM-------------------TD 210 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 K+ +K L + + + V LI Y + + + P+ G + + + R + Sbjct: 211 PDKLPLVKKTLRILTEQLRPQDKVT------LITYASGEQLVLPPTSGKD--KDTILRAL 262 Query: 285 DSLILKPTDSTP-AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 ++L S A++ AY+ ++++ N + II TDG+ N Sbjct: 263 NALHAGGATSGERALRMAYEQ----AEKAYVKNGINR------------IILATDGDFNV 306 Query: 344 FKSNVNTIK-ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 S+ T+K + + +++ I + T+ + +++ + + +Y+ ++++ Sbjct: 307 GVSDTETLKSLVAEKRKSGISLSTLGYGTGNYNEAMMEQIADAGDGNYSYIDSEK 361 >gi|310641808|ref|YP_003946566.1| von willebrand factor type a [Paenibacillus polymyxa SC2] gi|309246758|gb|ADO56325.1| von Willebrand factor type A [Paenibacillus polymyxa SC2] Length = 600 Score = 44.5 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 43/220 (19%), Positives = 79/220 (35%), Gaps = 50/220 (22%) Query: 163 HLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 ++ I+++ P LVVD+S SM+ SDP + + Sbjct: 13 GIICTMIMSSILAWQPQTANAASPSASKVDAVLVVDVSNSMN---TSDPGKIGN------ 63 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV-RQYVT 281 A+K + + D + G++ YT V++ + + +Q + Sbjct: 64 ------EAMKMFIDMLSTQNDKV---------GIVAYTDVVQREKALLNISSEADKQELK 108 Query: 282 RDMDSLILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 +D L TD++ +K+A +IL K I+ L DG Sbjct: 109 TFIDGLNRGAYTDTSVGVKEAIRILQDGKTAGHAP----------------MIVMLADGN 152 Query: 341 NNNFKSNVNT--------IKICDKAKENFIKIVTISINAS 372 N+ K+ T + +AK++ + I TI +NA Sbjct: 153 NDFNKTTGRTESQSAQDMAQAVAEAKKSGVPIYTIGLNAD 192 >gi|298370192|ref|ZP_06981508.1| pilus-associated protein [Neisseria sp. oral taxon 014 str. F0314] gi|298281652|gb|EFI23141.1| pilus-associated protein [Neisseria sp. oral taxon 014 str. F0314] Length = 1092 Score = 44.5 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 12/76 (15%), Positives = 28/76 (36%), Gaps = 10/76 (13%) Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISIN--ASPNGQRLLKTCVSSPEYHYNVVNA 395 DG+ ++ + K ++ T+ S G+ L+ S ++N + Sbjct: 283 DGDYSDPRG--------TNFKNQLVQTYTVGFGSGVSATGEAYLRNGASGSNNYFNARDE 334 Query: 396 DSLIHVFQNISQLMVH 411 + L F I+ + + Sbjct: 335 NGLFAAFDAITDSIAN 350 >gi|229523713|ref|ZP_04413118.1| hypothetical protein VCA_001283 [Vibrio cholerae bv. albensis VL426] gi|229337294|gb|EEO02311.1| hypothetical protein VCA_001283 [Vibrio cholerae bv. albensis VL426] Length = 886 Score = 44.5 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 27/173 (15%), Positives = 60/173 (34%), Gaps = 27/173 (15%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 E + ++L++D+SGSM + +T++ +K + + ++ Sbjct: 267 ESETKQGANVQLMLDVSGSMGRDAGNG--------------KTRLQVMKESAIQLIEQYQ 312 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGT-EKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 L + V + L ++ W T + + Y+ + TD A+K A Sbjct: 313 AL--GQTKVQLILFSSDASIKTASGLLWMTVAEAKNYINALSAN---GGTDYDDAIKLAQ 367 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE--NNNFKSNVNTIKI 353 + + + + + + D E N+ + N NTI+ Sbjct: 368 ESWSGTINGQPLSGATNVSYFLSDGVPEGY-----DWELKNSQWVQNQNTIET 415 >gi|294781746|ref|ZP_06747079.1| phage/colicin/tellurite resistance cluster TerY protein [Fusobacterium sp. 1_1_41FAA] gi|294481856|gb|EFG29624.1| phage/colicin/tellurite resistance cluster TerY protein [Fusobacterium sp. 1_1_41FAA] Length = 229 Score = 44.5 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 29/228 (12%), Positives = 77/228 (33%), Gaps = 40/228 (17%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + + L++D+SGSM K+ L +A + Sbjct: 9 KPATAKHLPVVLLLDVSGSMSGE--------------------KIENLYDATNEMIKVFS 48 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAY 302 ++ + + +I + VE + + V + +R ++ + T A++ A Sbjct: 49 DAVSKEKIIDIAIITFGENVELHTPYT----SVVDFKSRGLNPFLASGMTPLGTALRMAK 104 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 ++ + PS ++ ++ ++DG + + + + Sbjct: 105 DMIEDKE-------------TTPSNIYRPAVVLVSDGVPTDEWRGP-LDNFKNNGRSSKC 150 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + ++I + ++LK+ E + N ++ F+ IS + Sbjct: 151 QRFAVAI-GNDADNQMLKSFAECNENFFIAENVSDIVDKFKQISMSVS 197 >gi|118375014|ref|XP_001020694.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89302461|gb|EAS00449.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 610 Score = 44.5 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 38/229 (16%), Positives = 80/229 (34%), Gaps = 46/229 (20%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 L I RP + ++D S SM +K+ +KN +L Sbjct: 143 TQKLYISTSSRPNLDLVCIIDNSESMS-------------------GCSKIENVKNTILQ 183 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTP 296 L+ ++ + LI + + ++ + ++ + +S+ T+ T Sbjct: 184 LLEMLNEND------RLSLITFNSYAKQLCGLKKVSNLNKETLQAITNSIKAYGGTNITS 237 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN--TIKIC 354 ++ A+QIL S KK++ ++ I L+DG+++ + + Sbjct: 238 GLEIAFQILQSRKKKNSVSS----------------IFLLSDGQDDGADTKIKNLLKITY 281 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVF 402 + +E I + S S + L++ Y V D + F Sbjct: 282 QQLQEESFTIHSFSF-GSDHDCPLMQKIAQIKDGSFYFVEKNDQVDEFF 329 >gi|297671963|ref|XP_002814093.1| PREDICTED: collagen alpha-6(VI) chain-like [Pongo abelii] Length = 2291 Score = 44.5 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 56/341 (16%), Positives = 105/341 (30%), Gaps = 62/341 (18%) Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 SN LG E+I A F + ++++ + + N Sbjct: 522 SNKQDLGKAIENIRQMGGNTNTGAALNFTLSLLQKAKKQRGNK-VPCHLVVLTNGMSKDS 580 Query: 130 THMANNRL------------DSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFI 177 NRL +N T + Y + + L + R + Sbjct: 581 ILEPANRLREEHIRVYAIGVKEANQTQLREIAGEEKRVYYVHDFDALKDIRN-----QVV 635 Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + E + I +VD SGS+ + +KM K + Sbjct: 636 QEICTEEACKEMKADIMFLVDSSGSIG-----------------PENFSKM---KTFMKN 675 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 + + + V +G + ++ ++ + R M + ++ Sbjct: 676 LVSKSQIGP---DRVQIGAVQFSDINKEEFQL-----------NRFMSQSDIS--NAIDQ 719 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 M Q + SF + +F K +KF+I +TDGE + V I Sbjct: 720 MAHIGQTTLTGSALSFVSQYF-SPTKGARPNVRKFLILITDGE---AQDIVKEPAI--AL 773 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 ++ + I ++ + S Q L+ PE + V N D L Sbjct: 774 RQEGVIIYSVGVFGSNVTQ--LEEISGRPEMVFYVENFDIL 812 >gi|148676906|gb|EDL08853.1| matrilin 2, isoform CRA_b [Mus musculus] Length = 941 Score = 44.5 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 61/164 (37%), Gaps = 21/164 (12%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTP 296 ID L+ + +GL+ Y+T+V G ++ + + + Sbjct: 678 VTGIIDSLAVSPKAARVGLLQYSTQVRTEFTLR-GFSSAKEMKKAVTHMKYMGKGSMTGL 736 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+K ++ + + G + PS + I TDG + +V+ + K Sbjct: 737 ALKHMFERSFTQVE----------GARPPSTQVPRVAIVFTDG---RAQDDVS--EWASK 781 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 AK N I + + + + + L+ S P ++ + + ++ Sbjct: 782 AKANGITMYAVGVGKAIEEE--LQEIASEPIDKHLFYAEDFSTM 823 >gi|120407045|ref|NP_058042.2| matrilin-2 [Mus musculus] Length = 937 Score = 44.5 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 61/164 (37%), Gaps = 21/164 (12%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTP 296 ID L+ + +GL+ Y+T+V G ++ + + + Sbjct: 678 VTGIIDSLAVSPKAARVGLLQYSTQVRTEFTLR-GFSSAKEMKKAVTHMKYMGKGSMTGL 736 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+K ++ + + G + PS + I TDG + +V+ + K Sbjct: 737 ALKHMFERSFTQVE----------GARPPSTQVPRVAIVFTDG---RAQDDVS--EWASK 781 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 AK N I + + + + + L+ S P ++ + + ++ Sbjct: 782 AKANGITMYAVGVGKAIEEE--LQEIASEPIDKHLFYAEDFSTM 823 >gi|20136122|gb|AAM11539.1| matrilin-2 [Mus musculus] Length = 956 Score = 44.5 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 61/164 (37%), Gaps = 21/164 (12%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTP 296 ID L+ + +GL+ Y+T+V G ++ + + + Sbjct: 678 VTGIIDSLAVSPKAARVGLLQYSTQVRTEFTLR-GFSSAKEMKKAVTHMKYMGKGSMTGL 736 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+K ++ + + G + PS + I TDG + +V+ + K Sbjct: 737 ALKHMFERSFTQVE----------GARPPSTQVPRVAIVFTDG---RAQDDVS--EWASK 781 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 AK N I + + + + + L+ S P ++ + + ++ Sbjct: 782 AKANGITMYAVGVGKAIEEE--LQEIASEPIDKHLFYAEDFSTM 823 >gi|62185620|gb|AAH92298.1| Matrilin 2 [Mus musculus] Length = 937 Score = 44.5 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 61/164 (37%), Gaps = 21/164 (12%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTP 296 ID L+ + +GL+ Y+T+V G ++ + + + Sbjct: 678 VTGIIDSLAVSPKAARVGLLQYSTQVRTEFTLR-GFSSAKEMKKAVTHMKYMGKGSMTGL 736 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+K ++ + + G + PS + I TDG + +V+ + K Sbjct: 737 ALKHMFERSFTQVE----------GARPPSTQVPRVAIVFTDG---RAQDDVS--EWASK 781 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 AK N I + + + + + L+ S P ++ + + ++ Sbjct: 782 AKANGITMYAVGVGKAIEEE--LQEIASEPIDKHLFYAEDFSTM 823 >gi|74202868|dbj|BAE37504.1| unnamed protein product [Mus musculus] Length = 928 Score = 44.5 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 61/164 (37%), Gaps = 21/164 (12%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTP 296 ID L+ + +GL+ Y+T+V G ++ + + + Sbjct: 678 VTGIIDSLAVSPKAARVGLLQYSTQVRTEFTLR-GFSSAKEMKKAVTHMKYMGKGSMTGL 736 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+K ++ + + G + PS + I TDG + +V+ + K Sbjct: 737 ALKHMFERSFTQVE----------GARPPSTQVPRVAIVFTDG---RAQDDVS--EWASK 781 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 AK N I + + + + + L+ S P ++ + + ++ Sbjct: 782 AKANGITMYAVGVGKAIEEE--LQEIASEPIDKHLFYAEDFSTM 823 >gi|7387906|sp|O08746|MATN2_MOUSE RecName: Full=Matrilin-2; Flags: Precursor gi|2072792|gb|AAC53163.1| matrilin-2 precursor [Mus musculus] Length = 956 Score = 44.5 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 61/164 (37%), Gaps = 21/164 (12%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTP 296 ID L+ + +GL+ Y+T+V G ++ + + + Sbjct: 678 VTGIIDSLAVSPKAARVGLLQYSTQVRTEFTLR-GFSSAKEMKKAVTHMKYMGKGSMTGL 736 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+K ++ + + G + PS + I TDG + +V+ + K Sbjct: 737 ALKHMFERSFTQVE----------GARPPSTQVPRVAIVFTDG---RAQDDVS--EWASK 781 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 AK N I + + + + + L+ S P ++ + + ++ Sbjct: 782 AKANGITMYAVGVGKAIEEE--LQEIASEPIDKHLFYAEDFSTM 823 >gi|312110043|ref|YP_003988359.1| hypothetical protein GY4MC1_0934 [Geobacillus sp. Y4.1MC1] gi|311215144|gb|ADP73748.1| Ig domain protein group 2 domain protein [Geobacillus sp. Y4.1MC1] Length = 932 Score = 44.5 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 39/186 (20%), Positives = 67/186 (36%), Gaps = 46/186 (24%) Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 + R++ RP + V D+SGSM K+ + K AL Sbjct: 66 LTPQGRVDNIIRPPIDVVFVFDVSGSM------------------VMPSLKLDSAKYALQ 107 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNI--EPSWGTEKVRQYVTRDMDSL----ILK 290 +D ++ + L+ ++ V+ + GT V+Q++ Sbjct: 108 SAVDYFKANANPND--RFALVPFSDGVQSDKVVPFPSGTYDVKQHLNWIATVANSLRANG 165 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG-ENNNFKSNVN 349 T+ T A++QA Q +D R +K+IIFLTDG + Sbjct: 166 GTNYTQALQQA-QSFFNDPAR------------------KKYIIFLTDGMPTVSIAKEPI 206 Query: 350 TIKICD 355 T K+C+ Sbjct: 207 TYKVCE 212 >gi|108760959|ref|YP_630756.1| von Willebrand factor type A domain-containing protein [Myxococcus xanthus DK 1622] gi|108464839|gb|ABF90024.1| von Willebrand factor type A domain protein [Myxococcus xanthus DK 1622] Length = 476 Score = 44.5 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 35/241 (14%), Positives = 82/241 (34%), Gaps = 51/241 (21%) Query: 169 YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 ++ +F + ++ +R + LV+D SGSM K+ Sbjct: 73 TSEVFATFDLSGAQVPGAQRSPVNLALVIDRSGSMSGY--------------------KL 112 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP---SWGTEKVRQYVTRDMD 285 A K A + ++ + +I Y + V+ + E++ QYV D Sbjct: 113 AQAKQAARHLIGLLNDQD------RLAIIHYGSDVKSLPSLEATAANRERMFQYVDGIWD 166 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE-NNNF 344 T+ + A + S +R++ N +I ++DG+ Sbjct: 167 E---GGTNIGAGLS-AGRYQLSTAQRTYGVNR---------------LILMSDGQPTEGL 207 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQ 403 ++ ++ + + + + I + + + L++ + + +A L +FQ Sbjct: 208 TADEELTRMARELRATGLTLSAIGV-GTDFNEDLMQAFAEYGAGAYGFLEDAAQLSTLFQ 266 Query: 404 N 404 Sbjct: 267 K 267 >gi|260837294|ref|XP_002613640.1| hypothetical protein BRAFLDRAFT_227016 [Branchiostoma floridae] gi|229299026|gb|EEN69649.1| hypothetical protein BRAFLDRAFT_227016 [Branchiostoma floridae] Length = 216 Score = 44.5 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 28/171 (16%), Positives = 58/171 (33%), Gaps = 23/171 (13%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 +K L + D+ +G++ Y+ + + + + + Sbjct: 21 EKMKTFLNNIVGQFDIGPTA---TQVGVVQYSWFIRQECAL-NAHSSLASLQQAISNITV 76 Query: 289 LK-PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 L T + A+ A + N R GV K ++ +TDG + Sbjct: 77 LGLGTHTGAALTFARNTALTA------ANGARPGVP-------KIVVVMTDG----ASED 119 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 T+ + + + IS++ S RLL+ SP+ + + D+L Sbjct: 120 DVTLPS-QNLRNDGVITFAISVSWSLPNDRLLQDIAGSPDRIFAATDFDAL 169 >gi|332232509|ref|XP_003265447.1| PREDICTED: collagen alpha-6(VI) chain [Nomascus leucogenys] Length = 2264 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 55/353 (15%), Positives = 108/353 (30%), Gaps = 66/353 (18%) Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 SN LG E+I A F + ++++ + + N Sbjct: 495 SNRQDLGKAIENIRQMGGNTNTGAALNFTLSLLQKAKKQRGNK-VPCHLVVLTNGMSKDS 553 Query: 130 THMANNRL------------DSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFI 177 NRL +N T + Y + + L + R + Sbjct: 554 ILEPANRLREEHIRVYAIGVKEANQTQLREIAGEEKRVYYVHDFDALKDIRN-----QVV 608 Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + E + I +VD SGS+ + +KM K + Sbjct: 609 QEICAEEACKEMKADIMFLVDSSGSIG-----------------PENFSKM---KTFMKN 648 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 + + + V +G++ ++ ++ + R M + ++ Sbjct: 649 LVSKSQIGP---DQVQIGVVQFSDINKEEFQL-----------NRFMSQSDIS--NAIDQ 692 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 M Q + SF + +F K +KF+I +TDGE + + Sbjct: 693 MAHIGQTTLTGSALSFVSQYF-SPTKGARPNIRKFLILITDGEAQDIVKEPAVV-----L 746 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 ++ + I ++ + S Q L+ PE + V N + Q I +V Sbjct: 747 RQEGVIIYSVGVFGSNVTQ--LEEISGRPEMVFYVEN----FDILQRIEDDLV 793 >gi|257870337|ref|ZP_05649990.1| von Willebrand factor type A domain-containing protein [Enterococcus gallinarum EG2] gi|257804501|gb|EEV33323.1| von Willebrand factor type A domain-containing protein [Enterococcus gallinarum EG2] Length = 1169 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 31/154 (20%), Positives = 59/154 (38%), Gaps = 39/154 (25%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E + LVVD SGSM ++ ++ + F+D++ S Sbjct: 333 EIAPLDLVLVVDWSGSM-------------------NDNNRIGEVQKGVDRFVDTLAE-S 372 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 + ++++MG +GY++ KN + G + V+ + S T + A++ A +L Sbjct: 373 GITDNIHMGYVGYSSDGYKNDSVAMGPFDSVKNAIKTITPSSTTGGTFTQKALRDAGNML 432 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + +K I+ LTDG Sbjct: 433 ATPNGH------------------KKVIVLLTDG 448 >gi|238782874|ref|ZP_04626903.1| tight adherance operon protein [Yersinia bercovieri ATCC 43970] gi|238716297|gb|EEQ08280.1| tight adherance operon protein [Yersinia bercovieri ATCC 43970] Length = 530 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 48/281 (17%), Positives = 88/281 (31%), Gaps = 69/281 (24%) Query: 2 HLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 +LLS F +K I ++K + F + + F+ L+ + KK + A A Sbjct: 21 NLLSLFIIISRKFIKNDKGAILLPFIIILPFFIALLFLSFEISQLLQKKAKLSDAIEQAT 80 Query: 62 LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI 121 LA + L D + N +D F ++ F EI Sbjct: 81 LA----LTVENDDLPDELQMRKN------VDLVSNFSSAYLPLE------HFSVPEIDIK 124 Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP--- 178 N + YN + SY + L++ ++ I Sbjct: 125 NNC-----------------GQLTYNAKITMSY------FANFLSKTAMTNAITTIGTED 161 Query: 179 ---ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 A+ ++ + + V D SGSM+ + + + K+ AL++ Sbjct: 162 NGAAIKQVSTIQDKATDVIFVADYSGSMNEGFHG-----------KVPRGEKINALRDVF 210 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 SI L + + SWGT+++ Sbjct: 211 NRLNGSI-------------LKNSNINLIGFVPFSWGTKRI 238 >gi|297191182|ref|ZP_06908580.1| von Willebrand factor [Streptomyces pristinaespiralis ATCC 25486] gi|197718543|gb|EDY62451.1| von Willebrand factor [Streptomyces pristinaespiralis ATCC 25486] Length = 424 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 43/227 (18%), Positives = 73/227 (32%), Gaps = 32/227 (14%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 E + +ELV+D+SGSM +++MAA K A Sbjct: 24 PTGIAAADEPVAKESPKVELVLDVSGSMRAR--------------DIDGKSRMAAAKQAF 69 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 L V E+V +G+ + G + RQ + P D T Sbjct: 70 NEVL------DAVPEEVRLGIRTLGADYPGDDRKR-GCKDTRQ-------LYPVGPLDRT 115 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A K A L + + I+ +TDGE+ + + Sbjct: 116 EA-KAAVATLAPTGWTPIGPALLGAAEDLEGGDATRRIVLITDGEDTCAPLDPCEVAREI 174 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIH 400 AK + I T+ + + L+ C++ + +V + D L Sbjct: 175 AAKGIHLVIDTLGLVPDAKTRTQLR-CIAEATGGTYTSVQHTDELSD 220 >gi|262193846|ref|YP_003265055.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262077193|gb|ACY13162.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 344 Score = 44.5 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 37/224 (16%), Positives = 68/224 (30%), Gaps = 46/224 (20%) Query: 157 RLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNS 216 RL + L + IP I +V+D+S SM Sbjct: 61 RLALMLGCLVLAVLALMQPQIPGRTETVTSSEVSADIMVVLDVSRSMLAD---------- 110 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 D T++A K + ++ +GL+ + R + Sbjct: 111 -----DVAPTRLARAKAEVAELSSALR-------GHRIGLVAFAGRASVLAPLTPDYGFF 158 Query: 277 RQYVTR-DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 R + D S+ T+ A+++A + P K I+ Sbjct: 159 RMILDGVDTKSVSRGGTEIGQALRKAVRSF-------------------DPGPGAKMILL 199 Query: 336 LTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLL 379 +TDGE++ + +A E +++V I + Q L Sbjct: 200 ITDGEDHGGYAEDAAR----EALEAGVRVVAIGFGSEQGSQITL 239 >gi|145527514|ref|XP_001449557.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124417145|emb|CAK82160.1| unnamed protein product [Paramecium tetraurelia] Length = 606 Score = 44.5 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 44/309 (14%), Positives = 94/309 (30%), Gaps = 47/309 (15%) Query: 98 IKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDY- 156 +K++ G Y E ++ + + N + V YDY Sbjct: 92 VKDYFSNLFGGDPQYQYKYEPKSQYDDDDDILQQYQKQGGLPVNIEDMIELKVEAQYDYC 151 Query: 157 RLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNS 216 +L+ E + I I + + V+D SGSM Sbjct: 152 KLKKSEKQIIPAMVSIITKDIEQYAKNNSSIEAGIDLICVIDKSGSMSGQ---------- 201 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 K+ +++ +L+ L+ + + LI + +++ TE Sbjct: 202 ----------KIESVQQSLVQLLNFLSEKD------RLCLIVFDGGAKRHTPLKTLTEGN 245 Query: 277 RQYVTRDMDSLILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 ++Y + ++ T+ A+Q + K ++ T+ I Sbjct: 246 KKYFKGAIAAISAGGSTNIAAGTDIAFQQIQQRKMKNQVTS----------------IFL 289 Query: 336 LTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA 395 L+DG+++ + K + + I + + + K C Y + N Sbjct: 290 LSDGQDSGAAERIQKQK---DRISDVVTIHSFGYGNDHDADLMSKICKVGQGSFYYIENV 346 Query: 396 DSLIHVFQN 404 L F + Sbjct: 347 KLLDEFFAD 355 >gi|209808861|ref|YP_002264399.1| hemolysin-type calcium-binding protein [Aliivibrio salmonicida LFI1238] gi|208010423|emb|CAQ80774.1| hemolysin-type calcium-binding protein [Aliivibrio salmonicida LFI1238] Length = 2890 Score = 44.5 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 39/190 (20%), Positives = 72/190 (37%), Gaps = 19/190 (10%) Query: 169 YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 S + + GE P LI LV+D SGSM D + S D+ +T+M Sbjct: 2380 TTTVNASPSQDIYADQGGETPTTLISLVLDSSGSM------DHKPFKSDNSNPDQDKTRM 2433 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNI-----EPSWGTEKVRQYVTRD 283 + A + LD++ + E+V + L+ + + + W T V+ + Sbjct: 2434 ELVLEASIAMLDNVK-VQEGSEEVKVQLVDFDDQKHSSQDKDVESLGWFT--VQSAIDAL 2490 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFR-QGVKIPSL--PFQKFIIFLTDGE 340 +L+ + Y +D + + Q +I ++ + FL+DG+ Sbjct: 2491 NAALVDIAEKDKD--EHFYPKGGTDYEEGIYAVMSGYQDTQITNITGETNDVVYFLSDGD 2548 Query: 341 NNNFKSNVNT 350 NN N+ Sbjct: 2549 NNGGWHGGNS 2558 >gi|149176499|ref|ZP_01855112.1| hypothetical protein PM8797T_29982 [Planctomyces maris DSM 8797] gi|148844612|gb|EDL58962.1| hypothetical protein PM8797T_29982 [Planctomyces maris DSM 8797] Length = 598 Score = 44.5 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 53/340 (15%), Positives = 101/340 (29%), Gaps = 36/340 (10%) Query: 17 SEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLG 76 S + F ++ A +++ + + F I + K M +A AA LA A ++ + Sbjct: 19 SRRGAFMVMAAPFLVATMGFMAFGIDIAVITMTKTRMRNAVEAAALAAAQQITDAVQTTA 78 Query: 77 DRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNR 136 D N + ++ Y ++ R + + Sbjct: 79 DGIGGSDNVSGDVQDANSIAIDTARAVAEKVARLNGVYIDPETDVEFGKRYQDSGGTFHM 138 Query: 137 LDSSNNTIFYNMDVMTSYDYRLQ---------FIEHLLNQRYNQKIVSFIPALLRIEMGE 187 + N + + V D + F ++++ S I + Sbjct: 139 VWGENAKPYNVVKVTARKDNATEGQPDSRLQLFFAGFMSEKTAAVTTSAIAFI------- 191 Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI--DLL 245 I LV+D SGSM D K+A+ LD I L+ Sbjct: 192 -EARDIVLVLDYSGSMSYDSEFDAMSSYRLG-------------KSAVEANLDDIWETLV 237 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 G + + I GT +L L DS+ +K + Sbjct: 238 DSGATYSDSGKLKFPATGYGRINSEVGTYISSTNDDYIYRALDLDEEDSSGNLKYPFPQ- 296 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL-TDGENNNF 344 + K + ++ +I ++ +DG NN+ Sbjct: 297 --EGKNYYGNLNGEPSGNTNKNLWKNYIKWVRSDGTVNNY 334 >gi|91082533|ref|XP_973629.1| PREDICTED: similar to Inter-alpha-trypsin inhibitor heavy chain H4 precursor (ITI heavy chain H4) (Inter-alpha-inhibitor heavy chain 4) (Inter-alpha-trypsin inhibitor family heavy chain-related protein) (IHRP) (Plasma kallikrein sensitive glycoprotein 120) (P [Tribolium castaneum] gi|270007557|gb|EFA04005.1| hypothetical protein TcasGA2_TC014154 [Tribolium castaneum] Length = 824 Score = 44.5 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 41/247 (16%), Positives = 78/247 (31%), Gaps = 50/247 (20%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I V+D SGSM K+ L A+ L + V Sbjct: 304 IVFVLDHSGSMSGR--------------------KIDQLIEAMQNILTDLKETDLFN-IV 342 Query: 253 YMG--LIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 G + + + + + ++ + T+ ++ A +I+ Sbjct: 343 RFGDLAMVWDVSQNQFTQLPNFNEYGNLEPHLREINLPRAVNGTEEN--IEAAKKIIEDK 400 Query: 309 KKRSFFTNFF---------RQGVKIPSLPFQKFIIFLTDGENNNFKSN----VNTIKICD 355 + + ++ + +Q I+FLTDG N S NT+ + Sbjct: 401 SRLGMTNMMYGLEVGLFLIKRTQEETPDKYQPMIVFLTDGHPNAGMSGRDEITNTVTSLN 460 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPE----YHYNVVNAD-SLIHVFQNISQLM- 409 K+ I ++S +R L+ S + Y +A L ++ IS + Sbjct: 461 SGKKK-ASIFSLSF-GDFADKRFLRKISSKNSGFSRHIYESSDASLQLQDFYRAISAPLL 518 Query: 410 --VHRKY 414 V+ KY Sbjct: 519 SNVNFKY 525 >gi|194221585|ref|XP_001495200.2| PREDICTED: collagen, type XXIX, alpha 1 [Equus caballus] Length = 2617 Score = 44.5 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 44/340 (12%), Positives = 110/340 (32%), Gaps = 54/340 (15%) Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNH----------IKESLSGYSAVFYNTEIQNIVN 123 + D + +S + A F+ + +K+ L + +++ Sbjct: 691 EISDAIDRMSLINQNTYTGHALEFVDQYFSHLKGARPGVKKFLILITDGEARDGVRDPAR 750 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 + R + + + +N T + S ++++ + L + K++ + AL Sbjct: 751 ALREKEVVIFSVGVYGANRTQLEEISGDGSLVFQVEKFDDL--KAIESKLIFRVCAL--H 806 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + + I V+D SGS++ ++ + L + ++ Sbjct: 807 DCKRINLLDIVFVLDHSGSINSQQQ-----------------------ESMMNLTIHLVE 843 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + + V G + Y+ E + ++ T + A++ A Sbjct: 844 KANVGRGQVQFGALKYSKEPEDLFYLNTFSKGAAITENLRRRRDTYGETYTAKALEHA-- 901 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 + F + ++ +I +TDG +++ + +T K + I Sbjct: 902 ----------NSQFTEEHGSRIKQNVKQMLIVITDGVSHDREQLSDTAL---KLRNKGII 948 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 I + + Q L+ +V N D L ++Q Sbjct: 949 IYAVGVG--EADQYELEAIAGDKNNTRHVDNFDKLKDIYQ 986 Score = 42.6 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 36/230 (15%), Positives = 80/230 (34%), Gaps = 48/230 (20%) Query: 174 VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 + ++ + E I +VD SGS+ + KM K Sbjct: 611 SEVVHSICTEKGCEDMKADIMFLVDSSGSIG-----------------PENFGKM---KT 650 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD--SLILKP 291 + L I + + +G++ ++T ++ + ++ ++ +D SLI + Sbjct: 651 FMKNLLAKIQIGP---DKTQIGVVQFSTDAKEEFQL--NKYFTQKEISDAIDRMSLINQN 705 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A++ F + + +K +KF+I +TDGE + + Sbjct: 706 TYTGHALE--------------FVDQYFSHLKGARPGVKKFLILITDGEARDGVRDPA-- 749 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 +E + I ++ + + Q L+ + V D L + Sbjct: 750 ---RALREKEVVIFSVGVYGANRTQ--LEEISGDGSLVFQVEKFDDLKAI 794 >gi|163758683|ref|ZP_02165770.1| von Willebrand factor type A domain protein [Hoeflea phototrophica DFL-43] gi|162283973|gb|EDQ34257.1| von Willebrand factor type A domain protein [Hoeflea phototrophica DFL-43] Length = 587 Score = 44.5 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 32/218 (14%), Positives = 72/218 (33%), Gaps = 47/218 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + +V+D S SM ++ + K+ K+ + DL+S+ + V Sbjct: 36 VMIVLDGSNSMWGQVDGE---------------AKITIAKDVMT------DLISNWDDAV 74 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GL+ Y R + + D++ + L + PA+ Q ++ K Sbjct: 75 DLGLMVYGHRRKGD--------------CSDIEVVALPGKVNRPALIDKVQSISPRGKTP 120 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK-----IVTI 367 + + ++ ++DG C +AK I + I Sbjct: 121 ISKTLLLAATSVGYFSGKSSVVLVSDGLETCDADP------CAQAKALGIINPGFDVHVI 174 Query: 368 SINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQN 404 + + + L+ + + + NA+ L + Sbjct: 175 GFDVTEEEFKSLQCIATETGGKFFRANNAEELKDALRR 212 >gi|118355467|ref|XP_001010993.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89292760|gb|EAR90748.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 2033 Score = 44.5 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 36/218 (16%), Positives = 78/218 (35%), Gaps = 45/218 (20%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 RP + V+D SGSM K+ +KN + + L+ Sbjct: 1601 RPNLDLICVIDNSGSMSGQ--------------------KIENVKNTI------LQLIDM 1634 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQILT 306 + ++ + +I + + ++ + ++ + + S+ T+ T ++ A+ IL Sbjct: 1635 LNDNDRLSIITFNSHAQQLCGLRKVNKDNKENLQKITKSIYANGGTNITSGLQTAFSILQ 1694 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI--KICDKAKENFIKI 364 S K+R+ ++ I L+DG++NN S + + + +E I Sbjct: 1695 SRKQRNSVSS----------------IFLLSDGQDNNSDSRIRNLLQTTYQQLQEECFTI 1738 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + +G + + Y V D + F Sbjct: 1739 HSFGFGNDHDGPLMQRIAQIKDGSFYYVERNDQVDEFF 1776 >gi|326326039|ref|YP_004250848.1| hypothetical protein VIBNI_0107 [Vibrio nigripulchritudo] gi|323669090|emb|CBJ93137.1| Protein of unknown function (exported) [Vibrio nigripulchritudo] Length = 1081 Score = 44.5 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 37/219 (16%), Positives = 73/219 (33%), Gaps = 53/219 (24%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + V+D SGSM SDP+++ + S L+ +KE Sbjct: 312 LELAFVLDSSGSMRW---SDPDNIR----------------------IVGSKHLVDRLKE 346 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +I + + + + ++ + S TD + +A + + + Sbjct: 347 VDRGAVIDFDSTAQLLQSLTDNKAVIKSALDLIDAS---GGTDIGDGVSKALEEFANARS 403 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 S + ++ LTDG + ++ + T + + I Sbjct: 404 ASDWA-----------------VVLLTDG-SGSYNHALTTELVQKNIRVLGIT------M 439 Query: 371 ASPNGQRLLKTCVSSP-EYHYNVVNADSLIHVFQNISQL 408 S Q L++ S + +V AD LI VF+ S + Sbjct: 440 GSGANQSLIRGISDSTYGIYQHVNTADELIEVFERFSTI 478 >gi|159899109|ref|YP_001545356.1| FHA domain-containing protein [Herpetosiphon aurantiacus ATCC 23779] gi|159892148|gb|ABX05228.1| FHA domain containing protein [Herpetosiphon aurantiacus ATCC 23779] Length = 785 Score = 44.5 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 46/304 (15%), Positives = 98/304 (32%), Gaps = 55/304 (18%) Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEH-LLNQRYNQKIVSFIPALL 181 + ++ +TH ++ + ++++ + I +L+ + + V +P Sbjct: 30 DKPKVVITHQVVTEKNALSVEAYFSVRYNDGRAVPVNEISSTVLDLKNGEPPVQAVPQDP 89 Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 I LV+D SGSM+ + N A K A+ Sbjct: 90 TTP------IKIALVMDQSGSMNPFIEEVKRAANQAIDQAPAN------AKIAV-FTFTR 136 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 ++ + VY+ ++ V+ Y+ ++ S T A QA Sbjct: 137 MNSVD-----VYLPAFDFSDDR----------NAVKDYINQNYRSEPGGETCLYTAAHQA 181 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV-NTIKICDKAKEN 360 L + K ++ II TDG++ + N + + D + Sbjct: 182 TDFLLNTLK----------------PEERRAIILFTDGKDEDINGNQCSDKTVIDVTTKA 225 Query: 361 FIK------IVTISINASPNGQ---RLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 I TI + ++ + L+ + + V D + F I + + Sbjct: 226 GPTQGTKTPIYTIGLCSADCARIQPESLRQISENTQAISLVGPRDQMSAAFITIMDSIKN 285 Query: 412 RKYS 415 +KY+ Sbjct: 286 QKYT 289 >gi|149919617|ref|ZP_01908096.1| putative outer membrane adhesin like protein [Plesiocystis pacifica SIR-1] gi|149819560|gb|EDM78988.1| putative outer membrane adhesin like protein [Plesiocystis pacifica SIR-1] Length = 1168 Score = 44.5 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 36/225 (16%), Positives = 63/225 (28%), Gaps = 47/225 (20%) Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFL----IELVVDLSGSMHCAMNSD 210 D L F + N S + + F + LV+D S SM S Sbjct: 544 DLDLSFF--MPAGTPNSDPRSPLTDCVTHTATVHHDFDAYDTVALVLDRSKSMDQDQGS- 600 Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 R ++ + A+L + D + V L + + Sbjct: 601 --------------RKRIEWAQRAILKWTDHV--ADGGS--VQASLRKFN---QDAPPAV 639 Query: 271 WGTEKVRQYVTRDMDSLILKPT-------DSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 +G + V V + + T D P A D + Sbjct: 640 FGFKTVLDAVVGGETATEIDSTAIEEYLEDIEPDGSTA----IGDAIDAAVAALMAHDDL 695 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICD---KAKENFIKIV 365 P+ I +TDGE + + +CD A ++ + + Sbjct: 696 DPNSSNNNAIFLITDGEQTSGDKD-----VCDALEDAAKDDVPVY 735 >gi|254787807|ref|YP_003075236.1| von Willebrand factor A [Teredinibacter turnerae T7901] gi|237686979|gb|ACR14243.1| von Willebrand factor type A domain protein [Teredinibacter turnerae T7901] Length = 689 Score = 44.5 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 36/239 (15%), Positives = 74/239 (30%), Gaps = 51/239 (21%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIP--------ALLRIEMGERPIFLIELVVDLSGSMHC 205 YDY L + I + + + + P + ++D+SGSM Sbjct: 282 YDYPLPSAATAPFKPTITVIPAPWNQAKRLVHIGIKALPLAHPPKANLVFLLDVSGSMG- 340 Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK 265 K+ +K ++ L L + V VY G G V + Sbjct: 341 ------------------SPDKLPLVKQSMELLLSGLQPTDTVSIVVYAGAAG---TVLE 379 Query: 266 NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP 325 + ++ + T ++ AYQ+ ++ +R Sbjct: 380 PTP----VAEQQKILAALDRLNAGGSTAGAQGIELAYQLAEANYQRDAVNR--------- 426 Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIK-ICDKAKENFIKIVTISINASPNGQRLLKTCV 383 II TDG+ N ++ +K ++ + N I++ + + L++ Sbjct: 427 -------IILATDGDFNVGIADPEQLKGYVERKRANGIELSILGFGSGNYNDALMQQLA 478 >gi|162454787|ref|YP_001617154.1| hypothetical protein sce6505 [Sorangium cellulosum 'So ce 56'] gi|161165369|emb|CAN96674.1| putative membrane protein [Sorangium cellulosum 'So ce 56'] Length = 384 Score = 44.5 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 24/149 (16%), Positives = 51/149 (34%), Gaps = 37/149 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +V+D S SM+ +D + +++ K + + + E Sbjct: 90 DVVVVLDYSKSMYA---------------RDVEPSRIFRAKVEVARLIKDL-------EG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 G + + + G + + D + + + T A+ QA ++L D K Sbjct: 128 ARFGAVAFAGEPMGFPLTADGAAIAQFFRQLDPNDMPIGGTAIARALDQANELLKRDPK- 186 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 S ++ I+ +TDGE Sbjct: 187 --------------SAEHKRIILLVTDGE 201 >gi|156741667|ref|YP_001431796.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156232995|gb|ABU57778.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 826 Score = 44.5 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 40/282 (14%), Positives = 88/282 (31%), Gaps = 74/282 (26%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKR--------------TKMAALKNALLLFLD 240 L++D+SGSM + C + + ++ K L F+ Sbjct: 408 LILDVSGSMSWTFDGRGVQNGQTVFCTNPSQGCVSVQTAWPNAQERRIYTAKQVLRSFVA 467 Query: 241 SI--DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 I D S ++ + L+ ++ R+ + S + + + L T+ + Sbjct: 468 QIDQDRQSGLRPYDTVRLVTFSGRLGSFVNSSGAVGDNNRALNDLTEVLPAGWTNDRATL 527 Query: 299 KQAYQI--------LTSDKKRSFFTNFFRQGVKIPSLP--------FQKFIIFLTDG--- 339 + A + F R + P +++ +IF+TDG Sbjct: 528 EAAINSAGMVDGDPYMTAGATPSAVAFARASQVFANAPERAPNGMKYRRVVIFVTDGVAN 587 Query: 340 ------ENNN-------------------FKSNVNTIKI----CDKAKENFIK-----IV 365 +NN ++ + KE +I+ + Sbjct: 588 VLRNGMQNNYGEGCQLGAENVGCQMGDPLPDGSLRPLNAMVAEAQALKEAYIRPSDGSVY 647 Query: 366 TISINAS--PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 ++++ + G L+ S P+Y ++ L +F +I Sbjct: 648 VVALSGTFEATGLNLV---ASQPDYVKRADRSEELQQIFDDI 686 >gi|332833576|ref|XP_003312497.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5 [Pan troglodytes] Length = 728 Score = 44.5 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 45/222 (20%), Positives = 76/222 (34%), Gaps = 48/222 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D S SM TK+ K+AL L + Sbjct: 82 VVFVLDSSASMVG--------------------TKLRQTKDALFTILHDLRPQD------ 115 Query: 253 YMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +IG++ R++ K+ S + +R TD A+++A ++L Sbjct: 116 RFSIIGFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL----- 170 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF---IKIVTI 367 N + I I+FLTDG+ +T+KI + +E + I TI Sbjct: 171 -----NKYVAHSGIGDRSVS-LIVFLTDGK--PTVGETHTLKILNNTREAARGQVCIFTI 222 Query: 368 SINASPNGQRL----LKTCVSSPEYHYNVVNADSLIHVFQNI 405 I + + L L+ C + H LI + I Sbjct: 223 GIGNDVDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 264 >gi|312434033|ref|NP_116206.4| inter-alpha-trypsin inhibitor heavy chain H5 isoform 2 [Homo sapiens] Length = 728 Score = 44.5 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 45/222 (20%), Positives = 76/222 (34%), Gaps = 48/222 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D S SM TK+ K+AL L + Sbjct: 82 VVFVLDSSASMVG--------------------TKLRQTKDALFTILHDLRPQD------ 115 Query: 253 YMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +IG++ R++ K+ S + +R TD A+++A ++L Sbjct: 116 RFSIIGFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL----- 170 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF---IKIVTI 367 N + I I+FLTDG+ +T+KI + +E + I TI Sbjct: 171 -----NKYVAHSGIGDRSVS-LIVFLTDGK--PTVGETHTLKILNNTREAARGQVCIFTI 222 Query: 368 SINASPNGQRL----LKTCVSSPEYHYNVVNADSLIHVFQNI 405 I + + L L+ C + H LI + I Sbjct: 223 GIGNDVDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 264 >gi|310703621|ref|NP_085046.5| inter-alpha-trypsin inhibitor heavy chain H5 isoform 1 precursor [Homo sapiens] Length = 942 Score = 44.5 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 45/222 (20%), Positives = 76/222 (34%), Gaps = 48/222 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D S SM TK+ K+AL L + Sbjct: 296 VVFVLDSSASMVG--------------------TKLRQTKDALFTILHDLRPQD------ 329 Query: 253 YMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +IG++ R++ K+ S + +R TD A+++A ++L Sbjct: 330 RFSIIGFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL----- 384 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF---IKIVTI 367 N + I I+FLTDG+ +T+KI + +E + I TI Sbjct: 385 -----NKYVAHSGIGDRSVS-LIVFLTDGK--PTVGETHTLKILNNTREAARGQVCIFTI 436 Query: 368 SINASPNGQRL----LKTCVSSPEYHYNVVNADSLIHVFQNI 405 I + + L L+ C + H LI + I Sbjct: 437 GIGNDVDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 478 >gi|291229807|ref|XP_002734862.1| PREDICTED: polydom-like [Saccoglossus kowalevskii] Length = 1730 Score = 44.5 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 31/229 (13%), Positives = 73/229 (31%), Gaps = 56/229 (24%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + + ++D S S+ A A K + L +S Sbjct: 31 QGKSDLVFLLDRSASVGSA--------------------NFEAEKGFVESLLGQF-SISP 69 Query: 248 VKEDVYMGLIGYTTRVEK---NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + ++ Y+ V + I Q + R + T++ A+++A I Sbjct: 70 AS--TRVDVVSYSEDVVRHIDYIREPKNKCHFSQDI-RHVTYRNSGKTNTNGALQEARNI 126 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 ++ K ++ L+DG++N T ++ +++ ++I Sbjct: 127 FVGSRQDVH-----------------KVVVLLSDGQSNTGGDPTTTA---EELRQDGVEI 166 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 TI+I + L + +S + + F+ +L + Sbjct: 167 FTIAIGLFNKDE--LNSIATSDHHTFE-------YSSFREFKKLASRIR 206 >gi|187609608|sp|Q86UX2|ITIH5_HUMAN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H5; Short=ITI heavy chain H5; Short=ITI-HC5; Short=Inter-alpha-inhibitor heavy chain 5; Flags: Precursor Length = 942 Score = 44.5 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 45/222 (20%), Positives = 76/222 (34%), Gaps = 48/222 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D S SM TK+ K+AL L + Sbjct: 296 VVFVLDSSASMVG--------------------TKLRQTKDALFTILHDLRPQD------ 329 Query: 253 YMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +IG++ R++ K+ S + +R TD A+++A ++L Sbjct: 330 RFSIIGFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL----- 384 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF---IKIVTI 367 N + I I+FLTDG+ +T+KI + +E + I TI Sbjct: 385 -----NKYVAHSGIGDRSVS-LIVFLTDGK--PTVGETHTLKILNNTREAARGQVCIFTI 436 Query: 368 SINASPNGQRL----LKTCVSSPEYHYNVVNADSLIHVFQNI 405 I + + L L+ C + H LI + I Sbjct: 437 GIGNDVDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 478 >gi|119606788|gb|EAW86382.1| inter-alpha (globulin) inhibitor H5, isoform CRA_d [Homo sapiens] Length = 735 Score = 44.5 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 45/222 (20%), Positives = 76/222 (34%), Gaps = 48/222 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D S SM TK+ K+AL L + Sbjct: 296 VVFVLDSSASMVG--------------------TKLRQTKDALFTILHDLRPQD------ 329 Query: 253 YMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +IG++ R++ K+ S + +R TD A+++A ++L Sbjct: 330 RFSIIGFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL----- 384 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF---IKIVTI 367 N + I I+FLTDG+ +T+KI + +E + I TI Sbjct: 385 -----NKYVAHSGIGDRSVS-LIVFLTDGK--PTVGETHTLKILNNTREAARGQVCIFTI 436 Query: 368 SINASPNGQRL----LKTCVSSPEYHYNVVNADSLIHVFQNI 405 I + + L L+ C + H LI + I Sbjct: 437 GIGNDVDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 478 >gi|119606787|gb|EAW86381.1| inter-alpha (globulin) inhibitor H5, isoform CRA_c [Homo sapiens] Length = 748 Score = 44.5 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 45/222 (20%), Positives = 76/222 (34%), Gaps = 48/222 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D S SM TK+ K+AL L + Sbjct: 102 VVFVLDSSASMVG--------------------TKLRQTKDALFTILHDLRPQD------ 135 Query: 253 YMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +IG++ R++ K+ S + +R TD A+++A ++L Sbjct: 136 RFSIIGFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL----- 190 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF---IKIVTI 367 N + I I+FLTDG+ +T+KI + +E + I TI Sbjct: 191 -----NKYVAHSGIGDRSVS-LIVFLTDGK--PTVGETHTLKILNNTREAARGQVCIFTI 242 Query: 368 SINASPNGQRL----LKTCVSSPEYHYNVVNADSLIHVFQNI 405 I + + L L+ C + H LI + I Sbjct: 243 GIGNDVDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 284 >gi|119606785|gb|EAW86379.1| inter-alpha (globulin) inhibitor H5, isoform CRA_a [Homo sapiens] gi|168275576|dbj|BAG10508.1| inter-alpha trypsin inhibitor heavy chain precursor 5 isoform 1 [synthetic construct] Length = 942 Score = 44.5 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 45/222 (20%), Positives = 76/222 (34%), Gaps = 48/222 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D S SM TK+ K+AL L + Sbjct: 296 VVFVLDSSASMVG--------------------TKLRQTKDALFTILHDLRPQD------ 329 Query: 253 YMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +IG++ R++ K+ S + +R TD A+++A ++L Sbjct: 330 RFSIIGFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL----- 384 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF---IKIVTI 367 N + I I+FLTDG+ +T+KI + +E + I TI Sbjct: 385 -----NKYVAHSGIGDRSVS-LIVFLTDGK--PTVGETHTLKILNNTREAARGQVCIFTI 436 Query: 368 SINASPNGQRL----LKTCVSSPEYHYNVVNADSLIHVFQNI 405 I + + L L+ C + H LI + I Sbjct: 437 GIGNDVDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 478 >gi|55958059|emb|CAI12954.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] gi|55958529|emb|CAI16361.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] Length = 577 Score = 44.5 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 45/222 (20%), Positives = 76/222 (34%), Gaps = 48/222 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D S SM TK+ K+AL L + Sbjct: 171 VVFVLDSSASMVG--------------------TKLRQTKDALFTILHDLRPQD------ 204 Query: 253 YMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +IG++ R++ K+ S + +R TD A+++A ++L Sbjct: 205 RFSIIGFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL----- 259 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF---IKIVTI 367 N + I I+FLTDG+ +T+KI + +E + I TI Sbjct: 260 -----NKYVAHSGIGDRSVS-LIVFLTDGK--PTVGETHTLKILNNTREAARGQVCIFTI 311 Query: 368 SINASPNGQRL----LKTCVSSPEYHYNVVNADSLIHVFQNI 405 I + + L L+ C + H LI + I Sbjct: 312 GIGNDVDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 353 >gi|55958058|emb|CAI12953.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] gi|55958528|emb|CAI16360.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] Length = 742 Score = 44.5 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 45/222 (20%), Positives = 76/222 (34%), Gaps = 48/222 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D S SM TK+ K+AL L + Sbjct: 82 VVFVLDSSASMVG--------------------TKLRQTKDALFTILHDLRPQD------ 115 Query: 253 YMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +IG++ R++ K+ S + +R TD A+++A ++L Sbjct: 116 RFSIIGFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL----- 170 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF---IKIVTI 367 N + I I+FLTDG+ +T+KI + +E + I TI Sbjct: 171 -----NKYVAHSGIGDRSVS-LIVFLTDGK--PTVGETHTLKILNNTREAARGQVCIFTI 222 Query: 368 SINASPNGQRL----LKTCVSSPEYHYNVVNADSLIHVFQNI 405 I + + L L+ C + H LI + I Sbjct: 223 GIGNDVDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 264 >gi|37181977|gb|AAQ88792.1| LLLL311 [Homo sapiens] Length = 694 Score = 44.5 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 45/222 (20%), Positives = 76/222 (34%), Gaps = 48/222 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D S SM TK+ K+AL L + Sbjct: 296 VVFVLDSSASMVG--------------------TKLRQTKDALFTILHDLRPQD------ 329 Query: 253 YMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +IG++ R++ K+ S + +R TD A+++A ++L Sbjct: 330 RFSIIGFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL----- 384 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF---IKIVTI 367 N + I I+FLTDG+ +T+KI + +E + I TI Sbjct: 385 -----NKYVAHSGIGDRSVS-LIVFLTDGK--PTVGETHTLKILNNTREAARGQVCIFTI 436 Query: 368 SINASPNGQRL----LKTCVSSPEYHYNVVNADSLIHVFQNI 405 I + + L L+ C + H LI + I Sbjct: 437 GIGNDVDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 478 >gi|30314037|gb|AAO49812.1| inter-alpha trypsin inhibitor heavy chain precursor 5 [Homo sapiens] Length = 942 Score = 44.5 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 45/222 (20%), Positives = 76/222 (34%), Gaps = 48/222 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D S SM TK+ K+AL L + Sbjct: 296 VVFVLDSSASMVG--------------------TKLRQTKDALFTILHDLRPQD------ 329 Query: 253 YMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +IG++ R++ K+ S + +R TD A+++A ++L Sbjct: 330 RFSIIGFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL----- 384 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF---IKIVTI 367 N + I I+FLTDG+ +T+KI + +E + I TI Sbjct: 385 -----NKYVAHSGIGDRSVS-LIVFLTDGK--PTVGETHTLKILNNTREAARGQVCIFTI 436 Query: 368 SINASPNGQRL----LKTCVSSPEYHYNVVNADSLIHVFQNI 405 I + + L L+ C + H LI + I Sbjct: 437 GIGNDVDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 478 >gi|55958060|emb|CAI12955.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] gi|55958531|emb|CAI16363.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] gi|189442558|gb|AAI67770.1| Inter-alpha (globulin) inhibitor H5 [synthetic construct] Length = 956 Score = 44.5 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 45/222 (20%), Positives = 76/222 (34%), Gaps = 48/222 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D S SM TK+ K+AL L + Sbjct: 296 VVFVLDSSASMVG--------------------TKLRQTKDALFTILHDLRPQD------ 329 Query: 253 YMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +IG++ R++ K+ S + +R TD A+++A ++L Sbjct: 330 RFSIIGFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL----- 384 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF---IKIVTI 367 N + I I+FLTDG+ +T+KI + +E + I TI Sbjct: 385 -----NKYVAHSGIGDRSVS-LIVFLTDGK--PTVGETHTLKILNNTREAARGQVCIFTI 436 Query: 368 SINASPNGQRL----LKTCVSSPEYHYNVVNADSLIHVFQNI 405 I + + L L+ C + H LI + I Sbjct: 437 GIGNDVDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 478 >gi|49355778|ref|NP_001001851.1| inter-alpha-trypsin inhibitor heavy chain H5 isoform 3 precursor [Homo sapiens] gi|119606789|gb|EAW86383.1| inter-alpha (globulin) inhibitor H5, isoform CRA_e [Homo sapiens] Length = 702 Score = 44.5 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 45/222 (20%), Positives = 76/222 (34%), Gaps = 48/222 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D S SM TK+ K+AL L + Sbjct: 296 VVFVLDSSASMVG--------------------TKLRQTKDALFTILHDLRPQD------ 329 Query: 253 YMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +IG++ R++ K+ S + +R TD A+++A ++L Sbjct: 330 RFSIIGFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL----- 384 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF---IKIVTI 367 N + I I+FLTDG+ +T+KI + +E + I TI Sbjct: 385 -----NKYVAHSGIGDRSVS-LIVFLTDGK--PTVGETHTLKILNNTREAARGQVCIFTI 436 Query: 368 SINASPNGQRL----LKTCVSSPEYHYNVVNADSLIHVFQNI 405 I + + L L+ C + H LI + I Sbjct: 437 GIGNDVDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 478 >gi|50949741|emb|CAH10363.1| hypothetical protein [Homo sapiens] Length = 460 Score = 44.5 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 45/222 (20%), Positives = 76/222 (34%), Gaps = 48/222 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D S SM TK+ K+AL L + Sbjct: 79 VVFVLDSSASMVG--------------------TKLRQTKDALFTILHDLRPQD------ 112 Query: 253 YMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +IG++ R++ K+ S + +R TD A+++A ++L Sbjct: 113 RFSIIGFSNRIKVRKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL----- 167 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF---IKIVTI 367 N + I I+FLTDG+ +T+KI + +E + I TI Sbjct: 168 -----NKYVAHSGIGDRSVS-LIVFLTDGK--PTVGETHTLKILNNTREAARGQVCIFTI 219 Query: 368 SINASPNGQRL----LKTCVSSPEYHYNVVNADSLIHVFQNI 405 I + + L L+ C + H LI + I Sbjct: 220 GIGNDVDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 261 >gi|18916771|dbj|BAB85539.1| KIAA1953 protein [Homo sapiens] Length = 824 Score = 44.5 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 45/222 (20%), Positives = 76/222 (34%), Gaps = 48/222 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D S SM TK+ K+AL L + Sbjct: 178 VVFVLDSSASMVG--------------------TKLRQTKDALFTILHDLRPQD------ 211 Query: 253 YMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +IG++ R++ K+ S + +R TD A+++A ++L Sbjct: 212 RFSIIGFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL----- 266 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF---IKIVTI 367 N + I I+FLTDG+ +T+KI + +E + I TI Sbjct: 267 -----NKYVAHSGIGDRSVS-LIVFLTDGK--PTVGETHTLKILNNTREAARGQVCIFTI 318 Query: 368 SINASPNGQRL----LKTCVSSPEYHYNVVNADSLIHVFQNI 405 I + + L L+ C + H LI + I Sbjct: 319 GIGNDVDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 360 >gi|310643461|ref|YP_003948219.1| protein [Paenibacillus polymyxa SC2] gi|309248411|gb|ADO57978.1| Putative uncharacterized protein [Paenibacillus polymyxa SC2] Length = 696 Score = 44.5 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 41/266 (15%), Positives = 83/266 (31%), Gaps = 60/266 (22%) Query: 158 LQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIE-----LVVDLSGSMHCAMNSDPE 212 + F+ +L + + I + + IE V+D S SM ++DPE Sbjct: 10 ITFMTFVLFFQTG--LSGVICTVNQANAASLGTASIEGYDAVFVLDTSYSMR---DTDPE 64 Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SW 271 + + ++ + +G + Y V + S Sbjct: 65 GIAAE-----------------VISMFMDLSDADR----TRVGFVAYNHHVVASKPLTSI 103 Query: 272 GTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 G + + +++ L TD +++ + G P Sbjct: 104 GVAAQKSQIQQEIRMLNRSGYTDLGLGLRK-------------GSELLAAGASQGRQP-- 148 Query: 331 KFIIFLTDGEN---------NNFKSNVNTIKICDKAKENFIKIVTISIN--ASPNGQRLL 379 F+I L+DGE + SN + + A+ + TI +N + N Q L Sbjct: 149 -FMILLSDGETDFGVSSGSRSKGDSNNDVSSVIKSAQTKGYPVYTIGLNHDGTVNRQELE 207 Query: 380 KTCVSSPEYHYNVVNADSLIHVFQNI 405 + + + +A+ L + I Sbjct: 208 RIASQTGGASFITSSAEDLPEILNRI 233 >gi|194223903|ref|XP_001494710.2| PREDICTED: integrin, alpha 1 [Equus caballus] Length = 1208 Score = 44.5 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 76/199 (38%), Gaps = 25/199 (12%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + K+ D ++ + + +G++ Y V + V + + Sbjct: 212 IYPWKSVTDFLNDLLERMDIGPKQTQVGIVQYGENVTHEFNL--NKYSSTEEVLVAANKI 269 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T A I T+ K+ R G +K ++ +TDGE+++ Sbjct: 270 VQRGGRQT---MTALGIDTARKEAFTEARGARHG-------VKKVMVIVTDGESHDNHQL 319 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRL--------LKTCVSSP--EYHYNVVNADS 397 I+ C+K I+ +I+I N L +K+ S P ++ +NV + + Sbjct: 320 NQVIQDCEK---QNIQRFSIAILGHYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELA 376 Query: 398 LIHVFQNISQLMVHRKYSV 416 L+ + + + + + + +V Sbjct: 377 LVTIVEALGERIFALEATV 395 >gi|145502983|ref|XP_001437469.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124404619|emb|CAK70072.1| unnamed protein product [Paramecium tetraurelia] Length = 562 Score = 44.5 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 36/177 (20%), Positives = 62/177 (35%), Gaps = 36/177 (20%) Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 + +I + P I ++ D+SGSM D K+ ++M A+ Sbjct: 103 VTESPQIIQQQDPAEAIVVLYDISGSMSSQFFGD------------KELSRMGAVNAFFS 150 Query: 237 LFLDS--IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 F D +H+ + V+ G + T +++ D+ T Sbjct: 151 AFADKTLAFEFNHIVKLVWFGSTLFDKCEF--------TSDFNKFIKLVDDANPGGSTKC 202 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 A+ A L K+ K P + + I+ LTDGE+N S NT+ Sbjct: 203 YDAIDYAINKLLEVKQ------------KYPDIVLR--ILALTDGEDNASASKPNTL 245 >gi|302143246|emb|CBI20541.3| unnamed protein product [Vitis vinifera] Length = 630 Score = 44.5 bits (103), Expect = 0.037, Method: Composition-based stats. Identities = 30/224 (13%), Positives = 77/224 (34%), Gaps = 50/224 (22%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 +R + V+D+SGSM +K++ LK A+ + ++ Sbjct: 201 DRAPIDLVAVLDVSGSMAG--------------------SKLSLLKRAVCFLIQNLGPSD 240 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQIL 305 + ++ +++ + ++ R+ ++SL T+ +K+ ++L Sbjct: 241 ------RLSIVSFSSTARRIFPLRRMSDNGREAAGLAINSLTSSGGTNIVEGLKKGVRVL 294 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI--CDKA-----K 358 +++ + II L+DG++ NVN + C + + Sbjct: 295 EERSEQNPVAS----------------IILLSDGKDTYNCDNVNRRQTSHCASSNPRQGR 338 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + I + T + + + S + + ++ F Sbjct: 339 QAIIPVHTFGFGSDHDSTAMHAISDESGGTFSFIESVATVQDAF 382 >gi|195941051|ref|ZP_03086433.1| von Willebrand factor, type A [Escherichia coli O157:H7 str. EC4024] Length = 325 Score = 44.5 bits (103), Expect = 0.037, Method: Composition-based stats. Identities = 36/227 (15%), Positives = 78/227 (34%), Gaps = 46/227 (20%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E+P+ I L++D+SGSM T++ A++ ++ F+ + Sbjct: 92 EKPMRNIMLILDVSGSME-------------KNDVAGGLTRLQAVQQSVKKFVAARKSD- 137 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 +GL+ + S + + +++ + + T A+ ++L Sbjct: 138 ------RIGLVIFANSAWPFAPVSEDKQALETRISQLTPGMAGQQTAIGDALGVTVKLLD 191 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 S K I LTDG + + A + +++ T Sbjct: 192 S----------------TGDKEASKLAILLTDGNDTASQLTPRLAAQ--LAVSHHVQLHT 233 Query: 367 ISI-NASPNG-----QRLLKTCVS-SPEYHYNVVNA-DSLIHVFQNI 405 I+ + + +G LL+ + + N+ SL V++ I Sbjct: 234 IAFGDVNSSGDDKVDLNLLQDLARMTGGRSWTAENSGASLDAVWKEI 280 >gi|134093165|gb|ABO53025.1| matrilin 4 isoform 1 precursor, 5 prime [Chlorocebus aethiops] Length = 214 Score = 44.5 bits (103), Expect = 0.037, Method: Composition-based stats. Identities = 28/169 (16%), Positives = 63/169 (37%), Gaps = 22/169 (13%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTP 296 + + L +G+I Y+++V+ ++ + + RD+ L T + Sbjct: 57 LVGLLRGLKVGANATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQG-TMTGL 115 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ + S + G + P + + +TDG + ++ + Sbjct: 116 AIQYVMNVAFSVAE----------GARPPEERVPRVAVIVTDG---RPQD--RVAEVAAQ 160 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 A+ I+I + + + G L+ S P E+ + V + LI F Sbjct: 161 ARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESF-DLIQEFG 206 >gi|118464548|ref|YP_883428.1| hypothetical protein MAV_4290 [Mycobacterium avium 104] gi|118165835|gb|ABK66732.1| conserved hypothetical protein [Mycobacterium avium 104] Length = 335 Score = 44.5 bits (103), Expect = 0.037, Method: Composition-based stats. Identities = 39/234 (16%), Positives = 75/234 (32%), Gaps = 51/234 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + LV+D+S SM D ++AA K A F D + + Sbjct: 99 VMLVIDVSESMAS---------------TDVPPNRLAAAKEAGKQFADQLTPA------I 137 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GL+ + + P+ V+ + + T + + A Q + + Sbjct: 138 NLGLVEFAANATLLVPPTTNRAAVKAGIDSLQPAPK---TATGEGIFTALQAIAT----- 189 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENN---NFKSNVNTIKICDKAKENFIKIVTISI 369 + P I+ +DG N + + AK ++I TIS Sbjct: 190 -----VGSVMGGGEGPPPARIVLESDGAENVPLDPNAPQGAFTAARAAKAEGVQISTISF 244 Query: 370 N--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + Q L K C + ++ + DSL +V+ + + + Sbjct: 245 GTPYGTVDYEGATIPVPVDDQTLQKICEITDGQAFHADSLDSLKNVYSTLQRQI 298 >gi|41409533|ref|NP_962369.1| hypothetical protein MAP3435c [Mycobacterium avium subsp. paratuberculosis K-10] gi|81570936|sp|Q73UD3|Y3435_MYCPA RecName: Full=UPF0353 protein MAP_3435c gi|41398364|gb|AAS05985.1| hypothetical protein MAP_3435c [Mycobacterium avium subsp. paratuberculosis K-10] Length = 335 Score = 44.5 bits (103), Expect = 0.037, Method: Composition-based stats. Identities = 39/234 (16%), Positives = 75/234 (32%), Gaps = 51/234 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + LV+D+S SM D ++AA K A F D + + Sbjct: 99 VMLVIDVSESMAS---------------TDVPPNRLAAAKEAGKQFADQLTPA------I 137 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GL+ + + P+ V+ + + T + + A Q + + Sbjct: 138 NLGLVEFAANATLLVPPTTNRAAVKAGIDSLQPAPK---TATGEGIFTALQAIAT----- 189 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENN---NFKSNVNTIKICDKAKENFIKIVTISI 369 + P I+ +DG N + + AK ++I TIS Sbjct: 190 -----VGSVMGGGEGPPPARIVLESDGAENVPLDPNAPQGAFTAARAAKAEGVQISTISF 244 Query: 370 N--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + Q L K C + ++ + DSL +V+ + + + Sbjct: 245 GTPYGTVDYEGATIPVPVDDQTLQKICEITDGQAFHADSLDSLKNVYSTLQRQI 298 >gi|126153367|gb|AAI31711.1| MATN1 protein [Homo sapiens] Length = 480 Score = 44.1 bits (102), Expect = 0.037, Method: Composition-based stats. Identities = 23/145 (15%), Positives = 51/145 (35%), Gaps = 17/145 (11%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 I+ L +G++ Y + V++ K + T + A++ A Sbjct: 52 IESLDVGPNATRVGMVNYASTVKQEFSLRAHVSKAALLQAVRRIQPLSTGTMTGLAIQFA 111 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 D + G + S K +I +TDG + +V+ +A+ + Sbjct: 112 ITKAFGDAE----------GGRSRSPDISKVVIVVTDGRPQDSVQDVS-----ARARASG 156 Query: 362 IKIVTISINASPNGQRLLKTCVSSP 386 +++ I + + + L+ S P Sbjct: 157 VELFAIGVGSVD--KATLRQIASEP 179 Score = 41.8 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 37/223 (16%), Positives = 82/223 (36%), Gaps = 40/223 (17%) Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNA-----------LLLFLDSI-DLLSHVKED 251 +NSD + N T + L + + F+ I D L + Sbjct: 236 GFTLNSDGKTCNVCSGGGGSSATDLVFLIDGSKSVRPENFELVKKFISQIVDTLDVSDKL 295 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD--SLILKPTDSTPAMKQAYQILTSDK 309 +GL+ Y++ V + G ++ + + S + K T + A+K D Sbjct: 296 AQVGLVQYSSSVRQEFPL--GRFHTKKDIKAAVRNMSYMEKGTMTGAALKY-----LIDN 348 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + + K+ I+F TDG + ++ ++ KAK+ K+ + + Sbjct: 349 SFTVSSGARPGAQKVG-------IVF-TDGRSQDYINDAA-----KKAKDLGFKMFAVGV 395 Query: 370 -NASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 NA + L+ S P E+++ + ++ + + + + + Sbjct: 396 GNAVEDE---LREIASEPVAEHYFYTADFKTINQIGKKLQKKI 435 >gi|323345325|ref|ZP_08085548.1| aerotolerance protein BatA [Prevotella oralis ATCC 33269] gi|323093439|gb|EFZ36017.1| aerotolerance protein BatA [Prevotella oralis ATCC 33269] Length = 332 Score = 44.1 bits (102), Expect = 0.037, Method: Composition-based stats. Identities = 44/246 (17%), Positives = 69/246 (28%), Gaps = 72/246 (29%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I L +D+S SM D ++ A K F+ Sbjct: 88 IDIMLAMDVSTSMLAE---------------DLHPNRIEAAKAVAAEFI----AGRPNDN 128 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVR-----QYVTRDMDS--LILKPTDSTPAMKQAYQ 303 +GL + + T+ Q V D+ + LI T + A Sbjct: 129 ---IGLTIFAGEAFTQCPMT--TDHASLLNLLQNVRTDIAARGLIQDGTAVGMGLANAVS 183 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L K +S K +I LTDG NN + T AK I+ Sbjct: 184 RLKDSKAKS------------------KVVILLTDGSNNMGDLSPMTSAQ--IAKSLGIR 223 Query: 364 IVTISINA--------SPNG-------------QRLLKTCVSSPEYHYNVVNADSLIHVF 402 + TI + G + L ++ Y N L ++ Sbjct: 224 VYTIGVGTNKVARYPMPVTGGIQYVNIPVEIDTKTLSDIAATTDGNFYRATNNRELKQIY 283 Query: 403 QNISQL 408 +I +L Sbjct: 284 NDIDKL 289 >gi|315181668|gb|ADT88581.1| Large exoprotein [Vibrio furnissii NCTC 11218] Length = 3149 Score = 44.1 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 32/162 (19%), Positives = 59/162 (36%), Gaps = 20/162 (12%) Query: 191 FLIELVVDLSGSMHCAM-NSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + I L++D SGSM + +D N ++M + +AL + V Sbjct: 2537 YNIALIIDTSGSMKFDLAGNDNGFSNRYQSQSQYNASRMKLVIDALTNLATDLVNHDGV- 2595 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + + LIG+ + + ++ ++Q +T D T+ A A Sbjct: 2596 --ININLIGFESSAHSALTLQLTSDNLQQLLTEIQDMDAEGGTNYEAAFDLA-------- 2645 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 +N+F P+ ++ FLTDG+ S N Sbjct: 2646 -----SNWFSHQ---PTEGYENLTYFLTDGDPTFSNSGDNGA 2679 >gi|242066912|ref|XP_002454745.1| hypothetical protein SORBIDRAFT_04g036560 [Sorghum bicolor] gi|241934576|gb|EES07721.1| hypothetical protein SORBIDRAFT_04g036560 [Sorghum bicolor] Length = 737 Score = 44.1 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 39/313 (12%), Positives = 87/313 (27%), Gaps = 54/313 (17%) Query: 109 YSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQR 168 + F + E +++ ++ F + TS + + HL Sbjct: 212 DPSTFNDDEPLDLLCEEANDTQQGCLRTVEIKTYPEFTEVPENTS-ERNFTVLIHLKAPL 270 Query: 169 YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 S R + V+D+SGSM TK+ Sbjct: 271 AQHLQPSSNLGDGNGLSTTRAPVDLITVLDVSGSMAG--------------------TKL 310 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT-EKVRQYVTRDMDSL 287 A LK A+ + ++ + +I +++ + T +Q + Sbjct: 311 ALLKRAMGFVIQNLGSSD------RLSVIAFSSSARRLFPLRRMTESGRQQSLLAVNSLT 364 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T+ +++ +++ + ++ + II L+DG++ S Sbjct: 365 SNGGTNIAEGLRKGSKVIEERQAKNPVCS----------------IILLSDGQDTYTVSP 408 Query: 348 VNTIK-----ICD----KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 + C I + A + L +S + ++ Sbjct: 409 TAGVHKGAPEYCALLPSTNGNQQIPVHVFGFGADHDSVSLHSISQTSGGTFSFIETEAAI 468 Query: 399 IHVFQN-ISQLMV 410 F I L+ Sbjct: 469 QDAFAQCIGGLLS 481 >gi|162456414|ref|YP_001618781.1| hypothetical protein sce8131 [Sorangium cellulosum 'So ce 56'] gi|161166996|emb|CAN98301.1| hypothetical protein sce8131 [Sorangium cellulosum 'So ce 56'] Length = 507 Score = 44.1 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 40/230 (17%), Positives = 78/230 (33%), Gaps = 44/230 (19%) Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 +PA R + L+VD SGSM KM + A Sbjct: 108 VPAARAARGQPRAPAAVVLLVDASGSMQGP--------------------KMENARAAAQ 147 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDST 295 F+D + V + + + + P+ R V R + +L T+ Sbjct: 148 AFVDRLPDGDLVS------VASFADTAQARVAPTVLGRSTRPAVARAIAALGPDGSTNLF 201 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI-KIC 354 +K A Q + PS + ++ ++DG+ N S+ + + + Sbjct: 202 AGLKLAEQ----------------HALAAPSTHAVRRVVLISDGQANIGPSSPDILGALA 245 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + + ++I +I + A + + L V S Y++ A + V + Sbjct: 246 QRGAAHGVQITSIGVGADYDERTLNALAVGSSGRLYHLTEAREMSSVLER 295 >gi|328951307|ref|YP_004368642.1| von Willebrand factor type A [Marinithermus hydrothermalis DSM 14884] gi|328451631|gb|AEB12532.1| von Willebrand factor type A [Marinithermus hydrothermalis DSM 14884] Length = 744 Score = 44.1 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 38/252 (15%), Positives = 85/252 (33%), Gaps = 49/252 (19%) Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 S + + + R + LV+D+SGSM D +K L A Sbjct: 318 SPLADEIPVRPAGRSGVALVLVLDVSGSMA-----------------DGNPSK---LALA 357 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TD 293 + L ++ +G++ +++ P T + + +D L T Sbjct: 358 VAGALSLVETARPED---RLGIVTFSSGPRWLFPPRPMTARGKLEAKTLLDRLRPGGSTR 414 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 A +QA + L + + + K I+ LTDG+ + + + Sbjct: 415 MLEAYRQAIEALEALELET------------------KQILVLTDGQVEEDPAALVALAE 456 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF-----QNISQL 408 A+ I+ ++++ + L + ++V + L +F + + Sbjct: 457 --AARAQGIRTNSVALGGDADRALLARMSRVGEGRFWDVPTPEDLPRLFLEEAERTFGRE 514 Query: 409 MVHRKYSVILKG 420 + ++ V L+ Sbjct: 515 ALEGRFPVRLEA 526 >gi|116622066|ref|YP_824222.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116225228|gb|ABJ83937.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 309 Score = 44.1 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 38/245 (15%), Positives = 87/245 (35%), Gaps = 60/245 (24%) Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 + +R+ E + LV+D SGSM R KMA++ A Sbjct: 64 VRQSIRLFSHEDIPVTVGLVIDHSGSM---------------------RPKMASVIAAAR 102 Query: 237 LFLDSID-----LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 F+ S + + EDV +GL + + ++ S Sbjct: 103 TFIQSSSPEDQMFVVNFNEDVTLGLSTEIPFTNRPEDLTYAISH----------SPPTGK 152 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T A+ +A + +G + +K ++ ++DG +N ++ Sbjct: 153 TALYDAVWKA-------------REWVARGSRD-----KKVLVVVSDGGDNASTHTLS-- 192 Query: 352 KICDKAKENFIKIVTISI---NASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQ 407 +I + A ++ I++ TI I + +L+ ++ + ++ + ++I++ Sbjct: 193 EILEAANKSNIQVFTIGIFDPDDPDKNPGVLRQLARATGGEAFVPDELSEVVAICESIAK 252 Query: 408 LMVHR 412 + + Sbjct: 253 DIRSQ 257 >gi|318604213|emb|CBY25711.1| protein TadG, associated with Flp pilus assembly [Yersinia enterocolitica subsp. palearctica Y11] Length = 457 Score = 44.1 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 51/282 (18%), Positives = 93/282 (32%), Gaps = 72/282 (25%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 +H + F + K +E+ I F + F+ LI + + +K + A A Sbjct: 8 IHKFNHFTLFKK----NEQGAILISFMIIFPFFIALIFITFEISHYLQRKAKLSDAIEQA 63 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 LA + + + D + I N+A + +++ L + I N Sbjct: 64 TLA----LTIENNEIPDEPQQIKNNA----------LVLSYVNAYLPSKK---FLVPIIN 106 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKI------V 174 I +++ + YN V +Y +F+ + V Sbjct: 107 INDNTHY----------------LEYNAAVTMAY--PAKFLSQSPFTNTISDMNITDNGV 148 Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 + + IF V D SGSM N + + ++ AL++A Sbjct: 149 AIKNKAIEASEPTDVIF----VADYSGSMLYNFN----------ENKPRDHERIDALRSA 194 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 D I S++ IGY I SWGT+++ Sbjct: 195 FRKLHDIIMDNSNIN------AIGY-------IPFSWGTKRI 223 >gi|221068121|ref|ZP_03544226.1| outer membrane adhesin like proteiin [Comamonas testosteroni KF-1] gi|220713144|gb|EED68512.1| outer membrane adhesin like proteiin [Comamonas testosteroni KF-1] Length = 1268 Score = 44.1 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 19/109 (17%), Positives = 37/109 (33%), Gaps = 12/109 (11%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + +V+DLSGSM N P ++++ K AL ++ + DV Sbjct: 851 VMIVLDLSGSMAWDSNG-----KVLPGGGSNANSRLSLAKKALEALINKYE----EYGDV 901 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + L+ + +W + + + T A+ A Sbjct: 902 AVKLVTFNGSTANAHA-TWMSAATAIAIINGL--TATGGTPYKAALNAA 947 >gi|322436225|ref|YP_004218437.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] gi|321163952|gb|ADW69657.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] Length = 304 Score = 44.1 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 27/194 (13%), Positives = 58/194 (29%), Gaps = 41/194 (21%) Query: 229 AALKNALLLFLDSIDLLS--HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 A K + L D + V ++ +T G ++R+ Sbjct: 98 EAGKKFVRALLREQDEFDLMDFSDTVRE-VVSFTND---KKRIENGLNELRK-------- 145 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T A+ A Q L ++ ++ +TDG+N Sbjct: 146 --GDATAVYDAVYLASQRLGETNAGGGR---------------RRVLVLITDGDN--TVH 186 Query: 347 NVNTIKICDKAKENFIKIVT---ISINASP-----NGQRLLKTCVSSPEYHYNVVNADSL 398 V + ++A+ + + + I A L++ + +Y V + L Sbjct: 187 GVGYDQAVEQAQRAGVMVYALIVVPIEADAGRNTGGEHALIQMATDTGGNYYYVNDPRDL 246 Query: 399 IHVFQNISQLMVHR 412 V+ +S + + Sbjct: 247 AKVYAKVSDDLRTQ 260 >gi|119946440|ref|YP_944120.1| von Willebrand factor, type A [Psychromonas ingrahamii 37] gi|119865044|gb|ABM04521.1| von Willebrand factor, type A [Psychromonas ingrahamii 37] Length = 327 Score = 44.1 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 35/240 (14%), Positives = 77/240 (32%), Gaps = 55/240 (22%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 ++ + + +DLSGSM F+ Sbjct: 82 QQQSRDMIISLDLSGSMQEVDMPLNGQTVDRLTLLKDLL----------KTFIKQ----- 126 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++ +GLI + ++ + ++Q V L T ++ A + Sbjct: 127 --RQGDRLGLILFADHAYLQTPLTFDLKTIQQMVDESEIGLAGTRTAIGESIAMAIKRFV 184 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 +K Q+ +I ++DG NN+ + I+ +A +N I I T Sbjct: 185 ENKNE------------------QRVLILVSDGANNSGS--IEPIQAAKQAAKNNITIYT 224 Query: 367 ISI------------------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 I + +A + + L + + ++ N L +++Q +++L Sbjct: 225 IGMGAEQMIKRGLFGNQRINPSADLDEKTLTEIANLTGGKYFRARNQTELQNIYQTLNKL 284 >gi|121637412|ref|YP_977635.1| hypothetical protein BCG_1543 [Mycobacterium bovis BCG str. Pasteur 1173P2] gi|224989887|ref|YP_002644574.1| hypothetical protein JTY_1518 [Mycobacterium bovis BCG str. Tokyo 172] gi|166979775|sp|A1KIS1|Y1543_MYCBP RecName: Full=UPF0353 protein BCG_1543 gi|254800546|sp|C1ANC7|Y1518_MYCBT RecName: Full=UPF0353 protein JTY_1518 gi|121493059|emb|CAL71530.1| Probable membrane protein [Mycobacterium bovis BCG str. Pasteur 1173P2] gi|224773000|dbj|BAH25806.1| hypothetical protein JTY_1518 [Mycobacterium bovis BCG str. Tokyo 172] Length = 335 Score = 44.1 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 41/234 (17%), Positives = 72/234 (30%), Gaps = 51/234 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + LV+D+S SM D + ++M A + A F D + + Sbjct: 99 VMLVIDVSQSM---------------RATDVEPSRMVAAQEAAKQFADELTPG------I 137 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GLI Y + P+ E + + + + T + A+ A Q + + Sbjct: 138 NLGLIAYAGTATVLVSPTTNREATKNALDKLQFADR---TATGEAIFTALQAIATVGAV- 193 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK--SNVNTIKICD-KAKENFIKIVTISI 369 + P I+ +DG+ N AK+ + I TIS Sbjct: 194 ---------IGGGDTPPPARIVLFSDGKETMPTNPDNPKGAYTAARTAKDQGVPISTISF 244 Query: 370 NAS-------------PNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLM 409 P +K S YN L V+ ++ Q + Sbjct: 245 GTPYGFVEIDDQRQPVPVDDETMKKVAQLSGGNSYNAATLAELRAVYSSLQQQI 298 >gi|298372684|ref|ZP_06982674.1| BatA protein [Bacteroidetes oral taxon 274 str. F0058] gi|298275588|gb|EFI17139.1| BatA protein [Bacteroidetes oral taxon 274 str. F0058] Length = 326 Score = 44.1 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 37/252 (14%), Positives = 70/252 (27%), Gaps = 64/252 (25%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 I + +D+SGSM D ++ A K+ + F Sbjct: 74 TSDSYSNSTTEGINIVIAMDISGSMLAR---------------DLSPNRLEAAKDVGIEF 118 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 + L GL+ + + + + +I T + Sbjct: 119 I----LSRPNDNF---GLVVFAGESFTQCPITSNHASLVNLFKQVDFGIIQDGTAIGLGL 171 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 A + + +S K II LTDG NN ++ I A+ Sbjct: 172 ATAINRIKDAEGKS------------------KVIILLTDGTNNT--GDIAPISAAQIAQ 211 Query: 359 ENFIKIVTISI------NASPNGQ----------------RLLKTCVSSPEYHYNVVNAD 396 I++ TI + Q L + ++ ++ N Sbjct: 212 SYGIRVYTIGVGTQGIAEVPMLDQFGNIHYTEAEVVIDETTLQQIASTTGGKYFRATNVS 271 Query: 397 SLIHVFQNISQL 408 SL ++ I ++ Sbjct: 272 SLKQIYSEIDKM 283 >gi|119628047|gb|EAX07642.1| matrilin 1, cartilage matrix protein, isoform CRA_a [Homo sapiens] Length = 496 Score = 44.1 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 23/145 (15%), Positives = 51/145 (35%), Gaps = 17/145 (11%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 I+ L +G++ Y + V++ K + T + A++ A Sbjct: 68 IESLDVGPNATRVGMVNYASTVKQEFSLRAHVSKAALLQAVRRIQPLSTGTMTGLAIQFA 127 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 D + G + S K +I +TDG + +V+ +A+ + Sbjct: 128 ITKAFGDAE----------GGRSRSPDISKVVIVVTDGRPQDSVQDVS-----ARARASG 172 Query: 362 IKIVTISINASPNGQRLLKTCVSSP 386 +++ I + + + L+ S P Sbjct: 173 VELFAIGVGSVD--KATLRQIASEP 195 Score = 41.4 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 37/223 (16%), Positives = 82/223 (36%), Gaps = 40/223 (17%) Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNA-----------LLLFLDSI-DLLSHVKED 251 +NSD + N T + L + + F+ I D L + Sbjct: 252 GFTLNSDGKTCNVCSGGGGSSATDLVFLIDGSKSVRPENFELVKKFISQIVDTLDVSDKL 311 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD--SLILKPTDSTPAMKQAYQILTSDK 309 +GL+ Y++ V + G ++ + + S + K T + A+K D Sbjct: 312 AQVGLVQYSSSVRQEFPL--GRFHTKKDIKAAVRNMSYMEKGTMTGAALKY-----LIDN 364 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + + K+ I+F TDG + ++ ++ KAK+ K+ + + Sbjct: 365 SFTVSSGARPGAQKVG-------IVF-TDGRSQDYINDAA-----KKAKDLGFKMFAVGV 411 Query: 370 -NASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 NA + L+ S P E+++ + ++ + + + + + Sbjct: 412 GNAVEDE---LREIASEPVAEHYFYTADFKTINQIGKKLQKKI 451 >gi|330862285|emb|CBX72446.1| hypothetical protein YEW_HH31780 [Yersinia enterocolitica W22703] Length = 457 Score = 44.1 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 52/282 (18%), Positives = 93/282 (32%), Gaps = 72/282 (25%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 +H + F + K +E+ I F + F+ LI + + +K + A A Sbjct: 8 IHKFNHFTLFKK----NEQGAILISFMIIFPFFIALIFITFEISHYLQRKAKLSDAIEQA 63 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 LA A + + D + I N+A + +++ L + I N Sbjct: 64 TLALAI----ENNEIPDEPQQIKNNA----------LVLSYVNAYLPSKK---FLVPIIN 106 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKI------V 174 I +++ + YN V +Y +F+ + V Sbjct: 107 INDNTHY----------------LEYNAAVTMAY--PAKFLSQSPFTNTISDMNITDNGV 148 Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 + + IF V D SGSM N + + ++ AL++A Sbjct: 149 AIKNKAIEASEPTDVIF----VADYSGSMLYNFN----------ENKPRDHERIDALRSA 194 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 D I S++ IGY I SWGT+++ Sbjct: 195 FRKLHDIIMDNSNIN------AIGY-------IPFSWGTKRI 223 >gi|148645283|gb|ABR01165.1| complement factor B [Ovis aries] Length = 761 Score = 44.1 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 49/270 (18%), Positives = 91/270 (33%), Gaps = 49/270 (18%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 YD + E L + + + A GE+ ++V+D SGSM+ + D D Sbjct: 224 YDTPAEVAEAFL--SSLTETIEGVDAEDGHSPGEQQK--RKIVLDPSGSMNIYLVLDGSD 279 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE--------K 265 A KN L F++ + + GL+ Y T + K Sbjct: 280 SVGAHNFTG--------AKNCLRDFIEKVASYGVKPKY---GLVTYATEPKVLIKVFDPK 328 Query: 266 NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT-SDKKRSFFTNFFRQGVKI 324 + E W TE++ + D + T++ A+ + Y +++ N R Sbjct: 329 SSEADWVTEQLNR--INYADHKLKAGTNTKRALLEVYNMMSRDINNLKETWNRTRH---- 382 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTI----KICDKAKEN--------FIKIVTISINAS 372 II +TDG +N V I + D + I + + + Sbjct: 383 -------VIIIMTDGLHNMGGDPVTVIHDIRYLLDIGRNRKNPREDYLDIYVFGVGPLVN 435 Query: 373 PNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 L + ++ + + ++L VF Sbjct: 436 QENINALASKKDKEQHVFKLQGMENLEDVF 465 >gi|13529371|gb|AAH05429.1| Matn2 protein [Mus musculus] Length = 956 Score = 44.1 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 61/164 (37%), Gaps = 21/164 (12%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTP 296 ID L+ + +GL+ Y+T+V G ++ + + + Sbjct: 678 VTGIIDSLAVSPKAARVGLLQYSTQVRTEFTLR-GFSSAKEMKKAVAHMKYMGKGSMTGL 736 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+K ++ + + G + PS + I TDG + +V+ + K Sbjct: 737 ALKHMFERSFTQVE----------GARPPSTQVPRVAIVFTDG---RAQDDVS--EWASK 781 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 AK N I + + + + + L+ S P ++ + + ++ Sbjct: 782 AKANGITMYAVGVGKAIEEE--LQEIASEPIDKHLFYAEDFSTM 823 >gi|4505111|ref|NP_002370.1| cartilage matrix protein precursor [Homo sapiens] gi|115556|sp|P21941|MATN1_HUMAN RecName: Full=Cartilage matrix protein; AltName: Full=Matrilin-1; Flags: Precursor gi|1732121|gb|AAB38702.1| cartilage matrix protein [Homo sapiens] gi|56205026|emb|CAI19322.1| matrilin 1, cartilage matrix protein [Homo sapiens] gi|182887817|gb|AAI60064.1| Matrilin 1, cartilage matrix protein [synthetic construct] gi|189066540|dbj|BAG35790.1| unnamed protein product [Homo sapiens] Length = 496 Score = 44.1 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 23/145 (15%), Positives = 51/145 (35%), Gaps = 17/145 (11%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 I+ L +G++ Y + V++ K + T + A++ A Sbjct: 68 IESLDVGPNATRVGMVNYASTVKQEFSLRAHVSKAALLQAVRRIQPLSTGTMTGLAIQFA 127 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 D + G + S K +I +TDG + +V+ +A+ + Sbjct: 128 ITKAFGDAE----------GGRSRSPDISKVVIVVTDGRPQDSVQDVS-----ARARASG 172 Query: 362 IKIVTISINASPNGQRLLKTCVSSP 386 +++ I + + + L+ S P Sbjct: 173 VELFAIGVGSVD--KATLRQIASEP 195 Score = 41.4 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 37/223 (16%), Positives = 82/223 (36%), Gaps = 40/223 (17%) Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNA-----------LLLFLDSI-DLLSHVKED 251 +NSD + N T + L + + F+ I D L + Sbjct: 252 GFTLNSDGKTCNVCSGGGGSSATDLVFLIDGSKSVRPENFELVKKFISQIVDTLDVSDKL 311 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD--SLILKPTDSTPAMKQAYQILTSDK 309 +GL+ Y++ V + G ++ + + S + K T + A+K D Sbjct: 312 AQVGLVQYSSSVRQEFPL--GRFHTKKDIKAAVRNMSYMEKGTMTGAALKY-----LIDN 364 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + + K+ I+F TDG + ++ ++ KAK+ K+ + + Sbjct: 365 SFTVSSGARPGAQKVG-------IVF-TDGRSQDYINDAA-----KKAKDLGFKMFAVGV 411 Query: 370 -NASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 NA + L+ S P E+++ + ++ + + + + + Sbjct: 412 GNAVEDE---LREIASEPVAEHYFYTADFKTINQIGKKLQKKI 451 >gi|301753369|ref|XP_002912539.1| PREDICTED: anthrax toxin receptor 2-like [Ailuropoda melanoleuca] Length = 611 Score = 44.1 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 32/244 (13%), Positives = 66/244 (27%), Gaps = 52/244 (21%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 P + + R F + V+D SGS+ V + Sbjct: 153 PVSAQEQPSCRGAFDLYFVLDKSGSVANNWIEIYNFVQQLTERFVSPQ------------ 200 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 + + I ++++ + + K+ + + + T Sbjct: 201 --------------MRLSFIVFSSQATIILPLTGDRSKISKGLEDLKRVSPVGETYIHEG 246 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG--ENNNFKSNVNTIKICD 355 +K A Q K II LTDG + KI Sbjct: 247 LKLA----------------NEQIQKAGGFKASSIIIALTDGKLDGLVPSYAEKEAKI-- 288 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 ++ ++ + + Q L+ S + + V + I ++ R + Sbjct: 289 -SRSFGARVYCVGVL--DFEQAQLERIADSKDQVFPVKGGFQ---ALKGIINSILDRSCT 342 Query: 416 VILK 419 IL+ Sbjct: 343 EILE 346 >gi|281346820|gb|EFB22404.1| hypothetical protein PANDA_000280 [Ailuropoda melanoleuca] Length = 482 Score = 44.1 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 32/244 (13%), Positives = 66/244 (27%), Gaps = 52/244 (21%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 P + + R F + V+D SGS+ V + Sbjct: 23 PVSAQEQPSCRGAFDLYFVLDKSGSVANNWIEIYNFVQQLTERFVSPQ------------ 70 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 + + I ++++ + + K+ + + + T Sbjct: 71 --------------MRLSFIVFSSQATIILPLTGDRSKISKGLEDLKRVSPVGETYIHEG 116 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG--ENNNFKSNVNTIKICD 355 +K A Q K II LTDG + KI Sbjct: 117 LKLA----------------NEQIQKAGGFKASSIIIALTDGKLDGLVPSYAEKEAKI-- 158 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 ++ ++ + + Q L+ S + + V + I ++ R + Sbjct: 159 -SRSFGARVYCVGVL--DFEQAQLERIADSKDQVFPVKGGFQ---ALKGIINSILDRSCT 212 Query: 416 VILK 419 IL+ Sbjct: 213 EILE 216 >gi|332162963|ref|YP_004299540.1| putative tight adherance operon protein [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|325667193|gb|ADZ43837.1| putative tight adherance operon protein [Yersinia enterocolitica subsp. palearctica 105.5R(r)] Length = 457 Score = 44.1 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 51/282 (18%), Positives = 93/282 (32%), Gaps = 72/282 (25%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 +H + F + K +E+ I F + F+ LI + + +K + A A Sbjct: 8 IHKFNHFTLFKK----NEQGAILISFMIIFPFFIALIFITFEISHYLQRKAKLSDAIEQA 63 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 LA + + + D + I N+A + +++ L + I N Sbjct: 64 TLA----LTIENNEIPDEPQQIKNNA----------LVLSYVNAYLPSKK---FLVPIIN 106 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKI------V 174 I +++ + YN V +Y +F+ + V Sbjct: 107 INDNTHY----------------LEYNAAVTMAY--PAKFLSQSPFTNTISDMNITDNGV 148 Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 + + IF V D SGSM N + + ++ AL++A Sbjct: 149 AIKNKAIEASEPTDVIF----VADYSGSMLYNFN----------ENKPRDHERIDALRSA 194 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 D I S++ IGY I SWGT+++ Sbjct: 195 FRKLHDIIMDNSNIN------AIGY-------IPFSWGTKRI 223 >gi|282601472|ref|ZP_05981787.2| von Willebrand factor type A domain protein [Subdoligranulum variabile DSM 15176] gi|282569002|gb|EFB74537.1| von Willebrand factor type A domain protein [Subdoligranulum variabile DSM 15176] Length = 246 Score = 44.1 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 73/207 (35%), Gaps = 24/207 (11%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR-------TKMAALKNALLLFLDS 241 P I L +D SGSM A+ D D ++ +++ L+ + LF +S Sbjct: 10 PRVPICLCLDTSGSMG-AVQGDCVDTGKTLFEDGRQWNLVTGGTSRLDELQKGIKLFYNS 68 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + + ++ + + + ++ + + + + T + A Sbjct: 69 VREDEVARYAAEICIVTFDSEAKCRMDFANLDRQ-----SDLPELTATGDTAMGEGVNLA 123 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG-ENNNFKSNVNTIKICDKAK-E 359 +L S K R+ FQ +++ +TDG N N ++ C + + Sbjct: 124 LDLLESRK---------REYQDKGVDYFQPWLVLMTDGVPNGNEGEFERAVQRCRDMEAQ 174 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSP 386 + + I+I + L K P Sbjct: 175 KKLTVFPIAIGDEGDQTALAKFSAKRP 201 >gi|329902233|ref|ZP_08273073.1| Putative MxaC-like protein [Oxalobacteraceae bacterium IMCC9480] gi|327548825|gb|EGF33456.1| Putative MxaC-like protein [Oxalobacteraceae bacterium IMCC9480] Length = 338 Score = 44.1 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 36/234 (15%), Positives = 78/234 (33%), Gaps = 37/234 (15%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 + S + + G I +++D S SM A+ + K+ K+ + Sbjct: 74 LASPATTIEKTGSGAE----ILVLLDRSASMDSALQEKGAKTPLTDKYAEPKKRKI--AR 127 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 AL F + + +GL+ ++ + + + + ++ + L T Sbjct: 128 TALAGF-------AAGRPHDAIGLMMFSENQFQVMPFNMRPDMIQAAIQAGGVGSGLGNT 180 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 D AM A + + I+ ++DG + ++ Sbjct: 181 DVGSAMLAALRTFDDRPDSG-----------------SRIIMLVSDG--GAQIAPAVRLQ 221 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 I D K N I + I + + Q L S ++ V ++ F+++S Sbjct: 222 IADGLKRNRIALYWIYLRSYN--QPAL---ADSDSAEFDGVVEVAMHRYFRSLS 270 >gi|315649635|ref|ZP_07902720.1| von Willebrand factor type A [Paenibacillus vortex V453] gi|315275108|gb|EFU38483.1| von Willebrand factor type A [Paenibacillus vortex V453] Length = 595 Score = 44.1 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 48/235 (20%), Positives = 88/235 (37%), Gaps = 52/235 (22%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + + LV+D S SM NSDP+ + S A+K + + + Sbjct: 30 MAASQGSKIDAVLVMDASNSMK---NSDPDRIGS------------EAMKMFIDMLSTTG 74 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQ 300 D + G++ YT R+++ +E + + +D L P TD + + + Sbjct: 75 DKV---------GVVSYTDRIQREKALLEIQSEADKTALKEFIDQLDRGPYTDISVGLDE 125 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK-------- 352 A ++L +QG++ P I+ L DG N+ + T + Sbjct: 126 AVKVL-------------KQGMEPAHAPM---IVVLADGNNDLDPNTGKTSQEASDHLNQ 169 Query: 353 ICDKAKENFIKIVTISINASPN-GQRLLKTCVS-SPEYHYNVVNADSLIHVFQNI 405 +AK + I I TI +NA + L + + + +AD L + I Sbjct: 170 AVQEAKGSGIPIYTIGLNADGKLNKEALAELANQTGGKSFTTSSADDLPQILSEI 224 >gi|212276002|ref|NP_001130333.1| hypothetical protein LOC100191428 [Zea mays] gi|194688870|gb|ACF78519.1| unknown [Zea mays] Length = 704 Score = 44.1 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 38/243 (15%), Positives = 78/243 (32%), Gaps = 59/243 (24%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + + R I V+D+SGSM TKMA LK A+ + Sbjct: 251 SLRLSRRVPIDIVTVLDVSGSMAG--------------------TKMALLKQAMGFVIQH 290 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQ 300 + + +I +++ + + RQ + ++SL T+ A+K+ Sbjct: 291 LRPSD------RLSVIAFSSTARRLFPLQRMSHHGRQQALQAINSLGAGGGTNIADALKK 344 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A +++ ++ + II L+DG++ N+++ A Sbjct: 345 AVKVIADRSYKNSVCS----------------IILLSDGQDTY---NISSNFQGTSAGRR 385 Query: 361 F------------IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN-ISQ 407 + + T A + L +S + + + F I Sbjct: 386 SLVPSANPNELHMVPLHTFGFGADHDSDTLHSISEASGGTFSFIEDEGVMQDAFAQCIGG 445 Query: 408 LMV 410 L+ Sbjct: 446 LLS 448 >gi|297665730|ref|XP_002811194.1| PREDICTED: cartilage matrix protein-like [Pongo abelii] Length = 495 Score = 44.1 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 25/145 (17%), Positives = 51/145 (35%), Gaps = 17/145 (11%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 I+ L +G++ Y + V++ K + T + A++ A Sbjct: 68 IESLDVGPNATRVGMVNYASTVKQEFSLRAHISKAALLQAVRRIQPLSTGTMTGLAIQFA 127 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 SD + G + S K +I +TDG + NV + +A+ + Sbjct: 128 ITKAFSDAE----------GGRSRSPDISKVVIVVTDG---RPQDNVQDVS--ARARASG 172 Query: 362 IKIVTISINASPNGQRLLKTCVSSP 386 +++ I + + L+ S P Sbjct: 173 VELFAIGVG--RVDKATLRQIASEP 195 Score = 42.6 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 29/173 (16%), Positives = 70/173 (40%), Gaps = 28/173 (16%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD--SLILKPTDSTPAMK 299 +D L + +GL+ Y++ V + G ++ + + S + K T + A+K Sbjct: 301 VDTLDVSDKLAQVGLVQYSSSVRQEFPL--GRFHTKKDIKAAVRNMSYMEKGTMTGAALK 358 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 D + + + K+ I+F TDG + ++ ++ KAK+ Sbjct: 359 Y-----LIDNSFTVSSGARPRAQKVG-------IVF-TDGRSQDYINDAA-----KKAKD 400 Query: 360 NFIKIVTISI-NASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 K+ + + NA + L+ S P E+++ + ++ + + + + + Sbjct: 401 LGFKMFAVGVGNAVEDE---LREIASEPVAEHYFYTADFKTINQIGKKLQKKI 450 >gi|254781007|ref|YP_003065420.1| hypothetical protein CLIBASIA_04540 [Candidatus Liberibacter asiaticus str. psy62] gi|254040684|gb|ACT57480.1| hypothetical protein CLIBASIA_04540 [Candidatus Liberibacter asiaticus str. psy62] Length = 411 Score = 44.1 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 8/57 (14%), Positives = 31/57 (54%) Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + I I ++ + + + L+ C S P +Y + + ++++ + +++++ ++ +S Sbjct: 343 RDAITIFSVGFSPDQDTRYTLRQCASDPSKYYEINSDENVMPIAKSLARNVITNWFS 399 >gi|193786838|dbj|BAG52161.1| unnamed protein product [Homo sapiens] Length = 496 Score = 44.1 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 23/145 (15%), Positives = 51/145 (35%), Gaps = 17/145 (11%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 I+ L +G++ Y + V++ K + T + A++ A Sbjct: 68 IESLDVGPNATRVGMVNYASTVKQEFSLRAHVSKAALLQAVRRIQPLSTGTMTGLAIQFA 127 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 D + G + S K +I +TDG + +V+ +A+ + Sbjct: 128 ITKAFGDAE----------GGRSRSPDISKVVIVVTDGRPQDSVQDVS-----ARARASG 172 Query: 362 IKIVTISINASPNGQRLLKTCVSSP 386 +++ I + + + L+ S P Sbjct: 173 VELFAIGVGSVD--KATLRQIASEP 195 Score = 40.2 bits (92), Expect = 0.62, Method: Composition-based stats. Identities = 32/221 (14%), Positives = 75/221 (33%), Gaps = 36/221 (16%) Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNA-----------LLLFLDSI-DLLSHVKED 251 +NSD + N T + L + + F+ I D L + Sbjct: 252 GFTLNSDGKTCNVCSGGGGSSATDLVFLIDGSKSVRPENFELVKKFISQIVDTLDVSDKL 311 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ Y++ V + G ++ + + + M + + Sbjct: 312 AQVGLVQYSSSVRQEFPL--GRFHTKKDIKAAVRN-----------MSYMEKSTMTGAAL 358 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-N 370 + + QK I TDG + ++ ++ KAK+ K+ + + N Sbjct: 359 KYLIDNSFTVSSGARPGAQKVGIVFTDGRSQDYINDAA-----KKAKDLGFKMFAVGVGN 413 Query: 371 ASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 A + L+ S P E+++ + ++ + + + + + Sbjct: 414 AVEDE---LREIASEPVAEHYFYTADFKTINQIGKKLQKKI 451 >gi|163858556|ref|YP_001632854.1| hypothetical protein Bpet4238 [Bordetella petrii DSM 12804] gi|163262284|emb|CAP44587.1| hypothetical protein Bpet4238 [Bordetella petrii] Length = 244 Score = 44.1 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 31/229 (13%), Positives = 77/229 (33%), Gaps = 40/229 (17%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + + + L++D+SGSM K+ + +A+ LD+ Sbjct: 10 KFTAPKAKPLPVVLLLDVSGSMSGE--------------------KIRNVNDAVRDMLDT 49 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + ++++ +I + ++V + + D T A++ A Sbjct: 50 FSDTENGETEIHVAIITFGSQVALHQPLA------SASDIHWQDLSAGGMTPLGTALQMA 103 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 ++ +PS ++ ++ ++DG N+ I D + Sbjct: 104 KAMIEDKDV-------------VPSRAYRPTVVLVSDGGPNDAWEKPLNAFISD-GRSAK 149 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + ++I A + L K + + NA L F+ ++ + Sbjct: 150 CDRLAMAIGADADEAVLGKFIEGTSNRLFYAENAKQLRDFFKFVTMSVT 198 >gi|170574976|ref|XP_001893043.1| Zona pellucida-like domain containing protein [Brugia malayi] gi|158601129|gb|EDP38122.1| Zona pellucida-like domain containing protein [Brugia malayi] Length = 664 Score = 44.1 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 30/152 (19%), Positives = 59/152 (38%), Gaps = 19/152 (12%) Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL--KPTDSTPAMKQAYQILTSDKKR 311 + LI Y+ + + + + V R ++ L T + A+ QAY++LT Sbjct: 1 LALITYSGQAYIHFKFNDPQIGNNTSVIRHLNGLKSIKGTTSTHIALHQAYKLLT----D 56 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN- 370 + N R+G +K II TDG + ++ K+ ++I I++ Sbjct: 57 TDNENGVREG-------VKKMIIIFTDGHSQRSPQDMALR-----LKDKGVEIFAITLTP 104 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 A + L + + ++ + VN F Sbjct: 105 APYADEGELLSITQNTDHIFTPVNLKDFEIKF 136 >gi|297290486|ref|XP_001113553.2| PREDICTED: complement factor B isoform 1 [Macaca mulatta] Length = 1266 Score = 44.1 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 33/228 (14%), Positives = 76/228 (33%), Gaps = 43/228 (18%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 +++D SGSM+ + D D A K L+ ++ + Sbjct: 763 IILDPSGSMNIYLVLDGSDSIGAGNFTG--------AKKCLVNLIEKVASYGVKPRY--- 811 Query: 255 GLIGYTTR--------VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 L+ Y T +++ W T+K+ + D + T++ A++ Y ++ Sbjct: 812 ALVTYATYPRIWVKVSDQESSNADWVTKKLSE--INYEDHKLKSGTNTKRALQAVYSMM- 868 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK------SNVNTIKICDKAKEN 360 ++ + II +TDG +N + + K ++N Sbjct: 869 ---------SWPEDIPPEGWNRTRHVIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKN 919 Query: 361 ------FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + + L + + ++ + V + ++L VF Sbjct: 920 PREDYLDVYVFGVGPLVDQVNINALASKKDNEQHVFKVKDMENLEDVF 967 >gi|291395817|ref|XP_002714337.1| PREDICTED: complement factor B-like [Oryctolagus cuniculus] Length = 764 Score = 44.1 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 39/228 (17%), Positives = 79/228 (34%), Gaps = 43/228 (18%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 +V+D +GSM+ + D D A A ++ I+ ++ Sbjct: 261 IVLDPAGSMNIYLVLDGSDSIGASNFTG-----------AKRCLVNLIEKVASYGVRPRY 309 Query: 255 GLIGYTTRVE--------KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 GL+ Y T K+ + +W TEK+ Q D + T++ A+ + Y ++ Sbjct: 310 GLVTYATYPNVLVRVSDPKSSDANWVTEKLNQ--ISYEDHKLKTGTNTKRALVEVYNMM- 366 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK----ICDKAKEN-- 360 ++ + II +TDG +N V I + + K+ Sbjct: 367 ---------SWPGDVPPEGWNRTRHVIILMTDGLHNMGGDPVTVINEIRDLLNIGKDRKN 417 Query: 361 ------FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + + P L + + ++ + V + + L VF Sbjct: 418 PREDYLDVYVFGVGPLVEPANINALASKKENEQHVFRVKDMEHLEDVF 465 >gi|73542573|ref|YP_297093.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] gi|72119986|gb|AAZ62249.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] Length = 340 Score = 44.1 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 37/232 (15%), Positives = 73/232 (31%), Gaps = 43/232 (18%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 +P+ + L +DLS SM DP ++ A++ + F+ Sbjct: 90 TQPVRDLLLALDLSQSMDTRDFRDPSGAL---------IPRVQAVREVVSSFVAR----- 135 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + +GLI + + V+ + + + T A+ Sbjct: 136 --RPGDRIGLIVFGDAPYPLAPFTLDHALVQTMIRDLLPGMAGPSTALGDAV-------- 185 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 F Q S +K +I LTDG + K D AK+ + + T Sbjct: 186 -----GLGIKMFDQ-----SPAPEKVLIVLTDGNDTASKMPPERAA--DIAKQRHVTVHT 233 Query: 367 ISINASPNGQR-------LLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 I I L + + ++ + +SL ++ + ++ H Sbjct: 234 IGIGDPSAEGEQRVDLGVLQRMAAQTGGRYFFGADQNSLESIYATLDRITPH 285 >gi|331085807|ref|ZP_08334890.1| hypothetical protein HMPREF0987_01193 [Lachnospiraceae bacterium 9_1_43BFAA] gi|330406730|gb|EGG86235.1| hypothetical protein HMPREF0987_01193 [Lachnospiraceae bacterium 9_1_43BFAA] Length = 1321 Score = 44.1 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 41/236 (17%), Positives = 81/236 (34%), Gaps = 50/236 (21%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 +++ E I L D SGSM K+ LK A+ F+ Sbjct: 504 KVKKLEYDTVNIALCCDNSGSMEGE--------------------KIENLKKAVSTFVGK 543 Query: 242 IDLLSHVKEDVYMGLIGYTTRV-EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 + ++V +G++ + + V E EP EK+ Q V T+ ++ Sbjct: 544 L------ADEVNIGIVPFGSGVLEGVCEPGSSREKLEQSVESFRSD---SGTNIYSGVEY 594 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 +L +K + ++DG ++ S KI + Sbjct: 595 TLSMLAKEKDALNIA------------------VIMSDG-QDSIPSEEQLQKITSACENG 635 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I + ++ + A + L + + V +++SL +Q I Q + +Y + Sbjct: 636 NILLYSMGLGADVESEVLSTYSDAGNGAYVFVSDSNSLYSFYQYIYQ-ISKNRYEI 690 >gi|297286914|ref|XP_001113364.2| PREDICTED: collagen alpha-6(VI) chain-like [Macaca mulatta] Length = 2262 Score = 44.1 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 55/353 (15%), Positives = 107/353 (30%), Gaps = 66/353 (18%) Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 SN LG E+I A F + ++++ + + N Sbjct: 495 SNKQDLGKAIENIRQLGGNTNTGAALNFTLSLLQKAKKQRGNK-VPCHLVVLTNGMSKDS 553 Query: 130 THMANNRL------------DSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFI 177 NRL +N T + Y + + L + R + Sbjct: 554 ILEPANRLREEHIRVYAIGIKEANQTQLREIAGEEKRVYYVHDFDALKDIRN-----QVV 608 Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + E + I +VD SGS+ + +KM K + Sbjct: 609 QEICAEEACKEMKADIMFLVDSSGSIG-----------------PENFSKM---KTFMKN 648 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 + + + V +G++ ++ ++ + R M + ++ Sbjct: 649 LVSKSQIGP---DRVQIGVVQFSDINKEEFQL-----------NRFMSQSDIS--NAIDQ 692 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 M Q + SF + +F K +KF+I +TDGE + Sbjct: 693 MAHIGQTTLTGSALSFVSQYF-SPTKGSRPSVRKFLILITDGEAQDIVKEPAV-----AL 746 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 ++ + I ++ + S Q L+ PE + V N + Q I +V Sbjct: 747 RQEGVIIYSVGVFGSNVTQ--LEEISGRPEMVFYVEN----FDILQRIEDDLV 793 Score = 41.8 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 35/243 (14%), Positives = 81/243 (33%), Gaps = 32/243 (13%) Query: 180 LLRIEMGERPIFLIE-------LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM--AA 230 LL + F +E ++ D++ S+ + D E + T + Sbjct: 957 LLAMAGSSDKYFFVETFGGLKGIISDVTASVCNSSKVDCEIDKVDLVFLMDGSTSIQQTD 1016 Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL--I 288 K + + + V +G ++ GT + ++ ++++ I Sbjct: 1017 FKKMKEFMVSVVQDFDVSNKRVRIGAAQFSDAYRPEFPL--GTFIGAKEISIQIENITQI 1074 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T A++ K +F + + ++ LTDG +S Sbjct: 1075 FGNTHIGAALR---------KVEHYFRPDMGSRINTGTPQV---LLVLTDG-----QSQD 1117 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + + I I ++ I + Q+L++ ++ + V N D L V + I + Sbjct: 1118 EVAQAAEALRRRGIDIYSVGI-GDVDDQQLMQITGTAEKK-LTVHNFDELKKVNKRIVRN 1175 Query: 409 MVH 411 + Sbjct: 1176 ICT 1178 >gi|160899637|ref|YP_001565219.1| von Willebrand factor type A [Delftia acidovorans SPH-1] gi|160365221|gb|ABX36834.1| von Willebrand factor type A [Delftia acidovorans SPH-1] Length = 244 Score = 44.1 bits (102), Expect = 0.045, Method: Composition-based stats. Identities = 32/229 (13%), Positives = 77/229 (33%), Gaps = 40/229 (17%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + + + L++D+SGSM K+ + +A+ LD+ Sbjct: 10 KFTAPKAKPLPVVLLLDVSGSMSGE--------------------KIRNVNDAVRDMLDT 49 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + ++++ +I + ++V + + D T A++ A Sbjct: 50 FSDTENGETEIHVAIITFGSQVALHQPLA------SASDIHWQDLSAGGMTPLGTALQMA 103 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 ++ IPS ++ ++ ++DG N+ I D + Sbjct: 104 KAMIEDKDV-------------IPSRAYRPTVVLVSDGGPNDAWEKPLNAFISD-GRSAK 149 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + ++I A + L K + + NA L F+ ++ + Sbjct: 150 CDRLAMAIGADADEAVLGKFIEGTSNRLFYAENAKQLRDFFKFVTMSVT 198 >gi|104779436|ref|YP_605934.1| surface adhesion protein [Pseudomonas entomophila L48] gi|95108423|emb|CAK13117.1| Surface adhesion protein [Pseudomonas entomophila L48] Length = 5862 Score = 44.1 bits (102), Expect = 0.045, Method: Composition-based stats. Identities = 37/240 (15%), Positives = 83/240 (34%), Gaps = 43/240 (17%) Query: 162 EHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 +++ N +V+ + L + + + +VD SGSM Sbjct: 5310 DNITGTDGNDVVVADVSGLHVVPGQN---YNLAFIVDTSGSMGS---------------- 5350 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKE--DVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 + + A K +L ++ + V + L+ + T+V+ ++ + ++ Sbjct: 5351 ----SGVDAAKKSLESVFKTLAASVKGDQSGTVNILLVDFATQVKSSVAVTLNDAGLQTL 5406 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + + T+ A K N+F Q +K F+TDG Sbjct: 5407 LNALNNLRADGGTNYEDAFKTT-------------ANWF-QNLKDGGNTGSNQTFFITDG 5452 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVT-ISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 + +++N N+ + I + T ++ +G SS + ++ NA +L Sbjct: 5453 KPTYYQANENSNPTLGT---SGITLDTFLASINYKSGMAYNGYIDSSNTNYVSIDNAGNL 5509 >gi|86148746|ref|ZP_01067019.1| VCBS [Vibrio sp. MED222] gi|85833461|gb|EAQ51646.1| VCBS [Vibrio sp. MED222] Length = 2142 Score = 44.1 bits (102), Expect = 0.045, Method: Composition-based stats. Identities = 35/266 (13%), Positives = 73/266 (27%), Gaps = 34/266 (12%) Query: 107 SGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLN 166 + + I + ++ + + + + V+ L+ + Sbjct: 1433 GTANGKVVIEATLDRNGHYDIQLKAPVDHPNTNGEDNLVIQIPVIAKDTSGLKSSGGQIT 1492 Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 + + + I+LV+D+SGSM +T Sbjct: 1493 VSIEDDQPVATAIDVPVTPETKSDTNIQLVIDVSGSMGYDSG-------------VAGKT 1539 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMD 285 ++A LK +L L D L VK V + ++ + SW + + + Sbjct: 1540 RLAILKESLAKMLQQYDTLGDVK--VQIVTFTGNAKLIHDGSKSWFSVSEAITEINKLKP 1597 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 Y + S+ + + ++ + F+TDG N Sbjct: 1598 KN-----------NTDYDDALRKARTSWDHDEDSKLPDANNVSY-----FITDGIPNQDD 1641 Query: 346 --SNVNTIKICDKAKENFIKIVTISI 369 +N I I I Sbjct: 1642 RIDYWEAKTWTKHLDDNGITSQAIGI 1667 >gi|302336993|ref|YP_003802199.1| von Willebrand factor type A [Spirochaeta smaragdinae DSM 11293] gi|301634178|gb|ADK79605.1| von Willebrand factor type A [Spirochaeta smaragdinae DSM 11293] Length = 333 Score = 44.1 bits (102), Expect = 0.046, Method: Composition-based stats. Identities = 43/267 (16%), Positives = 78/267 (29%), Gaps = 68/267 (25%) Query: 165 LNQRYNQKIVSFIPALL--RIEMGERPIFLIELVVDLSGSMHC-AMNSDPEDVNSAPICQ 221 L IV+ L R ++ I V+D S SM D Sbjct: 63 LWLGVASLIVALAGPRLVEREKIYLHRGVDIVFVLDQSPSMIVRDFGGD----------- 111 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 T+ A K+A+ F++ +E +GL+ + P T +++ Sbjct: 112 ----TRFDAAKHAIRTFVE-------GREHDPLGLVIFGDEAALVTPP---TLDYTSFLS 157 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 R ++ L ++ S +K ++ ++DGEN Sbjct: 158 RMDAVRVM--------------KLGRGSALGLGMAVATVHLEKSSAE-RKVMVIVSDGEN 202 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTIS----------INASPNGQ------------RLL 379 N + + A I+I + G+ LL Sbjct: 203 NAGEITPESAAR--VAASLGIRIYAVGVGGEGSVATEFTDPETGKSYRGTYEGKIDMELL 260 Query: 380 KTCV-SSPEYHYNVVNADSLIHVFQNI 405 K S+ + + +L VF+ I Sbjct: 261 KAVTESTRGQAFLAGSPGALSQVFREI 287 >gi|156616290|ref|NP_001096078.1| collagen alpha-6(VI) chain precursor [Homo sapiens] gi|189082902|sp|A6NMZ7|CO6A6_HUMAN RecName: Full=Collagen alpha-6(VI) chain; Flags: Precursor Length = 2263 Score = 44.1 bits (102), Expect = 0.046, Method: Composition-based stats. Identities = 55/353 (15%), Positives = 108/353 (30%), Gaps = 66/353 (18%) Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 SN LG E+I A F + ++++ + + N Sbjct: 494 SNKQDLGKAIENIRQMGGNTNTGAALNFTLSLLQKAKKQRGNK-VPCHLVVLTNGMSKDS 552 Query: 130 THMANNRL------------DSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFI 177 NRL +N T + Y + + L + R + Sbjct: 553 ILEPANRLREEHIRVYAIGIKEANQTQLREIAGEEKRVYYVHDFDALKDIRN-----QVV 607 Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + E + I +VD SGS+ + +KM K + Sbjct: 608 QEICTEEACKEMKADIMFLVDSSGSIG-----------------PENFSKM---KTFMKN 647 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 + + + V +G++ ++ ++ + R M + ++ Sbjct: 648 LVSKSQIGP---DRVQIGVVQFSDINKEEFQL-----------NRFMSQSDIS--NAIDQ 691 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 M Q + SF + +F K +KF+I +TDGE + + Sbjct: 692 MAHIGQTTLTGSALSFVSQYF-SPTKGARPNIRKFLILITDGEAQDIVKEPAVV-----L 745 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 ++ + I ++ + S Q L+ PE + V N + Q I +V Sbjct: 746 RQEGVIIYSVGVFGSNVTQ--LEEISGRPEMVFYVEN----FDILQRIEDDLV 792 >gi|90411204|ref|ZP_01219217.1| hypothetical protein P3TCK_06547 [Photobacterium profundum 3TCK] gi|90328050|gb|EAS44371.1| hypothetical protein P3TCK_06547 [Photobacterium profundum 3TCK] Length = 436 Score = 44.1 bits (102), Expect = 0.046, Method: Composition-based stats. Identities = 38/376 (10%), Positives = 107/376 (28%), Gaps = 45/376 (11%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 ++K +I L+++ + + V + K +++ ++A L+ A + Sbjct: 12 RAQKGVVAIFATLAMVVLIGAGALALDVGNLVLSKGKLQNIVDSAALSAAKAI------- 64 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 + A++ I +L+ F + I N S + ++ Sbjct: 65 -----DLGGDQAEAIVAG-----NEAINNNLTLD--GFGSMTIDNTDIHYEFSDSLPFDS 112 Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 S+N + V ++ + N + + + I Sbjct: 113 ---STNTATSPYVRVRIEDVDVADYLVAIFNIDMSARSSAVAGPSSSITTTCN------- 162 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 VV LS ++ + + K + + + ++ G Sbjct: 163 VVPLSICEGSESSTTLSGYSEGSLHVLKASSTKDSAIGSGNFMPMALKDADGN---TIPG 219 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 Y + + + + + ++ PT Y+ + + + Sbjct: 220 ASSYGDALAGSFDTC--LTVIEDEMITSEPGNMVGPTRGIDTRFGIYEGTFKNDENKYPA 277 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS-INASPN 374 + + Q + + DG+ N + T+ I + + +A+ + Sbjct: 278 DKDTYYDSNNLVEVQT--VQIDDGDGNTIEEYQTTLSH--------DAIYSFADYSANQD 327 Query: 375 GQRLLKTCVSSPEYHY 390 + L + ++ Sbjct: 328 FESCLSDSACQSQGYF 343 >gi|15608619|ref|NP_215997.1| hypothetical protein Rv1481 [Mycobacterium tuberculosis H37Rv] gi|31792676|ref|NP_855169.1| hypothetical protein Mb1517 [Mycobacterium bovis AF2122/97] gi|148661274|ref|YP_001282797.1| hypothetical protein MRA_1491 [Mycobacterium tuberculosis H37Ra] gi|148822701|ref|YP_001287455.1| hypothetical protein TBFG_11510 [Mycobacterium tuberculosis F11] gi|167968028|ref|ZP_02550305.1| hypothetical membrane protein [Mycobacterium tuberculosis H37Ra] gi|215403336|ref|ZP_03415517.1| hypothetical protein Mtub0_06533 [Mycobacterium tuberculosis 02_1987] gi|215411140|ref|ZP_03419948.1| hypothetical protein Mtub9_07385 [Mycobacterium tuberculosis 94_M4241A] gi|215426820|ref|ZP_03424739.1| hypothetical protein MtubT9_10680 [Mycobacterium tuberculosis T92] gi|215430374|ref|ZP_03428293.1| hypothetical protein MtubE_06801 [Mycobacterium tuberculosis EAS054] gi|215445676|ref|ZP_03432428.1| hypothetical protein MtubT_06934 [Mycobacterium tuberculosis T85] gi|218753198|ref|ZP_03531994.1| hypothetical protein MtubG1_07054 [Mycobacterium tuberculosis GM 1503] gi|219557390|ref|ZP_03536466.1| hypothetical protein MtubT1_08827 [Mycobacterium tuberculosis T17] gi|253799469|ref|YP_003032470.1| hypothetical protein TBMG_02500 [Mycobacterium tuberculosis KZN 1435] gi|254231712|ref|ZP_04925039.1| hypothetical protein TBCG_01457 [Mycobacterium tuberculosis C] gi|254364352|ref|ZP_04980398.1| hypothetical membrane protein [Mycobacterium tuberculosis str. Haarlem] gi|254550498|ref|ZP_05140945.1| hypothetical protein Mtube_08557 [Mycobacterium tuberculosis '98-R604 INH-RIF-EM'] gi|260186427|ref|ZP_05763901.1| hypothetical protein MtubCP_10429 [Mycobacterium tuberculosis CPHL_A] gi|260204765|ref|ZP_05772256.1| hypothetical protein MtubK8_10713 [Mycobacterium tuberculosis K85] gi|289447084|ref|ZP_06436828.1| membrane protein [Mycobacterium tuberculosis CPHL_A] gi|289554729|ref|ZP_06443939.1| membrane protein [Mycobacterium tuberculosis KZN 605] gi|289569506|ref|ZP_06449733.1| membrane protein [Mycobacterium tuberculosis T17] gi|289574162|ref|ZP_06454389.1| membrane protein [Mycobacterium tuberculosis K85] gi|289745232|ref|ZP_06504610.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987] gi|289750042|ref|ZP_06509420.1| membrane protein [Mycobacterium tuberculosis T92] gi|289753564|ref|ZP_06512942.1| hypothetical protein TBGG_00680 [Mycobacterium tuberculosis EAS054] gi|289757593|ref|ZP_06516971.1| conserved hypothetical protein [Mycobacterium tuberculosis T85] gi|289761639|ref|ZP_06521017.1| conserved hypothetical protein [Mycobacterium tuberculosis GM 1503] gi|294993225|ref|ZP_06798916.1| hypothetical protein Mtub2_01637 [Mycobacterium tuberculosis 210] gi|297634047|ref|ZP_06951827.1| hypothetical protein MtubK4_07987 [Mycobacterium tuberculosis KZN 4207] gi|297731033|ref|ZP_06960151.1| hypothetical protein MtubKR_08072 [Mycobacterium tuberculosis KZN R506] gi|298524990|ref|ZP_07012399.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A] gi|306775670|ref|ZP_07414007.1| membrane protein [Mycobacterium tuberculosis SUMu001] gi|306779490|ref|ZP_07417827.1| membrane protein [Mycobacterium tuberculosis SUMu002] gi|306784220|ref|ZP_07422542.1| membrane protein [Mycobacterium tuberculosis SUMu003] gi|306788587|ref|ZP_07426909.1| membrane protein [Mycobacterium tuberculosis SUMu004] gi|306792930|ref|ZP_07431232.1| membrane protein [Mycobacterium tuberculosis SUMu005] gi|306797308|ref|ZP_07435610.1| membrane protein [Mycobacterium tuberculosis SUMu006] gi|306803189|ref|ZP_07439857.1| membrane protein [Mycobacterium tuberculosis SUMu008] gi|306967588|ref|ZP_07480249.1| membrane protein [Mycobacterium tuberculosis SUMu009] gi|306971779|ref|ZP_07484440.1| membrane protein [Mycobacterium tuberculosis SUMu010] gi|307079498|ref|ZP_07488668.1| membrane protein [Mycobacterium tuberculosis SUMu011] gi|307084057|ref|ZP_07493170.1| membrane protein [Mycobacterium tuberculosis SUMu012] gi|313658366|ref|ZP_07815246.1| hypothetical protein MtubKV_08092 [Mycobacterium tuberculosis KZN V2475] gi|54040185|sp|P64856|Y1517_MYCBO RecName: Full=UPF0353 protein Mb1517 gi|54042534|sp|P64855|Y1481_MYCTU RecName: Full=UPF0353 protein Rv1481/MT1528 gi|166979870|sp|A5U2I5|Y1491_MYCTA RecName: Full=UPF0353 protein MRA_1491 gi|3261503|emb|CAA16011.1| PROBABLE MEMBRANE PROTEIN [Mycobacterium tuberculosis H37Rv] gi|31618266|emb|CAD96184.1| PROBABLE MEMBRANE PROTEIN [Mycobacterium bovis AF2122/97] gi|124600771|gb|EAY59781.1| hypothetical protein TBCG_01457 [Mycobacterium tuberculosis C] gi|134149866|gb|EBA41911.1| hypothetical membrane protein [Mycobacterium tuberculosis str. Haarlem] gi|148505426|gb|ABQ73235.1| putative membrane protein [Mycobacterium tuberculosis H37Ra] gi|148721228|gb|ABR05853.1| hypothetical membrane protein [Mycobacterium tuberculosis F11] gi|253320972|gb|ACT25575.1| membrane protein [Mycobacterium tuberculosis KZN 1435] gi|289420042|gb|EFD17243.1| membrane protein [Mycobacterium tuberculosis CPHL_A] gi|289439361|gb|EFD21854.1| membrane protein [Mycobacterium tuberculosis KZN 605] gi|289538593|gb|EFD43171.1| membrane protein [Mycobacterium tuberculosis K85] gi|289543260|gb|EFD46908.1| membrane protein [Mycobacterium tuberculosis T17] gi|289685760|gb|EFD53248.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987] gi|289690629|gb|EFD58058.1| membrane protein [Mycobacterium tuberculosis T92] gi|289694151|gb|EFD61580.1| hypothetical protein TBGG_00680 [Mycobacterium tuberculosis EAS054] gi|289709145|gb|EFD73161.1| conserved hypothetical protein [Mycobacterium tuberculosis GM 1503] gi|289713157|gb|EFD77169.1| conserved hypothetical protein [Mycobacterium tuberculosis T85] gi|298494784|gb|EFI30078.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A] gi|308215767|gb|EFO75166.1| membrane protein [Mycobacterium tuberculosis SUMu001] gi|308327531|gb|EFP16382.1| membrane protein [Mycobacterium tuberculosis SUMu002] gi|308330994|gb|EFP19845.1| membrane protein [Mycobacterium tuberculosis SUMu003] gi|308334816|gb|EFP23667.1| membrane protein [Mycobacterium tuberculosis SUMu004] gi|308338604|gb|EFP27455.1| membrane protein [Mycobacterium tuberculosis SUMu005] gi|308342306|gb|EFP31157.1| membrane protein [Mycobacterium tuberculosis SUMu006] gi|308350100|gb|EFP38951.1| membrane protein [Mycobacterium tuberculosis SUMu008] gi|308354737|gb|EFP43588.1| membrane protein [Mycobacterium tuberculosis SUMu009] gi|308358644|gb|EFP47495.1| membrane protein [Mycobacterium tuberculosis SUMu010] gi|308362622|gb|EFP51473.1| membrane protein [Mycobacterium tuberculosis SUMu011] gi|308366304|gb|EFP55155.1| membrane protein [Mycobacterium tuberculosis SUMu012] gi|323719929|gb|EGB29041.1| membrane protein [Mycobacterium tuberculosis CDC1551A] gi|326903107|gb|EGE50040.1| membrane protein [Mycobacterium tuberculosis W-148] gi|328459217|gb|AEB04640.1| membrane protein [Mycobacterium tuberculosis KZN 4207] Length = 335 Score = 44.1 bits (102), Expect = 0.046, Method: Composition-based stats. Identities = 41/234 (17%), Positives = 72/234 (30%), Gaps = 51/234 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + LV+D+S SM D + ++M A + A F D + + Sbjct: 99 VMLVIDVSQSM---------------RATDVEPSRMVAAQEAAKQFADELTPG------I 137 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GLI Y + P+ E + + + + T + A+ A Q + + Sbjct: 138 NLGLIAYAGTATVLVSPTTNREATKNALDKLQFADR---TATGEAIFTALQAIATVGAV- 193 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK--SNVNTIKICD-KAKENFIKIVTISI 369 + P I+ +DG+ N AK+ + I TIS Sbjct: 194 ---------IGGGDTPPPARIVLFSDGKETMPTNPDNPKGAYTAARTAKDQGVPISTISF 244 Query: 370 NAS-------------PNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLM 409 P +K S YN L V+ ++ Q + Sbjct: 245 GTPYGFVEINDQRQPVPVDDETMKKVAQLSGGNSYNAATLAELRAVYSSLQQQI 298 >gi|327299330|ref|XP_003234358.1| hypothetical protein TERG_04951 [Trichophyton rubrum CBS 118892] gi|326463252|gb|EGD88705.1| hypothetical protein TERG_04951 [Trichophyton rubrum CBS 118892] Length = 741 Score = 44.1 bits (102), Expect = 0.047, Method: Composition-based stats. Identities = 30/147 (20%), Positives = 55/147 (37%), Gaps = 11/147 (7%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 + IVS P L I LV+D+SGSM NS +D + Sbjct: 46 PNRDSMIVSIQPPLKPENDVPHVPCDIVLVIDISGSM----NSAAPIPTGERGGEDTGLS 101 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + K+A +++++ + ++ + T V E ++ + V +D Sbjct: 102 ILDLTKHAAKTIIETLNEKD------RLAVVTFCTEVNVAFELDSMNKENKSTVLGAIDK 155 Query: 287 LILK-PTDSTPAMKQAYQILTSDKKRS 312 L K T+ MK+ IL ++ + Sbjct: 156 LYGKSSTNLWHGMKKGLNILATNPAQG 182 >gi|332254526|ref|XP_003276380.1| PREDICTED: cartilage matrix protein [Nomascus leucogenys] Length = 496 Score = 44.1 bits (102), Expect = 0.047, Method: Composition-based stats. Identities = 24/145 (16%), Positives = 50/145 (34%), Gaps = 17/145 (11%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 I L +G++ Y + V++ K + T + A++ A Sbjct: 68 IQSLDVGPNATRVGMVNYASTVKQEFSLRAHVSKAALLQAVRRIQPLSTGTMTGLAIQFA 127 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 SD + G + S K +I +TDG + +V+ +A+ + Sbjct: 128 ITKAFSDAE----------GGRSRSPDISKVVIVVTDGRPQDSVQDVS-----ARARASG 172 Query: 362 IKIVTISINASPNGQRLLKTCVSSP 386 +++ I + + L+ S P Sbjct: 173 VELFAIGVG--RVDKATLRQIASEP 195 Score = 41.8 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 37/223 (16%), Positives = 83/223 (37%), Gaps = 40/223 (17%) Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNA-----------LLLFLDSI-DLLSHVKED 251 +NSD + N T + L + + F++ I D L + Sbjct: 252 GFTLNSDGKTCNVCSGGGGSSATDLVFLIDGSKSVRPENFELVKKFINQIVDTLDVSDKL 311 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD--SLILKPTDSTPAMKQAYQILTSDK 309 +GL+ Y++ V + G ++ + + S + K T + A+K D Sbjct: 312 AQVGLVQYSSSVRQEFPL--GRFHTKKDIKAAVRNMSYMEKGTMTGAALKY-----LIDN 364 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + + K+ I+F TDG + ++ ++ KAK+ K+ + + Sbjct: 365 SFTVSSGARPGAQKVG-------IVF-TDGRSQDYINDAA-----KKAKDLGFKMFAVGV 411 Query: 370 -NASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 NA + L+ S P E+++ + ++ + + + + + Sbjct: 412 GNAVEDE---LREIASEPVAEHYFYTADFKTINQIGKKLQKKI 451 >gi|326789712|ref|YP_004307533.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] gi|326540476|gb|ADZ82335.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] Length = 404 Score = 44.1 bits (102), Expect = 0.047, Method: Composition-based stats. Identities = 33/225 (14%), Positives = 73/225 (32%), Gaps = 49/225 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E+ I + +D SGSM SDP L+ Sbjct: 109 EKAFKDIVIAIDTSGSME---QSDPNGERFKATSS----------------------LID 143 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS----LILKPTDSTPAMKQAY 302 +++ + + + + + T K ++ V + + T + +AY Sbjct: 144 NLEGNRRIAFMTFDDSPILQFDFMEATTKEQKEVVKAKIASYQQNDDGQTGVRDMINEAY 203 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 +++ ++ K + +I ++DG ++ ++ + + + N I Sbjct: 204 ELIQNNSKNHSGS-----------------LIMISDGAPSDDSASNIPALVSNYVQNN-I 245 Query: 363 KIVTISINASPNG--QRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 I TI + N Q L+ + HY+ + + F I Sbjct: 246 PIYTIGMMYGDNSAEQYLIDIANLTGGQHYSTSDTTMIAGAFGQI 290 >gi|290995572|ref|XP_002680357.1| predicted protein [Naegleria gruberi] gi|284093977|gb|EFC47613.1| predicted protein [Naegleria gruberi] Length = 269 Score = 44.1 bits (102), Expect = 0.047, Method: Composition-based stats. Identities = 35/226 (15%), Positives = 75/226 (33%), Gaps = 50/226 (22%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 ER + VVD SGSM +K+ +K+ L D + Sbjct: 45 ERKGIDLICVVDKSGSMAG--------------------SKIEMVKSTLAFMFDQLKPTD 84 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQIL 305 + L+ + + + +++ + E R + + ++ T+ + A+ ++ L Sbjct: 85 ------RIALVEFDSNISTSLQFTNMNESGRSKAKQVVSNIRAGSCTNLSGAL---FEGL 135 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN------TIKICDKAKE 359 +R+ ++ TDG N +N N T I ++ + Sbjct: 136 RLIGQRTNANEVTS-------------LLLFTDGLANEGITNTNEIVKKMTTMIHEEIRT 182 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 N + + T + L + +Y + D + F N+ Sbjct: 183 N-LTVFTFGFGTDTDANMLTSISQAGNGLYYFLQTTDDIPKAFGNV 227 >gi|281349285|gb|EFB24869.1| hypothetical protein PANDA_021744 [Ailuropoda melanoleuca] Length = 493 Score = 44.1 bits (102), Expect = 0.047, Method: Composition-based stats. Identities = 21/124 (16%), Positives = 41/124 (33%), Gaps = 15/124 (12%) Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 + I Y+T+ +E + K+ ++R + T+ K+A + + + Sbjct: 90 RVSFITYSTQGHTLMELTSDRNKIHNSLSRLKNIKPTGATNMHEGFKKANEQIEQENAGG 149 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 II LT G K+ T +KA+E K+ + + Sbjct: 150 NNAASL--------------IIALTTGP-LTPKALQETKSEAEKAREMGAKVYCVGVKDY 194 Query: 373 PNGQ 376 Q Sbjct: 195 RKDQ 198 >gi|269954956|ref|YP_003324745.1| LPXTG-motif cell wall anchor domain-containing protein [Xylanimonas cellulosilytica DSM 15894] gi|269303637|gb|ACZ29187.1| LPXTG-motif cell wall anchor domain protein [Xylanimonas cellulosilytica DSM 15894] Length = 2724 Score = 44.1 bits (102), Expect = 0.047, Method: Composition-based stats. Identities = 33/246 (13%), Positives = 62/246 (25%), Gaps = 27/246 (10%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + +S + + + E + LV+D S SM + SD + Sbjct: 130 VGPDELAVALSALGSTRHVMSTEPVPIDVVLVLDNSYSMTQCVESD----AFCRGPNVWQ 185 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKED----------VYMGLIGYTTRVEKNIEPSWGTE 274 ++ AA+ +A+ + I V GL+ W T+ Sbjct: 186 DSRAAAMADAVNSAIQIIHEDDPDNRVAIVLFGTNSEVVTGLV---NPQPTGSRYVWLTQ 242 Query: 275 KVRQYVTRDMDSLILKPTDS--TPAMKQAYQILTSDKKRSFF-TNFFRQGVKIPSLPFQK 331 + + T + T Y + + S Sbjct: 243 SGNSNTNGVLTLHVGNQTRTIGTSGSDTQYTNIQRGLTTGLGLLASQAKAAVTGSEQRVP 302 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKA----KENFIKIVTISINASPNGQRLLKTCVSSPE 387 +I TDGE ++ + A A LLK Sbjct: 303 NLILFTDGEPTRSSTSQSWWNPSGSAVQGPSSTGSAFYGNGFLA-ALSASLLK--AKVEA 359 Query: 388 YHYNVV 393 ++ + Sbjct: 360 HYNDAD 365 >gi|225028889|ref|ZP_03718081.1| hypothetical protein EUBHAL_03177 [Eubacterium hallii DSM 3353] gi|224953773|gb|EEG34982.1| hypothetical protein EUBHAL_03177 [Eubacterium hallii DSM 3353] Length = 1070 Score = 44.1 bits (102), Expect = 0.047, Method: Composition-based stats. Identities = 50/380 (13%), Positives = 114/380 (30%), Gaps = 86/380 (22%) Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 +L + + + K ++ + + + S S +A F + + + I T Sbjct: 277 DLQMDTLQMQHLKEEKKDRTVETVAKAADHKLSVSKSKVTAKFEVKSFSDFILTWNIDAT 336 Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPI 190 + L++ +N + L+ ++ A + + Sbjct: 337 --PADPLETGDNAASIEKQINHEKYATLR--------DDGTYDLTLTVAGKKGTETNKAK 386 Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + ++D SGSM +K A NA+ S+ + Sbjct: 387 LDVIYILDKSGSMKEDFGGT---------------SKRIAASNAITALTKSLKQ--NANI 429 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQY---------------VTRDMDSLILKPTDST 295 D ++ ++ + WG + + + R T+ Sbjct: 430 DARFSMVTFSGNKTTGM---WGQGDTKTWDDAEVAVSWTTDAGTIERGSKPTSNGGTNYQ 486 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE-NNNFKSNVNT---- 350 ++ A ++LTS + + +IF++DG+ + + T Sbjct: 487 AGIRTAKELLTSKRAGAMTA-----------------VIFISDGDPTFYYNPDGYTRGDG 529 Query: 351 ----------IKICDKAKENFIK------IVTISINASPNGQRLLKTCVS---SPEYHYN 391 +K+C A +N I T+ + + + L C + S +++ Sbjct: 530 NNDGNGGADNLKVCLDAAKNEIANLGVNYFYTVGVGKANDYVNLSDLCSASGVSGAKNFD 589 Query: 392 VVNADSLIHVFQNISQLMVH 411 N D L F I ++ Sbjct: 590 GTNTDELTKAFSTIESDILT 609 >gi|224823582|ref|ZP_03696691.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Lutiella nitroferrum 2002] gi|224604037|gb|EEG10211.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Lutiella nitroferrum 2002] Length = 995 Score = 44.1 bits (102), Expect = 0.047, Method: Composition-based stats. Identities = 36/295 (12%), Positives = 69/295 (23%), Gaps = 85/295 (28%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR-----------TKMAALKNALLLFLDS 241 I L D SGSM + + T+++ K+ +D+ Sbjct: 50 ILLFFDNSGSMAWDVYGNTGQKRERHCNWRGCWYGDVLDPLQGETRLSIAKDVTKSMIDA 109 Query: 242 IDLLSHVKEDVYMGLIGY-------------TTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 D+ GL + R + + ++ R ++ Sbjct: 110 Y-------PDMRWGLFSFDPSSSKDYVNDREDLRYNRAGRLVAEVGGGKDHLDRVKSAID 162 Query: 289 LKPTDSTPAMKQAYQILT------------SDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 D+ + +AY +T S R + F++ + Sbjct: 163 SLGADTNTPLAEAYYEMTRYYRGMSSAYYKPVNPASGSRGSGRYASPVQYRCQANFVLMM 222 Query: 337 TDGENNNFK-------------------------------------SNVNTIKI-CDKAK 358 TDGE S+ + Sbjct: 223 TDGEPTEDNEFPLSGDPLFAGLSLGKDNPLPSFAGRAAAGDLFTSGSDGEGQGWDAETFA 282 Query: 359 ENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQLMVHR 412 IK TI LL+ ++ N + L F + + + Sbjct: 283 RQSIKTYTIGFTVDN---TLLQKTAMQGGGRYFTASNREQLKASFTSAMDDIYRQ 334 >gi|125719088|ref|YP_001036221.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain-containing protein [Streptococcus sanguinis SK36] gi|125499005|gb|ABN45671.1| Nitric oxide reductase NorD / Von Willebrand factor type A (vWA) domain protein, putative [Streptococcus sanguinis SK36] gi|324989618|gb|EGC21563.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK353] Length = 444 Score = 44.1 bits (102), Expect = 0.047, Method: Composition-based stats. Identities = 57/315 (18%), Positives = 101/315 (32%), Gaps = 39/315 (12%) Query: 49 KKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSG 108 + SM+ + I S V + S+ G L D K+ I +S Sbjct: 52 MRTSMQ-YVDRTIGKATSVFVLDESKYGKDVRKTEGWNYIGLSPDGKKVINYIWNKSTKS 110 Query: 109 -YSAVFYNTEIQNIVNSSRISMTHMAN-NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLN 166 +V + ++ NRL + N T Y + + LN Sbjct: 111 WDESVLGTNSLYDMQLDLEFKADESYQDNRLINYNLTGQYK---NSKNKLSIDTAISALN 167 Query: 167 QRYNQKIVSFIPALLRIEMGERPI-----FLIELVVDLSGSMHCAMNSD---PEDVNSAP 218 + V+ + + PI + V D SGSM + ++ P Sbjct: 168 TKQVISKVAKGKKGVALAYRNDPIEGQVNTAVSFVFDTSGSMAYGLRNEGKRNSQGKWGP 227 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVK-EDVY-MGLIGYTTRVEKNIEPSWGTEKV 276 + D R +M LK L +D + + +V V G Y E +E T K+ Sbjct: 228 LDADNPRARMNILKKKANLLVDDLKEIGNVSVNLVRFSGSASY--IQEDFVELDKDTGKI 285 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 ++ + S I T+ ++ L + + K+++ L Sbjct: 286 KEKIKSLPTSWI---TNPGDGLRYGLVSLQRNPAQL------------------KYVVLL 324 Query: 337 TDGENNNFKSNVNTI 351 TDG N + + + I Sbjct: 325 TDGIPNAYTGSPDGI 339 >gi|254780914|ref|YP_003065327.1| hypothetical protein CLIBASIA_04065 [Candidatus Liberibacter asiaticus str. psy62] gi|254780929|ref|YP_003065342.1| hypothetical protein CLIBASIA_04140 [Candidatus Liberibacter asiaticus str. psy62] gi|254040591|gb|ACT57387.1| hypothetical protein CLIBASIA_04065 [Candidatus Liberibacter asiaticus str. psy62] gi|254040606|gb|ACT57402.1| hypothetical protein CLIBASIA_04140 [Candidatus Liberibacter asiaticus str. psy62] Length = 408 Score = 44.1 bits (102), Expect = 0.048, Method: Composition-based stats. Identities = 8/57 (14%), Positives = 31/57 (54%) Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + I I ++ + + + L+ C S P +Y + + ++++ + +++++ ++ +S Sbjct: 343 RDAITIFSVGFSPDQDTRYTLRQCASDPSKYYEINSDENVMPIAKSLARNVITNWFS 399 >gi|89098949|ref|ZP_01171829.1| hypothetical protein B14911_06266 [Bacillus sp. NRRL B-14911] gi|89086353|gb|EAR65474.1| hypothetical protein B14911_06266 [Bacillus sp. NRRL B-14911] Length = 940 Score = 44.1 bits (102), Expect = 0.048, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 50/180 (27%), Gaps = 40/180 (22%) Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 + + R + V D SGSM K K + K+A+ Sbjct: 64 LTPKGKATNANRDPIDVVFVFDKSGSM---------------NDSGKNPQKFQSAKDAMT 108 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEK----NIEPSWGTEKVRQYVTRDMDSLILKPT 292 ++ + + G + + VE N P + + L T Sbjct: 109 AAVNFFKENAGPND--RFGFVPFDDDVETGKVVNFAPENNMASLNLINSNSNSLSALGGT 166 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + T ++ A + K+++F+TDGE K T Sbjct: 167 NYTQSLDAAL-------------------GMFGNSTNNKYVLFMTDGEPTFSKVIERTTY 207 >gi|296228118|ref|XP_002759733.1| PREDICTED: collagen alpha-6(VI) chain [Callithrix jacchus] Length = 2267 Score = 44.1 bits (102), Expect = 0.048, Method: Composition-based stats. Identities = 57/353 (16%), Positives = 107/353 (30%), Gaps = 66/353 (18%) Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 SN LG E+I A F + ++++ + + N Sbjct: 493 SNKQDLGKAIENIRQMGGNTNTGAALNFTLSLLQKAKQQRGNK-VPCHLVVLTNGMSKDS 551 Query: 130 THMANNRL------------DSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFI 177 NRL +N T + Y + + L + R + Sbjct: 552 ILEPANRLREEHIRVYAIGVKEANKTQLREIAGEEKRVYYVHDFDALKDIRN-----QVV 606 Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + E + I +VD SGS+ + +KM K + Sbjct: 607 QEICAEEACKEMKADIMFLVDSSGSIG-----------------PENFSKM---KTFMKN 646 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 + + V +G++ ++ G K + R M + ++ Sbjct: 647 LVSKSQI---GANRVQIGVVQFS-----------GVNKEEFQLNRFMSQSDIS--NAIDQ 690 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 M + + SF + +F K +KF+I +TDGE + Sbjct: 691 MVHIGETTLTGSALSFVSQYF-SPTKGARPNVRKFLILITDGEAQDVVKEPAV-----AL 744 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 ++ I I ++ + S Q L+ PE + V N + Q+I +V Sbjct: 745 RQEGIIIYSVGVFGSNVTQ--LEEISGRPEMVFYVEN----FDILQHIEDDLV 791 >gi|308175402|ref|YP_003922107.1| hypothetical protein BAMF_3511 [Bacillus amyloliquefaciens DSM 7] gi|307608266|emb|CBI44637.1| conserved hypothetical protein YwmC [Bacillus amyloliquefaciens DSM 7] gi|328555380|gb|AEB25872.1| hypothetical protein BAMTA208_18610 [Bacillus amyloliquefaciens TA208] gi|328913751|gb|AEB65347.1| hypothetical protein LL3_03821 [Bacillus amyloliquefaciens LL3] Length = 229 Score = 43.7 bits (101), Expect = 0.048, Method: Composition-based stats. Identities = 38/243 (15%), Positives = 76/243 (31%), Gaps = 46/243 (18%) Query: 172 KIVSFIPALLRIEMGERPIF----LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + + PA E F +++D SGSM ++ +K Sbjct: 14 LLTAASPAFAAENANENSFFKPKNNTAILLDASGSMAKRIDGV---------------SK 58 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVY--MGLIGYTTRVEKNIEPS--WGTEKV--RQYVT 281 K+ ++ F D I S V+ V+ G + +V+ +G ++ + ++ Sbjct: 59 YNMAKDEIVRFADQIKSKSQVRMTVFGSEGNNKNSGKVQSCESIRGVYGFQRFDKQSFLN 118 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 T A++ A K K + LTDGE Sbjct: 119 SLNGIGPTGWTPIAKALEDAKASFNGVHKLG-----------------SKSVYLLTDGEE 161 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIH 400 + T K + ++ IK+ I + + L + +Y + + Sbjct: 162 TCGGDPIKTAK---ELRKQHIKVNVIGFDFNEGFNGQLHAIAGAGGGKYYEAHSQKDMNR 218 Query: 401 VFQ 403 +F+ Sbjct: 219 IFK 221 >gi|221128149|ref|XP_002161198.1| PREDICTED: similar to inter-alpha trypsin inhibitor, heavy chain 3, partial [Hydra magnipapillata] Length = 464 Score = 43.7 bits (101), Expect = 0.048, Method: Composition-based stats. Identities = 44/268 (16%), Positives = 79/268 (29%), Gaps = 52/268 (19%) Query: 147 NMDVMTSY-DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHC 205 + T Y DY + + + K S L E +R + +V+D SGSM Sbjct: 7 TVACSTEYKDYPFKEKLDI-WTLISLKAPSLGMTLDEKEHRKRAPIDLVVVIDKSGSMAG 65 Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK 265 K+A +K L + ++ + LI + T V Sbjct: 66 E--------------------KLALVKKTLEFVVSQLNEKD------RLCLITFDTSVYL 99 Query: 266 NIEPSWGTEKVR-QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 + + + T + Q + D T+ + + + Sbjct: 100 DFKLTPMTPMNKYQTLKIIKDISPGSMTNLCGGLMKGLCEVIDRADEEKNEVAS------ 153 Query: 325 PSLPFQKFIIFLTDG------ENNNFKSNVNTIKICDKA---KENFIKIVTISINASPNG 375 ++ TDG N + S+ T K K I T ++ N Sbjct: 154 --------VLLFTDGFANKGGLTNIYCSSSQTAKYTIGIVGPKTADASIYTFGFGSNHNA 205 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 Q L + + +Y + N D + F Sbjct: 206 QMLKEISDAGSGMYYYIENVDMIAEAFG 233 >gi|238789155|ref|ZP_04632943.1| hypothetical protein yfred0001_29950 [Yersinia frederiksenii ATCC 33641] gi|238722687|gb|EEQ14339.1| hypothetical protein yfred0001_29950 [Yersinia frederiksenii ATCC 33641] Length = 522 Score = 43.7 bits (101), Expect = 0.049, Method: Composition-based stats. Identities = 42/262 (16%), Positives = 75/262 (28%), Gaps = 55/262 (20%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 I +E + F + F+ L+ + + KK + A A LA + Sbjct: 18 IKNENGAILLSFIFILPIFIGLVFLSFEISHFIQKKARLSDAIEQATLALTVDNDESPDD 77 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + E S +FI N+ K L + NI + S Sbjct: 78 DNIKKEKNS------------KFIINYAKAYLPNEKFS---KPVINITSHSDY------- 115 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIE 194 I Y +D+ Y + + + + +S L+ + + Sbjct: 116 ---------INYQVDMTIYY--PTKILNKIFQTVSPEVSISDNARALKYTTTDSKPTDVV 164 Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 V D SGSM+ + E K+ AL+ + I + Sbjct: 165 FVADYSGSMNEYFDESDE----------SDEKKIVALRRIFKDIQNEIKYNNV------- 207 Query: 255 GLIGYTTRVEKNIEPSWGTEKV 276 + + SWGT+ Sbjct: 208 -----NIDIIGFVPFSWGTKNF 224 >gi|261338458|ref|ZP_05966342.1| putative von Willebrand factor type A domain protein [Bifidobacterium gallicum DSM 20093] gi|270276443|gb|EFA22297.1| putative von Willebrand factor type A domain protein [Bifidobacterium gallicum DSM 20093] Length = 493 Score = 43.7 bits (101), Expect = 0.049, Method: Composition-based stats. Identities = 38/260 (14%), Positives = 82/260 (31%), Gaps = 62/260 (23%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 + + ++ + +++D+SGSM T M KNA+ Sbjct: 58 SVSSTDMDTAQQQTESDVVVLMDVSGSM--------------------TTTDMKVAKNAV 97 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + + L+ + V M ++ +++ + S + T S Sbjct: 98 NGLANQL--LNDENDTVRMSIVRFSSEAKT-----------------LEFSNGSEWTHSP 138 Query: 296 PAMKQAYQILTSDKKRSFF-TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + QA LTS ++ + + +++ ++DG N S + C Sbjct: 139 ALVAQALNTLTSRGNTNWDGALQNASALVQGDSARKSYVVLMSDGYPNTINSCYPAVANC 198 Query: 355 D-------KAKENFIK---------IVTIS--INASPNGQRLLKTCVSSPEYH----YNV 392 A I+ I +S +AS + + L+ + + Sbjct: 199 TDTSWSEPNAVPKAIEAANTMPNTQIYAVSTRTSASESMKELVDGINAKAPKYPAQIMYG 258 Query: 393 VNADSLIHVFQNISQLMVHR 412 + SL + F I+ + R Sbjct: 259 TDQQSLNNAFDTIADAIRKR 278 >gi|162138242|gb|ABX82825.1| complement factor B [Sus scrofa] Length = 765 Score = 43.7 bits (101), Expect = 0.049, Method: Composition-based stats. Identities = 42/227 (18%), Positives = 76/227 (33%), Gaps = 41/227 (18%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 +V+D SGSM+ + D D A KN L F++ + + Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGARNFTG--------AKNCLKDFIEKVASYGVKPKY--- 309 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK------PTDSTPAMKQAYQILT-S 307 GL+ Y T + I S +VT +D + T++ A+ + Y +++ Sbjct: 310 GLVTYATDPKVLIRVSNPKSADADWVTEQLDKISYDDHKLKAGTNTKKALLEVYNMMSWG 369 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV-------NTIKICDKAKEN 360 N R I+ LTDG +N V + + I K Sbjct: 370 VNNFPDNWNRTRH-----------VIVLLTDGLHNMGGDPVTVIHDIRDLLNIGRNRKNP 418 Query: 361 -----FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 I + + + L + ++ + + + D+L VF Sbjct: 419 REDYLDIYVFGVGPLVNQENINALASKKDKEQHVFKLKDVDNLEDVF 465 >gi|156120152|ref|NP_001095294.1| complement factor B [Sus scrofa] gi|148724909|emb|CAN87697.1| B-factor, properdin [Sus scrofa] Length = 765 Score = 43.7 bits (101), Expect = 0.049, Method: Composition-based stats. Identities = 42/227 (18%), Positives = 76/227 (33%), Gaps = 41/227 (18%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 +V+D SGSM+ + D D A KN L F++ + + Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGARNFTG--------AKNCLKDFIEKVASYGVKPKY--- 309 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK------PTDSTPAMKQAYQILT-S 307 GL+ Y T + I S +VT +D + T++ A+ + Y +++ Sbjct: 310 GLVTYATDPKVLIRVSNPKSADADWVTEQLDKISYDDHKLKAGTNTKKALLEVYNMMSWG 369 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV-------NTIKICDKAKEN 360 N R I+ LTDG +N V + + I K Sbjct: 370 VNNFPDNWNRTRH-----------VIVLLTDGLHNMGGDPVTVIHDIRDLLNIGRNRKNP 418 Query: 361 -----FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 I + + + L + ++ + + + D+L VF Sbjct: 419 REDYLDIYVFGVGPLVNQENINALASKKDKEQHVFKLKDVDNLEDVF 465 >gi|148724910|emb|CAN87698.1| B-factor, properdin [Sus scrofa] Length = 549 Score = 43.7 bits (101), Expect = 0.049, Method: Composition-based stats. Identities = 42/227 (18%), Positives = 76/227 (33%), Gaps = 41/227 (18%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 +V+D SGSM+ + D D A KN L F++ + + Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGARNFTG--------AKNCLKDFIEKVASYGVKPKY--- 309 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK------PTDSTPAMKQAYQILT-S 307 GL+ Y T + I S +VT +D + T++ A+ + Y +++ Sbjct: 310 GLVTYATDPKVLIRVSNPKSADADWVTEQLDKISYDDHKLKAGTNTKKALLEVYNMMSWG 369 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV-------NTIKICDKAKEN 360 N R I+ LTDG +N V + + I K Sbjct: 370 VNNFPDNWNRTRH-----------VIVLLTDGLHNMGGDPVTVIHDIRDLLNIGRNRKNP 418 Query: 361 -----FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 I + + + L + ++ + + + D+L VF Sbjct: 419 REDYLDIYVFGVGPLVNQENINALASKKDKEQHVFKLKDVDNLEDVF 465 >gi|326789709|ref|YP_004307530.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] gi|326540473|gb|ADZ82332.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] Length = 593 Score = 43.7 bits (101), Expect = 0.050, Method: Composition-based stats. Identities = 44/250 (17%), Positives = 77/250 (30%), Gaps = 61/250 (24%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + LV+D SGSM ++ K+ + LF+D + Sbjct: 25 MAATSDAQLDAILVIDASGSM-----------------KETDPNKLGL--EGVKLFVDML 65 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSW-----GTEKVRQYVTRDMDSLILKPTDSTPA 297 L + +G++ Y + V + S E ++ +V L TD T Sbjct: 66 GLTDN-----QVGVVTYGSDVSQTYPMSLVKNQSDKENIKNFVDGITRDLEY--TDITSG 118 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN--------NNFKSNVN 349 +K+A ++L I+ TDG N + + Sbjct: 119 LKEAVKMLNQRNASGNSP----------------LIVVFTDGNNAIGGVANRTPADIDKD 162 Query: 350 TIKICDKAKENFIKIVTISINASPNGQ----RLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 I +A+ I TI +N NG+ L K V + + + L + I Sbjct: 163 LAAIISQAQSEGYPIYTIGLN--DNGKLNEAYLEKISVDTKAKAFATKDPAELPDILTEI 220 Query: 406 SQLMVHRKYS 415 + K Sbjct: 221 FAAHSNLKVQ 230 >gi|226310161|ref|YP_002770055.1| hypothetical protein BBR47_05740 [Brevibacillus brevis NBRC 100599] gi|226093109|dbj|BAH41551.1| hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 477 Score = 43.7 bits (101), Expect = 0.050, Method: Composition-based stats. Identities = 41/232 (17%), Positives = 84/232 (36%), Gaps = 40/232 (17%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + + +E+++D SGSM N +T+M A K A+ F +S+ ++ Sbjct: 169 KESYNVEIILDASGSMAAKSNG---------------KTRMDAAKEAIQAFAESLPEQAN 213 Query: 248 VKEDVY----MGLIGYTTRVEKNIEPSWGTEKV-RQYVTRDMDSL-ILKPTDSTPAMKQA 301 V VY G T + E +G + ++ +T+ ++ T ++++A Sbjct: 214 VALRVYGHKGSGKESDKTLSCGSSELVYGMQTYNKEKLTQSLNQFQPTGYTPIAYSLQEA 273 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + L K+P I ++DG V K +++ Sbjct: 274 KKDL----------------SKLPGDKNTNMIFLVSDGIETCDGDPVEAAKQLAQSEITP 317 Query: 362 IKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQLMVHR 412 I I I GQ+ LK ++ + + + L F + + ++ Sbjct: 318 I-INVIGFGVDGPGQQQLKEVAKAAGGRYVLIQDQKELQDEFNR-GKEIANK 367 >gi|326672754|ref|XP_002664126.2| PREDICTED: collagen alpha-1(XXI) chain-like [Danio rerio] Length = 572 Score = 43.7 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 29/178 (16%), Positives = 67/178 (37%), Gaps = 25/178 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL--ILKPTDST 295 ++ + ++ +G++ Y+ +I G + + M+S+ + T++ Sbjct: 64 LVNITMSFNIGQKFTQVGVVQYSDDPFLHIPL--GKHFSSSDLIKAMESIEYMGGNTNTG 121 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+K A L + +R K + LTDG KS + + Sbjct: 122 RAIKFANDKLFALSER-------------GPNGIAKIAVVLTDG-----KSQDEVLAAAE 163 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 A++ I + I + S + L+ + P Y ++V + ++ + + I Q + Sbjct: 164 AARKKGIILFAIGV-GSETEEAQLRAIANKPSSTYVFSVKDYKAIAKIREVIRQKLCE 220 >gi|307565272|ref|ZP_07627765.1| von Willebrand factor type A domain protein [Prevotella amnii CRIS 21A-A] gi|307345941|gb|EFN91285.1| von Willebrand factor type A domain protein [Prevotella amnii CRIS 21A-A] Length = 318 Score = 43.7 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 37/232 (15%), Positives = 70/232 (30%), Gaps = 44/232 (18%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + + I L +D+S SM + D ++ K F+ Sbjct: 81 DNKDSEGIDIMLAMDISASM---------------LTNDVIPNRLEVAKEVASDFI---- 121 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 +GL + + + + L++K Sbjct: 122 SGRPNDN---IGLTIFAGEAFTQCPLTTDHASLINLLNSVRTDLVVKG------------ 166 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L D G S K +I LTDG NN + T + AK+ I+ Sbjct: 167 -LIQDGTAIGMGLINAVGRLKSSKAKSKVVILLTDGSNNVGSISPMTAA--EIAKKFNIR 223 Query: 364 IVTISINASPN------GQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 + TI + N L+ + + Y+ + L ++++I++L Sbjct: 224 VYTIGLGTEQNNGYSDIDYTTLRQIANVTNGKFYSAQSQTELSQIYKDINKL 275 >gi|288802180|ref|ZP_06407620.1| BatA protein [Prevotella melaninogenica D18] gi|288335147|gb|EFC73582.1| BatA protein [Prevotella melaninogenica D18] Length = 318 Score = 43.7 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 40/233 (17%), Positives = 71/233 (30%), Gaps = 46/233 (19%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + + I L +D+S SM + +D +M K F+ S Sbjct: 81 DNKDTEGIDIMLTMDISASM---------------LTEDVFPNRMEVAKEVASEFISS-R 124 Query: 244 LLSHVKEDVYMG-LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 ++ ++ G + VR + L+ T + + Sbjct: 125 PSDNIGLTIFAGEAFTQCPMTLDHAALLNLLHNVRT--DLVTNGLMQDGTAIGLGLANSV 182 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 L K +S K +I LTDG NN + T AK+ I Sbjct: 183 SRLKDSKAKS------------------KVVILLTDGSNNVGSISPMTAAT--IAKKFGI 222 Query: 363 KIVTISINASPNG-------QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 ++ TI + + L V + Y + L ++Q+I +L Sbjct: 223 RVYTIGLGRETGEDIGAIDYKTLQDIAVLTNGEFYRAQSQAELSKIYQDIDKL 275 >gi|148689164|gb|EDL21111.1| RIKEN cDNA E330026B02 [Mus musculus] Length = 1482 Score = 43.7 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 51/352 (14%), Positives = 101/352 (28%), Gaps = 64/352 (18%) Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRF---IKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 SN LG E+I A F + K+ + S Sbjct: 493 SNKPDLGKAIENIRQMGGNTNTGAALNFTLKLLQRAKKERGSKVPCHLVVLTNGMSRDSV 552 Query: 127 ISMTHMANNR--------LDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP 178 + H + +N + Y + + L N R + Sbjct: 553 LGPAHKLREENIRVHAIGVKEANQMQLREIAGEEKRVYYVHDFDALRNIRN-----QVVQ 607 Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + E I +VD SGS+ + +KM K + Sbjct: 608 EICAEEACRDMKADIMFLVDSSGSIG-----------------PENFSKM---KMFMKNL 647 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 + + + V +G++ ++ ++ + T + + +D + Sbjct: 648 VSKSQI---GADRVQIGVVQFSHENKEEFQL--NTFMSQSDIANAIDRMTHIG------- 695 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 T F + + K +KF+I +TDGE + + + Sbjct: 696 -----ETTLTGSALTFVSQYFSPDKGARPNVRKFLILITDGEAQDIVRDPAL-----ALR 745 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + + I ++ + S Q L+ PE + V N + Q+I +V Sbjct: 746 KEGVIIYSVGVFGSNVTQ--LEEISGKPEMVFYVEN----FDILQHIEDDLV 791 >gi|328880283|emb|CCA53522.1| putative secreted protein [Streptomyces venezuelae ATCC 10712] Length = 424 Score = 43.7 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 40/223 (17%), Positives = 72/223 (32%), Gaps = 32/223 (14%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + E + +ELV+D+SGSM +++M+A K A Sbjct: 25 TSATADEPVTKESPKVELVLDVSGSMRAK--------------DIDGKSRMSAAKQAFNE 70 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 L V E+V +G+ + +Q + P D T A Sbjct: 71 VL------DAVPEEVRLGIRTLGADYPG--------QDRKQGCKDTRQLYPVGPLDRTEA 116 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 K A L + + I+ +TDGE+ + + A Sbjct: 117 -KTAVASLAPTGWTPIGPALLGAAKDLEGGDATRRIVLITDGEDTCAPLDPCQVAREIAA 175 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 K + I T+ + ++ L TC++ + +V + L Sbjct: 176 KGIHLVIDTLGLVPDAKTRQQL-TCIAEATGGTYTSVHRTEDL 217 >gi|33600172|ref|NP_887732.1| putative hemolysin [Bordetella bronchiseptica RB50] gi|33567770|emb|CAE31684.1| putative hemolysin [Bordetella bronchiseptica RB50] Length = 3346 Score = 43.7 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 60/169 (35%), Gaps = 24/169 (14%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 ++ + + I LV+DLS SM+ S K T++ K+AL Sbjct: 2715 GGIKQNVTAGTSYNIALVLDLSDSMNDKWGSG-----------SNKPTRLQTAKDALKAL 2763 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR-QYVTRDMDSLILKPTDSTPA 297 L++ + + + + LI + + + + V +D L+ ST Sbjct: 2764 LENQLAVHDGE--INVSLITFNG---SSSALKKSITGLTPENVDEMVDILMGLKASSTTP 2818 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 A+ T + + + P++ FLTDGE + S Sbjct: 2819 YGAAFDRTTQWFEGQPTVDSEGK-------PYKNLTFFLTDGEPSTEWS 2860 >gi|302346571|ref|YP_003814869.1| von Willebrand factor type A domain protein [Prevotella melaninogenica ATCC 25845] gi|302150280|gb|ADK96541.1| von Willebrand factor type A domain protein [Prevotella melaninogenica ATCC 25845] Length = 318 Score = 43.7 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 40/233 (17%), Positives = 71/233 (30%), Gaps = 46/233 (19%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + + I L +D+S SM + +D +M K F+ S Sbjct: 81 DNKDTEGIDIMLTMDISASM---------------LTEDVFPNRMEVAKEVASEFISS-R 124 Query: 244 LLSHVKEDVYMG-LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 ++ ++ G + VR + L+ T + + Sbjct: 125 PSDNIGLTIFAGEAFTQCPMTLDHAALLNLLHNVRT--DLVTNGLMQDGTAIGLGLANSV 182 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 L K +S K +I LTDG NN + T AK+ I Sbjct: 183 SRLKDSKAKS------------------KVVILLTDGSNNVGSISPMTAAT--IAKKFGI 222 Query: 363 KIVTISINASPNG-------QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 ++ TI + + L V + Y + L ++Q+I +L Sbjct: 223 RVYTIGLGRETGEDIGAIDYKTLQDIAVLTNGEFYRAQSQAELSKIYQDIDKL 275 >gi|325661940|ref|ZP_08150560.1| hypothetical protein HMPREF0490_01298 [Lachnospiraceae bacterium 4_1_37FAA] gi|325471792|gb|EGC75010.1| hypothetical protein HMPREF0490_01298 [Lachnospiraceae bacterium 4_1_37FAA] Length = 1321 Score = 43.7 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 41/236 (17%), Positives = 81/236 (34%), Gaps = 50/236 (21%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 +++ E I L D SGSM K+ LK A+ F+ Sbjct: 504 KVKKLEYDTVNIALCCDNSGSMEGE--------------------KIENLKKAVSTFVGK 543 Query: 242 IDLLSHVKEDVYMGLIGYTTRV-EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 + ++V +G++ + + V E EP EK+ Q V T+ ++ Sbjct: 544 L------ADEVNIGIVPFGSGVLEGVCEPGSSREKLEQSVESFRSD---SGTNIYSGVEY 594 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 +L +K + ++DG ++ S KI + Sbjct: 595 TLSMLAKEKDALNIA------------------VIMSDG-QDSIPSEEQLQKITSACENG 635 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I + ++ + A + L + + V +++SL +Q I Q + +Y + Sbjct: 636 NILLYSMGLGADVESEVLSTYSDAGNGAYVFVSDSNSLYSFYQYIYQ-ISKNRYEI 690 >gi|325286052|ref|YP_004261842.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] gi|324321506|gb|ADY28971.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] Length = 348 Score = 43.7 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 34/219 (15%), Positives = 73/219 (33%), Gaps = 44/219 (20%) Query: 152 TSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDP 211 +++ L+F +L ++ ++E +R I VD+S SM Sbjct: 51 STFKSILKFCVFILGLTLLIVGLANPKIGTKLETVKREGVDIVFAVDVSKSMLAE----- 105 Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW 271 D +++A K + + + +G+I Y + + + Sbjct: 106 ----------DIAPSRLAKAKRIVSEIIAQLGSD-------RIGIIAYAGQAYPQLPITT 148 Query: 272 GTEKVRQYVTRD-MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 + ++ + L + T A+ A D++ + Sbjct: 149 DYGAAKMFLQGLNTNMLSSQGTAINQALDLASTYYDDDEQTN------------------ 190 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + + ++DGE S +T +KA + IKI TI + Sbjct: 191 RVLFIISDGE---DHSEGSTEGAVEKAVDQGIKIFTIGV 226 >gi|256787646|ref|ZP_05526077.1| lipoprotein [Streptomyces lividans TK24] gi|289771539|ref|ZP_06530917.1| lipoprotein [Streptomyces lividans TK24] gi|289701738|gb|EFD69167.1| lipoprotein [Streptomyces lividans TK24] Length = 532 Score = 43.7 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 35/226 (15%), Positives = 72/226 (31%), Gaps = 47/226 (20%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 ERP + V+D+SGSM + ++ + AL D + Sbjct: 176 ERQSERPPAALTFVIDISGSMG-------------------EPGRLDLAQEALGTMTDRL 216 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQA 301 V L+ ++ E + + R V +D L T+ M+ Sbjct: 217 RDDDSV------ALVTFSDEAETVLPMT-RLGDHRGRVHDAIDGLEPTDSTNLGAGMETG 269 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI--KICDKAKE 359 Y+ ++ ++ ++D N ++ +TI +I + +E Sbjct: 270 YETAVEGRREGATNR----------------VVLVSDALANTGDTDADTILERIATERRE 313 Query: 360 NFIKIVTISINASPNGQRLLKTCVSS-PEYHYNVVNADSLIHVFQN 404 + I + + + S G L++ + V + VF Sbjct: 314 HGITLFGVGV-GSDYGDALMERLADKGDGHTTYVSTTEDAREVFSE 358 >gi|21221175|ref|NP_626954.1| lipoprotein [Streptomyces coelicolor A3(2)] gi|6969217|emb|CAB75310.1| putative lipoprotein [Streptomyces coelicolor A3(2)] Length = 532 Score = 43.7 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 35/226 (15%), Positives = 72/226 (31%), Gaps = 47/226 (20%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 ERP + V+D+SGSM + ++ + AL D + Sbjct: 176 ERQSERPPAALTFVIDISGSMG-------------------EPGRLDLAQEALGTMTDRL 216 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQA 301 V L+ ++ E + + R V +D L T+ M+ Sbjct: 217 RDDDSV------ALVTFSDEAETVLPMT-RLGDHRGRVHDAIDGLEPTDSTNLGAGMETG 269 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI--KICDKAKE 359 Y+ ++ ++ ++D N ++ +TI +I + +E Sbjct: 270 YETAVEGRREGATNR----------------VVLVSDALANTGDTDADTILERIATERRE 313 Query: 360 NFIKIVTISINASPNGQRLLKTCVSS-PEYHYNVVNADSLIHVFQN 404 + I + + + S G L++ + V + VF Sbjct: 314 HGITLFGVGV-GSDYGDALMERLADKGDGHTTYVSTTEDAREVFSE 358 >gi|220675931|emb|CAX12090.1| matrilin 3b [Danio rerio] Length = 434 Score = 43.7 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 30/178 (16%), Positives = 63/178 (35%), Gaps = 34/178 (19%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDST 295 + ++ L + + L+ Y + V + +V+Q +R + T + Sbjct: 227 LSEMVNSLDIGSDATRVALVNYASTVNIEFHLKKYFSKAEVKQAFSRIDP--LSTGTMTG 284 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI----IFLTDGENNNFKSNVNTI 351 A+K A + + ++ + P +K I I +TDG + V + Sbjct: 285 MAIKTAMEQVFTENA--------------GARPLKKGIGKVAIIVTDG---RPQDKVEEV 327 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN---ADSLIHVFQN 404 A+ + I+I + + R LK S P ++ + V + L F+ Sbjct: 328 S--AAARASGIEIYAVGV--DRAEMRSLKQMASQPLDDHVFYVETYGVIEKLTSKFRE 381 >gi|59939916|ref|NP_001012385.1| matrilin 3b [Danio rerio] gi|56797875|emb|CAG30518.1| matrilin-3b precursor [Danio rerio] gi|220675929|emb|CAX12088.1| matrilin 3b [Danio rerio] Length = 478 Score = 43.7 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 30/178 (16%), Positives = 63/178 (35%), Gaps = 34/178 (19%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDST 295 + ++ L + + L+ Y + V + +V+Q +R + T + Sbjct: 227 LSEMVNSLDIGSDATRVALVNYASTVNIEFHLKKYFSKAEVKQAFSRIDP--LSTGTMTG 284 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI----IFLTDGENNNFKSNVNTI 351 A+K A + + ++ + P +K I I +TDG + V + Sbjct: 285 MAIKTAMEQVFTENA--------------GARPLKKGIGKVAIIVTDG---RPQDKVEEV 327 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN---ADSLIHVFQN 404 A+ + I+I + + R LK S P ++ + V + L F+ Sbjct: 328 S--AAARASGIEIYAVGV--DRAEMRSLKQMASQPLDDHVFYVETYGVIEKLTSKFRE 381 >gi|295399398|ref|ZP_06809380.1| Ig domain protein group 2 domain protein [Geobacillus thermoglucosidasius C56-YS93] gi|294978864|gb|EFG54460.1| Ig domain protein group 2 domain protein [Geobacillus thermoglucosidasius C56-YS93] Length = 929 Score = 43.7 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 39/186 (20%), Positives = 67/186 (36%), Gaps = 46/186 (24%) Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 + R++ RP + V D+SGSM K+ + K AL Sbjct: 66 LTPQGRVDNIIRPPIDVVFVFDVSGSM------------------VMPSLKLDSAKYALQ 107 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNI--EPSWGTEKVRQYVTRDMDSL----ILK 290 +D ++ + L+ ++ V+ + GT V+Q++ Sbjct: 108 SAVDYFKANANPND--RFALVPFSDDVQYSKVVPFPSGTYDVKQHLNWIATVANSLRANG 165 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG-ENNNFKSNVN 349 T+ T A++QA Q +D R +K+IIFLTDG + Sbjct: 166 GTNYTQALQQA-QSFFNDPAR------------------KKYIIFLTDGMPTVSIAKEPI 206 Query: 350 TIKICD 355 T K+C+ Sbjct: 207 TYKVCE 212 >gi|327269503|ref|XP_003219533.1| PREDICTED: collagen alpha-1(XXII) chain-like [Anolis carolinensis] Length = 1601 Score = 43.7 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 24/161 (14%), Positives = 59/161 (36%), Gaps = 23/161 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL--KPTDSTPAMK 299 +D + +G++ Y+ R + G + R+ + ++ T++ A++ Sbjct: 65 VDTFEIGPDKTRVGVVRYSDRPTTEFDL--GRYQTREQIKEAAKNIKYYGGNTNTGDALR 122 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 S++ G + +K I LTDG + + + T A + Sbjct: 123 YINTYSFSEEA----------GGRPTDSAIKKVAILLTDGRSQDHVLDPAT-----AAHK 167 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 I+I + + + L S P + ++V + +++ Sbjct: 168 AGIRIFAVGVG--EALKEELDEIASEPKSAHVFHVSDYNAI 206 >gi|219852403|ref|YP_002466835.1| hypothetical protein Mpal_1806 [Methanosphaerula palustris E1-9c] gi|219546662|gb|ACL17112.1| conserved hypothetical protein [Methanosphaerula palustris E1-9c] Length = 316 Score = 43.7 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 35/245 (14%), Positives = 77/245 (31%), Gaps = 60/245 (24%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 R + L +D SGSM D + T++ A K + + + S+D Sbjct: 83 QTRDGVSVVLAIDDSGSMAA---------------NDYQPTRLEAAKESASVLIKSLDPK 127 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 Y G++ + + S ++V + T + A + Sbjct: 128 D------YAGVVIFESGATTAAYLSPDKDRVMEKTAAIEQKN--GQTALGDGLALAVDMA 179 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 S + +K ++ L+DG N + A +N +++ Sbjct: 180 DSIPNQ------------------KKVVVLLSDGVGNAGVISPEDAT--AFAAQNKVQVF 219 Query: 366 TISI-----------------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 T+ + A+ + L + +Y V+ +L ++ +++ Sbjct: 220 TVGLGSKSPVLLGTDPTGTPQYATLDEAALQSIAEKTGGTYYTSVDEQTLHQIYAGLNKE 279 Query: 409 MVHRK 413 +V K Sbjct: 280 IVREK 284 >gi|38505728|ref|NP_942348.1| hypothetical protein slr7060 [Synechocystis sp. PCC 6803] gi|38423752|dbj|BAD01962.1| slr7060 [Synechocystis sp. PCC 6803] Length = 588 Score = 43.7 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 37/237 (15%), Positives = 70/237 (29%), Gaps = 50/237 (21%) Query: 172 KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 I+ P RP + V+D SGSM K+ Sbjct: 25 LIIRITPPSPPAMDQPRPSLNLGFVIDRSGSMEGH-------------------NKITYA 65 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK--NIEPSWGTEKVRQYVTRDMDSLIL 289 + A+ +D + H+ + + +V+ + ++ V Sbjct: 66 RQAVCYAIDQLSPGDHLSVTI------FDDQVQTLIPSTLVKDKAQFKRLVQGI---NPG 116 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 TD + Q + S + II L+DG N ++N + Sbjct: 117 GCTDLHGG----------------WLQGGIQVSQNLSAELNR-IILLSDGLANRGETNPD 159 Query: 350 TIKI-CDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQN 404 I + T+ + + LL+ S +Y V +A+ L +F+ Sbjct: 160 IIATDVHGLAQRGASTTTLGL-GDDYNEDLLEAMARSGDGNYYYVADAEQLPTIFER 215 >gi|254293317|ref|YP_003059340.1| von Willebrand factor A [Hirschia baltica ATCC 49814] gi|254041848|gb|ACT58643.1| von Willebrand factor type A [Hirschia baltica ATCC 49814] Length = 563 Score = 43.7 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 36/189 (19%), Positives = 67/189 (35%), Gaps = 45/189 (23%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 ++ E+P + L+VD+SGSM K+ K AL L +D + Sbjct: 181 LDRTEQPPLNLTLLVDVSGSM-------------------NHEDKLPLAKKALKLLIDKM 221 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQA 301 D H+ VY G G K E + + +D+L T ++ A Sbjct: 222 DEDDHISVVVYAGAAGTVLEPTKGSE--------KSKIFAALDNLSAGGSTAGGEGLRLA 273 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE-NNNFKSNVNTIKICDKAKEN 360 Y S ++++ + ++ LTDG+ N S+ + +E+ Sbjct: 274 Y----SLAEQNYDAASVNR------------VMLLTDGDFNVGVTSDERLEDFVARKRES 317 Query: 361 FIKIVTISI 369 + + + Sbjct: 318 GVYLSVLGF 326 >gi|237667682|ref|ZP_04527666.1| von Willebrand factor type A domain protein [Clostridium butyricum E4 str. BoNT E BL5262] gi|237656030|gb|EEP53586.1| von Willebrand factor type A domain protein [Clostridium butyricum E4 str. BoNT E BL5262] Length = 1336 Score = 43.7 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 32/177 (18%), Positives = 69/177 (38%), Gaps = 28/177 (15%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 ++ E I L++D SGSM+ +D I ++K ++ +LK + F++ Sbjct: 498 EDINEEVKKDIVLILDTSGSMNFNFYND-------SIPYNEKDKRIYSLKQSAKQFINKF 550 Query: 243 DLLSHVKEDV--YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 + +++ + Y GY +++ E + Y + + T+ +++ Sbjct: 551 NNKDNIRIGIIPYSYYSGYANNIKQLTEI--NDNNKKSYENYIDNIKVEGATNQGDGIRE 608 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 A ++L + S +K++I +TDGE + I D A Sbjct: 609 AGKMLLNTDGNS-----------------KKYVILITDGEATAITIEKPNLIINDSA 648 >gi|227818462|ref|YP_002822433.1| hypothetical protein NGR_b02140 [Sinorhizobium fredii NGR234] gi|227337461|gb|ACP21680.1| conserved hypothetical protein [Sinorhizobium fredii NGR234] Length = 440 Score = 43.7 bits (101), Expect = 0.056, Method: Composition-based stats. Identities = 13/78 (16%), Positives = 29/78 (37%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 + ++ + + NF ++ AL V L + + + + + M+ A +AA + Sbjct: 3 KVLTAMQRLLKDQSGNFGLMTALLVPVLFLSGSVALNIANATREASKMQDALDAAAIKAV 62 Query: 66 SKMVSNLSRLGDRFESIS 83 S R E+ Sbjct: 63 RSYGEGESENAVRTEANR 80 >gi|293335787|ref|NP_001168683.1| hypothetical protein LOC100382472 [Zea mays] gi|223948855|gb|ACN28511.1| unknown [Zea mays] gi|223949305|gb|ACN28736.1| unknown [Zea mays] gi|223949981|gb|ACN29074.1| unknown [Zea mays] gi|223950189|gb|ACN29178.1| unknown [Zea mays] gi|224028553|gb|ACN33352.1| unknown [Zea mays] Length = 731 Score = 43.7 bits (101), Expect = 0.056, Method: Composition-based stats. Identities = 39/313 (12%), Positives = 88/313 (28%), Gaps = 54/313 (17%) Query: 109 YSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQR 168 + F + E ++ + ++ F + TS + + HL Sbjct: 206 DPSTFNDDEPLDLSSEEANDNHQGCLRTVEIKTYPEFTEVPENTS-ERNFTVLIHLKAPV 264 Query: 169 YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 S R + V+D+SGSM TK+ Sbjct: 265 AQHLQPSSNLGDGNGLSTARAPVDLITVLDVSGSMAG--------------------TKL 304 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT-EKVRQYVTRDMDSL 287 A LK A+ + ++ + +I +++ + T +Q + Sbjct: 305 ALLKRAMGFVIQNLGSSD------RLSVIAFSSSARRLFPLRRMTESGRQQSLLAVNSLT 358 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T+ A+++ +++ + ++ + II L+DG++ S Sbjct: 359 SNGGTNIAEALRKGSKVIEERQAKNPVCS----------------IILLSDGQDTYTVSP 402 Query: 348 VNTIK-----ICD----KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 + C + + A + L +S + ++ Sbjct: 403 TAGVHKGAPEYCALLPSTNGNQQVPVHVFGFGADHDSVSLHSISQTSGGTFSFIETEAAI 462 Query: 399 IHVFQN-ISQLMV 410 F I L+ Sbjct: 463 QDAFAQCIGGLLS 475 >gi|87310828|ref|ZP_01092954.1| inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated [Blastopirellula marina DSM 3645] gi|87286343|gb|EAQ78251.1| inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated [Blastopirellula marina DSM 3645] Length = 788 Score = 43.7 bits (101), Expect = 0.056, Method: Composition-based stats. Identities = 45/249 (18%), Positives = 88/249 (35%), Gaps = 53/249 (21%) Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 +R +T + + +++ +F++ D + + ++ + + + Sbjct: 230 KNRAVVTIEQKDCVPTNDFRLFFDTA---KTDLSASVLTYRPDKSEDGYFLLLASPPVEE 286 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + + VVD SGSM K+ K A L++++ Sbjct: 287 VGDVKTKKTVIFVVDRSGSMSGE--------------------KIEQAKEAAKFVLNNLN 326 Query: 244 LLSHVKEDVYMGLIGYTTRVEKN-IEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQA 301 +I Y + VE E +K R+ +D+L T+ A+ +A Sbjct: 327 EGDLFN------IIAYDSDVESFEPELQKLDDKTREKALGFVDNLYAGGSTNIDGALAKA 380 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK-EN 360 +L DK+ S +++FLTDG + + N KI D AK +N Sbjct: 381 MGMLKDDKRPS-------------------YMLFLTDGLPTHGEQNE--AKIVDNAKQKN 419 Query: 361 FIKIVTISI 369 ++ IS Sbjct: 420 DVRARVISF 428 >gi|194211145|ref|XP_001494734.2| PREDICTED: chloride channel, calcium activated, family member 4 [Equus caballus] Length = 1022 Score = 43.7 bits (101), Expect = 0.057, Method: Composition-based stats. Identities = 36/218 (16%), Positives = 74/218 (33%), Gaps = 53/218 (24%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + LV+D SGSM + ++ + A FL ++ Sbjct: 307 MCLVLDKSGSMAGS-------------------NRLNRMNQAAKHFLMQ-----TIENGS 342 Query: 253 YMGLIGYTTRVEKNIEPSWGT-EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 ++G++ + + + T R + + + T +K A+Q+LT + Sbjct: 343 WVGMVHFDSTAYIKSKLIQITSSNERNKLLESLPTAASGGTSICRGIKSAFQVLTGTYPQ 402 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISIN 370 + I+ LTDGE+N T C D+ +++ I I++ Sbjct: 403 IDGSE----------------IVLLTDGEDN-------TAGSCVDEVRQSGAIIHFIAL- 438 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADS---LIHVFQNI 405 Q +++ + H + + LI F + Sbjct: 439 GPSADQAVIEMSTITGGKHKYASDEAANNGLIDAFAAL 476 >gi|3929911|dbj|BAA34707.1| complement factor B/C2B [Cyprinus carpio] Length = 833 Score = 43.7 bits (101), Expect = 0.057, Method: Composition-based stats. Identities = 37/204 (18%), Positives = 69/204 (33%), Gaps = 31/204 (15%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + A + + L + + G+I Y + ++ + + + YV R + Sbjct: 355 ESFHLAKNATIQLVQKLDSYEVTMRFGIISYASEAKEIVSITNDLSQDVHYVMRKLHEFS 414 Query: 289 LK------PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 K T+ A+ + Y+ L ++ Q V II TDG +N Sbjct: 415 DKSHGNKRGTNLHDALNKVYEELALLRENKRSHFNETQNV----------IIIATDGYSN 464 Query: 343 NFKSNVNT----------IKICDKAKENFIKIVTISINASPNGQRLLKTCVS---SPEYH 389 S +N D KE + + ++ N Q L++ S + Sbjct: 465 MGPSPINILPKIRNLFGYKSSVDHTKEELLDVYVFAVGQQVNKQE-LQSIASIKKDERHV 523 Query: 390 YNVVNADSLIHVF-QNISQLMVHR 412 + + + L VF Q IS V + Sbjct: 524 FVLKDYRQLGLVFNQMISDSAVTK 547 >gi|260823583|ref|XP_002606160.1| hypothetical protein BRAFLDRAFT_126487 [Branchiostoma floridae] gi|229291499|gb|EEN62170.1| hypothetical protein BRAFLDRAFT_126487 [Branchiostoma floridae] Length = 515 Score = 43.7 bits (101), Expect = 0.057, Method: Composition-based stats. Identities = 32/225 (14%), Positives = 75/225 (33%), Gaps = 46/225 (20%) Query: 174 VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 + + L + + P + V+D SGS+ +K+ Sbjct: 159 IDAVNELEQESECKIPGMDMVFVLDGSGSVGAD--------------------NFETVKD 198 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKP 291 ++ +D ++ + +G++ Y+ V+ + V+ ++ + Sbjct: 199 FVVSVVDGFEI---GQSRTRIGVVQYSDEVQNEFNLTEYGNKADVQSAISNIT--YLQGR 253 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A++ + S++ G + P K I +TDGE + + Sbjct: 254 TYTGAALRYMTDVSFSEEA----------GARPPYQAIPKVGIVVTDGEATDNVQGPAS- 302 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVN 394 A E + + I I + L+ + P+ + + V N Sbjct: 303 ----SAHEAGVNVFAIGIGGYDVRE--LRQIATDPDATHVFAVDN 341 >gi|167757049|ref|ZP_02429176.1| hypothetical protein CLORAM_02598 [Clostridium ramosum DSM 1402] gi|167703224|gb|EDS17803.1| hypothetical protein CLORAM_02598 [Clostridium ramosum DSM 1402] Length = 965 Score = 43.7 bits (101), Expect = 0.057, Method: Composition-based stats. Identities = 47/278 (16%), Positives = 92/278 (33%), Gaps = 62/278 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I LV D+S SM + E NS K+ TK KNA + FL++ + + K Sbjct: 87 DIVLVFDISNSMA-----EDEHGNSTSSNDKKRLTK---AKNAAIEFLNNSKISGNKKN- 137 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-TRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 ++ + + E +Q + ++ + T+ + +A +L + K Sbjct: 138 -RYSIVTFNYYGTVEQNLTSNLETAKQAIRDVELGNNSDGGTNIQAGLYKARTVLKNAKS 196 Query: 311 RSFFTNFFRQGVKIPSLPFQK-----FII-----------------FLTDGE-------- 340 + G S +++ T GE Sbjct: 197 ENGIIILLSDGGATGSYKLNNERNNGYLVNDYSEATATDKALGYSGRYTFGENAINYDSV 256 Query: 341 ---------------NNNFKSNVNTIKICDK----AKENFIKIVTISINASPNGQRLLKT 381 N+++ N + + AK++ I TI + LK Sbjct: 257 IKGGRNDFTLDLYLNNSHYSLNNAAATLAENEALLAKKSGNTIFTIGYTTGSSVNSFLKN 316 Query: 382 CVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + E + ++ L +++NI+ +V +Y I+K Sbjct: 317 VATQGEGY-AYSSSSDLSGIYENIANEIVT-RYETIIK 352 >gi|326670350|ref|XP_001332841.4| PREDICTED: anthrax toxin receptor 1-like [Danio rerio] Length = 609 Score = 43.7 bits (101), Expect = 0.058, Method: Composition-based stats. Identities = 34/232 (14%), Positives = 71/232 (30%), Gaps = 48/232 (20%) Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 P F + V+D SGS+ N V Sbjct: 26 TPEKGTAASSCYGGFDLYFVLDKSGSVQHHWNEIYNFVEHLAQ----------------- 68 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 +S + M I ++ + + ++ + E++R+ + D T Sbjct: 69 ------KFISP---QLRMSFIVFSDQGKILMQLTEDREQIRKGLKELQDVRPGGDTFMHE 119 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE--NNNFKSNVNTIKIC 354 +++A + + + T II LTDGE N+F Sbjct: 120 GIQRASEQIYYGNTEGYRTAS--------------VIIALTDGELHENHFYYAEREA--- 162 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA-DSLIHVFQNI 405 ++++ + + + Q L S ++ + V + ++L V +I Sbjct: 163 NRSRSLGASVYCVGVKDFNETQ--LAKIADSKDHVFPVNDGFEALQGVIDSI 212 >gi|156404155|ref|XP_001640273.1| predicted protein [Nematostella vectensis] gi|156227406|gb|EDO48210.1| predicted protein [Nematostella vectensis] Length = 1128 Score = 43.7 bits (101), Expect = 0.058, Method: Composition-based stats. Identities = 41/229 (17%), Positives = 86/229 (37%), Gaps = 31/229 (13%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 +P + LVVD SGSM + ++ + R ++A L A + + Sbjct: 204 EAASPQPK-DVILVVDYSGSMGGSRLPIAKEAAKTVLDTLNPRDRVAFL--AFESGVRRV 260 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 + S +D + + + K + +++++ + S T A A+ Sbjct: 261 KVTSGDAKDEKC----FESSLAKASPV--NIDILKKFLDGEYAS---GGTMYAIAFNAAF 311 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE-NNNFKSNVNTIKICDK--AKE 359 IL + K + + I+F+TDG N++ + +NT+K ++ + + Sbjct: 312 DIL-------------DKYYKEKNTTRRPVILFMTDGAPNDDPGTILNTVKTRNQGLSTK 358 Query: 360 NFIKIVTISINASPNGQRLLKTCVS---SPEYHYNVVNADSLIHVFQNI 405 I + SP G LL++ + V +L V +++ Sbjct: 359 ADILTFGMGGGISPAGVDLLQSLAEQTLDGGARFEVSLTTALRDVSRHL 407 >gi|197099226|ref|NP_001126843.1| inter-alpha-trypsin inhibitor heavy chain H4 [Pongo abelii] gi|55732844|emb|CAH93116.1| hypothetical protein [Pongo abelii] Length = 896 Score = 43.7 bits (101), Expect = 0.058, Method: Composition-based stats. Identities = 33/165 (20%), Positives = 56/165 (33%), Gaps = 45/165 (27%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D SGSM K+ + A + +D LS + Sbjct: 275 VVFVIDKSGSMSGK--------------------KIQQTREA---LIKILDDLSPRDQF- 310 Query: 253 YMGLIGYTTRVEK-----NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 LI ++T + + K R + L T+ AM A Q+L S Sbjct: 311 --NLIVFSTEATQWRPSLVPASAENVNKARSFAAGIQA---LGGTNINDAMLMAVQLLDS 365 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + + S+ II LTDG+ ++N +I+ Sbjct: 366 SNQ--------EERLPDGSVSL---IILLTDGDPTVGETNPRSIQ 399 >gi|114048546|ref|YP_739096.1| von Willebrand factor, type A [Shewanella sp. MR-7] gi|113889988|gb|ABI44039.1| von Willebrand factor, type A [Shewanella sp. MR-7] Length = 625 Score = 43.7 bits (101), Expect = 0.058, Method: Composition-based stats. Identities = 35/237 (14%), Positives = 71/237 (29%), Gaps = 54/237 (22%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 E + ++D+SGSM K+ L+ AL + + Sbjct: 228 SKAELGASNLVFLLDVSGSMA-------------------SPDKLPLLQTALKMLTQQLG 268 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAY 302 V VY G G + Q + ++ L T+ ++ AY Sbjct: 269 AQDKVSIVVYAGAAG--------VVLDGAAGNDSQTLNYALEQLSAGGSTNGAQGIQLAY 320 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV-NTIKICDKAKENF 361 Q ++ + + +I TDG+ N +N+ I + K+ Sbjct: 321 Q-------------LAKKHLVEGGINR---VILATDGDFNVGTTNLDELIDLVSAQKQLG 364 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA---------DSLIHVFQNISQLM 409 I + T+ L++ Y +++ + L I++ + Sbjct: 365 IGLTTLGFGMGDYNDHLMEQLADKGNGQYAYIDSLNEARKVLVEQLSATLLTIAKEV 421 >gi|239817564|ref|YP_002946474.1| outer membrane adhesin like proteiin [Variovorax paradoxus S110] gi|239804141|gb|ACS21208.1| outer membrane adhesin like proteiin [Variovorax paradoxus S110] Length = 1867 Score = 43.7 bits (101), Expect = 0.059, Method: Composition-based stats. Identities = 30/160 (18%), Positives = 57/160 (35%), Gaps = 40/160 (25%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + +++DLSGSM + T+++ K A+ +D DL DV Sbjct: 1424 LMVILDLSGSMG-----------------QETPTRLSRAKEAIQNLIDGYDLY----GDV 1462 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 + L+ ++T + +W T + + +++ + T+ A+ A ++ K Sbjct: 1463 RVQLVTFST--TGASQQAWMTAAEAKALVQNLQA--AGSTNYDAALAAAMNGFSATGKLD 1518 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 N FLTDGE N + Sbjct: 1519 GAQNVSY---------------FLTDGEPTLGDGNTAQLA 1543 >gi|237735881|ref|ZP_04566362.1| predicted protein [Mollicutes bacterium D7] gi|229381626|gb|EEO31717.1| predicted protein [Coprobacillus sp. D7] Length = 965 Score = 43.7 bits (101), Expect = 0.059, Method: Composition-based stats. Identities = 47/278 (16%), Positives = 92/278 (33%), Gaps = 62/278 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I LV D+S SM + E NS K+ TK KNA + FL++ + + K Sbjct: 87 DIVLVFDISNSMA-----EDEHGNSTSSNDKKRLTK---AKNAAIEFLNNSKISGNKKN- 137 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-TRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 ++ + + E +Q + ++ + T+ + +A +L + K Sbjct: 138 -RYSIVTFNYYGTVEQNLTSNLETAKQAIRDVELGNNSDGGTNIQAGLYKARTVLKNAKS 196 Query: 311 RSFFTNFFRQGVKIPSLPFQK-----FII-----------------FLTDGE-------- 340 + G S +++ T GE Sbjct: 197 ENGIIILLSDGGATGSYKLNNERNNGYLVNDYSEATATDKALGYSGRYTFGENAINYDSV 256 Query: 341 ---------------NNNFKSNVNTIKICDK----AKENFIKIVTISINASPNGQRLLKT 381 N+++ N + + AK++ I TI + LK Sbjct: 257 IKGGRNDFTLDLYLNNSHYSLNNAAATLAENEALLAKKSGNTIFTIGYTTGSSVNSFLKN 316 Query: 382 CVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + E + ++ L +++NI+ +V +Y I+K Sbjct: 317 VATQGEGY-AYSSSSDLSGIYENIANEIVT-RYETIIK 352 >gi|162454179|ref|YP_001616546.1| hypothetical protein sce5902 [Sorangium cellulosum 'So ce 56'] gi|161164761|emb|CAN96066.1| hypothetical protein sce5902 [Sorangium cellulosum 'So ce 56'] Length = 940 Score = 43.7 bits (101), Expect = 0.059, Method: Composition-based stats. Identities = 40/225 (17%), Positives = 78/225 (34%), Gaps = 53/225 (23%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + P + LV+D SGSM P+ K K A + ++ I S Sbjct: 470 DMPSVAMALVMDRSGSM-----------TGLPLEMAKAAAKATAGVLSSDDLIEVIAFDS 518 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 V M +P+ ++ + R T+ A+ AYQ +T Sbjct: 519 APTRYVKM-------------QPARNRSRIAGEIARIQ---PGGGTEIFSALDAAYQDMT 562 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE-NNNFKSNVNTIKICDKAKENFIKIV 365 + R +K +I LTDG+ + ++ + I I + Sbjct: 563 VTQAR------------------KKHVILLTDGKASTGGIRDLVSAMI-----AESITVT 599 Query: 366 TISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLM 409 T+ + + ++LLK + V + ++L +F ++++ Sbjct: 600 TVGL-GNDLDEQLLKMIADVGGGRFHAVPDPNNLPRIFTKETEMV 643 >gi|310825891|ref|YP_003958248.1| hypothetical protein ELI_0266 [Eubacterium limosum KIST612] gi|308737625|gb|ADO35285.1| predicted protein [Eubacterium limosum KIST612] Length = 838 Score = 43.7 bits (101), Expect = 0.059, Method: Composition-based stats. Identities = 50/279 (17%), Positives = 83/279 (29%), Gaps = 79/279 (28%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I LV+D SGSM + P D ALK FL + + + Sbjct: 106 SIVLVLDNSGSMGWGSSPTPADYAR------------DALKEFANEFLKNGNSGN----- 148 Query: 252 VYMGLIGYTT-------------RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 +GL+ Y++ + + + ++ Q V TD + Sbjct: 149 -KLGLVTYSSGSGVPIYDAYDEIKYVQGYSMTENSDIFGQVVDGLQ--TPSGETDVQMGI 205 Query: 299 KQAYQILTSDKK-----------------RSFFTNFFRQGVKIPSLPFQKFIIFLT---- 337 K A IL +D + G I F I +T Sbjct: 206 KTARDILAADTSGNPQFILVFSDGATNRSARPTSAGELGGANISPCTFGDKIYDMTFKFD 265 Query: 338 -------------DG-ENNNFKSNVNTIKICDKA---KENFIKIVTISINASP------- 373 DG N N +T+ +A K+ I I ++ + Sbjct: 266 GFDYGAQGSAVYNDGVYTTNGNVNAHTVAAVSEALLAKDQGIDIYSVFYHNPALNDLEYG 325 Query: 374 NGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVH 411 G ++K SS +Y + NA + +F I + + Sbjct: 326 AGVFVMKNSASSGQYTEISPGNAGAFAEIFTEIEKQIQE 364 >gi|302555134|ref|ZP_07307476.1| secreted protein [Streptomyces viridochromogenes DSM 40736] gi|302472752|gb|EFL35845.1| secreted protein [Streptomyces viridochromogenes DSM 40736] Length = 415 Score = 43.7 bits (101), Expect = 0.059, Method: Composition-based stats. Identities = 42/229 (18%), Positives = 77/229 (33%), Gaps = 32/229 (13%) Query: 174 VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 ++F E + ++LV+D+SGSM +++MAA K Sbjct: 13 LAFPAGAAAGEPTGQDAPKVDLVLDVSGSMRAR--------------DIDGQSRMAAAKQ 58 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 A L E+V +G+ + + + + Y + P D Sbjct: 59 AFNEVL------DATPEEVELGIRTLGADYPGDDRKTGCKDTAQLY--------PVGPLD 104 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 T A K A LT + + K I+ ++DGE+ + + Sbjct: 105 RTEA-KTAVATLTPTGWTPIGPALLKAAGDLDGGNGSKRIVLISDGEDTCAPLDPCEVAR 163 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIH 400 AK + I T+ + + R L C++ + +V + D L Sbjct: 164 EIAAKGIGLTIDTLGLVPTAKLSRQLS-CIAEATGGTYTSVEHQDELTD 211 >gi|260785923|ref|XP_002588009.1| hypothetical protein BRAFLDRAFT_125403 [Branchiostoma floridae] gi|229273165|gb|EEN44020.1| hypothetical protein BRAFLDRAFT_125403 [Branchiostoma floridae] Length = 948 Score = 43.7 bits (101), Expect = 0.060, Method: Composition-based stats. Identities = 48/281 (17%), Positives = 94/281 (33%), Gaps = 29/281 (10%) Query: 149 DVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMN 208 D + D + + A + G + + + L VD+SGSM Sbjct: 378 DGIEEKDIPTDGTVEYVLAPADTPTDDPSGASPGVIPGAQGV--VVLCVDVSGSMGVTTR 435 Query: 209 SDPEDVNSAPICQDKKR-----TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 P+ +DKK T++ + A+ L+ ++L + ++ ++ + + V Sbjct: 436 -VPQLQAEWKAMRDKKAGKEYVTRLECMTAAIARHLERLELDTPDRQV---AIVTFQSHV 491 Query: 264 EKNIEPSWGTEKVRQYVTRDMDSLILKP------------TDSTPAMKQAYQILTSDKKR 311 + + + V D LI DS +K+ L++ Sbjct: 492 QLFGDGRYPPTDVTGDTLNSYDGLISTGRHLAEKFELQPIKDSLATLKEKVCDLSAGGTT 551 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT-----IKICDKAKENFIKIVT 366 + + P + II TDG N +V KI D A+ + I I Sbjct: 552 ALGPALAVCAGVVADKPRSE-IILCTDGAANVGVGDVKKDPGFYKKIGDFARSHKIVISI 610 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 I I + ++ N+++ L+ + I+Q Sbjct: 611 IGIEGENVALEQVSAAAATSGGTVNILHPLELVRQIRQIAQ 651 >gi|149181776|ref|ZP_01860267.1| hypothetical protein BSG1_01140 [Bacillus sp. SG-1] gi|148850517|gb|EDL64676.1| hypothetical protein BSG1_01140 [Bacillus sp. SG-1] Length = 949 Score = 43.7 bits (101), Expect = 0.060, Method: Composition-based stats. Identities = 44/284 (15%), Positives = 86/284 (30%), Gaps = 81/284 (28%) Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 + + ER + V D SGSM + + ++ ALK +L Sbjct: 59 VTPKGQATNEERKPIDVVFVHDTSGSMKDSFGGVKKATSAE-----------NALKESLR 107 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV-----------RQYVT---- 281 F + + + Y +K I+P+ G + +V Sbjct: 108 FFNQNQQSKDKYFFVPFDSDVSYKNYGDKRIQPAEGLSDILPMAEHLDFSEAYWVKKYSW 167 Query: 282 -------RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 +++ + T+ T +++ A +++II Sbjct: 168 YYGYYWSQEIFDFSVGGTNYTQSLEYALSKF------------------SGMRDSKRYII 209 Query: 335 FLTDGENNNFKSNVN--TIKICDKAKE---------------------------NFIKIV 365 FLTDGE + + T+ A+ N +K+ Sbjct: 210 FLTDGEPTSLNHDNKQYTLYTNGTARAGNVNANYNDVQKFIHEKAVASAEKLGVNDVKMY 269 Query: 366 TISINASPN-GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 +I+ +LL+ + N +SL +VF +IS+ Sbjct: 270 SIAFAEPGEVNYQLLENMSNKTGGRAIQANPNSLSNVFTDISKE 313 >gi|149187720|ref|ZP_01866017.1| hypothetical protein VSAK1_23409 [Vibrio shilonii AK1] gi|148838600|gb|EDL55540.1| hypothetical protein VSAK1_23409 [Vibrio shilonii AK1] Length = 340 Score = 43.7 bits (101), Expect = 0.060, Method: Composition-based stats. Identities = 40/222 (18%), Positives = 78/222 (35%), Gaps = 40/222 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLSGSM K++ ++ A+K+ L F +E Sbjct: 95 DLMLVVDLSGSMS---------ERDFLDNSGKEQDRLTAVKSVLETFAAK-------REG 138 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + E + + + + T A+ + + + Sbjct: 139 DRLGLILFGDSAYLQSPFTADHEAWLALLDQAQVGMAGESTHLGDAVGLTIKTYIDNPEN 198 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 +K I LTDG N+ S V I A+ I++ +++ + Sbjct: 199 Q---------------TVEKVAIILTDG--NDTDSLVPPIDAAKVAQAYGIRLYIVAMGS 241 Query: 372 S--PNGQRL----LKTCVS-SPEYHYNVVNADSLIHVFQNIS 406 Q + ++T + + + ++ + L V+Q IS Sbjct: 242 PNTTGDQAIDFSTIETMATVTGGQAFLAMSQEDLDAVYQTIS 283 >gi|296168868|ref|ZP_06850540.1| von Willebrand factor [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295896485|gb|EFG76135.1| von Willebrand factor [Mycobacterium parascrofulaceum ATCC BAA-614] Length = 335 Score = 43.7 bits (101), Expect = 0.060, Method: Composition-based stats. Identities = 37/234 (15%), Positives = 74/234 (31%), Gaps = 51/234 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + LV+D+S SM D ++ A K A F D + + Sbjct: 99 VMLVIDVSESMAA---------------TDVPPDRLTAAKEAGKQFADELTPA------I 137 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GL+ + + P+ V+ + + T + + A Q + + Sbjct: 138 NLGLVEFAANASLLVSPTTNRAAVKAAIDSLKPAPK---TATGEGLFTALQAIAT----- 189 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENN---NFKSNVNTIKICDKAKENFIKIVTISI 369 + P I+ +DG N + + AK ++I TIS Sbjct: 190 -----VGSVMGGGDGPPPARIVLESDGAENVPLDPNAPQGAFTAARAAKAEGVQISTISF 244 Query: 370 N--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + Q L K C + ++ + +SL +V+ + + + Sbjct: 245 GTPYGTVEYEGATIPVPVDDQTLQKICEITDGQAFHADSLESLKNVYSTLQRQI 298 >gi|313200528|ref|YP_004039186.1| von willebrand factor type a [Methylovorus sp. MP688] gi|312439844|gb|ADQ83950.1| von Willebrand factor type A [Methylovorus sp. MP688] Length = 333 Score = 43.7 bits (101), Expect = 0.061, Method: Composition-based stats. Identities = 35/238 (14%), Positives = 76/238 (31%), Gaps = 56/238 (23%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I +V+D S SM N D + +K+ A + L F++S ++ V Sbjct: 86 IMVVLDRSASM----NDDFAGKYIQSTDAVQNPSKLKAAREVLAKFIESRQ-----QDLV 136 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 GLI ++T + + ++ + ++ + T + A Sbjct: 137 --GLITFSTSPVFVLPLTHDQVALKAALDSA-EAQGMGFTAVARGLGMALDYFKGKPVTG 193 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGE---NNNFKSNVNTIKICDKAKENFIKIV---- 365 + I+ ++DG ++ + + + + A +I I Sbjct: 194 A-----------------RLILLVSDGGAHIDSQTQDRIRSWFASEHAALAWIYIRGTNS 236 Query: 366 -TISINASPNGQRLLKTCVSSPEYH--------------YNVVNADSLIHVFQNISQL 408 +I + G S+PEY Y N +L +++ ++ Sbjct: 237 PSIFVQ-PGEGDDA----ASAPEYFLHEYFKTLGVPYQAYEAENPRALEAALRDVGKM 289 >gi|307102430|gb|EFN50705.1| hypothetical protein CHLNCDRAFT_28788 [Chlorella variabilis] Length = 344 Score = 43.7 bits (101), Expect = 0.061, Method: Composition-based stats. Identities = 43/241 (17%), Positives = 75/241 (31%), Gaps = 38/241 (15%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNS---DPEDVNSAPICQDKKRTKMAALKNAL 235 A L+ RP + L++D+SGSM + S D ++ TK+ K L Sbjct: 74 ASLQASDFRRPRLNLVLLLDVSGSMGESFMSYYYDAAGTQQNLTAEELNTTKIDVAKEVL 133 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS----WGTEKVRQYVTRDMDSLILKP 291 LD + V ++ ++TR S ++ + +DM + Sbjct: 134 SGVLDLLAPNDSV------AIVLFSTRACTPQPLSRVSCLDIPALQAQIDKDMHAT--SS 185 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + + A L +G+ + I+ +TD + N+ + Sbjct: 186 TSLSAGLDLAIAEL----------KKCSEGMSASLTDTENRIMVITDQQPNSGDYTTGGL 235 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSS-----PEYHYNVVNADS----LIHVF 402 K+ I T I G L S +Y+V L F Sbjct: 236 AA-RLRKDADDGIFTTIIGV---GLDLNSELAESISKVRGANYYSVHRPGEFRRRLTDEF 291 Query: 403 Q 403 Sbjct: 292 D 292 >gi|253998453|ref|YP_003050516.1| von Willebrand factor type A [Methylovorus sp. SIP3-4] gi|253985132|gb|ACT49989.1| von Willebrand factor type A [Methylovorus sp. SIP3-4] Length = 333 Score = 43.7 bits (101), Expect = 0.061, Method: Composition-based stats. Identities = 35/238 (14%), Positives = 76/238 (31%), Gaps = 56/238 (23%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I +V+D S SM N D + +K+ A + L F++S ++ V Sbjct: 86 IMVVLDRSASM----NDDFAGKYIQSTDAVQNPSKLKAAREVLAKFIESRQ-----QDLV 136 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 GLI ++T + + ++ + ++ + T + A Sbjct: 137 --GLITFSTSPVFVLPLTHDQVALKAALDSA-EAQGMGFTAVARGLGMALDYFKGKPVTG 193 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGE---NNNFKSNVNTIKICDKAKENFIKIV---- 365 + I+ ++DG ++ + + + + A +I I Sbjct: 194 A-----------------RLILLVSDGGAHIDSQTQDRIRSWFASEHAALAWIYIRGTNS 236 Query: 366 -TISINASPNGQRLLKTCVSSPEYH--------------YNVVNADSLIHVFQNISQL 408 +I + G S+PEY Y N +L +++ ++ Sbjct: 237 PSIFVQ-PGEGDDA----ASAPEYFLHEYFKTLGVPYQAYEAENPRALEAALRDVGKM 289 >gi|57524519|ref|NP_001004007.1| matrilin 3a [Danio rerio] gi|51330145|gb|AAH80220.1| Matrilin 3a [Danio rerio] gi|123233072|emb|CAM15633.1| novel protein similar to vertebrate matrilin 3 (MATN3) (zgc:101120) [Danio rerio] gi|158254363|gb|AAI54374.1| Matn3a protein [Danio rerio] Length = 337 Score = 43.7 bits (101), Expect = 0.061, Method: Composition-based stats. Identities = 32/175 (18%), Positives = 65/175 (37%), Gaps = 28/175 (16%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--VRQYVTRDMDSLILKPTDST 295 D +D L + + ++ Y + V+ K ++Q +TR + T + Sbjct: 87 LADMVDTLDVGPDATRVAVVNYASTVKIEFLLKSHLTKDTIKQAITRIEP--LAAGTMTG 144 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+K+A ++K G + S K I +TDG + V + Sbjct: 145 MAIKKAMDEAFTEK----------SGARPKSKNISKVAIIVTDG---RPQDQVEEVS--A 189 Query: 356 KAKENFIKIVTISI-NASPNGQRLLKTCVSSP--EYHYNVVN---ADSLIHVFQN 404 A+ + I+I + + A +L+ S+P ++ + V + L F+ Sbjct: 190 AARASGIEIYAVGVDRADMRSLKLM---ASNPLEDHVFYVETYGVIEKLTSKFRE 241 >gi|95147674|ref|NP_001035616.1| complement factor B precursor [Bos taurus] gi|146345391|sp|P81187|CFAB_BOVIN RecName: Full=Complement factor B; AltName: Full=C3/C5 convertase; AltName: Full=EC-VMFB; Contains: RecName: Full=Complement factor B Ba fragment; Contains: RecName: Full=Complement factor B Bb fragment; Flags: Precursor gi|86438491|gb|AAI12505.1| Complement factor B [Bos taurus] gi|296474252|gb|DAA16367.1| complement factor B precursor [Bos taurus] Length = 761 Score = 43.7 bits (101), Expect = 0.061, Method: Composition-based stats. Identities = 47/267 (17%), Positives = 92/267 (34%), Gaps = 43/267 (16%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 YD + E L + + + A GE+ ++V+D SGSM+ + D D Sbjct: 224 YDTPAEVAEAFL--SSLTETIEGVDAEDGHSPGEQQK--RKIVLDPSGSMNIYLVLDGSD 279 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 A KN L F++ + + GL+ Y T + I S Sbjct: 280 SVGAHNFTG--------AKNCLRDFIEKVASYGVKPKY---GLVTYATEPKVLIRVSDPK 328 Query: 274 EKVRQYVTRDMDSL------ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 +VT ++ + + T++ A+ + Y +++ + + T + V Sbjct: 329 SSEADWVTDQLNQINYADHKLKAGTNTKRALLEVYNMMSREVNQFKETWNRTRHV----- 383 Query: 328 PFQKFIIFLTDGENNNFKSNVNTI----KICDKAKEN--------FIKIVTISINASPNG 375 II +TDG +N V I + D + I + + + Sbjct: 384 -----IIIMTDGLHNMGGDPVTVIHDIRYLLDIGRNRKNPREDYLDIYVFGVGPLVNQEN 438 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVF 402 L + ++ + + ++L VF Sbjct: 439 INALASKKDKEKHVFKLQGMENLEDVF 465 >gi|169629808|ref|YP_001703457.1| hypothetical protein MAB_2724c [Mycobacterium abscessus ATCC 19977] gi|169241775|emb|CAM62803.1| Conserved hypothetical protein [Mycobacterium abscessus] Length = 336 Score = 43.3 bits (100), Expect = 0.063, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 53/180 (29%), Gaps = 37/180 (20%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + LV+D+S SM D ++ A K A F ++ + Sbjct: 100 VMLVIDVSRSMES---------------TDVAPNRLGAAKEAGKEF------ARNLTPGI 138 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GLI Y T V + + T + + A Q + + Sbjct: 139 NLGLIAY---AGTATVLVSPTTNRDATVNALDNLQLADRTATGEGIFTALQAIATVGAV- 194 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK--SNVNTIKICD-KAKENFIKIVTISI 369 + P I+ ++DG+ N AK+ + I TI+ Sbjct: 195 ---------IGGGDKPPPARIVLMSDGKETVPSNPDNPKGAYTAARTAKDQQVPISTIAF 245 >gi|156616288|ref|NP_766515.2| collagen alpha-6(VI) chain isoform 2 [Mus musculus] Length = 1182 Score = 43.3 bits (100), Expect = 0.063, Method: Composition-based stats. Identities = 52/352 (14%), Positives = 102/352 (28%), Gaps = 64/352 (18%) Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRF---IKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 SN LG E+I A F + K+ + S Sbjct: 493 SNKPDLGKAIENIRQMGGNTNTGAALNFTLKLLQRAKKERGSKVPCHLVVLTNGMSRDSV 552 Query: 127 ISMTHMANNR--------LDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP 178 + H + +N T + Y + + L N R + Sbjct: 553 LGPAHKLREENIRVHAIGVKEANQTQLREIAGEEKRVYYVHEFDALRNIRN-----QVVQ 607 Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + E I +VD SGS+ + +KM K + Sbjct: 608 EICAEEACRDMKADIMFLVDSSGSIG-----------------PENFSKM---KMFMKNL 647 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 + + + V +G++ ++ ++ + T + + +D + Sbjct: 648 VSKSQI---GADRVQIGVVQFSHENKEEFQL--NTFMSQSDIANAIDRMTHIG------- 695 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 T F + + K +KF+I +TDGE + + + Sbjct: 696 -----ETTLTGSALTFVSQYFSPDKGARPNVRKFLILITDGEAQDIVRDPAI-----ALR 745 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + + I ++ + S Q L+ PE + V N + Q+I +V Sbjct: 746 KEGVIIYSVGVFGSNVTQ--LEEISGKPEMVFYVEN----FDILQHIEDDLV 791 >gi|117921993|ref|YP_871185.1| von Willebrand factor, type A [Shewanella sp. ANA-3] gi|117614325|gb|ABK49779.1| von Willebrand factor, type A [Shewanella sp. ANA-3] Length = 335 Score = 43.3 bits (100), Expect = 0.063, Method: Composition-based stats. Identities = 51/264 (19%), Positives = 79/264 (29%), Gaps = 46/264 (17%) Query: 159 QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP 218 + + L + L ++ E + ++VDLSGSM D Sbjct: 65 RGMLILSWMLIVSALAKP-SILGEVQTREDFGRDVLMLVDLSGSM---------DEADFT 114 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 T++ A KN L F+ GLI + T + Sbjct: 115 TADGSTLTRLNAAKNVLKTFIAK-RSGD------RFGLILFGDAAFIQTPF---TADQQV 164 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 +++ ++ A Q F Q P Q+ +I LTD Sbjct: 165 WLSLLEEAQTG----------MAGQSTHLGDAIGLGIKVFEQN---PQPSEQQVMIVLTD 211 Query: 339 G-ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 G + +F V+ KI A IKI TI++ P V Sbjct: 212 GNDTGSFVEPVDAAKI---AAARGIKIYTIAMGDPT-------HVGEQPMDMEVVQRVSQ 261 Query: 398 LI--HVFQNISQLMVHRKYSVILK 419 L F I Q + + Y +I K Sbjct: 262 LTQARAFIAIDQAELDKAYQLIDK 285 >gi|239827908|ref|YP_002950532.1| hypothetical protein GWCH70_2571 [Geobacillus sp. WCH70] gi|239808201|gb|ACS25266.1| Ig domain protein group 2 domain protein [Geobacillus sp. WCH70] Length = 942 Score = 43.3 bits (100), Expect = 0.064, Method: Composition-based stats. Identities = 49/235 (20%), Positives = 83/235 (35%), Gaps = 56/235 (23%) Query: 129 MTHMANNRLDSSNNTIFYNM-DVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE 187 + DS+N T+ + + Y + V+ IP R++ Sbjct: 24 TNIVFGESNDSNNATLDFTITSSQLEYAKPPNGDAQ------GRLDVTLIPK-GRVDNIV 76 Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 RP + V D+SGSM K+ + K AL +D ++ Sbjct: 77 RPPIDVVFVFDVSGSM--------------------TPLKLQSAKYALQSAVDYFKANAN 116 Query: 248 VKEDVYMGLIGYTTRVEKNI--EPSWGTEKVRQYVTRDM----DSLILKPTDSTPAMKQA 301 + LI +++ V+ N G V+Q++ R D T+ T +++QA Sbjct: 117 PND--RFALIPFSSDVQYNKVVPFPTGAYDVKQHLERIANVANDLRAYGGTNYTQSLQQA 174 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG-ENNNFKSNVNTIKICD 355 Q +D R +K+IIFLTDG + T K+C+ Sbjct: 175 -QSFFNDPTR------------------KKYIIFLTDGMPTVSIAKEPITYKVCE 210 >gi|332217050|ref|XP_003257666.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha-trypsin inhibitor heavy chain H5-like [Nomascus leucogenys] Length = 941 Score = 43.3 bits (100), Expect = 0.064, Method: Composition-based stats. Identities = 44/222 (19%), Positives = 76/222 (34%), Gaps = 48/222 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D S SM TK+ K+AL L + Sbjct: 296 VVFVLDSSASMVG--------------------TKLRQTKDALFTILHDLRPQD------ 329 Query: 253 YMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +IG++ R++ K+ S + +R TD A+++A ++L Sbjct: 330 RFSIIGFSNRIKVWKDYLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL----- 384 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF---IKIVTI 367 N + I ++FLTDG+ +T+KI + +E + I TI Sbjct: 385 -----NKYVAHSDIGDRSVS-LVVFLTDGK--PTVGETHTLKILNNTREAARGQVCIFTI 436 Query: 368 SINASPNGQRL----LKTCVSSPEYHYNVVNADSLIHVFQNI 405 I + + L L+ C + H LI + I Sbjct: 437 GIGNDVDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 478 >gi|327262912|ref|XP_003216267.1| PREDICTED: integrin alpha-1-like [Anolis carolinensis] Length = 1166 Score = 43.3 bits (100), Expect = 0.064, Method: Composition-based stats. Identities = 33/187 (17%), Positives = 74/187 (39%), Gaps = 26/187 (13%) Query: 234 ALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 ++ FL+S+ ++ + +G++ Y V V + + T Sbjct: 235 SVTEFLNSLLQNMNIGPQQTQVGIVQYGENVTHEFNL-NTYTTVEEVLVAAKKIGQRGGT 293 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + A I T+ K+ R+G QK ++ +TDGE+++ I+ Sbjct: 294 RT----NTALGIDTARKEAFTEARGARRG-------VQKVMVVVTDGESHDNYRLGEVIQ 342 Query: 353 ICDKAKENFIKIVTISINASPNGQRL--------LKTCVSSP--EYHYNVVNADSLIHVF 402 C+ + I+ +I+I N L +K+ S P ++ +NV + +L+ + Sbjct: 343 DCE---DENIQRFSIAILGHYNRGNLSTEKLVEEIKSIASEPTEKHFFNVSDELALLTIV 399 Query: 403 QNISQLM 409 + + + + Sbjct: 400 EALGERI 406 >gi|149200035|ref|ZP_01877061.1| hypothetical protein LNTAR_03619 [Lentisphaera araneosa HTCC2155] gi|149136908|gb|EDM25335.1| hypothetical protein LNTAR_03619 [Lentisphaera araneosa HTCC2155] Length = 307 Score = 43.3 bits (100), Expect = 0.064, Method: Composition-based stats. Identities = 34/232 (14%), Positives = 72/232 (31%), Gaps = 47/232 (20%) Query: 181 LRIEMGERPIFL-IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 ++ + + I I++ +D SGSM + A+ F Sbjct: 77 IKGKPEDEKIISNIQICLDSSGSMRADFGGKN---------------RYEVAMQAVKEFT 121 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV---RQYVTRDMDSLILKPTDSTP 296 + +E GL + T + + T + ++ D S T+ Sbjct: 122 EY-------REGDAFGLTVFGTEYINWVPVTKDTSAIALATPFLAPDRMSKWFGGTNIAK 174 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ + Q L + II ++DG + + V+ + + Sbjct: 175 ALRGSQQQLLQ------------------QEDGDRMIILVSDGVSGSPNDTVDMAQ---E 213 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + N I I I + + + + V N +L F+ I ++ Sbjct: 214 LRNNKIVAYCIYIGSGNGSPEMNALAAITGGQVFGVNNPKALDETFRFIDKM 265 >gi|332221823|ref|XP_003260064.1| PREDICTED: calcium-activated chloride channel regulator 4 isoform 1 [Nomascus leucogenys] Length = 921 Score = 43.3 bits (100), Expect = 0.065, Method: Composition-based stats. Identities = 33/235 (14%), Positives = 78/235 (33%), Gaps = 58/235 (24%) Query: 195 LVVDLSGSMHC--AMNSDPEDVNSAPICQDKKRTKMAALK-NALLLFLDSIDLLSHVKED 251 LV+D SGSM +N + + + + + + + ++ + + E Sbjct: 309 LVLDKSGSMGGYDRLNRMNQAAKHFLLQTVENGSWVGMVHFDTTATIVNKLIQIKGSDE- 367 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 R + + + L T +K A+Q++ + Sbjct: 368 -------------------------RNTLMAGLPTYALGGTSICSGIKYAFQVIGELHSQ 402 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISIN 370 + ++ LTDGE+N S C D+ K++ + I++ Sbjct: 403 LDGSE----------------VVLLTDGEDNTASS-------CIDEVKQSGAIVHFIALG 439 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNA--DSLIHVFQNIS---QLMVHRKYSVILKG 420 ++ + + + ++ + Y A + LI F ++ + + + KG Sbjct: 440 SAADEAVIEMSNITGGSHFYASDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKG 494 >gi|156741949|ref|YP_001432078.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233277|gb|ABU58060.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 847 Score = 43.3 bits (100), Expect = 0.065, Method: Composition-based stats. Identities = 38/216 (17%), Positives = 69/216 (31%), Gaps = 47/216 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 ER + L++D S SM +K K A ++ +S+ Sbjct: 388 ERSDTTLLLIIDQSASMGPETGL----------------SKFTMAKEAAIMATESLRAED 431 Query: 247 HVKEDVYMGLIGYT--TRVEKNIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQ 303 +G++ + TR + +P GT + R + +L L TD A++ Sbjct: 432 ------RIGVLAFDVSTRWVVDFQPV-GTGLSLADIQRRISTLPLGGGTDIYNALQTGLP 484 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L R + LTDG + + ++A+ I Sbjct: 485 ELARQPGRVRHA------------------VLLTDG-RSFTDDRQAYQALIEEARSRNIT 525 Query: 364 IVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSL 398 + TI+I + LL+T +Y + Sbjct: 526 LSTIAI-GTDADIDLLQTLARWGAGRYYFAAEPGDI 560 >gi|327269537|ref|XP_003219550.1| PREDICTED: matrilin-2-like [Anolis carolinensis] Length = 809 Score = 43.3 bits (100), Expect = 0.066, Method: Composition-based stats. Identities = 41/185 (22%), Positives = 67/185 (36%), Gaps = 24/185 (12%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 L+ + L + +GLI Y + V+ K + T + A Sbjct: 42 ILNILQFLDIRPDVTRVGLIQYGSTVKNEFSLKTFARKQDMERAVRRMMYLSTGTMTGLA 101 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 ++ A I S+ + G + S + I+ +TDG + + I KA Sbjct: 102 IQFAVNIAFSETE----------GARPLSQNVPRVIMIVTDGRPQDPVAE-----IAAKA 146 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADS---LIHVFQNISQLMVHR 412 + + I I I + LK+ S P ++ + V N L VFQN +L V Sbjct: 147 RNSGILIFAIGVG--RVDMNTLKSIGSEPYEDHVFLVANFSQIETLTSVFQN--KLCVQH 202 Query: 413 KYSVI 417 SVI Sbjct: 203 LCSVI 207 Score = 42.9 bits (99), Expect = 0.088, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 65/180 (36%), Gaps = 19/180 (10%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 L +D L+ + +GL+ Y+T+V + + K + + A Sbjct: 600 VLGILDSLTISPKAARIGLLQYSTQVRTEFTLKQFSTATDMKKAVSQMKYMGKGSMTGLA 659 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 +KQ + ++ + + S + + TDG ++ + KA Sbjct: 660 LKQMTERSFTEAEGARHL----------SAKVPRVCVVFTDG-----RAQDEVSEWAAKA 704 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVHRKYS 415 K+ I + I I + + L+ S P ++ + + ++ + + + + M K S Sbjct: 705 KQRGITMYAIGIGKAIEEE--LREIASDPPEKHLFYAEDFSAMGEITEKLQKRMCEGKSS 762 >gi|78048779|ref|YP_364954.1| putative secreted protein [Xanthomonas campestris pv. vesicatoria str. 85-10] gi|78037209|emb|CAJ24954.1| putative secreted protein [Xanthomonas campestris pv. vesicatoria str. 85-10] Length = 602 Score = 43.3 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 36/238 (15%), Positives = 80/238 (33%), Gaps = 54/238 (22%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 I + P + +VD+SGSM K+ L+++L L + + Sbjct: 225 IATADLPPANLVFLVDVSGSMDA-------------------PDKLPLLQSSLKLLVRQL 265 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQA 301 + L+ Y + P+ G + + + +D+L T ++ A Sbjct: 266 RAQD------RITLVTYAGNTSVVLPPTPGDQ--QGRIVEAIDALQSGGSTAGASGIELA 317 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK-ICDKAKEN 360 Y+ +QG + I+ TDG+ N +N + +K + + + + Sbjct: 318 YKA-------------AQQGYLRGGINR---ILLATDGDFNVGVTNFDQLKGMVAEKRRS 361 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN---------ADSLIHVFQNISQLM 409 I + T+ L++ + + Y ++ L I++ + Sbjct: 362 GIALSTLGFGTGNYNDNLMEQLADAGDGAYAYIDTALEARKVLTHELGATLATIARDV 419 >gi|254786708|ref|YP_003074137.1| von Willebrand factor A [Teredinibacter turnerae T7901] gi|237687355|gb|ACR14619.1| von Willebrand factor, type A [Teredinibacter turnerae T7901] Length = 767 Score = 43.3 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 40/223 (17%), Positives = 76/223 (34%), Gaps = 56/223 (25%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LV+D+SGSM +DP + L+ + L + + E Sbjct: 50 DVRLVIDVSGSMK---RNDPNN-----------------LRQPAVDLLVQL-----LPEG 84 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY----QILTS 307 G+ + ++V + + TD A QA + Sbjct: 85 SRAGVWTFG-----------------KWVNMLVPHRDV--TDPWRATAQAKASEINSVGL 125 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG----ENNNFKSNVNTIKICDKA----KE 359 F+K II LTDG + + ++ +I D+ KE Sbjct: 126 FTNIGEALEKATFEGADGGAEFRKSIILLTDGMVDIDKSPEQNKREWRRIADEVIPRLKE 185 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + TI+++A+ + L K +++ +AD L+ +F Sbjct: 186 AGVTVHTIALSANADTNLLNKISLATGGMAEVAHSADDLMRIF 228 >gi|257058175|ref|YP_003136063.1| von Willebrand factor type A [Cyanothece sp. PCC 8802] gi|256588341|gb|ACU99227.1| von Willebrand factor type A [Cyanothece sp. PCC 8802] Length = 418 Score = 43.3 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 39/243 (16%), Positives = 75/243 (30%), Gaps = 49/243 (20%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 +NQ +Q+ ++ + + L++D SGSM + Sbjct: 16 VNQSSSQRQLAISLWATGEDSDRTLPINLGLILDRSGSM--------------------R 55 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT--RVEKNIEPSWGTEKVRQYVTR 282 M +K A +D + + +I + V + + V+ + R Sbjct: 56 AQAMETVKEAANYLVDGLGPDD------RLSVITFNHHAEVILPNQSVEDLQGVKNKINR 109 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 S T MK + K+ I LTDGEN Sbjct: 110 LTAS---GGTCIDEGMKLGIKEAALGKENRVSQ-----------------IFLLTDGENE 149 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + N +K+ A E I + T+ ++ N L + S+ + + + + F Sbjct: 150 HG-DNERCLKLAKVAAEYNITLNTLGFGSNWNQDILEQIADSAGGMLCYIEHPEQALTEF 208 Query: 403 QNI 405 + Sbjct: 209 SRL 211 >gi|183982301|ref|YP_001850592.1| membrane protein [Mycobacterium marinum M] gi|226701243|sp|B2HPD3|Y2288_MYCMM RecName: Full=UPF0353 protein MMAR_2288 gi|183175627|gb|ACC40737.1| membrane protein [Mycobacterium marinum M] Length = 335 Score = 43.3 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 41/234 (17%), Positives = 71/234 (30%), Gaps = 51/234 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + LV+D+S SM D + +M A + A F D + + Sbjct: 99 VMLVIDVSQSM---------------RATDVEPNRMVAAQEAAKQFADELTPG------I 137 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GLI Y + P+ E + + + + T + A+ A Q + + Sbjct: 138 NLGLIAYAGTATVLVSPTTNREATKAALDKLQFADR---TATGEAIFTALQAIATVGAV- 193 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK--SNVNTIKICD-KAKENFIKIVTISI 369 + P I+ +DG+ N AK+ + I TIS Sbjct: 194 ---------IGGGDTPPPARIVLFSDGKETMPTNPDNPKGAYTAARTAKDQGVPISTISF 244 Query: 370 NAS-------------PNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLM 409 P +K S YN L V+ ++ Q + Sbjct: 245 GTPYGFVEINDQRQPVPVDDETMKKVAQLSGGNSYNAATLAELNSVYASLQQQI 298 >gi|145593798|ref|YP_001158095.1| von Willebrand factor, type A [Salinispora tropica CNB-440] gi|145303135|gb|ABP53717.1| von Willebrand factor, type A [Salinispora tropica CNB-440] Length = 319 Score = 43.3 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 38/237 (16%), Positives = 74/237 (31%), Gaps = 51/237 (21%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + + VD+S SM D ++ A K + F + Sbjct: 83 RERATVMVAVDVSTSMLAG---------------DVDPDRLTAAKESARRF------VDG 121 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP-AMKQAYQILT 306 + ++ +GL+ + + P R+ + +D L+ T A+ +A Sbjct: 122 LPDEFNVGLVAFAGSAAVLVPPD----TDREALDEGIDRLVEGATGVQGTAIGEAINTSL 177 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 K + K P I+ L+DG N + + A E + + T Sbjct: 178 GAVKALDG-----EAAKDPPPAR---IVLLSDGANTSGMDPMEAAA---DAVEMEVPVHT 226 Query: 367 ISI--------------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 I+ +GQ L + + V+ + L V+ +I + Sbjct: 227 IAFGTASGYVDRGGRPIQVPVDGQTLDAVARETGGQFHEAVSVEELRAVYDDIGSSV 283 >gi|51893456|ref|YP_076147.1| hypothetical protein STH2318 [Symbiobacterium thermophilum IAM 14863] gi|51857145|dbj|BAD41303.1| conserved hypothetical protein [Symbiobacterium thermophilum IAM 14863] Length = 414 Score = 43.3 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 33/225 (14%), Positives = 67/225 (29%), Gaps = 50/225 (22%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 RP + VVD SGSM A + K AL +D + Sbjct: 36 APEGRPPLNLAAVVDRSGSMAGAA--------------------LYFTKQALRFLVDQMA 75 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS--LILKPTDSTPAMKQA 301 + ++ Y +V V++ R + T+ + + Sbjct: 76 EED------RLAIVTYDDQVHVPFP---SQPVVQKDAVRLLVDGITAGGTTNLSGGLATG 126 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK-ICDKAKEN 360 Q + R + ++ +TDG N ++ + + +E Sbjct: 127 MQQI-------------RPHAGPGRVSR---VLLMTDGLANVGVTDPDVLAGWARAWREK 170 Query: 361 FIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQN 404 + + T+ + + LL + + + N D + +FQ Sbjct: 171 GLAVSTMGV-GPHFSEDLLVALAEAGGGNFHYIANPDQIPRIFQE 214 >gi|269315863|ref|NP_001161395.1| collagen alpha-5(VI) chain precursor [Mus musculus] Length = 2640 Score = 43.3 bits (100), Expect = 0.068, Method: Composition-based stats. Identities = 38/247 (15%), Positives = 81/247 (32%), Gaps = 46/247 (18%) Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 D R+ F ++ + + + + + + E I +VD SGS+ Sbjct: 625 DERVNFGQNFDSLKSIKN--EIVHRICSEKGCEDMKADIMFLVDSSGSIG---------- 672 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 T +K + + I + + +G++ ++ + + + + Sbjct: 673 ----------PTNFETMKTFMKNLVGKIQI---GADRSQVGVVQFSDYNREEFQLNKYST 719 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 Y D S I + T + A+ F N + K +KF+I Sbjct: 720 HEEIYAAIDRMSPINRNTLTGGAL--------------TFVNEYFDLSKGGRPQVRKFLI 765 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 LTDG+ + T + + I ++ + + Q L+ ++V N Sbjct: 766 LLTDGKAQDEVGGPAT-----ALRSKSVTIFSVGVYGANRAQ--LEEISGDGSLVFHVEN 818 Query: 395 ADSLIHV 401 D L + Sbjct: 819 FDHLKAI 825 Score = 40.2 bits (92), Expect = 0.54, Method: Composition-based stats. Identities = 52/344 (15%), Positives = 121/344 (35%), Gaps = 58/344 (16%) Query: 80 ESISNHAKRALIDDAKRFIKNHIKESLSGY----------SAVFYNTEIQNIVNSSRISM 129 + +S + L A F+ + S G + E+ + R Sbjct: 728 DRMSPINRNTLTGGALTFVNEYFDLSKGGRPQVRKFLILLTDGKAQDEVGGPATALRSKS 787 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 + + + +N + S + ++ +HL + K++ + AL + E Sbjct: 788 VTIFSVGVYGANRAQLEEISGDGSLVFHVENFDHL--KAIESKLIFRVCALHDCKRIE-- 843 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + I V+D SGS+ P Q+ + L + + + Sbjct: 844 LLDIVFVLDHSGSIG-------------PREQESM----------MNLTIHLVKKADVGR 880 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEK--VRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + V +G + Y+ E + + + +++ R D+ T + A++ + +L + Sbjct: 881 DRVQIGALTYSNHPEILFYLNTYSSGSAIAEHLRRPRDT--GGETYTAKALQHS-NVLFT 937 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 ++ S T RQ +I +TDG ++ + ++ + ++ I I + Sbjct: 938 EEHGSRLTQNVRQ-----------LMIVITDG-VSHDRDKLDEAAR--ELRDKGITIFAV 983 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + Q L+T E +V N D L ++ + + + + Sbjct: 984 GVG--NANQDELETMAGKKENTVHVDNFDKLRDIYLPLQETLCN 1025 >gi|262073024|ref|NP_001159971.1| integrin alpha-2 [Bos taurus] gi|296475809|gb|DAA17924.1| integrin alpha-2 [Bos taurus] Length = 1179 Score = 43.3 bits (100), Expect = 0.068, Method: Composition-based stats. Identities = 32/190 (16%), Positives = 61/190 (32%), Gaps = 30/190 (15%) Query: 233 NALLLFLDS-IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 +A+ FL+ + L MGLI Y + + + + Sbjct: 187 DAVKNFLEKFVQGLDIGPTKTQMGLIQYANNPRVVFNL-NTFKSKDEMIKATSQTFQYGG 245 Query: 292 --TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T++ A++ A S K ++ +TDGE+++ Sbjct: 246 DLTNTFKAIQYARDTAYSTAA-------------GGRPGATKVMVVVTDGESHDGSKLKA 292 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRL--------LKTCVSSPE--YHYNVVNADSLI 399 I C+K + I I++ N L +K S P + +NV + L+ Sbjct: 293 VIDQCNK---DNILRFGIAVLGYLNRNALDTKNLIKEIKAIASIPTERHFFNVSDEADLL 349 Query: 400 HVFQNISQLM 409 I + + Sbjct: 350 EKAGTIGEQI 359 >gi|254780135|ref|YP_003064548.1| hypothetical protein CLIBASIA_00070 [Candidatus Liberibacter asiaticus str. psy62] gi|254039812|gb|ACT56608.1| hypothetical protein CLIBASIA_00070 [Candidatus Liberibacter asiaticus str. psy62] Length = 408 Score = 43.3 bits (100), Expect = 0.068, Method: Composition-based stats. Identities = 8/54 (14%), Positives = 30/54 (55%) Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 I I ++ + + + L+ C S P +Y + + ++++ + +++++ ++ +S Sbjct: 346 ITIFSVGFSPDQDTRYTLRQCASDPSKYYEINSDENVMPIAKSLARNVITNWFS 399 >gi|189082901|sp|A6H584|CO6A5_MOUSE RecName: Full=Collagen alpha-5(VI) chain; AltName: Full=Collagen alpha-1(XXIX) chain; Flags: Precursor Length = 2640 Score = 43.3 bits (100), Expect = 0.068, Method: Composition-based stats. Identities = 38/247 (15%), Positives = 81/247 (32%), Gaps = 46/247 (18%) Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 D R+ F ++ + + + + + + E I +VD SGS+ Sbjct: 625 DERVNFGQNFDSLKSIKN--EIVHRICSEKGCEDMKADIMFLVDSSGSIG---------- 672 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 T +K + + I + + +G++ ++ + + + + Sbjct: 673 ----------PTNFETMKTFMKNLVGKIQI---GADRSQVGVVQFSDYNREEFQLNKYST 719 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 Y D S I + T + A+ F N + K +KF+I Sbjct: 720 HEEIYAAIDRMSPINRNTLTGGAL--------------TFVNEYFDLSKGGRPQVRKFLI 765 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 LTDG+ + T + + I ++ + + Q L+ ++V N Sbjct: 766 LLTDGKAQDEVGGPAT-----ALRSKSVTIFSVGVYGANRAQ--LEEISGDGSLVFHVEN 818 Query: 395 ADSLIHV 401 D L + Sbjct: 819 FDHLKAI 825 Score = 40.2 bits (92), Expect = 0.54, Method: Composition-based stats. Identities = 52/344 (15%), Positives = 121/344 (35%), Gaps = 58/344 (16%) Query: 80 ESISNHAKRALIDDAKRFIKNHIKESLSGY----------SAVFYNTEIQNIVNSSRISM 129 + +S + L A F+ + S G + E+ + R Sbjct: 728 DRMSPINRNTLTGGALTFVNEYFDLSKGGRPQVRKFLILLTDGKAQDEVGGPATALRSKS 787 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 + + + +N + S + ++ +HL + K++ + AL + E Sbjct: 788 VTIFSVGVYGANRAQLEEISGDGSLVFHVENFDHL--KAIESKLIFRVCALHDCKRIE-- 843 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + I V+D SGS+ P Q+ + L + + + Sbjct: 844 LLDIVFVLDHSGSIG-------------PREQESM----------MNLTIHLVKKADVGR 880 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEK--VRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + V +G + Y+ E + + + +++ R D+ T + A++ + +L + Sbjct: 881 DRVQIGALTYSNHPEILFYLNTYSSGSAIAEHLRRPRDT--GGETYTAKALQHS-NVLFT 937 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 ++ S T RQ +I +TDG ++ + ++ + ++ I I + Sbjct: 938 EEHGSRLTQNVRQ-----------LMIVITDG-VSHDRDKLDEAAR--ELRDKGITIFAV 983 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + Q L+T E +V N D L ++ + + + + Sbjct: 984 GVG--NANQDELETMAGKKENTVHVDNFDKLRDIYLPLQETLCN 1025 >gi|326335929|ref|ZP_08202106.1| aerotolerance-related membrane protein [Capnocytophaga sp. oral taxon 338 str. F0234] gi|325691893|gb|EGD33855.1| aerotolerance-related membrane protein [Capnocytophaga sp. oral taxon 338 str. F0234] Length = 348 Score = 43.3 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 38/262 (14%), Positives = 79/262 (30%), Gaps = 72/262 (27%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 ++E +R I +D+S SM D K +++ K + ++ Sbjct: 81 KLETVKREGVDIVFAIDVSKSMLAE---------------DVKPSRIEKAKRIISELIEL 125 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-MDSLILKPTDSTPAMKQ 300 + + I Y + + + + ++ D L + T A++ Sbjct: 126 L-------HGDRIAFIPYAAQAYPQLPLTSDYSAAKIFLEAINTDMLSSQGTAIGEAIQT 178 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A + S K +I L+DGE ++ + I+ + KE Sbjct: 179 AINYFEDSNQSS------------------KILIILSDGE-DHQQGATEMIQ---EVKEK 216 Query: 361 FIKIVTISINASPN--------------------------GQRLLKTCVSS-PEYHYNVV 393 I+I TI + + Q LL+ +++ Sbjct: 217 GIRIFTIGLGTTQGTTIPIKENGQTFPKRDKDGEVVITKLNQALLEEIAQEGDGKYFDGS 276 Query: 394 NADSLIHVFQNISQLMVHRKYS 415 N +I Q + + +Y Sbjct: 277 NTQQVIDNLQKVLNNIEKNEYE 298 >gi|315052466|ref|XP_003175607.1| U-box domain-containing protein [Arthroderma gypseum CBS 118893] gi|311340922|gb|EFR00125.1| U-box domain-containing protein [Arthroderma gypseum CBS 118893] Length = 740 Score = 43.3 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 33/193 (17%), Positives = 71/193 (36%), Gaps = 13/193 (6%) Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL 181 +SS + ++++ + I + N+ + IVS P L Sbjct: 3 TSSSDDGFEIIEDHQIPLRPANAGTTAVTENLTQEPILTIHSIPNK--DSMIVSIQPPLK 60 Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 I LV+D+SGSM NS +D + + K+A +++ Sbjct: 61 PENDVPHVPCDIVLVIDISGSM----NSSAPIPTGERGGEDTGLSILDLTKHAARTIIET 116 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQ 300 ++ + ++ + T V+ E + ++ + V R ++ L T+ +K+ Sbjct: 117 LNEND------RLAVVTFCTEVKVAFELDFMNKENKSMVLRAINKLYGTSSTNLWHGIKE 170 Query: 301 AYQILTSDKKRSF 313 ++LT+ R Sbjct: 171 GLKVLTATPVREN 183 >gi|163749961|ref|ZP_02157205.1| von Willebrand factor type A domain protein [Shewanella benthica KT99] gi|161330235|gb|EDQ01216.1| von Willebrand factor type A domain protein [Shewanella benthica KT99] Length = 648 Score = 43.3 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 38/239 (15%), Positives = 76/239 (31%), Gaps = 54/239 (22%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 ++ + + ++D+SGSM R K+ LK +L + Sbjct: 234 EVDKSQLGASNLVFLLDVSGSM-------------------NSRDKLPLLKTSLKMLSQQ 274 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQ 300 + HV VY G G + Q + + ++SL T+ ++Q Sbjct: 275 LSEQDHVSIVVYAGASG--------VVLDGVKGNDTQAINQALNSLKAGGSTNGGAGIQQ 326 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK-E 359 AY+ ++ + +I TDG+ N ++ + AK + Sbjct: 327 AYR-------------LAQKHFIQGGVNR---VILATDGDFNVGTTDHQALMDLIAAKRD 370 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA---------DSLIHVFQNISQLM 409 I + T+ L++ HY ++ D L I++ + Sbjct: 371 QGIALTTLGFGQGNYNDHLMEQLADKGNGHYAYIDTLNEARKVLVDELSSTLLTIAKDV 429 >gi|26344185|dbj|BAC35749.1| unnamed protein product [Mus musculus] Length = 1182 Score = 43.3 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 52/352 (14%), Positives = 102/352 (28%), Gaps = 64/352 (18%) Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRF---IKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 SN LG E+I A F + K+ + S Sbjct: 493 SNKPDLGKAIENIRQMGGNTNTGAALNFTLKLLQRAKKERGSKVPCHLVVLTNGMSRDSV 552 Query: 127 ISMTHMANNR--------LDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP 178 + H + +N T + Y + + L N R + Sbjct: 553 LGPAHKLREENIRVHAIGVKEANQTQLREIAGEEKRVYYVHEFDALRNIRN-----QVVQ 607 Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + E I +VD SGS+ + +KM K + Sbjct: 608 EICAEEACRDMKADIMFLVDSSGSIG-----------------PENFSKM---KMFMKNL 647 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 + + + V +G++ ++ ++ + T + + +D + Sbjct: 648 VSKSQI---GADRVQIGVVQFSHENKEEFQL--NTFMSQSDIANAIDRMTHIG------- 695 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 T F + + K +KF+I +TDGE + + + Sbjct: 696 -----ETTLTGSALTFVSQYFSPDKGARPNVRKFLILITDGEAQDIVRDPAI-----ALR 745 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + + I ++ + S Q L+ PE + V N + Q+I +V Sbjct: 746 KEGVIIYSVGVFGSNVTQ--LEEISGKPEMVFYVEN----FDILQHIEDDLV 791 >gi|89068023|ref|ZP_01155440.1| type II/IV secretion system protein, TadC subfamily protein [Oceanicola granulosus HTCC2516] gi|89046262|gb|EAR52319.1| type II/IV secretion system protein, TadC subfamily protein [Oceanicola granulosus HTCC2516] Length = 987 Score = 43.3 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 17/132 (12%), Positives = 46/132 (34%), Gaps = 6/132 (4%) Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 ++ + T + QA L + F + + I+ L++G + + Sbjct: 143 NVAVGMTADRNRLDQAISGLRAWGATRLNDAVFASAGALAGAEGRGAIVLLSEGPDADPS 202 Query: 346 SNVNTIKICDKAKEN----FIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIH 400 ++ + A + + + LL+ ++ ++ V +A +L Sbjct: 203 GAPLSVVDSEAALAAVVESGAPVYAVGL-GPGADAALLRRLAEATGGAYFPVADAAALPA 261 Query: 401 VFQNISQLMVHR 412 F +I+ + H+ Sbjct: 262 TFSDIATRLRHQ 273 >gi|218245149|ref|YP_002370520.1| von Willebrand factor type A [Cyanothece sp. PCC 8801] gi|218165627|gb|ACK64364.1| von Willebrand factor type A [Cyanothece sp. PCC 8801] Length = 418 Score = 43.3 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 39/243 (16%), Positives = 75/243 (30%), Gaps = 49/243 (20%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 +NQ +Q+ ++ + + L++D SGSM + Sbjct: 16 VNQSSSQRQLAISLWATGEDSDRTLPINLGLILDRSGSM--------------------R 55 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT--RVEKNIEPSWGTEKVRQYVTR 282 M +K A +D + + +I + V + + V+ + R Sbjct: 56 AQAMETVKEAANYLVDGLGPDD------RLSVITFNHHAEVILPNQSVEDLQGVKNKINR 109 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 S T MK + K+ I LTDGEN Sbjct: 110 LTAS---GGTCIDEGMKLGIKEAALGKENRVSQ-----------------IFLLTDGENE 149 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + N +K+ A E I + T+ ++ N L + S+ + + + + F Sbjct: 150 HG-DNERCLKLAKVAAEYNITLNTLGFGSNWNQDILEQIADSAGGMLCYIEHPEQALTEF 208 Query: 403 QNI 405 + Sbjct: 209 SRL 211 >gi|325689280|gb|EGD31286.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK115] Length = 471 Score = 43.3 bits (100), Expect = 0.070, Method: Composition-based stats. Identities = 28/177 (15%), Positives = 60/177 (33%), Gaps = 30/177 (16%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + I V D SGSM +++ E VN K R+++ L+ + + + + Sbjct: 199 QANVAISFVFDASGSMEFSLD-GTEKVNPYSNNPLKNRSRIDILREKTKKMMADLQPIGN 257 Query: 248 VKEDVYMGLIGYTTR----VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 V + L+ + + + IE G + + + T+ ++ Sbjct: 258 VS----VNLVQFNSHASFVQQNFIELDKGLTSINSAID---NLNPEHATNPGDGLRYGMV 310 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 L S+ + K+++ LTDG N++ +K + Sbjct: 311 SLQSNAAQL------------------KYVVLLTDGVPNSYMVGPQYNYWGNKIETR 349 >gi|302560610|ref|ZP_07312952.1| von Willebrand factor type A domain-containing protein [Streptomyces griseoflavus Tu4000] gi|302478228|gb|EFL41321.1| von Willebrand factor type A domain-containing protein [Streptomyces griseoflavus Tu4000] Length = 449 Score = 43.3 bits (100), Expect = 0.070, Method: Composition-based stats. Identities = 36/175 (20%), Positives = 56/175 (32%), Gaps = 39/175 (22%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 I S + A GE P + L+VD SGSM D TKM + Sbjct: 43 IGSAVTAPHLYSPGEGPSAAVVLMVDCSGSM------------------DYPPTKMRNAR 84 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 +A + +++ V +I G V + V L + Sbjct: 85 DATAA------AIDALRDGVRFAVI--------------GGTHVAKEVYPGGGGLAVAGP 124 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ-KFIIFLTDGENNNFKS 346 + KQA + L++ + T + S + I LTDG N + Sbjct: 125 GTREQAKQALRKLSAGGGTAIGTWLGLADRLLASADVAIRHGILLTDGRNEHESQ 179 >gi|254409659|ref|ZP_05023440.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] gi|196183656|gb|EDX78639.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] Length = 413 Score = 43.3 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 39/240 (16%), Positives = 77/240 (32%), Gaps = 45/240 (18%) Query: 166 NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 NQ +Q+ +S + + + L++D SGSMH Sbjct: 17 NQPSSQRQLSIAIRAITQSQDQSLPLNLCLILDHSGSMHGR------------------- 57 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 + +K A + ++ + + +I + R K + P+ + + ++ Sbjct: 58 -PLETVKKAAMQLIERLKEGD------RICVIAFDHR-AKVLVPNQAIDNLNTIKSQIRQ 109 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T +K + + K + + LTDGEN + Sbjct: 110 LSADGGTAIDEGLKLGIEEVAKGKADAVSQ-----------------VFLLTDGENEHG- 151 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 N +K+ A E+ + I T+ AS N L K S + + + F + Sbjct: 152 DNERCLKLAHFAVEHKLTINTLGFGASWNQDVLEKIADSGSGTLCYIEQPEQAVQEFGRL 211 >gi|229495742|ref|ZP_04389470.1| BatA protein [Porphyromonas endodontalis ATCC 35406] gi|229317316|gb|EEN83221.1| BatA protein [Porphyromonas endodontalis ATCC 35406] Length = 325 Score = 43.3 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 34/236 (14%), Positives = 65/236 (27%), Gaps = 63/236 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I L +D SGSM D + + A F+ + Sbjct: 86 IDIMLAIDASGSMMAM---------------DLQPNRFVAAVEVAQKFIGN----RPNDN 126 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +GL+ + + + ++ + T + A L Sbjct: 127 ---IGLVMFAGESFTQCPLTTDHATLLNRLSEVEIGYLEDGTAIGLGIATACNRLKESHA 183 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI- 369 +S K I+ LTDG NN + A+ I+I T+++ Sbjct: 184 KS------------------KIIVLLTDGTNNAGSIAPSMAA--SLAESLGIRIYTVAVG 223 Query: 370 ---NAS----------------PNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNI 405 A + LK ++ ++ + +SL ++ I Sbjct: 224 TRGEAPYPHATAFGTVIDNVKVEIDEASLKEIAQTTGGSYFRATDNESLNQIYDEI 279 >gi|198426775|ref|XP_002120099.1| PREDICTED: similar to SD03168p [Ciona intestinalis] Length = 1474 Score = 43.3 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 44/254 (17%), Positives = 88/254 (34%), Gaps = 55/254 (21%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 ++ RI+ E P I ++D SGS+ + +K + Sbjct: 155 SISRIDPFECPKVDILFLLDGSGSIV--------------------ESDFEIMKEWIENI 194 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL--KPTDSTP 296 S D+ S+ V +GL+ ++ E G ++ + M ++ + T + Sbjct: 195 TLSFDISSNGS--VAVGLMQFSHFSLTKTEFQIGMFTTKEEIMAAMKNVTIKKGNTYTAD 252 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++++ F++ + +K I+ LTDGE + S +T D Sbjct: 253 ALRRS-------------IAVFQKSSRYNDTNTRKVIVLLTDGEATDTASLSSTA---DL 296 Query: 357 AKENFIKIVTISI----------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + I I + I A Q ++ +P + V +L V + I Sbjct: 297 VRSQGITITAVLITEKVLPSERSAAVAQMQLIVNGVAGNPSGVFVVGTTANLDSVIRAI- 355 Query: 407 QLMVHRKYSVILKG 420 ++ L+G Sbjct: 356 ----TQRIQSTLEG 365 >gi|219849077|ref|YP_002463510.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219543336|gb|ACL25074.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 546 Score = 43.3 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 38/235 (16%), Positives = 77/235 (32%), Gaps = 43/235 (18%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 I A+ R I LVVD SGSM K+ +K Sbjct: 353 TAEVIVAIKNSWSLNRKRADIVLVVDTSGSMEGD--------------------KLTMVK 392 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 + FL I +GLI + + + + ++ T Sbjct: 393 AGIETFLMRILPED------RLGLITFASAARLVVPMAPLSDNRIALQDAVQAMRASGRT 446 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 A+ Q+L + + + I+ L+DG +N+ +++++ I+ Sbjct: 447 ALFDALVLGKQVL--------------EQLPPADDDRIRAIVLLSDGADNSSQASLDQIR 492 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 E+ I I ++ + +++L V + + +F+N+S+ Sbjct: 493 TLFD--ESGISIFPVAY-GNDADRQVLDAIAEFSRTIVVVGDTGDIAQIFENLSR 544 >gi|162452621|ref|YP_001614988.1| hypothetical protein sce4345 [Sorangium cellulosum 'So ce 56'] gi|161163203|emb|CAN94508.1| hypothetical protein sce4345 [Sorangium cellulosum 'So ce 56'] Length = 521 Score = 43.3 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 37/220 (16%), Positives = 76/220 (34%), Gaps = 44/220 (20%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + LV+D SGSM K T++ A + ++ V Sbjct: 108 LSLVIDRSGSM--------------------KGTRLTNAVQAATTAVSRLNDGDVVS--- 144 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 ++ + TR + P+ + R + + + L D S Sbjct: 145 ---VVTFDTRTSVVVPPTTVGPETRGRILASVRGISLGG----------------DTCIS 185 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT-IKICDKAKENFIKIVTISINA 371 + S + ++ L+DG+ N+ +V + +A++ + I TI ++ Sbjct: 186 CGIEEGLSLLGQTSAGVSRMLV-LSDGDANHGVRDVPGFRAMAQRARDRGVAITTIGVDV 244 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 N + L + S HY V N +L +F+ ++ + Sbjct: 245 DYNEKILSAIALDSNGRHYFVENDAALARIFEAEAEQLTT 284 >gi|253701051|ref|YP_003022240.1| von Willebrand factor A [Geobacter sp. M21] gi|251775901|gb|ACT18482.1| von Willebrand factor type A [Geobacter sp. M21] Length = 331 Score = 43.3 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 48/276 (17%), Positives = 83/276 (30%), Gaps = 66/276 (23%) Query: 161 IEHLLNQRYNQKIVSFIPALLRIEMGERPI------FLIELVVDLSGSMHCAMNSDPEDV 214 + LL + I AL R + ER + L +DLS SM Sbjct: 51 LARLLPYLRLAVLALGIAALARPQAVERESRVRSRGMDLVLALDLSTSMLAEEQGR---- 106 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + +++AA K L F+ ++D +GL+ + R + + Sbjct: 107 ------EGRGESRLAAAKRVLAGFIG-------GRKDDRIGLVAFAGRPYPAAPLTSDHQ 153 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 ++ V R + T A+ L S + +I Sbjct: 154 WLQGVVDRLDTGAVEDGTALGDAILSGVNRLRRRPAES------------------RALI 195 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS----------INASPNGQ-------- 376 +TDG NN + AK I++ I + + G Sbjct: 196 LITDGRNNAGAEPQLAAQ---AAKALGIRVHAIGIGSRGSAVIPVPSPLGGTIYRRLDAE 252 Query: 377 ---RLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 LK + ++ +A L VF I +L Sbjct: 253 LDAATLKGVAELTGGRYFEAGDATVLSRVFAEIDRL 288 >gi|146343040|ref|YP_001208088.1| hypothetical protein BRADO6230 [Bradyrhizobium sp. ORS278] gi|146195846|emb|CAL79873.1| conserved hypothetical protein [Bradyrhizobium sp. ORS278] Length = 519 Score = 43.3 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 31/216 (14%), Positives = 59/216 (27%), Gaps = 45/216 (20%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ------YV 280 ++ A+ A+ + + V +GL + + + Y Sbjct: 292 RLDAVGYAVNQLFTTANTTKKVANQFRIGLYPFIRYLYSYYPLTTNISGSTSDSSTINYA 351 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 ++ +L+ T+++ + S T+ +LP ++ +TDG Sbjct: 352 AANLATLLDTNTNASLGSGGTHIDTALSSVNSLITSVGDGSATTNTLP---YVFLVTDGA 408 Query: 341 N--------NNFKSNVNTIKI------CDKAKENFIKIVTI--------SINASPNGQR- 377 N S N C K I I + +NAS G Sbjct: 409 QDPQVKGVPNGSWSGSNHATTINPTTSCTPLKNRGIIISVLYIPYQTINPVNASFAGDED 468 Query: 378 ------------LLKTCVSSPEYHYNVVNADSLIHV 401 L+ C SP + Y + Sbjct: 469 DYANNNIPNIPPSLQACA-SPGFFYTANTPADITSA 503 >gi|1708567|sp|P53710|ITA2_BOVIN RecName: Full=Integrin alpha-2; AltName: Full=CD49 antigen-like family member B; AltName: Full=Collagen receptor; AltName: Full=Platelet membrane glycoprotein Ia; Short=GPIa; AltName: Full=VLA-2 subunit alpha; AltName: CD_antigen=CD49b; Flags: Precursor gi|439696|gb|AAB59255.1| integrin alpha 2 subunit [Bos taurus] Length = 1170 Score = 43.3 bits (100), Expect = 0.074, Method: Composition-based stats. Identities = 32/190 (16%), Positives = 61/190 (32%), Gaps = 30/190 (15%) Query: 233 NALLLFLDS-IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 +A+ FL+ + L MGLI Y + + + + Sbjct: 178 DAVKNFLEKFVQGLDIGPTKTQMGLIQYANNPRVVFNL-NTFKSKDEMIKATSQTFQYGG 236 Query: 292 --TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T++ A++ A S K ++ +TDGE+++ Sbjct: 237 DLTNTFKAIQYARDTAYSTAA-------------GGRPGATKVMVVVTDGESHDGSKLKA 283 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRL--------LKTCVSSPE--YHYNVVNADSLI 399 I C+K + I I++ N L +K S P + +NV + L+ Sbjct: 284 VIDQCNK---DNILRFGIAVLGYLNRNALDTKNLIKEIKAIASIPTERHFFNVSDEADLL 340 Query: 400 HVFQNISQLM 409 I + + Sbjct: 341 EKAGTIGEQI 350 >gi|332970976|gb|EGK09950.1| D-amino-acid dehydrogenase [Desmospora sp. 8437] Length = 441 Score = 43.3 bits (100), Expect = 0.074, Method: Composition-based stats. Identities = 33/224 (14%), Positives = 70/224 (31%), Gaps = 28/224 (12%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + E + +++D SGSM ++ KM K A+ F Sbjct: 123 EVNGPEEKQHNVTILLDASGSMAARVSGGE---------------KMQVAKEAVRSFTSQ 167 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + ++V VY G G ++ ++ + + + V + + P Sbjct: 168 MPEGTNVSLIVY-GHKGSNSKADQAES----CKGIEEIVELGPYNESTLQSKLDPIRATG 222 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + L N Q +K + I ++DG V K +++ Sbjct: 223 WTPLAGA------MNQAGQRLKETEGQAENVIYVVSDGLETCGGDPVKEAKSLNQS-NIK 275 Query: 362 IKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQN 404 + I + + LK + +++ + L F+N Sbjct: 276 ATVNIIGFDVGNKEHQALKKVAEAGGGKYFSATSKTELDLYFRN 319 >gi|308375589|ref|ZP_07444436.2| membrane protein [Mycobacterium tuberculosis SUMu007] gi|308345800|gb|EFP34651.1| membrane protein [Mycobacterium tuberculosis SUMu007] Length = 327 Score = 43.3 bits (100), Expect = 0.074, Method: Composition-based stats. Identities = 41/234 (17%), Positives = 72/234 (30%), Gaps = 51/234 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + LV+D+S SM D + ++M A + A F D + + Sbjct: 91 VMLVIDVSQSM---------------RATDVEPSRMVAAQEAAKQFADELTPG------I 129 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GLI Y + P+ E + + + + T + A+ A Q + + Sbjct: 130 NLGLIAYAGTATVLVSPTTNREATKNALDKLQFADR---TATGEAIFTALQAIATVGAV- 185 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK--SNVNTIKICD-KAKENFIKIVTISI 369 + P I+ +DG+ N AK+ + I TIS Sbjct: 186 ---------IGGGDTPPPARIVLFSDGKETMPTNPDNPKGAYTAARTAKDQGVPISTISF 236 Query: 370 NAS-------------PNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLM 409 P +K S YN L V+ ++ Q + Sbjct: 237 GTPYGFVEINDQRQPVPVDDETMKKVAQLSGGNSYNAATLAELRAVYSSLQQQI 290 >gi|55377967|ref|YP_135817.1| von Willebrand factor type A like metal binding protein [Haloarcula marismortui ATCC 43049] gi|55230692|gb|AAV46111.1| von Willebrand factor type A like metal binding protein [Haloarcula marismortui ATCC 43049] Length = 394 Score = 43.3 bits (100), Expect = 0.074, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 58/198 (29%), Gaps = 48/198 (24%) Query: 177 IPALLRIEMG---ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 + A + +E G E P I L +D SGSM + + Sbjct: 21 LTAEIEVEPGRLEEPPTRQIALCIDASGSMAGN--------------------DIEQARA 60 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS-WGTEKVRQYVTRDMDSLILKPT 292 +D Y+ +I + V + P+ WGT V D T Sbjct: 61 GAEWVFGLLDEDD------YVSIIAFDNEVTTVLAPTRWGTISRETAVDAVADISAGGGT 114 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG-ENNNFKSNVNTI 351 D + +A L + ++ L+DG +N++ T+ Sbjct: 115 DMYSGLLEAKASLQDLPTDDNTA---------------RRVLLLSDGKDNSHDPEAFGTL 159 Query: 352 KICDKAKENFIKIVTISI 369 + I+I I Sbjct: 160 AR--EIDTEGIRIKAAGI 175 >gi|146292146|ref|YP_001182570.1| von Willebrand factor, type A [Shewanella putrefaciens CN-32] gi|145563836|gb|ABP74771.1| von Willebrand factor, type A [Shewanella putrefaciens CN-32] Length = 633 Score = 43.3 bits (100), Expect = 0.074, Method: Composition-based stats. Identities = 37/234 (15%), Positives = 72/234 (30%), Gaps = 52/234 (22%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 E + ++D+SGSM K+ L+ AL + + Sbjct: 240 SELGASNLVFLLDVSGSMA-------------------SADKLPLLQTALKMLTQQLSAQ 280 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 V VY G G ++ + ++ T+ + + QAYQ Sbjct: 281 DKVSIVVYAGAAG--VVLDGASG-----DDIQALTYALEQLRAGGSTNGSQGILQAYQ-- 331 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKENFIKI 364 ++ + +I TDG+ N +N I + +K K+ I + Sbjct: 332 -----------LAQKHFIQGGINR---VILATDGDFNVGVTNFDQLISLIEKEKQRGIGL 377 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNA---------DSLIHVFQNISQLM 409 T+ +L++ HY ++ D L I++ + Sbjct: 378 TTLGFGMDNYNDQLMEQLADKGNGHYAYIDTLNEARKVLVDELSSTLLTIAKDV 431 >gi|110639040|ref|YP_679249.1| BatA-like protein [Cytophaga hutchinsonii ATCC 33406] gi|110281721|gb|ABG59907.1| BatA-like protein, aerotolerance-related protein [Cytophaga hutchinsonii ATCC 33406] Length = 351 Score = 43.3 bits (100), Expect = 0.074, Method: Composition-based stats. Identities = 36/249 (14%), Positives = 71/249 (28%), Gaps = 61/249 (24%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + +D+S SM D ++ A K Sbjct: 100 KSNETNTQYTEGINMIFAIDVSESM---------------KITDIHPSRFDAAKQICTDI 144 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-MDSLILKPTDSTPA 297 ++ + + +G++ ++ + ++ + + + T A Sbjct: 145 INK-------RSNDRIGIVIFSGEAVTLSPLTNDYVLLKNQLNDLKQNKDLQSGTAIGTA 197 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A L + + + ++ I+ ++DGEN + + T D Sbjct: 198 LGTAINRLKNAETK------------------ERIIVLISDGENTSGLMDPITAA--DLC 237 Query: 358 KENFIKIV-----------------TISINASPNGQRLLKT-CVSSPEYHYNVVNADSLI 399 E IKI TI S + LK ++ Y + SL Sbjct: 238 LEYNIKIYCIGLGKDGTHQFKDDNGTIQYVESKLDENTLKNISATTKGKFYRAYDKKSLD 297 Query: 400 HVFQNISQL 408 V NI QL Sbjct: 298 DVIANIDQL 306 >gi|328945098|gb|EGG39253.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK1087] Length = 471 Score = 43.3 bits (100), Expect = 0.075, Method: Composition-based stats. Identities = 28/177 (15%), Positives = 60/177 (33%), Gaps = 30/177 (16%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + I V D SGSM +++ E VN K R+++ L+ + + + + Sbjct: 199 QANVAISFVFDASGSMEFSLD-GTEKVNPYSNNPLKNRSRIDILREKTKKMMADLQPIGN 257 Query: 248 VKEDVYMGLIGYTTR----VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 V + L+ + + + IE G + + + T+ ++ Sbjct: 258 VS----VNLVQFNSHASFVQQNFIELDKGLTSINSAID---NLNPEHATNPGDGLRYGMV 310 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 L S+ + K+++ LTDG N++ +K + Sbjct: 311 SLQSNAAQL------------------KYVVLLTDGVPNSYMVGPQYNYWGNKIETR 349 >gi|302532683|ref|ZP_07285025.1| predicted protein [Streptomyces sp. C] gi|302441578|gb|EFL13394.1| predicted protein [Streptomyces sp. C] Length = 248 Score = 43.3 bits (100), Expect = 0.075, Method: Composition-based stats. Identities = 34/173 (19%), Positives = 53/173 (30%), Gaps = 42/173 (24%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 A + P +ELV+D+SGSM +++MAA K A Sbjct: 106 PSTAAAADGPAKEPP-KVELVLDVSGSMRA--------------NDIDGQSRMAAAKQAF 150 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGY--TTRV----EKNIEPSWGTEKVRQYVTRDMDSLIL 289 LD++ V+ + Y R + GT + T Sbjct: 151 NEVLDAVP--DEVRLGIRTLGATYPGDDRALGCKDTKQLYPVGTVNRTEAKTAVATLAPT 208 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 T PA++ A Q L + I+ +TDGE+ Sbjct: 209 GWTPIGPALQAAAQDLEGGNAT-------------------RRIVLITDGEDT 242 >gi|261418348|ref|YP_003252030.1| von Willebrand factor A [Geobacillus sp. Y412MC61] gi|319767693|ref|YP_004133194.1| von Willebrand factor type A [Geobacillus sp. Y412MC52] gi|261374805|gb|ACX77548.1| von Willebrand factor type A [Geobacillus sp. Y412MC61] gi|317112559|gb|ADU95051.1| von Willebrand factor type A [Geobacillus sp. Y412MC52] Length = 1077 Score = 43.3 bits (100), Expect = 0.075, Method: Composition-based stats. Identities = 29/157 (18%), Positives = 53/157 (33%), Gaps = 46/157 (29%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 RP + V+D+SGSM K+ + K+AL ++ + Sbjct: 195 RPPIDVVFVMDVSGSM--------------------TAMKLQSAKSALQAAVNYFKSNYN 234 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGT-EKVRQYVTRDMDS----LILKPTDSTPAMKQAY 302 + LI ++ V + +G V + +++ T+ + A+ A Sbjct: 235 QND--RFALIPFSDGVREASVVPFGKYSNVASQLDAILNTGNSLTAGGGTNYSAALSLAK 292 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 T + +K+IIFLTDG Sbjct: 293 SYFTDPTR-------------------KKYIIFLTDG 310 >gi|126316414|ref|XP_001380743.1| PREDICTED: similar to integrin alpha 2 subunit [Monodelphis domestica] Length = 1214 Score = 43.3 bits (100), Expect = 0.075, Method: Composition-based stats. Identities = 41/227 (18%), Positives = 74/227 (32%), Gaps = 31/227 (13%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQD-----KKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 D+S S +NS V + P D + + + + L + Sbjct: 185 DVSPSFQ-LLNSFSPAVQTCPSFIDVVVVCDESNSIYPWSAVKNFLVKFVQGLDIGPDKT 243 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GLI Y S T K ++ + + I D T K Sbjct: 244 QVGLIQYGNYPRVVFNMS--TFKTKEEMVKATSQTIQHGGDLTNTFKA----------IQ 291 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 F F K ++ +TDGE+++ I C+ ++ I I++ Sbjct: 292 FAREFAYSEASGGRPSATKVMVVVTDGESHDGSFLKEVIGQCN---DDNILRFGIAVLGY 348 Query: 373 PNGQRL--------LKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 N L +K S+P Y +NV + D+L+ + + + Sbjct: 349 LNRNALDTKNLIKEIKAIASTPTERYFFNVSDEDALLEKAGTLGERI 395 >gi|166367777|ref|YP_001660050.1| hypothetical protein MAE_50360 [Microcystis aeruginosa NIES-843] gi|166090150|dbj|BAG04858.1| hypothetical protein MAE_50360 [Microcystis aeruginosa NIES-843] Length = 420 Score = 43.3 bits (100), Expect = 0.076, Method: Composition-based stats. Identities = 34/204 (16%), Positives = 64/204 (31%), Gaps = 44/204 (21%) Query: 166 NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 NQ +Q+ + A ++ + LV+D SGSM Sbjct: 17 NQSSSQRQLMLSIAATSEQINTNLPINLCLVLDHSGSMQGK------------------- 57 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 + +K A L ++S+ + + +I + R + + + + ++ Sbjct: 58 -PLETVKKAALSLIESLGVND------RLSVIAFDHRAKVILPSQSREDDLTLIRSKIQQ 110 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T +K Q S + I LTDGEN + Sbjct: 111 LQAGGGTAIDEGIKLG-----------------IQESSTGSKGYVSHIFLLTDGENEHG- 152 Query: 346 SNVNTIKICDKAKENFIKIVTISI 369 +N +K+ + A E I + T Sbjct: 153 NNQRCLKLAEVAAEYGITLNTFGF 176 >gi|301784617|ref|XP_002927724.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like [Ailuropoda melanoleuca] Length = 898 Score = 43.3 bits (100), Expect = 0.077, Method: Composition-based stats. Identities = 41/182 (22%), Positives = 64/182 (35%), Gaps = 44/182 (24%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D S SM TK+ K+AL L + H Sbjct: 253 VVFVLDSSASMVG--------------------TKLRQTKDALFTILHDLRPQDHFS--- 289 Query: 253 YMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +IG++ R++ K+ S + VR TD A+++A ++L Sbjct: 290 ---IIGFSNRIKVWKDHLVSVTPDNVRDGKVYIHHMSPTGGTDINGALQRAIKLLNDYVA 346 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF---IKIVTI 367 + + IIFLTDG+ +T+KI + KE I I TI Sbjct: 347 HNDIEDRSVS-----------LIIFLTDGK--PTVGETHTLKILNNTKEAARGQICIFTI 393 Query: 368 SI 369 I Sbjct: 394 GI 395 >gi|281346829|gb|EFB22413.1| hypothetical protein PANDA_017530 [Ailuropoda melanoleuca] Length = 895 Score = 43.3 bits (100), Expect = 0.077, Method: Composition-based stats. Identities = 41/182 (22%), Positives = 64/182 (35%), Gaps = 44/182 (24%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D S SM TK+ K+AL L + H Sbjct: 251 VVFVLDSSASMVG--------------------TKLRQTKDALFTILHDLRPQDHFS--- 287 Query: 253 YMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +IG++ R++ K+ S + VR TD A+++A ++L Sbjct: 288 ---IIGFSNRIKVWKDHLVSVTPDNVRDGKVYIHHMSPTGGTDINGALQRAIKLLNDYVA 344 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF---IKIVTI 367 + + IIFLTDG+ +T+KI + KE I I TI Sbjct: 345 HNDIEDRSVS-----------LIIFLTDGK--PTVGETHTLKILNNTKEAARGQICIFTI 391 Query: 368 SI 369 I Sbjct: 392 GI 393 >gi|183981216|ref|YP_001849507.1| hypothetical protein MMAR_1194 [Mycobacterium marinum M] gi|183174542|gb|ACC39652.1| conserved hypothetical protein [Mycobacterium marinum M] Length = 772 Score = 43.3 bits (100), Expect = 0.077, Method: Composition-based stats. Identities = 30/170 (17%), Positives = 51/170 (30%), Gaps = 39/170 (22%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 E P + +V+D SGSM KM A + A +D + Sbjct: 301 AEPSSAPR-DVVVVLDRSGSMGGW--------------------KMVAARRAAGRIVDML 339 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 D ++ + R+E G R S L S A Sbjct: 340 DAGD------RFCVLAFDDRIETPPAMPDGLVPASDR-NRFAASSWLGSLRSRGGTVMA- 391 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 Q LT+ + + RQ ++ +TDG+ + + ++ Sbjct: 392 QPLTNAVEMLADSGEDRQAS----------VVLVTDGQISGEDHLLRSLA 431 >gi|109094740|ref|XP_001104627.1| PREDICTED: matrilin-4-like, partial [Macaca mulatta] Length = 222 Score = 43.3 bits (100), Expect = 0.077, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 65/177 (36%), Gaps = 26/177 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR--DMDSLILKPTDSTPAMK 299 +D L E +GL+ +++RV G V + + + T + A++ Sbjct: 34 VDFLDVSPEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALR 91 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S+ QG + +L + + TDG + +++ +AKE Sbjct: 92 HMVEHSFSE----------AQGARPRALNVPRVGLVFTDG---RSQDDISV--WAARAKE 136 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I + + + + + L+ S P + D F ++ L+ + + S+ Sbjct: 137 EGIAMYAVGVGKAVEAE--LREIASEPAELHVSYAPD-----FGTMTHLLRNLRSSI 186 >gi|30687725|ref|NP_850306.1| zinc finger (C3HC4-type RING finger) family protein [Arabidopsis thaliana] gi|330254526|gb|AEC09620.1| C3HC4-type RING finger-containing protein [Arabidopsis thaliana] Length = 692 Score = 43.3 bits (100), Expect = 0.077, Method: Composition-based stats. Identities = 40/296 (13%), Positives = 103/296 (34%), Gaps = 60/296 (20%) Query: 117 EIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSF 176 E Q + + S N+ + + I+ + + D R +F + + + Sbjct: 178 EQQLVFSGKSYSDALENNHPVRMMDLKIYPEVSAVPRADSREKFDVLVHLRAAAMVTGNA 237 Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 +I R + V+D+SGSM TK+A LK A+ Sbjct: 238 NSLNNQISRYPRAPVDLVTVLDISGSMAG--------------------TKLALLKRAMG 277 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS----WGTEKVRQYVTRDMDSLILKPT 292 + ++ + +I +++ + + G ++ Q V + + T Sbjct: 278 FVIQNLGSND------RLSVIAFSSTARRLFPLTKMSDAGRQRALQAVNSVVAN---GGT 328 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVN- 349 + +++ +++ + ++ + II L+DG + +++ N Sbjct: 329 NIAEGLRKGVKVMEDRRDKNPVAS----------------IILLSDGRDTYTMNQADPNY 372 Query: 350 ------TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 ++ C+ K I + + S + L+ + + ++ + ++S+I Sbjct: 373 KLLLPLSMHGCES-KRFQIPVHSFGF-GSDHDASLMHSVSETSGGTFSFIESESVI 426 >gi|120599917|ref|YP_964491.1| von Willebrand factor, type A [Shewanella sp. W3-18-1] gi|120560010|gb|ABM25937.1| von Willebrand factor, type A [Shewanella sp. W3-18-1] Length = 638 Score = 43.3 bits (100), Expect = 0.077, Method: Composition-based stats. Identities = 37/234 (15%), Positives = 73/234 (31%), Gaps = 52/234 (22%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 E + ++D+SGSM K+ L+ AL + + Sbjct: 245 SELGASNLVFLLDVSGSMA-------------------SADKLPLLQTALKMLTQQLSAQ 285 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 V VY G G ++ + ++ T+ + + QAYQ Sbjct: 286 DKVSIVVYAGAAG--VVLDGASG-----DDIQALTYALEQLRAGGSTNGSQGILQAYQ-- 336 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN-TIKICDKAKENFIKI 364 ++ + +I TDG+ N +N + I + +K K+ I + Sbjct: 337 -----------LAQKHFIQGGINR---VILATDGDFNVGVTNFDLLISLIEKEKQRGIGL 382 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNA---------DSLIHVFQNISQLM 409 T+ +L++ HY ++ D L I++ + Sbjct: 383 TTLGFGMDNYNDQLMEQLADKGNGHYAYIDTLNEARKVLVDELSSTLLTIAKDV 436 >gi|28374313|gb|AAH45465.1| Matn1 protein [Danio rerio] Length = 507 Score = 43.3 bits (100), Expect = 0.077, Method: Composition-based stats. Identities = 41/215 (19%), Positives = 78/215 (36%), Gaps = 28/215 (13%) Query: 197 VDLSGSMHCAM-NSDPEDVNSAPICQDK-KRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 + + +M + N+ P DV + ++ +K + ID LS + + Sbjct: 39 LRQAAAMAAGLCNTKPTDVVFIVDSSRSVRPSEFEQVK---VFLAKVIDGLSVGPDATRV 95 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 G++ Y +RV+ + K + T + A++ A + S+ + Sbjct: 96 GVVNYASRVKNEVSLKSHKTKAALVKAVSKIEPLSTGTMTGLAIQFAMNVAFSEAEG--- 152 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 G K P + K I +TDG + N+ I +A+E I+I I + Sbjct: 153 ------GRKSPDIS--KVAIIVTDG---RPQDNIRDIA--ARAREAGIEIFAIGVG--RV 197 Query: 375 GQRLLKTCVSSP--EYHYNVVN---ADSLIHVFQN 404 L+ S P ++ V + + L FQ Sbjct: 198 DMTTLRQMASEPLEDHVDYVESYSLIEKLTKKFQE 232 Score = 37.5 bits (85), Expect = 3.5, Method: Composition-based stats. Identities = 34/239 (14%), Positives = 78/239 (32%), Gaps = 34/239 (14%) Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 + I P + E ++ S S N+ + V + + Sbjct: 250 EHICISTPGSFKCACREGFTL-----MNDSRSCSACSNAATDVVFLIDGSKSVRPENFEL 304 Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLI 288 +K + L +D +D +S V GL+ Y++ V++ +++ V R Sbjct: 305 VKKWINLIIDKLD-VSETNTHV--GLVQYSSTVKQEFPLGRHNSKRSLKEAVKR------ 355 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 M + + SF + + K I TDG + ++ + Sbjct: 356 ---------MNYMERGTMTGHALSFLVDNSFGPNQGARPGVPKVGIVFTDGRSQDYIGDA 406 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 KAK K+ + + + L+ S P ++++ + ++ + + + Sbjct: 407 A-----KKAKALGFKMYAVGVG--NAVEDELREIASEPIADHYFYTADFKTMNQIAKKL 458 >gi|149773091|emb|CAO01895.1| collagen type VI alpha 6 [Mus musculus] Length = 1162 Score = 43.3 bits (100), Expect = 0.079, Method: Composition-based stats. Identities = 52/352 (14%), Positives = 102/352 (28%), Gaps = 64/352 (18%) Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRF---IKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 SN LG E+I A F + K+ + S Sbjct: 473 SNKPDLGKAIENIRQMGGNTNTGAALNFTLKLLQRAKKERGSKVPCHLVVLTNGMSRDSV 532 Query: 127 ISMTHMANNR--------LDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP 178 + H + +N T + Y + + L N R + Sbjct: 533 LGPAHKLREENIRVHAIGVKEANQTQLREIAGEEKRVYYVHEFDALRNIRN-----QVVQ 587 Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + E I +VD SGS+ + +KM K + Sbjct: 588 EICAEEACRDMKADIMFLVDSSGSIG-----------------PENFSKM---KMFMKNL 627 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 + + + V +G++ ++ ++ + T + + +D + Sbjct: 628 VSKSQI---GADRVQIGVVQFSHENKEEFQL--NTFMSQSDIANAIDRMTHIG------- 675 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 T F + + K +KF+I +TDGE + + + Sbjct: 676 -----ETTLTGSALTFVSQYFSPDKGARPNVRKFLILITDGEAQDIVRDPAI-----ALR 725 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + + I ++ + S Q L+ PE + V N + Q+I +V Sbjct: 726 KEGVIIYSVGVFGSNVTQ--LEEISGKPEMVFYVEN----FDILQHIEDDLV 771 >gi|109000862|ref|XP_001094970.1| PREDICTED: cartilage matrix protein [Macaca mulatta] Length = 495 Score = 43.3 bits (100), Expect = 0.081, Method: Composition-based stats. Identities = 24/145 (16%), Positives = 51/145 (35%), Gaps = 17/145 (11%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 I+ L +G++ Y + V++ K + T + A++ A Sbjct: 67 IESLDVGPNATRVGMVNYASTVKQEFSLRAHVSKAALLQAVRRIQPLSTGTMTGLAIQFA 126 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 SD + G + S K +I +TDG + +V+ +A+ + Sbjct: 127 ITKALSDAE----------GGRSRSPDISKVVIVVTDGRPQDSVQDVS-----ARARASG 171 Query: 362 IKIVTISINASPNGQRLLKTCVSSP 386 +++ I + + L+ S P Sbjct: 172 VELFAIGVG--RVDKATLRQIASEP 194 Score = 42.2 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 37/223 (16%), Positives = 83/223 (37%), Gaps = 40/223 (17%) Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNA-----------LLLFLDSI-DLLSHVKED 251 +NSD + N T + L + + F++ I D L + Sbjct: 251 GFTLNSDGKTCNVCSGGGGSSATDLVFLIDGSKSVRPENFELVKKFINQIVDTLDVSDKL 310 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD--SLILKPTDSTPAMKQAYQILTSDK 309 +GL+ Y++ V + G ++ + + S + K T + A+K D Sbjct: 311 AQVGLVQYSSSVRQEFPL--GRFHTKKDIKAAVRNMSYMEKGTMTGAALKY-----LIDN 363 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + + K+ I+F TDG + ++ ++ KAK+ K+ + + Sbjct: 364 SFTVSSGARPGAQKVG-------IVF-TDGRSQDYINDAA-----KKAKDLGFKMFAVGV 410 Query: 370 -NASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 NA + L+ S P E+++ + ++ + + + + + Sbjct: 411 GNAVEDE---LREIASEPVAEHYFYTADFKTINQIGKKLQKKI 450 >gi|327441394|dbj|BAK17759.1| uncharacterized protein containing a von Willebrand factor type A domain [Solibacillus silvestris StLB046] Length = 961 Score = 43.3 bits (100), Expect = 0.082, Method: Composition-based stats. Identities = 24/135 (17%), Positives = 41/135 (30%), Gaps = 19/135 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 + + L+ G +G+ V K + + + + + + +P Sbjct: 68 VTEVLSLMDDASSKDRFGFVGFNKEVTKELALTNNIVQAKSKLNEF-------GKNISPY 120 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 M S + S K I+ +T G N N + K+ KA Sbjct: 121 M---------ANDLSKGLEKAVDELTKKSTSNDKVIVIMTVG---NSIYNEVSKKLAAKA 168 Query: 358 KENFIKIVTISINAS 372 E I I TIS Sbjct: 169 YEEDITIHTISFGDP 183 >gi|291299307|ref|YP_003510585.1| von Willebrand factor type A [Stackebrandtia nassauensis DSM 44728] gi|290568527|gb|ADD41492.1| von Willebrand factor type A [Stackebrandtia nassauensis DSM 44728] Length = 626 Score = 43.3 bits (100), Expect = 0.082, Method: Composition-based stats. Identities = 33/234 (14%), Positives = 70/234 (29%), Gaps = 41/234 (17%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 A L+ +D SGSM + + TKM ++A+ Sbjct: 27 TAAPASAADNDGELLMA--LDASGSMEESDGAGN--------------TKMETARDAV-- 68 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ID+ + +GL Y G + + + D+ A Sbjct: 69 ----IDVAEAMPGHAKVGLRAY------------GPASTGSGCKASKELVPIDKIDA-DA 111 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A L + + + K I+ ++DGE + +K+ +K Sbjct: 112 ITTAATELKPEGDTPIAYSLEKAAGDFTEAKGPKTILLVSDGEETC---GGDPVKVAEKI 168 Query: 358 KENFI--KIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQL 408 + ++ I ++ L + +Y+ + +L + S+ Sbjct: 169 ASQGVDLRVHVIGFQVDDATRKQLTEIAKAGKGSYYDAQDGPALASRLKRASES 222 >gi|153010351|ref|YP_001371565.1| hypothetical protein Oant_3028 [Ochrobactrum anthropi ATCC 49188] gi|151562239|gb|ABS15736.1| conserved hypothetical protein [Ochrobactrum anthropi ATCC 49188] Length = 605 Score = 43.3 bits (100), Expect = 0.082, Method: Composition-based stats. Identities = 16/83 (19%), Positives = 31/83 (37%), Gaps = 15/83 (18%) Query: 347 NVNTIKICDKAKENFIKIVTISI------NASPNGQRLLKTCVSS---------PEYHYN 391 N K+CD AK I ++T+++ + LL++C S+ P + Sbjct: 521 NARFAKLCDNAKSANIIVMTVALDLSETNSTEKAQIDLLRSCSSNSRVRTESGRPAKLFW 580 Query: 392 VVNADSLIHVFQNISQLMVHRKY 414 L F+ I + + + Sbjct: 581 NSTGGELSETFRQIGDELSNLRI 603 >gi|114562255|ref|YP_749768.1| von Willebrand factor, type A [Shewanella frigidimarina NCIMB 400] gi|114333548|gb|ABI70930.1| von Willebrand factor, type A [Shewanella frigidimarina NCIMB 400] Length = 612 Score = 43.3 bits (100), Expect = 0.082, Method: Composition-based stats. Identities = 39/330 (11%), Positives = 98/330 (29%), Gaps = 47/330 (14%) Query: 45 DWHYKKNSMESANNAAILAGASKMVSN----LSRLGDRFESISNHAKRALIDDAKRFIKN 100 D + + + + A ++VS+ +S +N +++ + Sbjct: 75 DLYQGSKELHRSAKVMTSSMAQRIVSSQHVSVSDRNFSLAPTTNDKFESVVQNGNMVAGE 134 Query: 101 HIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQF 160 + S + + ++N ++ + + + Y + + + + Sbjct: 135 TPVSTFSIDVDTGSYSTTRRLINQGQLPTKNTVRVEELVNYFSYDYPVPTNSEQPFSVNT 194 Query: 161 IEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 + +++ + + + L++D+SGSM Sbjct: 195 ELAPSPYNADTQLLRIGLKGFDVAPDKLSASNLVLLLDVSGSMS---------------- 238 Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 K+ LK A+L+ + V VY G G V + ++ + Sbjct: 239 ---SADKLPLLKQAMLMLSQQLSAQDKVSIVVYAGASG----VVLDGVAGNDFTAIKTAL 291 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 ++ T+ + ++ AYQ L +N +I TDG+ Sbjct: 292 SQL---NAQGGTNGSQGIQLAYQ-LAQKHFIENGSNR---------------VILATDGD 332 Query: 341 -NNNFKSNVNTIKICDKAKENFIKIVTISI 369 N + + + I + T+ Sbjct: 333 FNLGMTDHQQLVDFVASRSKKGIGLSTLGF 362 >gi|296268803|ref|YP_003651435.1| von Willebrand factor type A [Thermobispora bispora DSM 43833] gi|296091590|gb|ADG87542.1| von Willebrand factor type A [Thermobispora bispora DSM 43833] Length = 607 Score = 42.9 bits (99), Expect = 0.083, Method: Composition-based stats. Identities = 31/222 (13%), Positives = 69/222 (31%), Gaps = 46/222 (20%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + +V+D+SGSM + +TK+ K A + L + Sbjct: 419 VLMVIDVSGSMGADVPG-------------TGQTKLELAKQAAINALPQLGPND------ 459 Query: 253 YMGLIGYTTRVEKNI------EPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 +GL ++T + + + R + T + AY+ + Sbjct: 460 QVGLWMFSTNQDGGKDYRELVPMGRNNRDLLKK--RIQGLIPGGGTGLYDTTRAAYRTVL 517 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN---NNFKSNVNTIKICDKAKENFIK 363 ++ LTDG+N N+ ++ + + ++ Sbjct: 518 ERHSNDVINA----------------VVVLTDGKNEDDNSISLEDLLAELRTETGQETVR 561 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + TI+ + + L + + Y+ S+ VF + Sbjct: 562 VFTIAYGNDADLEVLRQISQVTDAAAYDSREPGSIDQVFTAV 603 >gi|78189841|ref|YP_380179.1| von Willebrand factor, type A [Chlorobium chlorochromatii CaD3] gi|78172040|gb|ABB29136.1| von Willebrand factor, type A [Chlorobium chlorochromatii CaD3] Length = 329 Score = 42.9 bits (99), Expect = 0.083, Method: Composition-based stats. Identities = 34/239 (14%), Positives = 67/239 (28%), Gaps = 53/239 (22%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 LR + L +D+S SM Q +++ + A + Sbjct: 86 TLRSTTAAARGIDMVLAIDISESMM--------------QSQTDTQSRFEIARQAARNVV 131 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAM 298 + + + +GL+ + + T + T A+ Sbjct: 132 EQ-------RSNDRIGLVVFRGEAYTLSPLTRDHTVLSLLLDNLSSRIIQDDGTAIGSAL 184 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 A L + S + +I LTDGENN + + T A Sbjct: 185 LVALNRLQA------------------SESELQMVILLTDGENNAGEVSPLTAAA--LAA 224 Query: 359 ENFIKIV--TISI------NASPNGQ---RLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 ++ ++ NA + L + + ++ V N L +I+ Sbjct: 225 RRGVRFYVLNVAFESVKDENAPRSALYAAELQEVARRTGGSYFTVNNKTELETTIASIA 283 >gi|33595651|ref|NP_883294.1| putative hemolysin [Bordetella parapertussis 12822] gi|33565730|emb|CAE36274.1| putative hemolysin [Bordetella parapertussis] Length = 2215 Score = 42.9 bits (99), Expect = 0.083, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 59/169 (34%), Gaps = 24/169 (14%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 ++ + + I LV+DLS SM+ S K T++ K+AL Sbjct: 1620 GGIKQNVTAGTSYNIALVLDLSDSMNDKWGSG-----------SNKPTRLQTAKDALKAL 1668 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR-QYVTRDMDSLILKPTDSTPA 297 L++ + + + + LI + + + + V +D L+ S Sbjct: 1669 LENQLAVHDGE--INVSLITFNG---SSSALKKSITGLTPENVDEMVDILMGLKASSATP 1723 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 A+ T + + + P++ FLTDGE + S Sbjct: 1724 YGAAFDRTTQWFEGQPTVDSEGK-------PYKNLTFFLTDGEPSTEWS 1765 >gi|332216215|ref|XP_003257240.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 2 [Nomascus leucogenys] Length = 900 Score = 42.9 bits (99), Expect = 0.084, Method: Composition-based stats. Identities = 33/165 (20%), Positives = 56/165 (33%), Gaps = 45/165 (27%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D SGSM K+ + A + +D LS + Sbjct: 275 VVFVIDKSGSMSGR--------------------KIQQTREA---LIKILDDLSPRDQF- 310 Query: 253 YMGLIGYTTRVEK-----NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 LI ++T + + K R + L T+ AM A Q+L S Sbjct: 311 --NLIVFSTEAIQWRPSLVPASAENVNKARSFAAGIQA---LGGTNINDAMLMAVQLLDS 365 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + + S+ II LTDG+ ++N +I+ Sbjct: 366 SNQ--------EERLPDGSVSL---IILLTDGDPTVGETNPRSIQ 399 >gi|332216213|ref|XP_003257239.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 1 [Nomascus leucogenys] Length = 930 Score = 42.9 bits (99), Expect = 0.084, Method: Composition-based stats. Identities = 33/165 (20%), Positives = 56/165 (33%), Gaps = 45/165 (27%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D SGSM K+ + A + +D LS + Sbjct: 275 VVFVIDKSGSMSGR--------------------KIQQTREA---LIKILDDLSPRDQF- 310 Query: 253 YMGLIGYTTRVEK-----NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 LI ++T + + K R + L T+ AM A Q+L S Sbjct: 311 --NLIVFSTEAIQWRPSLVPASAENVNKARSFAAGIQA---LGGTNINDAMLMAVQLLDS 365 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + + S+ II LTDG+ ++N +I+ Sbjct: 366 SNQ--------EERLPDGSVSL---IILLTDGDPTVGETNPRSIQ 399 >gi|116751034|ref|YP_847721.1| von Willebrand factor, type A [Syntrophobacter fumaroxidans MPOB] gi|116700098|gb|ABK19286.1| von Willebrand factor, type A [Syntrophobacter fumaroxidans MPOB] Length = 479 Score = 42.9 bits (99), Expect = 0.084, Method: Composition-based stats. Identities = 31/223 (13%), Positives = 68/223 (30%), Gaps = 43/223 (19%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + R + +V+D SGSM K+ + A+L L + Sbjct: 85 NVEARRELDMVVVMDRSGSMA-------------------DAGKLTHARQAVLNLLSRLS 125 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAY 302 L+ Y+ V+++ T R + R + + T+ +++ Sbjct: 126 ETD------RFALVSYSDHVQRHGGLLPITPANRATLERIVRGIQPGGATNLGGGLQEGI 179 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK-AKENF 361 L + + L +I ++DG N ++ + + A E Sbjct: 180 SQL-------------AELQQNGRLSR---LILISDGLANRGVTDPSALGTMASVAAERG 223 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + T+ + N + + + +A + VF Sbjct: 224 YAVSTVGVGLDFNEHLMTSIADKGAGNYTFMESASAFAQVFDK 266 >gi|332970883|gb|EGK09860.1| D-amino-acid dehydrogenase [Desmospora sp. 8437] Length = 448 Score = 42.9 bits (99), Expect = 0.085, Method: Composition-based stats. Identities = 39/226 (17%), Positives = 73/226 (32%), Gaps = 40/226 (17%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 E E + +++D SGSM ++ KM K A+ F S+ Sbjct: 131 ESAEEKPLHVSILLDASGSMAGQVDGGM---------------KMNLAKAAVERFASSLP 175 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIE-------PSWGTEKVRQYVTRDMDSLILKPTDSTP 296 + V VY G G ++ +K + GT + ++ T Sbjct: 176 ENAKVSLWVY-GHKGSNSKKDKPVSCKSTEEVYPLGTYQEEKFSQSLDQFRATGWTPIAA 234 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 +MK A + L Q +I+ +DG V K ++ Sbjct: 235 SMKAAREEL--------------QKNSGEDATHMLYIV--SDGVETCGGDPVAEAKKLNQ 278 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 +K + + I + GQ+ L+ + Y V ++ + + Sbjct: 279 SKIKAV-VNIIGFDVDDAGQQALQKVAEAGGGEYETVESEQDLRSY 323 >gi|171910783|ref|ZP_02926253.1| Protein containing von Willebrand factor (vWF) type A domain [Verrucomicrobium spinosum DSM 4136] Length = 917 Score = 42.9 bits (99), Expect = 0.085, Method: Composition-based stats. Identities = 37/260 (14%), Positives = 87/260 (33%), Gaps = 56/260 (21%) Query: 161 IEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 + + + ++ I R+ ER +VD+SGSM Sbjct: 511 MAEAPWKP--EHRLARIAIKGRVNQQERAPANFVFLVDVSGSMD---------------- 552 Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 + K+ +K +L + + + + ++ Y + + GTEK R + Sbjct: 553 ---EPDKLPLVKQSLRMLTERLSTKD------RVAIVTYAGSTAVILPSTAGTEKSR--I 601 Query: 281 TRDMDSLILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +D L T+ ++ AY+ +Q + + +I TDG Sbjct: 602 IEAIDGLGAGGSTNGAGGIRLAYEQ-------------AQQHFQKEGVNR---VILCTDG 645 Query: 340 E-NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA--- 395 + N S K+ ++ ++ + + + A R ++T +Y +++ Sbjct: 646 DFNVGISSPGELQKLIEEKAKSRVFLSVLGFGAGNLKDRTMETLADKGNGNYAYIDSLSE 705 Query: 396 ------DSLIHVFQNISQLM 409 + + I++ + Sbjct: 706 ARKVLVEQMNATLVTIAKDV 725 >gi|327265811|ref|XP_003217701.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4-like [Anolis carolinensis] Length = 914 Score = 42.9 bits (99), Expect = 0.086, Method: Composition-based stats. Identities = 36/167 (21%), Positives = 54/167 (32%), Gaps = 37/167 (22%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V+D SGSM K+ AL L+ ++ H V+ G Sbjct: 273 VIDKSGSMIGK--------------------KIQQTIEALQKILEDLNPEDHFNLVVFSG 312 Query: 256 LIG-YTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 I + + + K E E +QYV M TD A+ A L Sbjct: 313 EISEWQSSLLKATEE--NVELAKQYVRTIMAQ---GGTDINGALLTAINSLDRATSAELL 367 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 I+ LTDG+ ++NVN+I+ K + Sbjct: 368 PEQSIS-----------MIVLLTDGQPTVGETNVNSIQTNIKKANDG 403 >gi|296170658|ref|ZP_06852233.1| von Willebrand factor [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295894647|gb|EFG74381.1| von Willebrand factor [Mycobacterium parascrofulaceum ATCC BAA-614] Length = 335 Score = 42.9 bits (99), Expect = 0.086, Method: Composition-based stats. Identities = 41/234 (17%), Positives = 72/234 (30%), Gaps = 51/234 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + LV+D+S SM D + +MAA + A F D + + Sbjct: 99 VMLVIDVSQSM---------------RATDVQPNRMAAAQEAAKQFADELTPG------I 137 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GLI Y + P+ + ++ + + + T + + A Q + + Sbjct: 138 NLGLIAYAGTATVLVSPTTNRDSTKRALDKLQFADR---TATGEGIFTALQAIATVGAV- 193 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENN---NFKSNVNTIKICDKAKENFIKIVTISI 369 + P I+ +DG+ N + AK+ + I TIS Sbjct: 194 ---------IGGGDAPPPARIVLFSDGKETMPTNPDNPKGAFTAARTAKDQGVPISTISF 244 Query: 370 NAS-------------PNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLM 409 P LK S YN L V+ + Q + Sbjct: 245 GTPYGFVEINDQRQPVPVDDETLKKVAQLSGGNAYNAATLQELKSVYATLQQQI 298 >gi|153012136|ref|YP_001373346.1| cell wall anchor domain-containing protein [Ochrobactrum anthropi ATCC 49188] gi|151564024|gb|ABS17517.1| LPXTG-motif cell wall anchor domain protein [Ochrobactrum anthropi ATCC 49188] Length = 750 Score = 42.9 bits (99), Expect = 0.086, Method: Composition-based stats. Identities = 51/304 (16%), Positives = 91/304 (29%), Gaps = 74/304 (24%) Query: 111 AVFYNTEIQNIVNSSRISMTHMANNRLDSSN-----NTIFYNMDVMTSYDYRLQFIEHLL 165 A F +++++ + RI T+ A + + S ++ + Sbjct: 270 AGFPLGKVESLFHKVRIDTTNDATREITLDGAAAADRDFVLEWSAVASDAPQVGLFREHI 329 Query: 166 NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 + PAL + +R + V+D SGSM Sbjct: 330 GKDDYVLAYVTPPALASPKKVQREVIF---VIDNSGSMGG-------------------- 366 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVY---MGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 T + K +L L + + + V+ N E R++VT Sbjct: 367 TSIEQAKASLDYALSQLQPGDRFN-VIRFDDTLTKFFEDSVDANQE---NIASARRFVTS 422 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 T+ PA+ A S N RQ I+FLTDGE + Sbjct: 423 LEAQ---GGTEMLPALHAALD-------DSNQGNGLRQ------------IVFLTDGEIS 460 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 N + ++ + A+ +I + I S+P + A+ F Sbjct: 461 NEQQLLDAVA----ARRGRSRIFMVGI-------------GSAPNSYLMNRAAELGRGTF 503 Query: 403 QNIS 406 +I Sbjct: 504 THIG 507 >gi|285808587|gb|ADC36107.1| putative chloride channel [uncultured bacterium 126] Length = 869 Score = 42.9 bits (99), Expect = 0.086, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 74/227 (32%), Gaps = 58/227 (25%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + + + P + +V+D S SM+ + + ++M Sbjct: 423 VTFDRDDEPTVALVIVLDRSWSMN-GTAMELSKSAAEGAANALAPSQM------------ 469 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL----ILKPTDSTP 296 +G++ + +I +VR+ D++ PT P Sbjct: 470 -------------LGVLTFNDASNWDIPL----GRVRESRPELHDAIGRIKASGPTAIFP 512 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ AY L + + R+ K +I L+DG+ + + + K Sbjct: 513 ALRNAYDALANVRVRA------------------KHVILLSDGQ--SDPEDFEGLVR--K 550 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVF 402 I + T+++ LL+ S Y V +A + +F Sbjct: 551 MSAAHITVSTVAL-GPDADAALLRNLASWGGGRSYVVQDAQQIPEIF 596 >gi|88601902|ref|YP_502080.1| von Willebrand factor, type A [Methanospirillum hungatei JF-1] gi|88187364|gb|ABD40361.1| von Willebrand factor, type A [Methanospirillum hungatei JF-1] Length = 316 Score = 42.9 bits (99), Expect = 0.087, Method: Composition-based stats. Identities = 38/232 (16%), Positives = 77/232 (33%), Gaps = 60/232 (25%) Query: 199 LSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG 258 +SGSM D + T++ + K + + L S+D Y G+I Sbjct: 96 VSGSMQA---------------TDYQPTRLESSKRSAEILLKSLDPKD------YAGIIT 134 Query: 259 YTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFF 318 + + S ++V + + T + + S R Sbjct: 135 FESGATSAAYLSPDKDRVIRKLQAIEP--KEGATAIGDGLALGIDMAESMPNR------- 185 Query: 319 RQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN-------- 370 +K +I L+DG NN + A+E I++ TI + Sbjct: 186 -----------KKVVILLSDGVNNAGVIHPEQAA--GFAREKGIQVFTIGMGSDSPVVLG 232 Query: 371 ---------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 A+ + L + S+ ++ V+ +L ++ N+++ +V K Sbjct: 233 YDWFGNPQYATLDEAMLQQIAASTNGQYFKSVDDRTLSEIYSNLNKEIVREK 284 >gi|288921031|ref|ZP_06415322.1| von Willebrand factor type A [Frankia sp. EUN1f] gi|288347549|gb|EFC81835.1| von Willebrand factor type A [Frankia sp. EUN1f] Length = 401 Score = 42.9 bits (99), Expect = 0.088, Method: Composition-based stats. Identities = 29/217 (13%), Positives = 59/217 (27%), Gaps = 31/217 (14%) Query: 153 SYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPE 212 + + +L + A + I I L +D+SGSM Sbjct: 51 RWQRHIPVALFVLGLATLGVGAARPQASVPIAANSTTIL---LALDVSGSM--------- 98 Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D ++ A + A F+ + +GL+ + Sbjct: 99 ------CSTDVSPNRLTAAEKAATDFIRA------QPGGSRIGLVTF---AGSAGLLVPP 143 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 T+ + + + T + + + ++ S Sbjct: 144 TDDTDKLLAALKSLTTSRGTAIGQGILTSIDAI-AEVDPSVPPTGADVPGGTGGEYAADV 202 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 I+ LTDG N + T +A +++ TI Sbjct: 203 IVVLTDGANTVGV-DPRTAA--GEAATRRLRVYTIGF 236 >gi|297529200|ref|YP_003670475.1| von Willebrand factor A [Geobacillus sp. C56-T3] gi|297252452|gb|ADI25898.1| von Willebrand factor type A [Geobacillus sp. C56-T3] Length = 1077 Score = 42.9 bits (99), Expect = 0.089, Method: Composition-based stats. Identities = 28/157 (17%), Positives = 53/157 (33%), Gaps = 46/157 (29%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 RP + V+D+SGSM K+ + K+AL ++ + Sbjct: 195 RPPIDVVFVMDVSGSM--------------------TAMKLQSAKSALQAAVNYFKSNYN 234 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGT-EKVRQYVTRDMDS----LILKPTDSTPAMKQAY 302 + L+ ++ V + +G V + +++ T+ + A+ A Sbjct: 235 QND--RFALVPFSDGVREASVVPFGKYSNVASQLDAILNTGNSLTAGGGTNYSAALSLAK 292 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 T + +K+IIFLTDG Sbjct: 293 SYFTDPTR-------------------KKYIIFLTDG 310 >gi|309791847|ref|ZP_07686333.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308226108|gb|EFO79850.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 542 Score = 42.9 bits (99), Expect = 0.090, Method: Composition-based stats. Identities = 44/261 (16%), Positives = 85/261 (32%), Gaps = 53/261 (20%) Query: 150 VMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNS 209 + +Y + Q ++ +L IV+ + R I LVVD+SGSM Sbjct: 330 ISMAYGVQPQGVQTILEVPKADVIVAVKNSW----AINRKRADIILVVDISGSMEGD--- 382 Query: 210 DPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT---TRVEKN 266 K+ + L FL I +G+I ++ T V Sbjct: 383 -----------------KLEMTRAGLESFLMRILPDD------RVGMITFSSSATEVVAP 419 Query: 267 IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 S +++ ++ T A++ A Q L + Sbjct: 420 AALSENRMQLQMAISEMSA---TGKTAVFDAVELARQSLEALPSTGEDR----------- 465 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 K I+ L+DG +N + + ++ E + I ++ A + +L Sbjct: 466 ---MKAIVLLSDGADNASRITLADLER--NFDETGVSIFPVAYGA-DADRSILDAIAEFS 519 Query: 387 EYHYNVVNADSLIHVFQNISQ 407 V + + +F+N+S+ Sbjct: 520 RTIVVVGDTGDIAQIFENLSR 540 >gi|256374530|ref|YP_003098190.1| von Willebrand factor type A [Actinosynnema mirum DSM 43827] gi|255918833|gb|ACU34344.1| von Willebrand factor type A [Actinosynnema mirum DSM 43827] Length = 550 Score = 42.9 bits (99), Expect = 0.090, Method: Composition-based stats. Identities = 27/219 (12%), Positives = 68/219 (31%), Gaps = 42/219 (19%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + +D +GSM A+N+ +K ++ + + S Sbjct: 49 GPMDVVFALDDTGSMGGALNN---------------------IKTSINAVVGDVVSASGG 87 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEK-VRQYVTRDMDSLILKP--TDSTPAMKQAYQIL 305 D +GL+ + + + G V YVT + + S A++ A + Sbjct: 88 --DYRLGLVTFKDSINVVTGLAAGNAGTVTGYVTNVLAASGGGGEPEASDEALRTA---V 142 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG-----ENNNFKSNVNTI-KICDKAKE 359 + N G + +KF++ +TD ++ ++ + + + A Sbjct: 143 SLRPAAGIPQNADFTGPWRSNA--RKFVVLVTDARPGGFDDAFTAADQASATAVANSALA 200 Query: 360 NFIKIVTISI-----NASPNGQRLLKTCVSSPEYHYNVV 393 +K+ + + + ++ + Sbjct: 201 AGVKLSAVYVPTSPSMTPTIAPIMQNYATTTSGVYVQAQ 239 >gi|297579199|ref|ZP_06941127.1| conserved hypothetical protein [Vibrio cholerae RC385] gi|297536793|gb|EFH75626.1| conserved hypothetical protein [Vibrio cholerae RC385] Length = 1778 Score = 42.9 bits (99), Expect = 0.091, Method: Composition-based stats. Identities = 27/173 (15%), Positives = 60/173 (34%), Gaps = 27/173 (15%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 E + ++L++D+SGSM + +T++ +K + + ++ Sbjct: 1159 ESETKQGANVQLMLDVSGSMGRDAGNG--------------KTRLQVMKESAIQLIEQYQ 1204 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGT-EKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 L + V + L ++ W T + + Y+ + TD A+K A Sbjct: 1205 AL--GQTKVQLILFSSDASIKTASGLLWMTVAEAKNYINALSAN---GGTDYDDAIKLAQ 1259 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE--NNNFKSNVNTIKI 353 + + + + + + D E N+ + N NTI+ Sbjct: 1260 ESWSGTINGQPLSGATNVSYFLSDGVPEGY-----DWELKNSQWVQNQNTIET 1307 >gi|33602243|ref|NP_889803.1| hypothetical protein BB3267 [Bordetella bronchiseptica RB50] gi|33576682|emb|CAE33759.1| putative exported protein [Bordetella bronchiseptica RB50] Length = 571 Score = 42.9 bits (99), Expect = 0.091, Method: Composition-based stats. Identities = 33/239 (13%), Positives = 78/239 (32%), Gaps = 54/239 (22%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 R+ + P + L++D SGSM R K+ LK+AL + Sbjct: 200 RVAPQDIPAVNLVLLIDTSGSMA-------------------DRAKLPLLKSALRQLVTQ 240 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQ 300 + + ++ Y + + G + + +D L T+ ++ Sbjct: 241 MRAQD------RVAIVAYAGSAGLVLPSTPGDRHAQ--ILAAIDGLQASGSTNGGAGLEL 292 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV-NTIKICDKAKE 359 AY + VK I+ +DG+ N ++++ ++ Sbjct: 293 AY------------AEAAKGLVKDGVNR----IVLASDGDFNVGRTDLAQLKDYVGSQRK 336 Query: 360 NFIKIVTISI-NASPNGQRLLKTCVSSPEYHYNVVN--------ADSLIHVFQNISQLM 409 I + T+ + + + N ++ + ++ + + A L I++ + Sbjct: 337 RGIALTTLGLGSGNYNDAMAMQLANAGDGSYHYIDSLLQARKVFASELSATLLTIAKDV 395 >gi|33596464|ref|NP_884107.1| hypothetical protein BPP1839 [Bordetella parapertussis 12822] gi|33566233|emb|CAE37141.1| putative exported protein [Bordetella parapertussis] Length = 571 Score = 42.9 bits (99), Expect = 0.091, Method: Composition-based stats. Identities = 33/239 (13%), Positives = 78/239 (32%), Gaps = 54/239 (22%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 R+ + P + L++D SGSM R K+ LK+AL + Sbjct: 200 RVAPQDIPAVNLVLLIDTSGSMA-------------------DRAKLPLLKSALRQLVTQ 240 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQ 300 + + ++ Y + + G + + +D L T+ ++ Sbjct: 241 MRAQD------RVAIVAYAGSAGLVLPSTPGDRHAQ--ILAAIDGLQASGSTNGGAGLEL 292 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV-NTIKICDKAKE 359 AY + VK I+ +DG+ N ++++ ++ Sbjct: 293 AY------------AEAAKGLVKDGVNR----IVLASDGDFNVGRTDLAQLKDYVGSQRK 336 Query: 360 NFIKIVTISI-NASPNGQRLLKTCVSSPEYHYNVVN--------ADSLIHVFQNISQLM 409 I + T+ + + + N ++ + ++ + + A L I++ + Sbjct: 337 RGIALTTLGLGSGNYNDAMAMQLANAGDGSYHYIDSLLQARKVFASELSATLLTIAKDV 395 >gi|309361123|emb|CAP30209.2| hypothetical protein CBG_10938 [Caenorhabditis briggsae AF16] Length = 579 Score = 42.9 bits (99), Expect = 0.091, Method: Composition-based stats. Identities = 20/77 (25%), Positives = 38/77 (49%), Gaps = 9/77 (11%) Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN-GQRL--LKTCVSSPE 387 K ++F TDG S +T + + K I + T+ I+ N G + LK +SP Sbjct: 505 KLLVF-TDG-----YSAEDTAEGIEALKRQGITVYTVGISTDKNAGLNVSELKGMATSPS 558 Query: 388 YHYNVVNADSLIHVFQN 404 ++++ + D+L+ F + Sbjct: 559 HYFDSSDFDNLLKHFPS 575 >gi|119493582|ref|ZP_01624246.1| von Willebrand factor, type A [Lyngbya sp. PCC 8106] gi|119452572|gb|EAW33755.1| von Willebrand factor, type A [Lyngbya sp. PCC 8106] Length = 414 Score = 42.9 bits (99), Expect = 0.092, Method: Composition-based stats. Identities = 40/262 (15%), Positives = 81/262 (30%), Gaps = 55/262 (20%) Query: 147 NMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCA 206 N+ + + I +QR VS IP + + L++D SGSM Sbjct: 2 NVSLEPFLNDPNLDITQGSSQRQLSMSVSAIPNSVD----RHVPLNLCLILDHSGSMSG- 56 Query: 207 MNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN 266 + + +K A +D ++ + ++ + R + Sbjct: 57 -------------------SPLETVKKAAGELIDRLNPGD------RISVVVFDHRAKVL 91 Query: 267 IEPS--WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 I E +++ + R S T +K + L K F Sbjct: 92 IPNQDIDDPESIKKQINRLRTS---GGTSIDEGLKLGIEELGKGKVERISQAF------- 141 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS 384 LTDGEN + +N +K+ A + + + ++ + Q +L+ Sbjct: 142 ----------LLTDGENEHGDNN-RCLKLAKLATDYNLTLNSLGF-GNDWNQDILEKIAD 189 Query: 385 S-PEYHYNVVNADSLIHVFQNI 405 + + +I F + Sbjct: 190 EGGGTLAYIEYPEQVIDEFSRL 211 >gi|77465024|ref|YP_354527.1| hypothetical protein RSP_3006 [Rhodobacter sphaeroides 2.4.1] gi|77389442|gb|ABA80626.1| conserved hypothetical protein containing Von Willebrand factor, type A domain [Rhodobacter sphaeroides 2.4.1] Length = 222 Score = 42.9 bits (99), Expect = 0.092, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 65/194 (33%), Gaps = 40/194 (20%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + LV+D S SM A + L+ + F + L V Sbjct: 18 ARLPVCLVLDTSASMTGA--------------------PITELQEGVSTFFAQL-LADDV 56 Query: 249 KED-VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 E + ++ + V+ ++ + RQ V T A++ A ++L + Sbjct: 57 AEYSAEVAVVTFGGNVDMAVDFA---AVTRQTVPSLTAG---GMTPMGEAVETALELLHT 110 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE-NNNFKSNVNTIKICDKAKENFIKIVT 366 K+ + + +Q +++ +TDG +N + D +E + + Sbjct: 111 RKE---------EYKRAGVDYYQPWLVIMTDGAPTDNISKASRLVD--DLVREKKLAVFA 159 Query: 367 ISINASPNGQRLLK 380 I I + L K Sbjct: 160 IGIGKDADMNELAK 173 >gi|325919992|ref|ZP_08181973.1| von Willebrand factor type A-like protein [Xanthomonas gardneri ATCC 19865] gi|325549526|gb|EGD20399.1| von Willebrand factor type A-like protein [Xanthomonas gardneri ATCC 19865] Length = 142 Score = 42.9 bits (99), Expect = 0.093, Method: Composition-based stats. Identities = 30/141 (21%), Positives = 51/141 (36%), Gaps = 27/141 (19%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 AA K L FLD +E +GL+ + R + VR + + L Sbjct: 17 AAAKAVLSDFLDR-------REGDRVGLLVFGQRAYALTPLTADLTSVRDQLADSVVGLA 69 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 + T A+ + + L K+ Q+ ++ LTDG N N Sbjct: 70 GRETAIGDAIALSVKRLREQKQG------------------QRVVVLLTDGVNTAGVLNP 111 Query: 349 NTIKICDKAKENFIKIVTISI 369 +K + AK +++ TI+ Sbjct: 112 --LKAAELAKAEGVRVHTIAF 130 >gi|291399641|ref|XP_002716222.1| PREDICTED: collagen, type XXIX, alpha 1 [Oryctolagus cuniculus] Length = 2738 Score = 42.9 bits (99), Expect = 0.095, Method: Composition-based stats. Identities = 24/171 (14%), Positives = 64/171 (37%), Gaps = 29/171 (16%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS--LILKPTDSTPAMK 299 + + + +G+ ++ ++ ++ ++ +D LI T + A+K Sbjct: 655 LAKIQIGADKTQIGVAQFSDYNKEEFPL--NKYFTQKEISDAIDRMLLITGNTLTGSALK 712 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 F + + K +KF+I +TDGE + +V + ++ Sbjct: 713 --------------FIDTYFTQSKGARHGVKKFLILITDGE---AQDDVREPAV--ALRD 753 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + I+++ + + Q L+ ++V N + L + I + ++ Sbjct: 754 KGVIILSVGVYGANRTQ--LEEISGDGSLVFHVENFEDL----KEIERKLI 798 Score = 39.1 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 68/410 (16%), Positives = 136/410 (33%), Gaps = 74/410 (18%) Query: 31 MSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM---VSNLSRLGDRF-------- 79 + FL+ I ++ KN M++ A I GA K V+ S Sbjct: 629 IMFLVDGSSSIGYANFEKMKNFMQTLL-AKIQIGADKTQIGVAQFSDYNKEEFPLNKYFT 687 Query: 80 -ESISNHAKRALI-------DDAKRFIKNHIKESLSGYSAV------FYNTEIQNIVN-- 123 + IS+ R L+ A +FI + +S V + E Q+ V Sbjct: 688 QKEISDAIDRMLLITGNTLTGSALKFIDTYFTQSKGARHGVKKFLILITDGEAQDDVREP 747 Query: 124 --SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL 181 + R + + + +N T + S + ++ E L + +K++ + L Sbjct: 748 AVALRDKGVIILSVGVYGANRTQLEEISGDGSLVFHVENFEDL--KEIERKLIFLVCTL- 804 Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + + + V+D SGS +NS+ ++ + L + Sbjct: 805 -HDCKRIQLLDVVFVLDHSGS----INSEQQESM-------------------INLTIHL 840 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + ++ V G + Y+ + T R + + + Sbjct: 841 VKKSDVGRDRVQFGALRYSDDPDILFYL--NTYSNRSAIIEHLRRRRDTGGSTF------ 892 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 T+ T F Q ++ +I +TDGE ++ + +N + K + Sbjct: 893 ----TAKALGRSATLFEEQHGSRIKQNVKQMLIIITDGE-SHDRHLLNDTAL--KLRNKG 945 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 I I I++ Q L+ + E +V + D L V+ + + M Sbjct: 946 ITI--IAVGVGKANQEELEAMAGNKENTIHVKDFDKLKDVYLPLQESMCT 993 >gi|226314068|ref|YP_002773964.1| hypothetical protein BBR47_44830 [Brevibacillus brevis NBRC 100599] gi|226097018|dbj|BAH45460.1| hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 677 Score = 42.9 bits (99), Expect = 0.095, Method: Composition-based stats. Identities = 45/251 (17%), Positives = 82/251 (32%), Gaps = 54/251 (21%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 A E G +F VVD S SM+ K K A + Sbjct: 36 GANGTAEAGVDAVF----VVDTSNSMN----------------------KTDPGKTAAEV 69 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ID+ + +G + Y R+ + P+ + + R+ ++ + Sbjct: 70 MSMFIDMSEATR--TRIGFVAYNDRIVQAQSPA----SMAEARNREQLKRTIQGLRYSG- 122 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK---------SNV 348 Y L +R + + F+I L+DG + + SN Sbjct: 123 ----YSDLGLGLRRGAEMIEKAK-----DPARKPFLILLSDGGTDLRQNAGGRSVAASNK 173 Query: 349 NTIKICDKAKENFIKIVTISINASPN-GQRLLKTCV-SSPEYHYNVVNADSLIHVFQNI- 405 + + KAK I TI +N + + LK ++ + + D L +F I Sbjct: 174 DVETVISKAKAQGYPIYTIGLNNDGSVQKEQLKKIAEATGGTSFVTQSTDDLPEIFNQIF 233 Query: 406 SQLMVHRKYSV 416 ++ + + SV Sbjct: 234 AKHIQSQLVSV 244 >gi|2707733|gb|AAD03350.1| microneme protein precursor Etmic-1 [Eimeria tenella] Length = 712 Score = 42.9 bits (99), Expect = 0.095, Method: Composition-based stats. Identities = 34/187 (18%), Positives = 57/187 (30%), Gaps = 37/187 (19%) Query: 174 VSFIPALLRIEMGER---PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 S P ++ + + LVVD SGS+ + Sbjct: 29 SSLQPGATTSSGQDQVCTSLLDVMLVVDESGSIGT-----------------------SN 65 Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + D ++ + EDV +GLI + TR + W + S Sbjct: 66 FRKVRQFIEDFVNSMPISPEDVRVGLITFATR----SKVRWNLSDPKATNPSLAISAARS 121 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 + ST Y L KK + TN + K ++ +TDG +N ++ Sbjct: 122 LSYSTGVTYTHY-GLQDAKKLLYDTNAGARN------NVPKLVLVMTDGASNLPSQTRSS 174 Query: 351 IKICDKA 357 A Sbjct: 175 AAALRDA 181 >gi|77735553|ref|NP_001029472.1| calcium-activated chloride channel regulator 4 [Bos taurus] gi|74268230|gb|AAI03388.1| Chloride channel accessory 4 [Bos taurus] gi|296489202|gb|DAA31315.1| chloride channel accessory 4 [Bos taurus] Length = 933 Score = 42.9 bits (99), Expect = 0.095, Method: Composition-based stats. Identities = 31/219 (14%), Positives = 73/219 (33%), Gaps = 53/219 (24%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM + ++ + A FL + + + ++ Sbjct: 310 LVLDKSGSMAAS-------------------NRLNRMNQAAQHFL--LQTVENGS---WV 345 Query: 255 GLIGYTTRVEKNIEPSWGTEKV-RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + R+ + + + T ++ +Q + + + Sbjct: 346 GMVHFDSSASIKSNLIQIISSSERRKLLESLPTAASGGTSICSGIESGFQAIRNADFQID 405 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 + I+ LTDGE++ T K C +K K++ I I++ Sbjct: 406 GSE----------------IVLLTDGEDS-------TAKSCIEKVKQSGAIIHFIAL-GP 441 Query: 373 PNGQRLLKTCVSSPEYHYNVVNADS---LIHVFQNISQL 408 Q + + +++ + + LI F ++ Sbjct: 442 DAAQAVKEMSIATGGKYIYASDEGQNNGLIDAFAALASE 480 >gi|296128023|ref|YP_003635273.1| von Willebrand factor type A [Cellulomonas flavigena DSM 20109] gi|296019838|gb|ADG73074.1| von Willebrand factor type A [Cellulomonas flavigena DSM 20109] Length = 500 Score = 42.9 bits (99), Expect = 0.095, Method: Composition-based stats. Identities = 40/226 (17%), Positives = 79/226 (34%), Gaps = 47/226 (20%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 L R + ++V + R + LVVD SGSM + Sbjct: 133 LPFRPDHRLVRVGISSAPASAVSRADADLVLVVDCSGSMD-------------------E 173 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 KM K AL + S+ + ++ Y+T + +EP+ R+ V + Sbjct: 174 AGKMETTKYALRTLVSSLRRTD------RVAMVCYSTEADVYLEPT--PVAEREGVLAAI 225 Query: 285 DSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 D L T++ + Y + S + T ++ ++DG N Sbjct: 226 DRLAPRDSTNAAAGLALGYDLAMSMRTEGRLTR----------------VVLVSDGVANV 269 Query: 344 FKSNVNTI--KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE 387 +++ I +I +AK I ++++ + + LL+ + Sbjct: 270 GETDPEGILARISSQAK-AGISLISVGVGITTYNDHLLEQLADQGD 314 >gi|255578117|ref|XP_002529928.1| protein binding protein, putative [Ricinus communis] gi|223530558|gb|EEF32436.1| protein binding protein, putative [Ricinus communis] Length = 731 Score = 42.9 bits (99), Expect = 0.095, Method: Composition-based stats. Identities = 31/235 (13%), Positives = 77/235 (32%), Gaps = 47/235 (20%) Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL----LNQRYNQKIVSFIP 178 + ++ L + + + + ++ F + S Sbjct: 208 RNLSTKNDSSDSHSLGTIDVKTYPEVSAVSRSASHDNFCVLIHLKAPVTSIRHNSSSNHM 267 Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 L ++ R + V+D+SGSM TK+A LK A+ Sbjct: 268 ELPQMSQNSRAPVDLVTVLDVSGSMAG--------------------TKLALLKRAMGFV 307 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTPA 297 + ++ + +I +++ + TE RQ ++SL+ T+ Sbjct: 308 IQNLGPSD------RLSVIAFSSTARRLFPLRCMTEAGRQEALLSVNSLVSNGGTNIAEG 361 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 +++ +++ K ++ + II L+DG++ ++ + + Sbjct: 362 LRKGAKVIVDRKWKNPVAS----------------IILLSDGQDTYTVTSPSGMN 400 >gi|153792263|ref|NP_001093210.1| matrilin 1 [Danio rerio] gi|148726249|emb|CAN88321.1| matrilin 1 [Danio rerio] gi|148726497|emb|CAN88267.1| matrilin 1 [Danio rerio] Length = 489 Score = 42.9 bits (99), Expect = 0.096, Method: Composition-based stats. Identities = 41/215 (19%), Positives = 78/215 (36%), Gaps = 28/215 (13%) Query: 197 VDLSGSMHCAM-NSDPEDVNSAPICQDK-KRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 + + +M + N+ P DV + ++ +K + ID LS + + Sbjct: 21 LRQAAAMAAGLCNTKPTDVVFIVDSSRSVRPSEFEQVK---VFLAKVIDGLSVGPDATRV 77 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 G++ Y +RV+ + K + T + A++ A + S+ + Sbjct: 78 GVVNYASRVKNEVSLKSHKTKAALVKAVSKIEPLSTGTMTGLAIQFAMNVAFSEAEG--- 134 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 G K P + K I +TDG + N+ I +A+E I+I I + Sbjct: 135 ------GRKSPDIS--KVAIIVTDG---RPQDNIRDIA--ARAREAGIEIFAIGVG--RV 179 Query: 375 GQRLLKTCVSSP--EYHYNVVN---ADSLIHVFQN 404 L+ S P ++ V + + L FQ Sbjct: 180 DMTTLRQMASEPLEDHVDYVESYSLIEKLTKKFQE 214 Score = 36.4 bits (82), Expect = 7.9, Method: Composition-based stats. Identities = 34/239 (14%), Positives = 78/239 (32%), Gaps = 34/239 (14%) Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 + I P + E ++ S S N+ + V + + Sbjct: 232 EHICISTPGSFKCACREGFTL-----MNDSRSCSACSNAATDVVFLIDGSKSVRPENFEL 286 Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLI 288 +K + L +D +D +S V GL+ Y++ V++ +++ V R Sbjct: 287 VKKWINLIIDKLD-VSETNTHV--GLVQYSSTVKQEFPLGRHNSKRSLKEAVKR------ 337 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 M + + SF + + K I TDG + ++ + Sbjct: 338 ---------MDYMERGTMTGHALSFLVDNSFGPNQGARPGVPKVGIVFTDGRSQDYIGDA 388 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 KAK K+ + + + L+ S P ++++ + ++ + + + Sbjct: 389 A-----KKAKALGFKMYAVGVG--NAVEDELREIASEPIADHYFYTADFKTMNQIAKKL 440 >gi|114587340|ref|XP_516522.2| PREDICTED: similar to PK-120 precursor isoform 5 [Pan troglodytes] gi|114587342|ref|XP_001172703.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 4 [Pan troglodytes] Length = 930 Score = 42.9 bits (99), Expect = 0.096, Method: Composition-based stats. Identities = 32/160 (20%), Positives = 52/160 (32%), Gaps = 35/160 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D SGSM K+ + AL+ LD + V Sbjct: 275 VVFVIDKSGSMSGR--------------------KIQQTREALIKILDDLSPRDQFNLIV 314 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 + R + K R + L T+ AM A Q+L S + Sbjct: 315 FSTEASQ-WRPSLVPASAENVNKARSFAVGIQA---LGGTNINDAMLMAVQLLDSSNQ-- 368 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + S+ II LTDG+ ++N +I+ Sbjct: 369 ------EEQLPEGSVSL---IILLTDGDPTVGETNPRSIQ 399 >gi|114587346|ref|XP_001172688.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 3 [Pan troglodytes] Length = 900 Score = 42.9 bits (99), Expect = 0.096, Method: Composition-based stats. Identities = 32/160 (20%), Positives = 52/160 (32%), Gaps = 35/160 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D SGSM K+ + AL+ LD + V Sbjct: 275 VVFVIDKSGSMSGR--------------------KIQQTREALIKILDDLSPRDQFNLIV 314 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 + R + K R + L T+ AM A Q+L S + Sbjct: 315 FSTEASQ-WRPSLVPASAENVNKARSFAVGIQA---LGGTNINDAMLMAVQLLDSSNQ-- 368 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + S+ II LTDG+ ++N +I+ Sbjct: 369 ------EEQLPEGSVSL---IILLTDGDPTVGETNPRSIQ 399 >gi|114587344|ref|XP_001172675.1| PREDICTED: similar to PK-120 precursor isoform 2 [Pan troglodytes] Length = 914 Score = 42.9 bits (99), Expect = 0.096, Method: Composition-based stats. Identities = 32/160 (20%), Positives = 52/160 (32%), Gaps = 35/160 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D SGSM K+ + AL+ LD + V Sbjct: 275 VVFVIDKSGSMSGR--------------------KIQQTREALIKILDDLSPRDQFNLIV 314 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 + R + K R + L T+ AM A Q+L S + Sbjct: 315 FSTEASQ-WRPSLVPASAENVNKARSFAVGIQA---LGGTNINDAMLMAVQLLDSSNQ-- 368 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + S+ II LTDG+ ++N +I+ Sbjct: 369 ------EEQLPEGSVSL---IILLTDGDPTVGETNPRSIQ 399 >gi|260797291|ref|XP_002593637.1| hypothetical protein BRAFLDRAFT_235784 [Branchiostoma floridae] gi|229278863|gb|EEN49648.1| hypothetical protein BRAFLDRAFT_235784 [Branchiostoma floridae] Length = 371 Score = 42.9 bits (99), Expect = 0.097, Method: Composition-based stats. Identities = 36/262 (13%), Positives = 84/262 (32%), Gaps = 47/262 (17%) Query: 111 AVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYN 170 + ++ +I +++ N LD N++ ++ + + ++ N + Sbjct: 140 GNYDLGQLTDIASTNETLGVVDNFNLLDDVRNSLLSSVCSVLNDNFLTTLTIQDCNSDHI 199 Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 + L ++ I V+D SGS+ + K+ A Sbjct: 200 SITMPCYTLLEKVTPPCNNPVDIVFVLDGSGSVG-----------------RRNFEKVQA 242 Query: 231 -LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSL 287 +K + F ++D +G++ Y++ V + + + + Sbjct: 243 GVKKIVGDFNIALDS-------TRVGVVQYSSIVRQEFALDTFSNLQGLESGIQSIP--Y 293 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + T + AM+ A Q + I+ +TDG + + S Sbjct: 294 MAGGTRTGAAMEYAIQNSFTSAN-------------GARPDVGHVIVLVTDGRSYDDVSQ 340 Query: 348 VNTIKICDKAKENFIKIVTISI 369 + KAK+ I + + I Sbjct: 341 AS-----QKAKQAGIVVFAVGI 357 >gi|292491521|ref|YP_003526960.1| hypothetical protein Nhal_1422 [Nitrosococcus halophilus Nc4] gi|291580116|gb|ADE14573.1| conserved hypothetical protein [Nitrosococcus halophilus Nc4] Length = 398 Score = 42.9 bits (99), Expect = 0.098, Method: Composition-based stats. Identities = 12/60 (20%), Positives = 36/60 (60%), Gaps = 1/60 (1%) Query: 17 SEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLG 76 S++ ++F + +++ + + G + + + K +++A +AA L+GA K+++++ +G Sbjct: 10 SQRGVTMVLFTIGMVAIIGMAGLALDMGHAYLNKTRLQNALDAAALSGA-KVLNDMHDVG 68 >gi|109900221|ref|YP_663476.1| vault protein inter-alpha-trypsin [Pseudoalteromonas atlantica T6c] gi|109702502|gb|ABG42422.1| Vault protein inter-alpha-trypsin [Pseudoalteromonas atlantica T6c] Length = 701 Score = 42.9 bits (99), Expect = 0.098, Method: Composition-based stats. Identities = 36/237 (15%), Positives = 72/237 (30%), Gaps = 65/237 (27%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + + P + ++D SGSM + K A+ Sbjct: 291 STGGAVAQQMPSREVVFLLDTSGSMAGE--------------------SIVQAKRAVDFA 330 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVE-----KNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 L + +V +I + + + ++ R +V T+ Sbjct: 331 LTQLRPEDNVN------IIQFNDAPQALWKRAMPATAKHIQRARNWVASLHAD---GGTE 381 Query: 294 STPAMKQAYQI--LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 PA+ A L D ++ RQ ++F+TDG +N + ++ I Sbjct: 382 MAPALTLALNKPSLHRDDSDLLGSHKLRQ------------VVFITDGSVSNEDALMSLI 429 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 ++K ++ TI I S+P ++ A + F I + Sbjct: 430 ----ESKLADNRLFTIGI-------------GSAPNSYFMTQAAQAGRGTFTYIGDI 469 >gi|83644399|ref|YP_432834.1| von Willebrand factor type A (vWA) domain-containing protein [Hahella chejuensis KCTC 2396] gi|83632442|gb|ABC28409.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Hahella chejuensis KCTC 2396] Length = 687 Score = 42.9 bits (99), Expect = 0.098, Method: Composition-based stats. Identities = 39/221 (17%), Positives = 72/221 (32%), Gaps = 65/221 (29%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM ++ + L+ F + + V + Sbjct: 312 LVLDRSGSMSGKFSTL-----------------LEGLRKGFAKF--------NRNDRVRV 346 Query: 255 GLIGYTTRVEKNIEPSWG-----TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 I + + E + G E ++Q V ++ T+ A++ A L +D+ Sbjct: 347 --IMFN---DNATEVTNGWVQATPENLQQVVGAVENAGPSGGTNLMSAIQSALTGLDADR 401 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + I +TDGE N + ++ I++ T I Sbjct: 402 TNA--------------------IWLVTDGEAN--VGETKQKAFIELLEKKDIRLFT-FI 438 Query: 370 NASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLM 409 + + LL+ +V N+D + I QLM Sbjct: 439 MGNSANRPLLEAITKHSNGFAISVSNSDDI------IGQLM 473 >gi|222528069|ref|YP_002571951.1| YD repeat-containing protein [Caldicellulosiruptor bescii DSM 6725] gi|222454916|gb|ACM59178.1| YD repeat protein [Caldicellulosiruptor bescii DSM 6725] Length = 3027 Score = 42.9 bits (99), Expect = 0.099, Method: Composition-based stats. Identities = 52/350 (14%), Positives = 107/350 (30%), Gaps = 56/350 (16%) Query: 50 KNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGY 109 KN + SA A+ V LS I+ + + +K + +++ Sbjct: 632 KNPLTSAEKYAVSEDGKVFVRALSDANIL---IAPLQVKRSDNVFINSLKGIVGKAIEIT 688 Query: 110 SAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRY 169 + F + + +VN + + N L + + + Sbjct: 689 AGGFDIKKAEIVVNYDEAELNGVEENNLMLYYVNYDKKILEPLEDVVVDTVYNRVSGKT- 747 Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 + +F+ + + I V+D SGSM D ++ Sbjct: 748 -EHFSTFLLGD-KNMPVDLSKVDIVFVLDNSGSMSS---------------NDPNYYRIE 790 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 A K F+ +ID L++ +GL+ + + V + K+ Q + Sbjct: 791 ATKK----FIQNIDELNN-----RVGLVDFDSSVSVRSNLTSDKSKLLQALNAMR--WTG 839 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T+ +K A +K I+ L+DG +N + Sbjct: 840 GSTNIGGGLKAAL-------------------GLFDQEQSKKIIVLLSDGYHNTGIHPND 880 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSL 398 + + + I + TI++ + LL + ++ V N L Sbjct: 881 VLP---ELIKQEIVVNTIAL-GKDCDRELLHDIADKTKGGYFYVDNTGGL 926 >gi|311273682|ref|XP_003133985.1| PREDICTED: integrin alpha-1-like, partial [Sus scrofa] Length = 506 Score = 42.9 bits (99), Expect = 0.099, Method: Composition-based stats. Identities = 31/184 (16%), Positives = 71/184 (38%), Gaps = 25/184 (13%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 D + + + +G++ Y V + V + ++ + T Sbjct: 213 AFLNDLLKRMDIGPKQTQVGIVQYGENVTHEFNL--NKYSSTEEVLVAANEIVQRGGRQT 270 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A I T+ K+ R+G +K ++ +TDGE+++ I+ C+ Sbjct: 271 ---MTALGIDTARKEAFTEARGARRG-------VKKVMVIVTDGESHDNHRLNKVIQDCE 320 Query: 356 KAKENFIKIVTISINASPNGQRL--------LKTCVSSP--EYHYNVVNADSLIHVFQNI 405 + I+ +I+I S N L +K+ S P ++ +NV + +L+ + + + Sbjct: 321 ---DESIQRFSIAILGSYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVEAL 377 Query: 406 SQLM 409 + + Sbjct: 378 GERI 381 >gi|304312981|ref|YP_003812579.1| hypothetical protein HDN1F_33640 [gamma proteobacterium HdN1] gi|301798714|emb|CBL46947.1| Hypothetical protein HDN1F_33640 [gamma proteobacterium HdN1] Length = 979 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 39/246 (15%), Positives = 86/246 (34%), Gaps = 53/246 (21%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 P + +++D+SGSM + P TK+ ++ Sbjct: 80 AAAVEPRPDMRVLIDISGSM----------KKTDPQNLRVPATKL------------LLN 117 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQA 301 L MG+ + V++ + + K +Q R+ + + T A+ A Sbjct: 118 LAKPGS---RMGIWTFGQHVDRLVPLATVDAKWKQAAAREANRISSSSLYTAIGDALDAA 174 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE---NNNFKSNVNTIK-----I 353 QG P +++ ++ L+DG + N N + + Sbjct: 175 -----------------IQGDLKPDPAWERSVVLLSDGMVDISKNPADNQREQQRIFQEV 217 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + K+ ++++ + + L + ++ + +AD L+HVF + + V++ Sbjct: 218 VPRLVAGGYKVHAVALSEQADIEFLKRLAEATKGHFSIAHSADQLMHVFVD-ASDRVNQP 276 Query: 414 YSVILK 419 V LK Sbjct: 277 LQVPLK 282 >gi|158891|gb|AAA29076.1| em100 gene is homologous the Eimeria tenella gene et100 (accession number M73495) encoding the microneme protein Etp100 [Eimeria maxima] Length = 724 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 26/180 (14%), Positives = 54/180 (30%), Gaps = 39/180 (21%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGS+ + +++ + F + + Sbjct: 46 LDVMLVVDESGSIGT--------------------SNYGKVRSFISNFAG---TMPLSPD 82 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQY-VTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 DV +GL+ + GT V ++ ++ + + A + Sbjct: 83 DVRVGLVTF------------GTSAVTRWDLSDSRAQNADLLAAAAKKLPYAAGSTYTHL 130 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + K K I+ +TDG T+ +K + + IV + + Sbjct: 131 GLAKAEEILFSFQKGGRDNAPKMILVMTDG---ASSRRSQTLSAAEKLRNRGVIIVVLGV 187 >gi|327271798|ref|XP_003220674.1| PREDICTED: matrilin-4-like [Anolis carolinensis] Length = 592 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 30/209 (14%), Positives = 76/209 (36%), Gaps = 26/209 (12%) Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 +GS + P D+ + + + + +D I L +G+I Y Sbjct: 33 TGSQQSKCKTGPLDIVF--VIDSSRSVRPFEFETMRRFMIDIIHNLDIGPNATRVGVIQY 90 Query: 260 TTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKPTDSTPAMKQAYQILTSDKKRSFFTNF 317 +++V+ + R + + +++++ + T + A++ + + ++ + Sbjct: 91 SSQVQNVFSLK--SFFTRAEMEKAINNIVPLAQGTMTGLAIQYVMNVAFTTQEGARPL-- 146 Query: 318 FRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQR 377 KIP + + +TDG + ++ +A+ I+I + + Sbjct: 147 ---HKKIPRVA-----VIVTDG---RPQD--RVTEVSAQARAAGIEIYAVGVQ--RADMN 191 Query: 378 LLKTCVSSP--EYHYNVVNADSLIHVFQN 404 L+ S E+ + V + LI F Sbjct: 192 SLRAMASPALEEHVFLVESF-DLIQQFGK 219 Score = 39.1 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 33/234 (14%), Positives = 78/234 (33%), Gaps = 42/234 (17%) Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL-----KNALL----LFLDS-----IDLL 245 SGS HC + C K + + ++ + +D L Sbjct: 328 SGSYHCVCPEGQQLQADKKTCNKCKAGHIDLVLVIDGSKSVRPQNFELVKQFVNQIVDFL 387 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 +GL+ Y++RV + T + K T + A+K + Sbjct: 388 DVSPHGTRVGLVQYSSRVRTEFPLNKFTTAADLKKAVQRVQYMEKGTMTGLALKHMLEHS 447 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 ++ + G + S + + TDG + +++ + ++KE I + Sbjct: 448 FTEAE----------GARPLSQNVPRIGLVFTDG---RSQDDIS--EWARRSKEAGIIMF 492 Query: 366 TISINASPNGQRLLKTCVSSP--EYH-YNVVNADSLIHVFQNISQLMVHRKYSV 416 + + + + L+ S P ++ Y+ F ++ ++ + K ++ Sbjct: 493 AVGVGKAVESE--LREIASEPVDKHFSYSAD--------FNTMTNIVENLKINI 536 >gi|268608768|ref|ZP_06142495.1| hypothetical protein RflaF_04637 [Ruminococcus flavefaciens FD-1] Length = 453 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 48/297 (16%), Positives = 90/297 (30%), Gaps = 35/297 (11%) Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 +I + N D + + + + D H + I + ++ Sbjct: 117 KKFKIEIKKDPLNATDELVLRYISSDESIATVDENGIVTAHKPGKVTVTVIADAVIPVVN 176 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 E L++D SGSM A + ++ K + K A++ Sbjct: 177 DNSLET---YTVLILDASGSMQGAPMTAQKEAAKKFCEDTIGTNKNSNHKFAVITLDSGS 233 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 L +T ++ E K Y T+ + A++ A Sbjct: 234 KT-----------LTDFTNDID---ELDSAIAKTTAY----------GSTNYSAALRNA- 268 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 L S N P +K T + ++++ I + K++ Sbjct: 269 AELLSKVSADAVRNIVLCSDGNPYGGEEKSTGKYTLSDYSDYEYANAAYDIAQEIKKD-Y 327 Query: 363 KIVTISINASPNGQRL------LKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 +I T+ S +G+ L LK S + V D L VF +++ V K Sbjct: 328 EIYTLGFFHSLSGEDLDFGRTYLKDVASYDSNYAEVNKVDDLQKVFADVAGNAVSYK 384 >gi|239833540|ref|ZP_04681868.1| Hypothetical protein OINT_2000308 [Ochrobactrum intermedium LMG 3301] gi|239821603|gb|EEQ93172.1| Hypothetical protein OINT_2000308 [Ochrobactrum intermedium LMG 3301] Length = 637 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 16/83 (19%), Positives = 31/83 (37%), Gaps = 15/83 (18%) Query: 347 NVNTIKICDKAKENFIKIVTISI------NASPNGQRLLKTCVSS---------PEYHYN 391 N K+CD AK I ++T+++ + LL++C S+ P + Sbjct: 553 NARFAKLCDNAKAANIIVMTVALDLNEANSTEKAQIDLLRSCSSNSRVRMEGGKPAKLFW 612 Query: 392 VVNADSLIHVFQNISQLMVHRKY 414 L F+ I + + + Sbjct: 613 NSTGGELSETFRQIGDELSNLRL 635 Score = 42.6 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 33/237 (13%), Positives = 77/237 (32%), Gaps = 50/237 (21%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 E+ NF++I AL ++ LL + + +N+++++ +AA LA + + Sbjct: 39 KDERGNFAMIAALVLVPLLLAGMVAVDTANLMRVRNNVQASLDAAALAVGKRFST----- 93 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 G+ + ++ R + +LS + F Q+ ++ T Sbjct: 94 GESHTVVQDYGARIFYANVT---------ALSADAINFQIAFPQDKTTDQQVQATAAFTY 144 Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 + L +++ + ++ E + L Sbjct: 145 KSLFGVVA------------------SRLTGDNWDKHQYTLTASVRLKNTIE-----VAL 181 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 V+D S SM + ++ LK+A ++++ S + V Sbjct: 182 VLDNSKSMD-------------ETRSGSSKKRIDLLKDAASQLVETMASQSALITYV 225 >gi|13591884|ref|NP_112256.1| integrin alpha-1 precursor [Rattus norvegicus] gi|124941|sp|P18614|ITA1_RAT RecName: Full=Integrin alpha-1; AltName: Full=CD49 antigen-like family member A; AltName: Full=Laminin and collagen receptor; AltName: Full=VLA-1; AltName: CD_antigen=CD49a; Flags: Precursor gi|56494|emb|CAA36384.1| unnamed protein product [Rattus norvegicus] gi|149059385|gb|EDM10392.1| integrin alpha 1, isoform CRA_b [Rattus norvegicus] Length = 1180 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 29/185 (15%), Positives = 66/185 (35%), Gaps = 25/185 (13%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + D + + + +G++ Y V K + + + Sbjct: 190 IAFLNDLLKRMDIGPKQTQVGIVQYGENVTHEFNL----NKYSSTEEVLVAANKIGRQGG 245 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 M A I T+ K+ R+G +K ++ +TDGE N ++ Sbjct: 246 LQTM-TALGIDTARKEAFTEARGARRG-------VKKVMVIVTDGE---SHDNYRLKQVI 294 Query: 355 DKAKENFIKIVTISINASPNGQRL--------LKTCVSSP--EYHYNVVNADSLIHVFQN 404 ++ I+ +I+I N L +K+ S P ++ +NV + +L+ + + Sbjct: 295 QDCEDENIQRFSIAILGHYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKA 354 Query: 405 ISQLM 409 + + + Sbjct: 355 LGERI 359 >gi|261880541|ref|ZP_06006968.1| BatA protein [Prevotella bergensis DSM 17361] gi|270332764|gb|EFA43550.1| BatA protein [Prevotella bergensis DSM 17361] Length = 332 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 42/246 (17%), Positives = 67/246 (27%), Gaps = 72/246 (29%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I L +D+S SM D K ++ A K+ F+ Sbjct: 88 IDIMLAMDVSTSMLAE---------------DLKPNRLEAAKDVASEFV----SGRPSDN 128 Query: 251 DVYMGLIGYTTRVEKNIEPS-------WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 +GL + + VR + LI T + A Sbjct: 129 ---IGLTIFAGESFTQCPLTIDHGSLLNLLNNVRTDIAAR--GLIQDGTAVGMGLANAVS 183 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L K +S K +I LTDG NN + T A+ I+ Sbjct: 184 RLKGSKAKS------------------KVVILLTDGSNNMGDISPLTAAQ--IARSLGIR 223 Query: 364 IVTISI---------NASPNG------------QRLLKTCVSSPEYHYNVVNADSLIHVF 402 + TI + G + L + Y N L ++ Sbjct: 224 VYTIGVGTNKVAPYPMPVAGGVQYVNMPVEIDTKTLSDIAAITEGNFYRATNNRELKQIY 283 Query: 403 QNISQL 408 ++I +L Sbjct: 284 RDIDKL 289 >gi|213965586|ref|ZP_03393780.1| von Willebrand factor type A domain protein [Corynebacterium amycolatum SK46] gi|213951745|gb|EEB63133.1| von Willebrand factor type A domain protein [Corynebacterium amycolatum SK46] Length = 330 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 36/255 (14%), Positives = 83/255 (32%), Gaps = 50/255 (19%) Query: 172 KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 ++ + + R + LVVD+S SM D ++ A Sbjct: 72 LVIISLAGPISETKVARNRATVMLVVDVSLSMSA---------------TDVAPDRITAA 116 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K A F++++ +D+ +GL+ ++ R + P+ T V R + + L Sbjct: 117 KEAGQEFVENL------PDDLNIGLVTFSGRARTAVSPT--TNH--DTVNRALQAAELDQ 166 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK--SNVN 349 +T + + + QG + P I+ L+DG+ + + Sbjct: 167 ATATG------DAIAAALDAINQFSDSVQGGGEGAPPAT--IVLLSDGKQTVPQELDDPR 218 Query: 350 -TIKICDKAKENFIKIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVN 394 +A + + + TIS + L + + ++ + Sbjct: 219 GAYTAAAEAAKAGVPVNTISFGTAQGAITVQGELIPVPNDDDSLREIARRTKGEFFSAGS 278 Query: 395 ADSLIHVFQNISQLM 409 + L + ++ + Sbjct: 279 LEQLRDAYGSLEDDI 293 >gi|126653689|ref|ZP_01725608.1| BatA [Bacillus sp. B14905] gi|126589726|gb|EAZ83861.1| BatA [Bacillus sp. B14905] Length = 973 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 45/297 (15%), Positives = 95/297 (31%), Gaps = 62/297 (20%) Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 +S I+ T N S + Y ++ + + ++ RY + + Sbjct: 622 ASFITNTSDPLNNTGSPGTAVAYFDSIIYGKSKVEATLVNPIDPRYATILKGLKDKTVSK 681 Query: 184 EMGERPIFL---------IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 ++ P F + VVD S SM ++ Sbjct: 682 DIFTNPYFSKNSCSLATEVAYVVDYSSSMKAVDPTNYRGKK------------------- 722 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 ++ F++ + +++ + K GT S TD Sbjct: 723 MIEFINQLKAKNNIVIETNT----------KATILGEGTTDAVLKKDLYKASKEKGATDI 772 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + A ++D K + K I+ ++DG+ + K K+ Sbjct: 773 FAGIDIALTKFSNDTKTA------------------KAIVVVSDGKTSKSK----MTKVI 810 Query: 355 DKAKENFIKIVTISI--NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 ++AK+ +KI T+S+ + N L++ + +Y ++ L VFQ + + Sbjct: 811 NEAKKQGVKIYTVSMGKKSQINDATLMQVSTETGGAYYYALDNLQLHQVFQKLIDAI 867 >gi|73541336|ref|YP_295856.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] gi|72118749|gb|AAZ61012.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] Length = 354 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 36/265 (13%), Positives = 72/265 (27%), Gaps = 64/265 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L +D+S SM D T++ A + A DL+ + Sbjct: 87 TLVLAMDVSRSMAA---------------SDVAPTRIGAAQQAAR------DLIVGLPPG 125 Query: 252 VYMGLIGYTT----------------RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 V +G++ + E GT + D Sbjct: 126 VRLGIVSFAATAIVVLPPTDNRLRMLDAIDRFELQNGTATGSGLIQSLAVLFPDDGIDLE 185 Query: 296 PAM--------KQAYQILTSDKKRSFFTN-FFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 + + LT Q P +I L+DG Sbjct: 186 GILFGGESLAPGTGGRSLTEAAAADAVRKRDLEQPGAAPGSYRHGAVILLSDGRRTTGPD 245 Query: 347 NVNTIKICDKAKENFIKIVTISINASP----NGQRL----------LKTCVS-SPEYHYN 391 ++ ++ A + +++ T+ A G L L+ + + ++ Sbjct: 246 PLDAARM---AAQRGLRVYTVGFGAVQDAGTEGSSLSYEMQVDEPTLRQIATLTDGEYFQ 302 Query: 392 VVNADSLIHVFQNISQLMVHRKYSV 416 +A L V++ +S + V Sbjct: 303 AGSAADLTRVYRQLSGRFALERSEV 327 >gi|125548980|gb|EAY94802.1| hypothetical protein OsI_16587 [Oryza sativa Indica Group] Length = 708 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 40/273 (14%), Positives = 84/273 (30%), Gaps = 56/273 (20%) Query: 151 MTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 + D+ + + +Q R + V+D+SGSM Sbjct: 234 SSKDDFAILIHLKAPSANPDQGTGKLANESSAGSSRNRAPVDLVTVLDVSGSMAG----- 288 Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 TK+A LK A+ + + + +I +++ + Sbjct: 289 ---------------TKLALLKRAMGFVVQHLGPSD------RLSVIAFSSSARRLFHLQ 327 Query: 271 WGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 + RQ + ++SL T+ A+K+A +++ ++ + Sbjct: 328 RISHHGRQQALQAINSLGASGGTNIADALKKAMKVIEDRSYKNSVCS------------- 374 Query: 330 QKFIIFLTDG-ENNNFKSNVNTI----------KICDKAKENFIKIVTISINASPNGQRL 378 II L+DG + N S+V I + A+ + + A + L Sbjct: 375 ---IILLSDGQDTYNISSSVQGASPDYKSLVPSSIINDARHT-VPLHAFGFGADHDSDSL 430 Query: 379 LKTCVSSPEYHYNVVNADSLIHVFQN-ISQLMV 410 +S + + + F I L+ Sbjct: 431 HSIAQASGGTFSFIEDEGVMQDAFAQCIGGLLS 463 >gi|317057468|ref|YP_004105935.1| von Willebrand factor type A [Ruminococcus albus 7] gi|315449737|gb|ADU23301.1| von Willebrand factor type A [Ruminococcus albus 7] Length = 782 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 39/240 (16%), Positives = 84/240 (35%), Gaps = 41/240 (17%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 A + R FLI D SGSM P+++ D ++ + Sbjct: 276 ASVNEAATTRVAFLI----DNSGSM------YPKELCPTSSENDVDFKRLDFTQ----SL 321 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-MDSLILKPTDSTPA 297 +D D D +G+ +T + + + + R + I T + A Sbjct: 322 IDKFDS------DFRIGISKFTGTYTRMCGFTDDRTALSDVIKRIRTEDEIFDGTHNQTA 375 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 +K+ + T+ + I+ L+DGE++ + + A Sbjct: 376 LKRCIEEFTA----------------TGDGKYVNIIVMLSDGESDETNAESI-KNLARLA 418 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 E + ++T+ + + L + S+ +Y+ A++L V++ I + Y ++ Sbjct: 419 NEKSVIVLTVGLGREIDRAWLQEMAYSTGGKYYSASEANALDDVYKQI---VTTLNYDIV 475 >gi|298491708|ref|YP_003721885.1| von Willebrand factor type A ['Nostoc azollae' 0708] gi|298233626|gb|ADI64762.1| von Willebrand factor type A ['Nostoc azollae' 0708] Length = 418 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 34/242 (14%), Positives = 78/242 (32%), Gaps = 51/242 (21%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 Q +Q+ ++ + + + R + L++D SGSM Sbjct: 18 QPSSQRQLAVSISAVGETLDRRVPLNLCLILDHSGSM--------------------NGR 57 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR--VEKNIEPSWGTEKVRQYVTRDM 284 + +K A+ L +D + + ++ + R + + +++Q + R Sbjct: 58 ALETVKKAVSLLVDQLSSED------RLSIVVFDHRAKILVPNQIISDRNQIKQQINRL- 110 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 T ++ + L KK + F LTDGEN + Sbjct: 111 --TADGGTAIDEGLRLGIEELAKGKKDTISQAF-----------------LLTDGENEHG 151 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADSLIHVFQ 403 +N +K A + + T+ Q +L+ + ++ + + + F Sbjct: 152 DNN-RCLKFAQLAASYNLTLNTLGF-GDNWNQDILEKIADAGLGNLSHIEHPNQAVDKFS 209 Query: 404 NI 405 + Sbjct: 210 RL 211 >gi|126344397|ref|XP_001365113.1| PREDICTED: similar to calcium-dependent chloride channel-1, partial [Monodelphis domestica] Length = 660 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 45/233 (19%), Positives = 82/233 (35%), Gaps = 55/233 (23%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM ++ L A LFL I +++ + Sbjct: 65 LVLDKSGSMAGG-------------------DRLNRLNQASQLFLLQI-----IEKGSWT 100 Query: 255 GLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + T+ R + + + T ++ A+ ++ K F Sbjct: 101 GMVTFDSSATIQSALIQIETDAQRNSLISRLPTAAGGGTSICSGLRTAFTVI----KNKF 156 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISI--N 370 T+ I+ LTDGE++ TI C D+ K++ I T+++ + Sbjct: 157 STDGSE-------------IVLLTDGEDS-------TISSCFDEVKQSGAIIHTVALGPS 196 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ---LMVHRKYSVILKG 420 A P + L K + + LI F +S + R + KG Sbjct: 197 ADPGLEELAKMTGGMKTSPTDNAQNNGLIDAFSALSSGNGAITQRSIQLESKG 249 >gi|14042797|dbj|BAB55397.1| unnamed protein product [Homo sapiens] Length = 397 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 40/189 (21%), Positives = 69/189 (36%), Gaps = 28/189 (14%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRD 283 TK+ K+AL L + +IG++ R++ K+ S + +R Sbjct: 4 TKLRQTKDALFTILHDLRPQD------RFSIIGFSNRIKVWKDHLISVTPDSIRDGKVYI 57 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 TD A+++A ++L N + I I+FLTDG+ Sbjct: 58 HHMSPTGGTDINGALQRAIRLL----------NKYVAHSGIGDRSVS-LIVFLTDGK--P 104 Query: 344 FKSNVNTIKICDKAKENF---IKIVTISINASPNGQRL----LKTCVSSPEYHYNVVNAD 396 +T+KI + +E + I TI I + + L L+ C + H Sbjct: 105 TVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGLTRRVHEEEDAGS 164 Query: 397 SLIHVFQNI 405 LI + I Sbjct: 165 QLIGFYDEI 173 >gi|14042827|dbj|BAB55409.1| unnamed protein product [Homo sapiens] Length = 397 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 40/189 (21%), Positives = 69/189 (36%), Gaps = 28/189 (14%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRD 283 TK+ K+AL L + +IG++ R++ K+ S + +R Sbjct: 4 TKLRQTKDALFTILHDLRPQD------RFSIIGFSNRIKVWKDHLISVTPDSIRDGKVYI 57 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 TD A+++A ++L N + I I+FLTDG+ Sbjct: 58 HHMSPTGGTDINGALQRAIRLL----------NKYVAHSGIGDRSVS-LIVFLTDGK--P 104 Query: 344 FKSNVNTIKICDKAKENF---IKIVTISINASPNGQRL----LKTCVSSPEYHYNVVNAD 396 +T+KI + +E + I TI I + + L L+ C + H Sbjct: 105 TVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKPSLENCGLTRRVHEEEDAGS 164 Query: 397 SLIHVFQNI 405 LI + I Sbjct: 165 QLIGFYDEI 173 >gi|56421171|ref|YP_148489.1| hypothetical protein GK2636 [Geobacillus kaustophilus HTA426] gi|56381013|dbj|BAD76921.1| hypothetical protein [Geobacillus kaustophilus HTA426] Length = 960 Score = 42.9 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 29/157 (18%), Positives = 53/157 (33%), Gaps = 46/157 (29%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 RP + V+D+SGSM K+ + K+AL ++ + Sbjct: 78 RPPIDVVFVMDVSGSM--------------------TAMKLQSAKSALQAAVNYFKSNYN 117 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGT-EKVRQYVTRDMDS----LILKPTDSTPAMKQAY 302 + LI ++ V + +G V + +++ T+ + A+ A Sbjct: 118 QND--RFALIPFSDGVREASVVPFGKYSNVASQLDAILNTGNSLTAGGGTNYSAALSLAK 175 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 T + +K+IIFLTDG Sbjct: 176 SYFTDPTR-------------------KKYIIFLTDG 193 >gi|291398581|ref|XP_002715573.1| PREDICTED: chloride channel accessory 4 [Oryctolagus cuniculus] Length = 874 Score = 42.9 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 51/362 (14%), Positives = 119/362 (32%), Gaps = 66/362 (18%) Query: 51 NSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYS 110 N +E+ +A + G +++ + N + D + F K S Sbjct: 129 NKIEATRCSAGITGINRVYTCRGDSCVTRNCRMNTTTKLYEKDCQFFPD---KVQTEKAS 185 Query: 111 AVFYNTEIQNIVNSSRISMTHMANNRLDSS-NNTIFYNMDVMTSYDYRLQFIEHLLNQRY 169 ++ + + + + A N ++ N + + ++ +E Sbjct: 186 IMYMQSIESTVEFCNEKNHNREAPNLQNTKCNFRSTWEVISISEDFQSTTPMEGPP---- 241 Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 PA + +R + LV+D SGSM + ++ Sbjct: 242 ------PTPAFSLLRPRQR---TVCLVLDKSGSMASS-------------------DRLN 273 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLI 288 + A FL + V+ ++G++ + + + + R + + + ++ Sbjct: 274 RMNQAAKYFLLQV-----VENGSWVGMVHFDSTASVKSQLIQIKSNSERNQLLQSLPTVA 328 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T ++ A+Q+ + I+ LTDGE++ S V Sbjct: 329 GGGTQICLGIRAAFQVFKQQNSEIDGSE----------------IVLLTDGEDSTASSCV 372 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA--DSLIHVFQNIS 406 N + E+ I I++ S + + + ++ ++Y NA + LI F ++ Sbjct: 373 NEVI------ESGTIIHFIALGPSADASVIQMSNLTGGSHYYASDNAQNNGLIDAFGALT 426 Query: 407 QL 408 Sbjct: 427 SE 428 >gi|124004754|ref|ZP_01689598.1| von Willebrand factor, type A, putative [Microscilla marina ATCC 23134] gi|123989877|gb|EAY29406.1| von Willebrand factor, type A, putative [Microscilla marina ATCC 23134] Length = 354 Score = 42.9 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 27/188 (14%), Positives = 67/188 (35%), Gaps = 39/188 (20%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I + VDLS SM D +++ +K L ++++ Sbjct: 106 DIFIAVDLSLSM---------------KATDIPPSRLEKIKYELSNIINTLKSD------ 144 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKV---RQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 +GL+ +++ + ++ + Q + ++ + TD ++ + Sbjct: 145 -RIGLVIFSSSAFMHCPLTYDKGALNLFTQILNTNLMPIGNAGTDFYAPLELVLKKYQEA 203 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 K + + + K ++ +DGE + I D+ K+N I++ T+ Sbjct: 204 NKSN----------RKQQNEYAKVVVLFSDGEEFGDRYT----AIVDQYKQNNIRVFTVG 249 Query: 369 INASPNGQ 376 + + G+ Sbjct: 250 VGSLQGGK 257 >gi|108758240|ref|YP_629592.1| putative lipoprotein [Myxococcus xanthus DK 1622] gi|108462120|gb|ABF87305.1| putative lipoprotein [Myxococcus xanthus DK 1622] Length = 659 Score = 42.9 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 34/162 (20%), Positives = 63/162 (38%), Gaps = 25/162 (15%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM-------AALKNALLLFLDSIDLL 245 + +V+D SGSM ++ P CQ ++ + A AL + + Sbjct: 61 VVVVIDESGSM--CVSDPPGAQLDNGFCQRREILDIIPEGVTEPARVRALKRLVQQFREV 118 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWG-------TEKVRQYVTRDMDSLILKPTDSTPAM 298 + +V + + + T V P+ + Y+ + S + K TD A+ Sbjct: 119 NAQGGNVQVSVAPFETNVRNVWPPTTTGDRFARPDNNIDSYI-EGLQSQLGKGTDYQGAL 177 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF-IIFLTDG 339 AY +++SD +N LP ++ ++FLTDG Sbjct: 178 SYAYSLISSDINAVAQSNPEL-------LPRTRYVVVFLTDG 212 >gi|254453558|ref|ZP_05066995.1| von Willebrand factor, type A [Octadecabacter antarcticus 238] gi|198267964|gb|EDY92234.1| von Willebrand factor, type A [Octadecabacter antarcticus 238] Length = 676 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 42/303 (13%), Positives = 90/303 (29%), Gaps = 49/303 (16%) Query: 54 ESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVF 113 +A ++A + +S I + +D +K +E +S S Sbjct: 182 RAAPTNDLVAEGMALGQVVSAPEPNVAGIGELDTESFANDTPNPLKITTEEPVSTLSIDV 241 Query: 114 YNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL---LNQRYN 170 I +S + + ++ Q ++ + Sbjct: 242 DTAAYALIRSSLTAG-QLPPADAVRIEEMINYFPYAYPAPDGQPFQPTINVFETPW-NAD 299 Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 ++V + +RP + ++D SGSM + K+ Sbjct: 300 TQLVHIGLQGEMPSIQDRPALNLVFLIDTSGSM-------------------ESADKLPL 340 Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 L+ + L LD++ V Y G + + E R + +++L Sbjct: 341 LRQSFRLMLDNLAPEDEVAIVTYAGSTSIALQPTQASE--------RTTILAALNALNAG 392 Query: 291 P-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T+ ++QAY + + K + +I TDG+ N S+ Sbjct: 393 GSTNGQGGLEQAYALAETMKTDGDVSR----------------VILATDGDFNVGLSDPR 436 Query: 350 TIK 352 ++ Sbjct: 437 GLQ 439 >gi|118617151|ref|YP_905483.1| hypothetical protein MUL_1490 [Mycobacterium ulcerans Agy99] gi|166979868|sp|A0PNU3|Y1490_MYCUA RecName: Full=UPF0353 protein MUL_1490 gi|118569261|gb|ABL04012.1| membrane protein [Mycobacterium ulcerans Agy99] Length = 335 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 41/234 (17%), Positives = 71/234 (30%), Gaps = 51/234 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + LV+D+S SM D + +M A + A F D + + Sbjct: 99 VMLVIDVSQSM---------------RATDVEPNRMVAAQEAAKQFADELTPG------I 137 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GLI Y + P+ E + + + + T + A+ A Q + + Sbjct: 138 NLGLIAYAGTATVLVSPTTNREATKAALDKLQFADR---TATGEAIFTALQAIATVGAV- 193 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK--SNVNTIKICD-KAKENFIKIVTISI 369 + P I+ +DG+ N AK+ + I TIS Sbjct: 194 ---------IGGGDTPPPARIVLFSDGKETMPTNPDNPKGAYTAARTAKDQGVPISTISF 244 Query: 370 NAS-------------PNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLM 409 P +K S YN L V+ ++ Q + Sbjct: 245 GTPYGFVEINDQRQPVPVDDETMKKVAQLSGGNSYNAATLAELNSVYVSLQQQI 298 >gi|56797849|emb|CAF33009.1| matrilin-1 [Danio rerio] Length = 320 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 41/215 (19%), Positives = 78/215 (36%), Gaps = 28/215 (13%) Query: 197 VDLSGSMHCAM-NSDPEDVNSAPICQDK-KRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 + + +M + N+ P DV + ++ +K + ID LS + + Sbjct: 21 LRQAAAMAAGLCNTKPTDVVFIVDSSRSVRPSEFEQVK---VFLAKVIDGLSVGPDATRV 77 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 G++ Y +RV+ + K + T + A++ A + S+ + Sbjct: 78 GVVNYASRVKNEVSLKSHKTKAALVKAVSKIEPLSTGTMTGLAIQFAMNVAFSEAEG--- 134 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 G K P + K I +TDG + N+ I +A+E I+I I + Sbjct: 135 ------GRKSPDIS--KVAIIVTDG---RPQDNIRDIA--ARAREAGIEIFAIGVG--RV 179 Query: 375 GQRLLKTCVSSP--EYHYNVVN---ADSLIHVFQN 404 L+ S P ++ V + + L FQ Sbjct: 180 DMTTLRQMASEPLEDHVDYVESYSLIEKLTKKFQE 214 >gi|254255255|ref|ZP_04948571.1| hypothetical protein BDAG_04588 [Burkholderia dolosa AUO158] gi|124900992|gb|EAY71742.1| hypothetical protein BDAG_04588 [Burkholderia dolosa AUO158] Length = 511 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 15/124 (12%), Positives = 49/124 (39%), Gaps = 6/124 (4%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 ++G+ ++ +II L++ + +G + + + ++ ++++ +A L+ A + S Sbjct: 99 VRRGLHRQRGAVAIIVGLALAVMIGFVGLALDLGKLYVTRSELQNSADACALSAARDLTS 158 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 +S I A F + + L+ + F ++ ++ + ++ Sbjct: 159 AISLSVAEANGI------AAGHMNFAFFQQKPVQMLTDSNVTFSDSLTNPFLSKTAVATP 212 Query: 131 HMAN 134 Sbjct: 213 ANVK 216 >gi|12583699|dbj|BAB21479.1| integrin alpha Hr1 precursor [Halocynthia roretzi] Length = 1332 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 34/233 (14%), Positives = 84/233 (36%), Gaps = 44/233 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + V+D SGS+ +K+ + +D+ KE Sbjct: 206 DVLFVLDGSGSVGKNF---------------------DKVKDWVKNITAKLDI---GKEI 241 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 V +G++ Y+ VE ++Y+T ++ K D+ + A + Sbjct: 242 VRVGVVQYSHYVEGKS------INKQKYITTEISIGEFKLLDN---FENAVDRIQLQGYT 292 Query: 312 SFFTNFFRQGVKIPSLPF---QKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 ++ ++ ++ + ++ ++ LTDG+ K N + ++ + I + Sbjct: 293 TYTGRALQKVIRDFDDAYIGNKQVLLLLTDGQ---AKDNKLILPNANRLRNKGIATFAVG 349 Query: 369 INASP-NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILKG 420 + + +L+ + S + + V + L +I + + S +L+G Sbjct: 350 VGEYDISELKLIASGTDSTDRVFTVTDFGEL----DSIVKSLQTEIQSFVLEG 398 >gi|224046544|ref|XP_002198814.1| PREDICTED: matrilin 2 [Taeniopygia guttata] Length = 902 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 38/174 (21%), Positives = 67/174 (38%), Gaps = 26/174 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKPTDST 295 L + L + ++GLI Y + V++ T + +Q + R + ++ T + Sbjct: 38 ILTILQFLDISPDATHVGLIQYGSTVKQEFSLK--TFRRKQDIERAVKRMMHLGTGTMTG 95 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A I S+ + RQ V + I+ +TDG + + I Sbjct: 96 LALQYAVNIAFSETE---GARPLRQNVP-------RIIMIVTDGRPQDPVAE-----IAA 140 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADS---LIHVFQN 404 KA+ + I I I + LK+ S P E+ + V N L FQ Sbjct: 141 KARNSGILIFAIGVG--RVDMNTLKSIGSEPHEEHVFLVANFSQIETLTSAFQT 192 >gi|119596272|gb|EAW75866.1| matrilin 4, isoform CRA_a [Homo sapiens] Length = 391 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 65/177 (36%), Gaps = 26/177 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR--DMDSLILKPTDSTPAMK 299 +D L E +GL+ +++RV G V + + + T + A++ Sbjct: 182 VDFLDVSPEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALR 239 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S+ QG + +L + + TDG + +++ +AKE Sbjct: 240 HMVEHSFSE----------AQGARPRALNVPRVGLVFTDG---RSQDDISV--WAARAKE 284 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I + + + + + L+ S P + D F ++ L+ + + S+ Sbjct: 285 EGIVMYAVGVGKAVEAE--LREIASEPAELHVSYAPD-----FGTMTHLLENLRGSI 334 >gi|56417742|emb|CAI21077.1| matrilin 4 [Homo sapiens] Length = 432 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 65/177 (36%), Gaps = 26/177 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR--DMDSLILKPTDSTPAMK 299 +D L E +GL+ +++RV G V + + + T + A++ Sbjct: 223 VDFLDVSPEGTRVGLVQFSSRVRTEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALR 280 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S+ QG + +L + + TDG + +++ +AKE Sbjct: 281 HMVEHSFSE----------AQGARPRALNVPRVGLVFTDG---RSQDDISV--WAARAKE 325 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I + + + + + L+ S P + D F ++ L+ + + S+ Sbjct: 326 EGIVMYAVGVGKAVEAE--LREIASEPAELHVSYAPD-----FGTMTHLLENLRGSI 375 >gi|3928084|gb|AAC79610.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 689 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 40/296 (13%), Positives = 103/296 (34%), Gaps = 60/296 (20%) Query: 117 EIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSF 176 E Q + + S N+ + + I+ + + D R +F + + + Sbjct: 175 EQQLVFSGKSYSDALENNHPVRMMDLKIYPEVSAVPRADSREKFDVLVHLRAAAMVTGNA 234 Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 +I R + V+D+SGSM TK+A LK A+ Sbjct: 235 NSLNNQISRYPRAPVDLVTVLDISGSMAG--------------------TKLALLKRAMG 274 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS----WGTEKVRQYVTRDMDSLILKPT 292 + ++ + +I +++ + + G ++ Q V + + T Sbjct: 275 FVIQNLGSND------RLSVIAFSSTARRLFPLTKMSDAGRQRALQAVNSVVAN---GGT 325 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVN- 349 + +++ +++ + ++ + II L+DG + +++ N Sbjct: 326 NIAEGLRKGVKVMEDRRDKNPVAS----------------IILLSDGRDTYTMNQADPNY 369 Query: 350 ------TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 ++ C+ K I + + S + L+ + + ++ + ++S+I Sbjct: 370 KLLLPLSMHGCES-KRFQIPVHSFGF-GSDHDASLMHSVSETSGGTFSFIESESVI 423 >gi|284030499|ref|YP_003380430.1| von Willebrand factor type A [Kribbella flavida DSM 17836] gi|283809792|gb|ADB31631.1| von Willebrand factor type A [Kribbella flavida DSM 17836] Length = 317 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 40/237 (16%), Positives = 70/237 (29%), Gaps = 53/237 (22%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R I + +D+S SM D ++ A K + F++ + + Sbjct: 83 RERATIVVAIDVSLSMMA---------------TDVDPNRLEAAKKSAKNFVNQLPSKFN 127 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQILT 306 V + G + R V R +D L L T + + + Q LT Sbjct: 128 VALVNFAGTAS----IIVPPTTD------RATVQRSIDGLELAESTATGEGIFTSLQALT 177 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 N I+ L+DG+ ++ + AKE I T Sbjct: 178 QVPPDPEHPNDPAPAR----------IVLLSDGKRTVGRTAQEGAQ---AAKEKNTPIYT 224 Query: 367 ISI--------------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 I+ P+ L + Y +A L V+++I + Sbjct: 225 ITFGTDSGFIEMDGIRQRVPPDRAELRSVAEITGGEAYTAESAGELEDVYKDIGSSV 281 >gi|47214246|emb|CAG12465.1| unnamed protein product [Tetraodon nigroviridis] Length = 467 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 33/244 (13%), Positives = 75/244 (30%), Gaps = 69/244 (28%) Query: 166 NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 +R + S + + +VD S S+ + Sbjct: 28 GRRNHAIDPSSTVSPASGSSCRNGPIDLVFIVDSSRSV--------------------RP 67 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 T+ K L +DS+++ + +GL+ Y + + Sbjct: 68 TEFEKAKEFLQDLVDSLEV---GLDSTRVGLVNY-----ASTPLA--------------- 104 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T + A++ A + + + G ++ S + + +TDG + Sbjct: 105 ----SGTMTGLAIRTAVEKAFAAEA----------GARLNSTKVARVAVVVTDG---RPQ 147 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN---ADSLIH 400 V + A+E+ I+I + + + L+ S P ++ + V + L Sbjct: 148 DEVERVS--AAARESGIEIYAVGV--DRADRTSLRLMASQPHEDHVFYVETYGVIEKLTS 203 Query: 401 VFQN 404 F+ Sbjct: 204 RFRE 207 >gi|42524419|ref|NP_969799.1| putative secreted protein [Bdellovibrio bacteriovorus HD100] gi|39576628|emb|CAE80792.1| putative secreted protein [Bdellovibrio bacteriovorus HD100] Length = 469 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 47/247 (19%), Positives = 83/247 (33%), Gaps = 59/247 (23%) Query: 176 FIPALLRIEMGERPIFLIELV---VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 A ++ + P I LV +D SGSM M K+TK+ LK Sbjct: 50 PANASGEVKAEDLPSTAIPLVEYVIDSSGSMGQLMGP--------------KKTKIYVLK 95 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK-------NIEPSWGTEKVRQYVTRDMD 285 L +L +S E GL +R +K IEP+ + + + + Sbjct: 96 KLLARYL-----MSQWTEKTSSGLRVIGSRRKKDCKDNYLAIEPAQSKLGAIEGIVKGFE 150 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + T A+K AY+ + + K P K ++ TDGE + Sbjct: 151 --PVGMTPIGQALKDAYKDV--------------EHYKGP-----KRVVLFTDGEETCGQ 189 Query: 346 SNVNTIKICDKAKENFIKIVTISI---NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 K+ +K ++ N +L C+ + + L +F Sbjct: 190 DPCKIAAEL-SGKDVDLKFFVVAFGLQNQPDVLDKL--ACIGDMS---QADDEEKLEELF 243 Query: 403 QNISQLM 409 Q++ + + Sbjct: 244 QDLDKQL 250 >gi|14042009|dbj|BAB55070.1| unnamed protein product [Homo sapiens] Length = 942 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 45/222 (20%), Positives = 75/222 (33%), Gaps = 48/222 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D S SM TK+ K+AL L + Sbjct: 296 VVFVLDSSASMVG--------------------TKLRQTKDALFTILHDLRPQD------ 329 Query: 253 YMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +IG+ R++ K+ S + +R TD A+++A ++L Sbjct: 330 RFSIIGFPNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL----- 384 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF---IKIVTI 367 N + I I+FLTDG+ +T+KI + +E + I TI Sbjct: 385 -----NKYVAHSGIGDRSVS-LIVFLTDGK--PTVGETHTLKILNNTREAARGQVCIFTI 436 Query: 368 SINASPNGQRL----LKTCVSSPEYHYNVVNADSLIHVFQNI 405 I + + L L+ C + H LI + I Sbjct: 437 GIGNDVDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 478 >gi|118090156|ref|XP_420539.2| PREDICTED: hypothetical protein [Gallus gallus] Length = 606 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 36/238 (15%), Positives = 75/238 (31%), Gaps = 44/238 (18%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + F + V+D SGS+ + + VN + T+ Sbjct: 146 EQPSCHGAFDMYFVLDKSGSVAQNWHEIFDFVNQLDGSGFVRLTE--------------- 190 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 +S + + I ++++ + + EK+++ + + T +KQA Sbjct: 191 RFVSP---KMRLSFIVFSSQAHVIMPLTGDREKIKEGLKNLSEVKPAGDTYIHEGLKQA- 246 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG--ENNNFKSNVNTIKICDKAKEN 360 Q K + F II LTDG + KI ++E Sbjct: 247 ---------------NMQIEKQGASRFSSIIIALTDGKLDGQIPLYAEKEAKI---SREL 288 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVIL 418 ++ + + Q L+ + E + V + I ++ + + IL Sbjct: 289 GARVYCVGVL--DFVQAQLERIADTKEQVFPVTGGFQ---ALKGIINSVLKQSCTEIL 341 >gi|73950493|ref|XP_544451.2| PREDICTED: similar to Cartilage matrix protein precursor (Matrilin-1) [Canis familiaris] Length = 562 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 37/223 (16%), Positives = 83/223 (37%), Gaps = 40/223 (17%) Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNA-----------LLLFLDSI-DLLSHVKED 251 +NSD + N T + L + + F++ I D L + Sbjct: 207 GFTLNSDGKTCNVCSGGGGSSATDLVFLIDGSKSVRPENFELVKKFINQIVDTLDVSDKL 266 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD--SLILKPTDSTPAMKQAYQILTSDK 309 +GL+ Y++ V + G ++ + + S + K T + A+K D Sbjct: 267 AQVGLVQYSSSVRQEFPL--GRFHTKKDIKAAVRNMSYMEKGTMTGAALKY-----LIDN 319 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + + K+ I+F TDG + ++ ++ KAK+ K+ + + Sbjct: 320 SFTVSSGARPGAQKVG-------IVF-TDGRSQDYINDAA-----KKAKDLGFKMFAVGV 366 Query: 370 -NASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 NA + L+ S P E+++ + ++ + + + + + Sbjct: 367 GNAVEDE---LREIASEPVAEHYFYTADFKTINQIGKRLQKKI 406 >gi|186681467|ref|YP_001864663.1| von Willebrand factor A [Nostoc punctiforme PCC 73102] gi|186463919|gb|ACC79720.1| von Willebrand factor, type A [Nostoc punctiforme PCC 73102] Length = 418 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 36/242 (14%), Positives = 73/242 (30%), Gaps = 49/242 (20%) Query: 166 NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 NQ +Q+ + + + + L++D SGSM Sbjct: 17 NQLNSQRQLGISISAIAETQDRHVPLNLCLILDHSGSM--------------------NG 56 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR--VEKNIEPSWGTEKVRQYVTRD 283 + +K A +D ++ + ++ + R V + EK++ + R Sbjct: 57 RSLETVKKAANRLVDRLNPSD------RLSVVVFDHRAKVLVPSQSVEDPEKIKNQINRL 110 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 T ++ + L KK + F LTDGEN + Sbjct: 111 AAD---GGTAIDEGLRLGIEELAKGKKDTVSQAF-----------------LLTDGENEH 150 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 +N +K A + + T+ + N L K + + + + F Sbjct: 151 GDNN-RCLKFAQLAASYNLTLNTLGFGDNWNQDVLEKIADAGLGTLSYIQKPEEAVDEFN 209 Query: 404 NI 405 + Sbjct: 210 RL 211 >gi|115459346|ref|NP_001053273.1| Os04g0508800 [Oryza sativa Japonica Group] gi|32489531|emb|CAE04734.1| OSJNBa0043L24.22 [Oryza sativa Japonica Group] gi|113564844|dbj|BAF15187.1| Os04g0508800 [Oryza sativa Japonica Group] gi|116310776|emb|CAH67569.1| OSIGBa0101P20.12 [Oryza sativa Indica Group] gi|125590953|gb|EAZ31303.1| hypothetical protein OsJ_15416 [Oryza sativa Japonica Group] Length = 708 Score = 42.6 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 40/273 (14%), Positives = 84/273 (30%), Gaps = 56/273 (20%) Query: 151 MTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 + D+ + + +Q R + V+D+SGSM Sbjct: 234 SSKDDFAILIHLKAPSANPDQGTGKLANESSAGSSRNRAPVDLVTVLDVSGSMAG----- 288 Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 TK+A LK A+ + + + +I +++ + Sbjct: 289 ---------------TKLALLKRAMGFVVQHLGPSD------RLSVIAFSSSARRLFHLQ 327 Query: 271 WGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 + RQ + ++SL T+ A+K+A +++ ++ + Sbjct: 328 RISHHGRQQALQAINSLGASGGTNIADALKKAMKVIEDRSYKNSVCS------------- 374 Query: 330 QKFIIFLTDG-ENNNFKSNVNTI----------KICDKAKENFIKIVTISINASPNGQRL 378 II L+DG + N S+V I + A+ + + A + L Sbjct: 375 ---IILLSDGQDTYNISSSVQGASPDYKSLVPSSIINDARHT-VPLHAFGFGADHDSDSL 430 Query: 379 LKTCVSSPEYHYNVVNADSLIHVFQN-ISQLMV 410 +S + + + F I L+ Sbjct: 431 HSIAQASGGTFSFIEDEGVMQDAFAQCIGGLLS 463 >gi|312072922|ref|XP_003139287.1| von Willebrand factor domain-containing protein [Loa loa] gi|307765555|gb|EFO24789.1| von Willebrand factor domain-containing protein [Loa loa] Length = 2142 Score = 42.6 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 41/332 (12%), Positives = 97/332 (29%), Gaps = 30/332 (9%) Query: 77 DRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNR 136 D ++I+ A + DA+ ++ I E+ S Y + ++ + AN Sbjct: 763 DSMQTITTAAAKLRQTDAEVYVVP-ITENYSKDELSLYVGDQGILITHPKAYKKKFANYA 821 Query: 137 LDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVS---FIPALLRIEMGERPIFLI 193 + NNT+ + + D I + + I + P+F Sbjct: 822 PLNDNNTVVKDFVEIARMDEPFTQIIVTTLFNDQSHVTNKPDMIVKSIAENETRHPLFDF 881 Query: 194 ELVVDLSGSMHCAMN-SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV--KE 250 +L S + DP + D ++ + L + I + Sbjct: 882 DL---SSAEIKSKAQIQDPNCLVDLIFIVDTSQSVEKTFQKQLQFAVTLIKQIPPSAFNN 938 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILTSDK 309 + + I +++ + N + + + + T S + A + + Sbjct: 939 RIRVAAISFSSEAQINFQF-NEFNNRTEILNALLSLTHSGGNTSSVSGINLAIKEIL--- 994 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + ++ I+ ++DG N+ + + D+ + I+ Sbjct: 995 -------------ERGREDVRRMIVLMSDG--NSQDCWEDLLDASDRLHATNTIVYAIAA 1039 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 N + L S + + L Sbjct: 1040 NPDYYFRELEAYTRSKWLVYVDGREGRFLNDA 1071 >gi|316931543|ref|YP_004106525.1| hypothetical protein Rpdx1_0148 [Rhodopseudomonas palustris DX-1] gi|315599257|gb|ADU41792.1| Protein of unknown function DUF2134, membrane [Rhodopseudomonas palustris DX-1] Length = 443 Score = 42.6 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 55/440 (12%), Positives = 135/440 (30%), Gaps = 68/440 (15%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 + N ++IFAL ++ + IG + + + +++A +AA + S+ Sbjct: 12 RDRRGNIAVIFALVLIPLISAIGCAVDYSRANALRTKLQAAADAASVGAVSRTSPAYIAA 71 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 G ++ A + DDA R +L+ + ++ + S + + + Sbjct: 72 GSMS---TDGAITSGADDALRIFNG----NLANLTGYTLDSVAATVSKSGEAVTSKVTFS 124 Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 S+ + MT + ++ P++ ++ Sbjct: 125 AQISTMFMKAVAVSSMTVGGVS-TATASMPKYIDFYLLLDNSPSMGVGATPTDVAAMVAA 183 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKR-----TKMAALKNALLLFLDSIDLLSHVKE 250 + S HCA DVN+ + + T++ L++A +D+ + Sbjct: 184 TANKSSDDHCAFACH--DVNNKNNYYNLAKALGITTRIDVLRSATQQLMDTAAATATYTN 241 Query: 251 DVYM-----GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 M G T + S + +D + +K + Y + Sbjct: 242 QFRMAIYDFGASAQTAGLRNLFALSASLSS-AKTAAGAIDLMTVKGQNDNNDQDTQYTAI 300 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK------------SNVNTIKI 353 + K++ F++DG + + + + Sbjct: 301 LPAIDKLIAAPGTGAAG-----SPLKYLFFVSDGVADEYNPACLKPKTGNRCQSPINPAL 355 Query: 354 CDKAKENFIKI---VTISINASPNG----------------------QRLLKTCVSSPEY 388 C K+ +++ T ++ N + ++ C SP + Sbjct: 356 CKTLKDRGVRVAVLYTTYLDLPSNDWYKKWIAPFNEGPYGPSPNSEIAKNMEACA-SPGF 414 Query: 389 HYNVVN----ADSLIHVFQN 404 ++ V A+++ +F+ Sbjct: 415 YFEVSPTQGIAEAMNALFKR 434 >gi|308462096|ref|XP_003093334.1| hypothetical protein CRE_03436 [Caenorhabditis remanei] gi|308250345|gb|EFO94297.1| hypothetical protein CRE_03436 [Caenorhabditis remanei] Length = 384 Score = 42.6 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 34/213 (15%), Positives = 70/213 (32%), Gaps = 35/213 (16%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + VVD S M + ++A L + + + + Sbjct: 37 LDVVAVVDNSKVMG-----------------NGDLAQIAVLITEIFAESRIGTSIPNQPK 79 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +GL+ Y N G +K + +D+ I +S + ++Y Sbjct: 80 TTRLGLVTYNW----NATIQAGLDKFQS--QQDVFENIFNALNSVSSTSESY----LANG 129 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 N +G + +QK I+ ++ S+ N I I D+ K+ I I+T+ N Sbjct: 130 LVAAENVLARGPNRGN-NYQKVIVLFA----ASYSSHSNPIAIADRLKQAGITIITMGYN 184 Query: 371 ASPNGQRLLKTCV--SSPEYHYNVVNADSLIHV 401 + + +SP + + + + Sbjct: 185 -NVGDPNFYQNLAKIASPNKSFTEKSLSQIGDI 216 >gi|258516146|ref|YP_003192368.1| von Willebrand factor type A [Desulfotomaculum acetoxidans DSM 771] gi|257779851|gb|ACV63745.1| von Willebrand factor type A [Desulfotomaculum acetoxidans DSM 771] Length = 219 Score = 42.6 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 21/158 (13%), Positives = 50/158 (31%), Gaps = 35/158 (22%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + L++D SGSM + A+K + + + Sbjct: 11 RLPVYLLLDRSGSMFGE--------------------PIEAVKQGVKYMISELKKEPQAI 50 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 E Y+ +I + + ++++ + ++ + +++ T S A Sbjct: 51 ETAYISVITFGSDARQDVQLT----ELAAFKEPQIEAN---GTTSLGAALHILNNCFD-- 101 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 N R+ ++ + +TDGE + N Sbjct: 102 ------NEVRKSTPTQKGDYKPLVFIMTDGEPTDDWEN 133 >gi|153954292|ref|YP_001395057.1| hypothetical protein CKL_1667 [Clostridium kluyveri DSM 555] gi|219854893|ref|YP_002472015.1| hypothetical protein CKR_1550 [Clostridium kluyveri NBRC 12016] gi|146347173|gb|EDK33709.1| Conserved hypothetical protein [Clostridium kluyveri DSM 555] gi|219568617|dbj|BAH06601.1| hypothetical protein [Clostridium kluyveri NBRC 12016] Length = 580 Score = 42.6 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 46/250 (18%), Positives = 82/250 (32%), Gaps = 48/250 (19%) Query: 174 VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 V+FIP + + V+D SGSM SDPE + A+K Sbjct: 19 VAFIPFSGVLAAQSGTSMDVIFVLDSSGSMA---ESDPEKIRE------------EAIKM 63 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKP- 291 L + + GL+ Y+ + + S + + + + + L Sbjct: 64 FLDMGQIQGNKA---------GLVAYSDSIVREHNLDSINSSEDKDRIKSMASDISLGQR 114 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 TD+ + +A +++ S K II L+DG+N+ +S ++ Sbjct: 115 TDTGRGLLEAVKLMESGHKSGNNP----------------VIILLSDGKNDPERSQDESL 158 Query: 352 KICDKA----KENFIKIVTIS--INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 A K + TI N + + +L S+ Y A L + +I Sbjct: 159 NDLKNALQICKSKGYPVYTIGLNYNGTVDKTQLGDISSSTGGKDYITNTASDLPAILTDI 218 Query: 406 SQLMVHRKYS 415 K Sbjct: 219 YADNSKLKVQ 228 >gi|56797851|emb|CAF33338.1| matrilin-3a [Danio rerio] Length = 295 Score = 42.6 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 29/173 (16%), Positives = 59/173 (34%), Gaps = 24/173 (13%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 D +D L + + ++ Y + V+ K + T + A Sbjct: 87 LADMVDTLDVGPDATRVAVVNYASTVKIEFLLKSHLTKDTIKQAITRIEPLAAGTMTGMA 146 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 +K+A ++K G + S K I +TDG + V + A Sbjct: 147 IKKAMDEAFTEK----------SGARPKSKNISKVAIIVTDG---RPQDQVEEVS--AAA 191 Query: 358 KENFIKIVTISI-NASPNGQRLLKTCVSSP--EYHYNVVN---ADSLIHVFQN 404 + + I+I + + A +L+ S+P ++ + V + L F+ Sbjct: 192 RASGIEIYAVGVDRADMRSLKLM---ASNPLEDHVFYVETYGVIEKLTSKFRE 241 >gi|297565996|ref|YP_003684968.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] gi|296850445|gb|ADH63460.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] Length = 717 Score = 42.6 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 40/238 (16%), Positives = 81/238 (34%), Gaps = 52/238 (21%) Query: 174 VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 S A+ + E + LV+D+SGSM + L Sbjct: 288 SSLADAIPLEPLQEDGGVGLVLVLDVSGSML----------------------EADKLGL 325 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTR---VEKNIEPSWGTEKVRQYVTRDMDSLILK 290 A+ L+ I ++ Y+G++ +++ V + + + R+ + + Sbjct: 326 AVAGSLELIRSA---RDQDYIGVVAFSSSARWVFRPRPMT--PQGRREAESLLLSVRAGG 380 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T+ A +A Q L K K ++ LTDG + + T Sbjct: 381 GTEIGEAYAEALQALRGLKTE------------------DKQVLVLTDGLVQDP--TLPT 420 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSS-PEYHYNVVNADSLIHVFQNISQ 407 ++ +A+ N I+ +++ S + L+ Y+V + L F +Q Sbjct: 421 LQAAHQAQANKIRTNAVAL-GSDADRAFLRELAKQGGGTFYDVPSPKDLPRFFLEEAQ 477 >gi|319425442|gb|ADV53516.1| lipoprotein with VWA and DUF3520 domains [Shewanella putrefaciens 200] Length = 638 Score = 42.6 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 37/234 (15%), Positives = 72/234 (30%), Gaps = 52/234 (22%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 E + ++D+SGSM K+ L+ AL + + Sbjct: 245 SELGASNLVFLLDVSGSMA-------------------SADKLPLLQTALKMLTQQLSAQ 285 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 V VY G G ++ + ++ T+ + + QAYQ Sbjct: 286 DKVSIVVYAGAAG--VVLDGASG-----DDIQALTYALEQLRAGGSTNGSQGILQAYQ-- 336 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKENFIKI 364 ++ + +I TDG+ N +N I + +K K+ I + Sbjct: 337 -----------LAQKHFIQGGINR---VILATDGDFNVGVTNFDQLISLIEKEKQRGIGL 382 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNA---------DSLIHVFQNISQLM 409 T+ +L++ HY ++ D L I++ + Sbjct: 383 TTLGFGMGNYNDQLMEQLADKGNGHYAYIDTLNEARKVLVDELSSTLLTIAKDV 436 >gi|18700173|gb|AAL77698.1| At2g38970/T7F6.14 [Arabidopsis thaliana] Length = 692 Score = 42.6 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 43/296 (14%), Positives = 101/296 (34%), Gaps = 60/296 (20%) Query: 117 EIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSF 176 E Q + + S N+ + + I+ + + D R +F + + + Sbjct: 178 EQQLVFSGKSYSGALENNHPVRMMDLKIYPEVSAVPRADSREKFDVLVHLRAAAMVTGNA 237 Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 +I R + V+D+SGSM TK+A LK A+ Sbjct: 238 NSLNNQISRYPRAPVDLVTVLDISGSMAG--------------------TKLALLKRAMG 277 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS----WGTEKVRQYVTRDMDSLILKPT 292 + ++ + +I +++ + + G ++ Q V + + T Sbjct: 278 FVIQNLGSND------RLSVIAFSSTARRLFPLTKMSDAGRQRALQAVNSVVAN---GGT 328 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVN- 349 + +++ +++ Q K P II L+DG +++ N Sbjct: 329 NIAEGLRKGVKVM------------EDQRDKNPVAS----IILLSDGRATYTMNQADPNY 372 Query: 350 ------TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 ++ C+ K I + + S + L+ + + ++ + ++S+I Sbjct: 373 KLLLPLSMHGCES-KRFQIPVHSFGF-GSDHDASLMHSVSETSGGTFSFIESESVI 426 >gi|300853770|ref|YP_003778754.1| hypothetical protein CLJU_c05700 [Clostridium ljungdahlii DSM 13528] gi|300433885|gb|ADK13652.1| hypothetical protein CLJU_c05700 [Clostridium ljungdahlii DSM 13528] Length = 484 Score = 42.6 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 46/248 (18%), Positives = 90/248 (36%), Gaps = 48/248 (19%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 FIP+ P + V+D SGSM SDPE++ + A+K L Sbjct: 21 FIPSQKVYASTSSPNVDVVFVLDSSGSMK---QSDPEEIRT------------EAIKMFL 65 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKP-TD 293 + + GL+ Y+ + K + +E ++ + ++ L TD Sbjct: 66 DMGQVQGNKA---------GLVAYSDTIIKEHNLDAINSESDKERIEEMASNIPLGQKTD 116 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + + ++L S + II L+DG+N++ +S ++K Sbjct: 117 IGRGLLEGAKVLDSGHDSNNRP----------------LIILLSDGKNDSQRSASESLKD 160 Query: 354 CDKA----KENFIKIVTISIN--ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + A K + TI +N + + +L + + +Y A L + ++I Sbjct: 161 LNSAISTCKSKGYPVYTIGLNYDGTVDKAQLTQIASETKGKNYITNKASDLTDILKDIYG 220 Query: 408 LMVHRKYS 415 + K Sbjct: 221 DSANVKVQ 228 >gi|207028763|ref|NP_001124799.1| inter-alpha-trypsin inhibitor heavy chain H5 [Pongo abelii] Length = 942 Score = 42.6 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 46/222 (20%), Positives = 77/222 (34%), Gaps = 48/222 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D S SM TK+ K+AL L + H Sbjct: 296 VVFVLDSSASMVG--------------------TKLRQTKDALFTILHDLRPQDHFS--- 332 Query: 253 YMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +IG++ R++ K+ S + +R TD A+++A ++L Sbjct: 333 ---IIGFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL----- 384 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF---IKIVTI 367 N + I I+FLTDG+ +T+KI + +E + I TI Sbjct: 385 -----NKYVAHSGIGDRSVS-LIVFLTDGK--PTVGETHTLKILNNTREAARGQVCIFTI 436 Query: 368 SINASPNGQRL----LKTCVSSPEYHYNVVNADSLIHVFQNI 405 I + + L L+ C + H LI + I Sbjct: 437 GIGNDVDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 478 >gi|56403909|emb|CAI29739.1| hypothetical protein [Pongo abelii] Length = 694 Score = 42.6 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 46/222 (20%), Positives = 77/222 (34%), Gaps = 48/222 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D S SM TK+ K+AL L + H Sbjct: 48 VVFVLDSSASMVG--------------------TKLRQTKDALFTILHDLRPQDHFS--- 84 Query: 253 YMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +IG++ R++ K+ S + +R TD A+++A ++L Sbjct: 85 ---IIGFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL----- 136 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF---IKIVTI 367 N + I I+FLTDG+ +T+KI + +E + I TI Sbjct: 137 -----NKYVAHSGIGDRSVS-LIVFLTDGK--PTVGETHTLKILNNTREAARGQVCIFTI 188 Query: 368 SINASPNGQRL----LKTCVSSPEYHYNVVNADSLIHVFQNI 405 I + + L L+ C + H LI + I Sbjct: 189 GIGNDVDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 230 >gi|332295408|ref|YP_004437331.1| von Willebrand factor type A [Thermodesulfobium narugense DSM 14796] gi|332178511|gb|AEE14200.1| von Willebrand factor type A [Thermodesulfobium narugense DSM 14796] Length = 438 Score = 42.6 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 58/147 (39%), Gaps = 19/147 (12%) Query: 163 HLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 LL QK+ + + E+ ER + V+D SGSM +N Sbjct: 12 RLLADTSGQKLFVLLSIEVSKEVEERGKLFVSFVLDTSGSMSETVN-------------- 57 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 ++K+ + +L L+S + +K+D + ++ + V+ + + TEK + + + Sbjct: 58 -DKSKIEIVIESLKKILES----NILKDDDEISIVTFDDEVKIVLPFTAATEKEKIFSSF 112 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDK 309 + T+ MK + +L Sbjct: 113 EQIRTGTVGTNLGAGMKVSLDLLKDKA 139 >gi|328865949|gb|EGG14335.1| Ubiquitin-conjugating enzyme E2 4 [Dictyostelium fasciculatum] Length = 563 Score = 42.6 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 41/250 (16%), Positives = 91/250 (36%), Gaps = 28/250 (11%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 L S P L +++ G R + +V+DLSGSM + + + + Sbjct: 121 LQINDTTSAGSPPPPLSQVDDGIRQ-LDV-IVLDLSGSMQ------QPAYIGSRVPGELE 172 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 T++ A + F+D + V +GL+ + ++E S + Sbjct: 173 MTRIEAAQATFQTFIDRFVSY---RYPVAVGLVCFGQKIEATFPISSNFDSF-------- 221 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 S L ++ + + ++ + + P++ + I LTDGE+N Sbjct: 222 -SNELGEVEAHQSQTRLWEAIKRAAEVIVEFKKSPTLKLAPNVRSR--IFCLTDGEDNGS 278 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT-CVSSPEYHYNVVNADSLIHVFQ 403 + D + + I + +I I G+ L ++ + ++ + +F+ Sbjct: 279 ---TPVFTVFDYLRTHGIILDSIPIG--QQGRATLSAFSKATGGTCFVANSSIECVQLFE 333 Query: 404 NISQLMVHRK 413 + LM+ + Sbjct: 334 REALLMLEHR 343 >gi|327458562|gb|EGF04912.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK1] Length = 463 Score = 42.6 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 43/320 (13%), Positives = 98/320 (30%), Gaps = 52/320 (16%) Query: 49 KKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAK----RALIDDAKRFIKNHIKE 104 + SM+ + + S V + S+ + +S A ++ N K+ Sbjct: 52 MRTSMQ-YVDRTVGKATSIFVLDDSKFKGSKQGLSREWSYIGLSADGKKVLNYVWNKKKQ 110 Query: 105 SLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMD---VMTSYDYRLQFI 161 + + N+ + N I YN+ T+ + Sbjct: 111 DW--DVSELGTKSLYNMKLDLEFKTEGAYQD-----NRLISYNLSGKYPDTNNKLSIDTA 163 Query: 162 EHLLNQRYNQKIVSFIPALLRIEMGERPI-----FLIELVVDLSGSMHCAMNSDPEDVNS 216 LN + V+ + I PI + V D SGSM + + NS Sbjct: 164 ISALNTKQVFSKVAKGKKGIAIAYRTDPIQGQMNIAVSFVFDTSGSMDWDLQGRNVNPNS 223 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR----VEKNIEPSWG 272 ++M L+ + + + + ++ + L+G+++ ++ G Sbjct: 224 G------TESRMTILRKKAEIMIKDLKGIGNIS----VNLVGFSSSGKYIQKEFSNLDNG 273 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + + + + T+ ++ L S + K+ Sbjct: 274 ADTIIGTIKDPKKLVPDGVTNPGDGLRYGLISLQSQPAQL------------------KY 315 Query: 333 IIFLTDGENNNFKSNVNTIK 352 ++ LTDG N + + + + Sbjct: 316 VVLLTDGIPNTYIVDPSALY 335 >gi|242097658|emb|CAY86115.1| truncated collagen type VI alpha 4 precursor [Homo sapiens] Length = 385 Score = 42.6 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 20/122 (16%), Positives = 45/122 (36%), Gaps = 20/122 (16%) Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKPTDSTPAMKQAY 302 L + V +GL Y + + ++ + + +L T++ A++ Sbjct: 264 LDISSDRVRVGLAQYNDNIYPAFQL--NQHPLKSMILEQIQNLPYRTGGTNTGSALEF-- 319 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + ++ T K + +I +TDGE+N+ V D+ KE+ + Sbjct: 320 ------IRTNYLTEESGSRAKD---RVPQIVILVTDGESNDEVQEVA-----DRLKEDGV 365 Query: 363 KI 364 + Sbjct: 366 VV 367 >gi|240172225|ref|ZP_04750884.1| hypothetical protein MkanA1_23119 [Mycobacterium kansasii ATCC 12478] Length = 335 Score = 42.6 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 43/234 (18%), Positives = 74/234 (31%), Gaps = 51/234 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + LV+D+S SM D + +M A + A F D + + Sbjct: 99 VMLVIDVSQSM---------------RATDVEPNRMVAAQEAAKQFADELTPG------I 137 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GLI Y + P+ + + + + + T + A+ A Q + + Sbjct: 138 NLGLIAYAGTATVLVSPTTNRDATKNALDKLQFADR---TATGEAIFTALQAIATVGAV- 193 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENN---NFKSNVNTIKICDKAKENFIKIVTISI 369 + P I+ +DG+ N + AK+ + I TIS Sbjct: 194 ---------IGGGDTPPPARIVLFSDGKETMPTNPDNPKGAFTAARTAKDQGVPISTISF 244 Query: 370 NAS-----PNGQ--------RLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLM 409 NGQ LK S YN L V+ ++ Q + Sbjct: 245 GTPYGFVEINGQRQPVPVDDETLKKVAQLSGGNAYNAATLAELKSVYASLQQQI 298 >gi|224081306|ref|XP_002190595.1| PREDICTED: matrilin 1, cartilage matrix protein [Taeniopygia guttata] Length = 493 Score = 42.6 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 34/169 (20%), Positives = 59/169 (34%), Gaps = 24/169 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAMKQ 300 I+ L +G+I Y + V + G + L T + A++ Sbjct: 66 IEALDVGPNATRVGVINYASAVRNELSLQ-GPHSKAALLQAVRRIQPLSTGTMTGLAIQF 124 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A S + G + + F+K I +TDG + +V+ +A+ Sbjct: 125 AISRAFSAAE----------GGRGSAPNFKKVAIVVTDGRPQDGVQDVS-----ARARAA 169 Query: 361 FIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN---ADSLIHVFQN 404 I+I I + G L+ S P E+ V + + L H FQ Sbjct: 170 GIEIFAIGVGRVDMGT--LRQMASEPLDEHVDYVESYSVIEKLTHKFQE 216 Score = 42.2 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 36/199 (18%), Positives = 76/199 (38%), Gaps = 34/199 (17%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + +K + +DS+++ + +GL+ Y++ V + K Sbjct: 284 RPENFELVKKFINQIVDSLEVSD---KQAQVGLVQYSSSVRQEFPLGQFKSKKDIKAAVK 340 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 S + K T + A+K S N R GV I+F TDG + + Sbjct: 341 KMSYMEKGTMTGQALKYLVDSSFSA------INGARPGVPKVG------IVF-TDGRSQD 387 Query: 344 FKSNVNTIKICDKAKENFIKIVTISI-NASPNGQRLLKTCVSSP--EYHYNVVNADSLIH 400 + S+ KAK++ ++ + + NA + L+ S P E+++ + Sbjct: 388 YISDAA-----KKAKDSGFRMFAVGVGNAVEDE---LREIASEPVAEHYFYTAD------ 433 Query: 401 VFQNISQLMVHRKYSVILK 419 F+ IS++ + + ++ Sbjct: 434 -FRTISKIGKKLQMKICVE 451 >gi|161408065|dbj|BAF94136.1| Dual Intracellular Von Willebrand factor domain A [Homo sapiens] Length = 276 Score = 42.6 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 20/122 (16%), Positives = 45/122 (36%), Gaps = 20/122 (16%) Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKPTDSTPAMKQAY 302 L + V +GL Y + + ++ + + +L T++ A++ Sbjct: 155 LDISSDRVRVGLAQYNDNIYPAFQL--NQHPLKSMILEQIQNLPYRTGGTNTGSALEF-- 210 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + ++ T K + +I +TDGE+N+ V D+ KE+ + Sbjct: 211 ------IRTNYLTEESGSRAKD---RVPQIVILVTDGESNDEVQEVA-----DRLKEDGV 256 Query: 363 KI 364 + Sbjct: 257 VV 258 >gi|239831900|ref|ZP_04680229.1| Poly [ADP-ribose] polymerase 4 [Ochrobactrum intermedium LMG 3301] gi|239824167|gb|EEQ95735.1| Poly [ADP-ribose] polymerase 4 [Ochrobactrum intermedium LMG 3301] Length = 777 Score = 42.6 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 39/229 (17%), Positives = 66/229 (28%), Gaps = 70/229 (30%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + ++ + V+D SGSM T + K +L L + Sbjct: 371 VASAKKAQREVVFVIDNSGSMGG--------------------TSIEQAKASLDYALSHL 410 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPS-----WGTEKVRQYVTRDMDSLILKPTDSTPA 297 +I + + + E S R +V T PA Sbjct: 411 QPGD------RFNVIRFDDTLTRFFEVSVEASQQNIASARHFVMSLEAQ---GGTAMLPA 461 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A S N RQ I+FLTDGE +N + ++ I A Sbjct: 462 LHAALD-------DSHQGNGLRQ------------IVFLTDGEISNEQQLLDAIA----A 498 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + +I + I ++P + A+ F +I Sbjct: 499 RRGRSRIFMVGI-------------GTAPNSYLMNHAAELGRGTFTHIG 534 >gi|294141918|ref|YP_003557896.1| von Willebrand factor type A domain-containing protein [Shewanella violacea DSS12] gi|194578715|dbj|BAG66042.1| von Willebrand factor typeA domain protein [Shewanella violacea] gi|293328387|dbj|BAJ03118.1| von Willebrand factor type A domain protein [Shewanella violacea DSS12] Length = 689 Score = 42.6 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 40/278 (14%), Positives = 86/278 (30%), Gaps = 58/278 (20%) Query: 146 YNMDVMTSYDYRLQFIEHL-LNQRYNQKIVSFIP-ALLRIEMGERPIFLIELVVDLSGSM 203 Y ++ + L + + K++ I ++ + + ++D+SGSM Sbjct: 221 YQYSTPSTVEQPFSVNTELAPSPYNDHKMLLRIGLKGYEVDKSQLGASNLVFLLDVSGSM 280 Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 R K+ LK +L + + HV VY G G Sbjct: 281 -------------------NSRDKLPLLKTSLKMLSQQLSEQDHVSIVVYAGASGVVLD- 320 Query: 264 EKNIEPSWGTEKVRQY-VTRDMDSLILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQG 321 G + Y + + +++L T+ ++QAY ++ Sbjct: 321 --------GVKGNDIYAINQALNNLKAGGSTNGGAGIQQAY-------------GLAQKH 359 Query: 322 VKIPSLPFQKFIIFLTDGE-NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLK 380 + +I TDG+ N + + + ++ I + T+ L++ Sbjct: 360 FIQGGVNR---VILATDGDFNVGTTDHQALMDLIASKRDQGIALTTLGFGQGNYNDHLME 416 Query: 381 TCVSSPEYHYNVVNA---------DSLIHVFQNISQLM 409 HY ++ D L I++ + Sbjct: 417 QLADKGNGHYAYIDTLNEARKVLVDELSSTLLTIAKDV 454 >gi|268579981|ref|XP_002644973.1| Hypothetical protein CBG10938 [Caenorhabditis briggsae] Length = 548 Score = 42.6 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 20/77 (25%), Positives = 38/77 (49%), Gaps = 9/77 (11%) Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN-GQRL--LKTCVSSPE 387 K ++F TDG S +T + + K I + T+ I+ N G + LK +SP Sbjct: 474 KLLVF-TDG-----YSAEDTAEGIEALKRQGITVYTVGISTDKNAGLNVSELKGMATSPS 527 Query: 388 YHYNVVNADSLIHVFQN 404 ++++ + D+L+ F + Sbjct: 528 HYFDSSDFDNLLKHFPS 544 >gi|282879637|ref|ZP_06288368.1| von Willebrand factor type A domain protein [Prevotella timonensis CRIS 5C-B1] gi|281306585|gb|EFA98614.1| von Willebrand factor type A domain protein [Prevotella timonensis CRIS 5C-B1] Length = 332 Score = 42.6 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 45/246 (18%), Positives = 72/246 (29%), Gaps = 72/246 (29%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I L +D+S SM D K ++ A KN F+ Sbjct: 88 IDIMLAMDVSTSMLAE---------------DLKPNRLEAAKNVAAEFI----SGRPNDN 128 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVR-----QYVTRDMDS--LILKPTDSTPAMKQAYQ 303 +GL + + T+ + V D+ + LI T + A Sbjct: 129 ---IGLTIFAGEAFTQCPMT--TDHTSLLNLLRNVRTDIAARGLISDGTAVGMGLANAVS 183 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L K +S K +I LTDG NN + T AK I+ Sbjct: 184 RLKDSKTKS------------------KVVILLTDGSNNMGDISPMTSAQ--IAKSLDIR 223 Query: 364 IVTISI----NAS-----------------PNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + TI + A + + L + Y N L ++ Sbjct: 224 VYTIGVGTNKVAPYPMSVGGGTQYINIPVEIDSKTLSDIAAVTEGNFYRATNNQQLKQIY 283 Query: 403 QNISQL 408 ++I +L Sbjct: 284 KDIDKL 289 >gi|262050538|ref|NP_001159921.1| inter-alpha-trypsin inhibitor heavy chain H4 isoform 2 precursor [Homo sapiens] Length = 900 Score = 42.6 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 33/165 (20%), Positives = 56/165 (33%), Gaps = 45/165 (27%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D SGSM K+ + A + +D LS + Sbjct: 275 VVFVIDKSGSMSGR--------------------KIQQTREA---LIKILDDLSPRDQF- 310 Query: 253 YMGLIGYTTRVEK-----NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 LI ++T + + K R + L T+ AM A Q+L S Sbjct: 311 --NLIVFSTEATQWRPSLVPASAENVNKARSFAAGIQA---LGGTNINDAMLMAVQLLDS 365 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + + S+ II LTDG+ ++N +I+ Sbjct: 366 SNQ--------EERLPEGSVSL---IILLTDGDPTVGETNPRSIQ 399 >gi|221042206|dbj|BAH12780.1| unnamed protein product [Homo sapiens] Length = 888 Score = 42.6 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 33/165 (20%), Positives = 56/165 (33%), Gaps = 45/165 (27%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D SGSM K+ + A + +D LS + Sbjct: 263 VVFVIDKSGSMSGR--------------------KIQQTREA---LIKILDDLSPRDQF- 298 Query: 253 YMGLIGYTTRVEK-----NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 LI ++T + + K R + L T+ AM A Q+L S Sbjct: 299 --NLIVFSTEATQWRPSLVPASAENVNKARSFAAGIQA---LGGTNINDAMLMAVQLLDS 353 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + + S+ II LTDG+ ++N +I+ Sbjct: 354 SNQ--------EERLPEGSVSL---IILLTDGDPTVGETNPRSIQ 387 >gi|219517748|gb|AAI36393.1| ITIH4 protein [Homo sapiens] Length = 935 Score = 42.6 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 33/165 (20%), Positives = 56/165 (33%), Gaps = 45/165 (27%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D SGSM K+ + A + +D LS + Sbjct: 275 VVFVIDKSGSMSGR--------------------KIQQTREA---LIKILDDLSPRDQF- 310 Query: 253 YMGLIGYTTRVEK-----NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 LI ++T + + K R + L T+ AM A Q+L S Sbjct: 311 --NLIVFSTEATQWRPSLVPASAENVNKARSFAAGIQA---LGGTNINDAMLMAVQLLDS 365 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + + S+ II LTDG+ ++N +I+ Sbjct: 366 SNQ--------EERLPEGSVSL---IILLTDGDPTVGETNPRSIQ 399 >gi|187950343|gb|AAI36394.1| Inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Homo sapiens] Length = 930 Score = 42.6 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 33/165 (20%), Positives = 56/165 (33%), Gaps = 45/165 (27%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D SGSM K+ + A + +D LS + Sbjct: 275 VVFVIDKSGSMSGR--------------------KIQQTREA---LIKILDDLSPRDQF- 310 Query: 253 YMGLIGYTTRVEK-----NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 LI ++T + + K R + L T+ AM A Q+L S Sbjct: 311 --NLIVFSTEATQWRPSLVPASAENVNKARSFAAGIQA---LGGTNINDAMLMAVQLLDS 365 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + + S+ II LTDG+ ++N +I+ Sbjct: 366 SNQ--------EERLPEGSVSL---IILLTDGDPTVGETNPRSIQ 399 >gi|119585669|gb|EAW65265.1| inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein), isoform CRA_b [Homo sapiens] Length = 914 Score = 42.6 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 33/165 (20%), Positives = 56/165 (33%), Gaps = 45/165 (27%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D SGSM K+ + A + +D LS + Sbjct: 275 VVFVIDKSGSMSGR--------------------KIQQTREA---LIKILDDLSPRDQF- 310 Query: 253 YMGLIGYTTRVEK-----NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 LI ++T + + K R + L T+ AM A Q+L S Sbjct: 311 --NLIVFSTEATQWRPSLVPASAENVNKARSFAAGIQA---LGGTNINDAMLMAVQLLDS 365 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + + S+ II LTDG+ ++N +I+ Sbjct: 366 SNQ--------EERLPEGSVSL---IILLTDGDPTVGETNPRSIQ 399 >gi|119585668|gb|EAW65264.1| inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein), isoform CRA_a [Homo sapiens] Length = 930 Score = 42.6 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 33/165 (20%), Positives = 56/165 (33%), Gaps = 45/165 (27%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D SGSM K+ + A + +D LS + Sbjct: 275 VVFVIDKSGSMSGR--------------------KIQQTREA---LIKILDDLSPRDQF- 310 Query: 253 YMGLIGYTTRVEK-----NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 LI ++T + + K R + L T+ AM A Q+L S Sbjct: 311 --NLIVFSTEATQWRPSLVPASAENVNKARSFAAGIQA---LGGTNINDAMLMAVQLLDS 365 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + + S+ II LTDG+ ++N +I+ Sbjct: 366 SNQ--------EERLPEGSVSL---IILLTDGDPTVGETNPRSIQ 399 >gi|4096840|gb|AAD05198.1| inter-alpha-trypsin inhibitor family heavy chain-related protein [Homo sapiens] Length = 930 Score = 42.6 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 33/165 (20%), Positives = 56/165 (33%), Gaps = 45/165 (27%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D SGSM K+ + A + +D LS + Sbjct: 275 VVFVIDKSGSMSGR--------------------KIQQTREA---LIKILDDLSPRDQF- 310 Query: 253 YMGLIGYTTRVEK-----NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 LI ++T + + K R + L T+ AM A Q+L S Sbjct: 311 --NLIVFSTEATQWRPSLVPASAENVNKARSFAAGIQA---LGGTNINDAMLMAVQLLDS 365 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + + S+ II LTDG+ ++N +I+ Sbjct: 366 SNQ--------EERLPEGSVSL---IILLTDGDPTVGETNPRSIQ 399 >gi|7770149|gb|AAF69610.1|AF119917_18 PRO1851 [Homo sapiens] Length = 644 Score = 42.6 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 33/165 (20%), Positives = 56/165 (33%), Gaps = 45/165 (27%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D SGSM K+ + A + +D LS + Sbjct: 5 VVFVIDKSGSMSGR--------------------KIQQTREA---LIKILDDLSPRDQF- 40 Query: 253 YMGLIGYTTRVEK-----NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 LI ++T + + K R + L T+ AM A Q+L S Sbjct: 41 --NLIVFSTEATQWRPSLVPASAENVNKARSFAAGIQA---LGGTNINDAMLMAVQLLDS 95 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + + S+ II LTDG+ ++N +I+ Sbjct: 96 SNQ--------EERLPEGSVSL---IILLTDGDPTVGETNPRSIQ 129 >gi|31542984|ref|NP_002209.2| inter-alpha-trypsin inhibitor heavy chain H4 isoform 1 precursor [Homo sapiens] gi|229463048|sp|Q14624|ITIH4_HUMAN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H4; Short=ITI heavy chain H4; Short=ITI-HC4; Short=Inter-alpha-inhibitor heavy chain 4; AltName: Full=Inter-alpha-trypsin inhibitor family heavy chain-related protein; Short=IHRP; AltName: Full=Plasma kallikrein sensitive glycoprotein 120; Short=Gp120; Short=PK-120; Contains: RecName: Full=70 kDa inter-alpha-trypsin inhibitor heavy chain H4; Contains: RecName: Full=35 kDa inter-alpha-trypsin inhibitor heavy chain H4; Flags: Precursor gi|1402590|dbj|BAA07536.1| PK-120 precursor [Homo sapiens] Length = 930 Score = 42.6 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 33/165 (20%), Positives = 56/165 (33%), Gaps = 45/165 (27%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D SGSM K+ + A + +D LS + Sbjct: 275 VVFVIDKSGSMSGR--------------------KIQQTREA---LIKILDDLSPRDQF- 310 Query: 253 YMGLIGYTTRVEK-----NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 LI ++T + + K R + L T+ AM A Q+L S Sbjct: 311 --NLIVFSTEATQWRPSLVPASAENVNKARSFAAGIQA---LGGTNINDAMLMAVQLLDS 365 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + + S+ II LTDG+ ++N +I+ Sbjct: 366 SNQ--------EERLPEGSVSL---IILLTDGDPTVGETNPRSIQ 399 >gi|1483187|dbj|BAA07602.1| inter-alpha-trypsin inhibitor family heavy chain-related protein (IHRP) [Homo sapiens] Length = 930 Score = 42.6 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 33/165 (20%), Positives = 56/165 (33%), Gaps = 45/165 (27%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D SGSM K+ + A + +D LS + Sbjct: 275 VVFVIDKSGSMSGR--------------------KIQQTREA---LIKILDDLSPRDQF- 310 Query: 253 YMGLIGYTTRVEK-----NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 LI ++T + + K R + L T+ AM A Q+L S Sbjct: 311 --NLIVFSTEATQWRPSLVPASAENVNKARSFAAGIQA---LGGTNINDAMLMAVQLLDS 365 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + + S+ II LTDG+ ++N +I+ Sbjct: 366 SNQ--------EERLPEGSVSL---IILLTDGDPTVGETNPRSIQ 399 >gi|241672093|ref|XP_002411438.1| neurogenic locus notch, putative [Ixodes scapularis] gi|215504089|gb|EEC13583.1| neurogenic locus notch, putative [Ixodes scapularis] Length = 1597 Score = 42.6 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 35/241 (14%), Positives = 81/241 (33%), Gaps = 58/241 (24%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 F+ + R + I V+D SGS+ + + A Sbjct: 31 FVSTIERYSTTKN---DIVFVLDESGSIGADVFP---------------------AELAF 66 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTR--VEKNIEPSWGTEKVRQYVTRD-MDSLILKPT 292 + + ++S E + ++ ++ V + S G + ++V T Sbjct: 67 TEMVARLLVVSP--EYSRLTVMTFSNDNLVHIDQVGSSGDTNMCKFVHELNQIPYRSGGT 124 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + A+ A +IL + ++ + + ++ ++DG+ N+ + Sbjct: 125 RTREALGYAGEILWNARQEAN-----------------RIVVLISDGQANSGSEPSEIAR 167 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + + I I + + A N LL SSP + Y ++ F+ I ++ Sbjct: 168 L---LRVKGIVIFGVGV-AHINKDELL-DVASSPAHTY-------MLRNFEYIKKVNKDL 215 Query: 413 K 413 + Sbjct: 216 R 216 >gi|121606137|ref|YP_983466.1| von Willebrand factor, type A [Polaromonas naphthalenivorans CJ2] gi|120595106|gb|ABM38545.1| von Willebrand factor, type A [Polaromonas naphthalenivorans CJ2] Length = 354 Score = 42.6 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 47/304 (15%), Positives = 95/304 (31%), Gaps = 68/304 (22%) Query: 148 MDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAM 207 + S + + LL+ S A++ + I L +D+SGSM Sbjct: 50 IGTGQSVRRHIPPLLFLLSLAVMLVAASRPLAVITLPSQNE---TIILAMDVSGSM---- 102 Query: 208 NSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNI 267 D ++ A +NA FL + +V +G++ + Sbjct: 103 -----------RATDVLPNRLVASQNAAKAFLADL------PRNVRVGVVAFAGTAAVVQ 145 Query: 268 EPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT----------SDKKRSFFTNF 317 P+ E + + + + + T + + L + + Sbjct: 146 PPTVSREDLTAAIDKFQ---LQRGTAIGNGIIVSLAELFPEAGIDLESMENNRERKHGLS 202 Query: 318 FRQGVKIPSLPFQKF------------IIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 Q K + F II LTDG+ +++ K+ A + I++ Sbjct: 203 LDQAGKDDGNGKKAFTPVAPGSYTSAAIILLTDGQRTTGIDSLDAAKV---AADRGIRVY 259 Query: 366 TISI--------------NASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNI-SQLM 409 T+ + + LK ++ ++ A L V+Q + S+L Sbjct: 260 TVGVGTVEGETIGFEGWSMRVKLDEETLKGIARATQAEYFYAGTATDLKKVYQTLSSRLT 319 Query: 410 VHRK 413 V +K Sbjct: 320 VEKK 323 >gi|320450208|ref|YP_004202304.1| von Willebrand factor, type A [Thermus scotoductus SA-01] gi|320150377|gb|ADW21755.1| von Willebrand factor, type A [Thermus scotoductus SA-01] Length = 414 Score = 42.6 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 34/250 (13%), Positives = 76/250 (30%), Gaps = 49/250 (19%) Query: 156 YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 L+ ++ ++ E+ ERP + V+D SGSM Sbjct: 20 LPLKPGVRATGPTRMPMLLRVKIPPVQAEV-ERPPLNLAFVLDRSGSMAGD--------- 69 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE--KNIEPSWGT 273 K+ K A+ ++++ + ++ Y +VE + Sbjct: 70 -----------KLKFAKKAVAYAVENLRPHD------RVAVVIYDHQVEVVVPSTLAENK 112 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 E++ R T+ + + ++ + I Sbjct: 113 EEIL---RRLRPVRPRGSTN----LHAGWLEGSTQVAAHLDAK-----------RLNRVI 154 Query: 334 IFLTDGENNNFKSNVNTIK-ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV 392 + L+DG N ++N N I + + T+ + N ++ + +Y + Sbjct: 155 V-LSDGLANTGETNPNVIAEQVRGLSQRGVSTSTLGVGLDYNEDLMMAMAEAGQGNYYFI 213 Query: 393 VNADSLIHVF 402 + D L +F Sbjct: 214 ESPDDLPGIF 223 >gi|293334601|ref|NP_001168718.1| hypothetical protein LOC100382510 [Zea mays] gi|223950381|gb|ACN29274.1| unknown [Zea mays] Length = 629 Score = 42.6 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 41/317 (12%), Positives = 83/317 (26%), Gaps = 57/317 (17%) Query: 108 GYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQ 167 F + E ++ ++ F + TS + + HL Sbjct: 100 ADPGTFNDDEPLDLSCEEANGTQQGCLRTVEIKAYPEFTQVPENTS-ERNFTVLVHLKAP 158 Query: 168 RYNQKIVSFIP-ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 + S R + V+D+SGSM T Sbjct: 159 PAQHLLQSSSDLGDGNGVSTTRAPVDLITVLDVSGSMAG--------------------T 198 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT-EKVRQYVTRDMD 285 K+A LK A+ + ++ + +I +++ + T +Q + Sbjct: 199 KLALLKRAMGFVIQNLGSSD------RLSVIAFSSSARRLFPLRRMTESGRQQSLLAVNS 252 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T+ ++ K S + + S II L+DG++ Sbjct: 253 LTANGGTNIAEGLR----------KGSKVIEERQSKNPVCS------IILLSDGQDTYTV 296 Query: 346 SNVNTIK-----IC------DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 S + C + + A + L +S + Sbjct: 297 SPTAGVHKGATEYCALLPSTTTNGSQQVPVHVFGFGADHDSVSLHSISQTSGGTFSFIET 356 Query: 395 ADSLIHVFQN-ISQLMV 410 ++ F I L+ Sbjct: 357 EATIQDAFAQCIGGLLS 373 >gi|114587348|ref|XP_001172665.1| PREDICTED: similar to PK-120 precursor isoform 1 [Pan troglodytes] Length = 849 Score = 42.6 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 32/160 (20%), Positives = 52/160 (32%), Gaps = 35/160 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D SGSM K+ + AL+ LD + V Sbjct: 263 VVFVIDKSGSMSGR--------------------KIQQTREALIKILDDLSPRDQFNLIV 302 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 + R + K R + L T+ AM A Q+L S + Sbjct: 303 FSTEASQ-WRPSLVPASAENVNKARSFAVGIQA---LGGTNINDAMLMAVQLLDSSNQ-- 356 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + S+ II LTDG+ ++N +I+ Sbjct: 357 ------EEQLPEGSVSL---IILLTDGDPTVGETNPRSIQ 387 >gi|223462569|gb|AAI50696.1| RIKEN cDNA E330026B02 gene [Mus musculus] Length = 1182 Score = 42.6 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 51/352 (14%), Positives = 101/352 (28%), Gaps = 64/352 (18%) Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRF---IKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 SN LG E+I A F + K+ + S Sbjct: 493 SNKPDLGKAIENIRQMGGNTNTGAALNFTLKLLQRAKKERGSKVPCHLVVLTNGMSRDSV 552 Query: 127 ISMTHMANNR--------LDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP 178 + H + +N + Y + + L N R + Sbjct: 553 LGPAHKLREENIRVHAIGVKEANQMQLREIAGEEKRVYYVHDFDALRNIRN-----QVVQ 607 Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + E I +VD SGS+ + +KM K + Sbjct: 608 EICAEEACRDMKADIMFLVDSSGSIG-----------------PENFSKM---KMFMKNL 647 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 + + + V +G++ ++ ++ + T + + +D + Sbjct: 648 VSKSQI---GADRVQIGVVQFSHENKEEFQL--NTFMSQSDIANAIDRMTHIG------- 695 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 T F + + K +KF+I +TDGE + + + Sbjct: 696 -----ETTLTGSALTFVSQYFSPDKGARPNVRKFLILITDGEAQDIVRDPAL-----ALR 745 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + + I ++ + S Q L+ PE + V N + Q+I +V Sbjct: 746 KEGVIIYSVGVFGSNVTQ--LEEISGKPEMVFYVEN----FDILQHIEDDLV 791 >gi|193216292|ref|YP_001997491.1| von Willebrand factor type A [Chloroherpeton thalassium ATCC 35110] gi|193089769|gb|ACF15044.1| von Willebrand factor type A [Chloroherpeton thalassium ATCC 35110] Length = 346 Score = 42.6 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 26/189 (13%), Positives = 73/189 (38%), Gaps = 35/189 (18%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 R++ +R + + +D+S SM D + +++ K + FL+ Sbjct: 80 RLKEVKRKGIEVVIALDVSNSMLAD---------------DIQPSRLQKSKYTISNFLER 124 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT-RDMDSLILKPTDSTPAMKQ 300 + + +GL+ + + + ++ ++ D++ + T+ + A+++ Sbjct: 125 L-----GND--RVGLVVFAGQSFVQCPITSDKSALKLFMDIVSTDAIPTQGTNFSSAIRE 177 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 + + L ++ + R K I+ +DGE++ + ++A Sbjct: 178 SIRALERIEEGAEAEEKNRVR--------NKVILIFSDGEDHEAGIDEVL----EEAASK 225 Query: 361 FIKIVTISI 369 I+I T+ + Sbjct: 226 NIRIYTVGV 234 >gi|327412874|emb|CAX67882.1| putative Von Willebrand factor, type A [Salmonella bongori] Length = 325 Score = 42.6 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 32/227 (14%), Positives = 84/227 (37%), Gaps = 46/227 (20%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 ++P+ + L++D+SGSM T++ A++ ++ F+ + Sbjct: 92 QKPMRDVVLILDVSGSMA-------------KNDVQGGATRLQAVQESVRKFVAA----- 133 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++ +GL+ + ++ S + ++ + + ++ + T A+ ++L Sbjct: 134 --RQSDRIGLVIFASQAWPFAPVSEDKQALQTRINQLSPGMVGQQTAIGDALGVGVKLLD 191 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + + K I LTDG N+ S + A + +++ T Sbjct: 192 N----------------TTNTEASKLAILLTDG--NDTASQLAPALAAQLAAAHHVQVHT 233 Query: 367 ISI-NASPNGQR-----LLKTCV--SSPEYHYNVVNADSLIHVFQNI 405 I+ + + G L++ + + + +L V++ I Sbjct: 234 IAFGDINSTGDDKVDLPLMQNIAQITGGQSWTAANSGAALDSVWKEI 280 >gi|159037814|ref|YP_001537067.1| von Willebrand factor type A [Salinispora arenicola CNS-205] gi|157916649|gb|ABV98076.1| von Willebrand factor type A [Salinispora arenicola CNS-205] Length = 427 Score = 42.6 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 44/252 (17%), Positives = 88/252 (34%), Gaps = 36/252 (14%) Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 L+ + PAL E P +ELV+D+SGSM +D + Sbjct: 11 LVGLLAMSVMTGPAPALADGEAPVEPP-KVELVLDVSGSMRA---TDIDG---------- 56 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 R++++ + A +D++ + + + Y ++ G + +Q V Sbjct: 57 -RSRISVAQQAFNEVVDALP--DETQLGIRVLGATYPGENKER-----GCQDTQQIV--- 105 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 + P D A K A L + + + I+ +TDGE+ Sbjct: 106 ----PVGPVDRVQA-KAAVATLRPTGFTPVGLALRSAAQDLGTGSTARRIVLITDGEDTC 160 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNG--QRLLKTCVSSPEYHYNVVNADSLIHV 401 + + A+ + + T+ + A ++LL ++ + +AD L Sbjct: 161 APPDPCEVARELAAQGTKLVVDTLGL-APDEKVRRQLLCIAAATGGTYTAAQSADELT-- 217 Query: 402 FQNISQLMVHRK 413 I QL+ + Sbjct: 218 -GRIKQLVDRAR 228 >gi|297735865|emb|CBI18619.3| unnamed protein product [Vitis vinifera] Length = 656 Score = 42.6 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 29/173 (16%), Positives = 61/173 (35%), Gaps = 43/173 (24%) Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 + R + V+D+ G M A K+ +K A+ Sbjct: 270 TTTSSLLNPARRAPIDLVTVLDVGGGMTGA--------------------KLQMMKRAMR 309 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDST 295 L + S+ + ++ ++ ++ + T R+ R ++SLI T + Sbjct: 310 LVISSLSSTD------RLSIVAFSASSKRLMPLKRMTTTGRRSARRIIESLIAGQGTSAG 363 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 A+K+A ++L ++R+ + I+ L+DG+N S Sbjct: 364 EALKKASKVLEDRRERNPVAS----------------IMLLSDGQNERVSSKS 400 >gi|225468694|ref|XP_002269894.1| PREDICTED: hypothetical protein, partial [Vitis vinifera] Length = 585 Score = 42.6 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 29/173 (16%), Positives = 61/173 (35%), Gaps = 43/173 (24%) Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 + R + V+D+ G M A K+ +K A+ Sbjct: 171 TTTSSLLNPARRAPIDLVTVLDVGGGMTGA--------------------KLQMMKRAMR 210 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDST 295 L + S+ + ++ ++ ++ + T R+ R ++SLI T + Sbjct: 211 LVISSLSSTD------RLSIVAFSASSKRLMPLKRMTTTGRRSARRIIESLIAGQGTSAG 264 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 A+K+A ++L ++R+ + I+ L+DG+N S Sbjct: 265 EALKKASKVLEDRRERNPVAS----------------IMLLSDGQNERVSSKS 301 >gi|225438159|ref|XP_002262605.1| PREDICTED: hypothetical protein [Vitis vinifera] Length = 830 Score = 42.6 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 29/173 (16%), Positives = 61/173 (35%), Gaps = 43/173 (24%) Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 + R + V+D+ G M A K+ +K A+ Sbjct: 416 TTTSSLLNPARRAPIDLVTVLDVGGGMTGA--------------------KLQMMKRAMR 455 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDST 295 L + S+ + ++ ++ ++ + T R+ R ++SLI T + Sbjct: 456 LVISSLSSTD------RLSIVAFSASSKRLMPLKRMTTTGRRSARRIIESLIAGQGTSAG 509 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 A+K+A ++L ++R+ + I+ L+DG+N S Sbjct: 510 EALKKASKVLEDRRERNPVAS----------------IMLLSDGQNERVSSKS 546 >gi|162457601|ref|YP_001619968.1| hypothetical protein sce9315 [Sorangium cellulosum 'So ce 56'] gi|161168183|emb|CAN99488.1| conserved hypothetical protein with a vWF type A domain [Sorangium cellulosum 'So ce 56'] Length = 617 Score = 42.6 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 33/227 (14%), Positives = 71/227 (31%), Gaps = 50/227 (22%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 RP + LV+D+SGSM MA ++A + Sbjct: 201 STAQAPAARPHLSVHLVLDVSGSMQGE--------------------PMARARDAARALV 240 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVE---KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 D +D L+ +++ + + ++ + + T+ Sbjct: 241 DRLDANDDFS------LVTFSSGADVRIDDGPVGPRRAAIKATIDGIREG---GGTNIGQ 291 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG-ENNNFKSNVNTIKICD 355 + Y R G+ ++ K ++ L+DG N S+ ++ Sbjct: 292 GLALGY------------AQAARPGIPEGAV---KVVLLLSDGRANAGITSSERLSRLAL 336 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSS-PEYHYNVVNADSLIHV 401 A + ++ T + A +G L+ S +Y + + D + Sbjct: 337 DAFQGGVQTSTFGLGADYDG-ALMSAIASDGAGGYYYLRDPDQIAPA 382 >gi|73949160|ref|XP_544264.2| PREDICTED: similar to inter-alpha trypsin inhibitor heavy chain precursor 5 isoform 1 [Canis familiaris] Length = 893 Score = 42.6 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 39/182 (21%), Positives = 64/182 (35%), Gaps = 44/182 (24%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D S SM TK+ K+AL L + H Sbjct: 253 VVFVLDSSASMVG--------------------TKLRQTKDALFTILHDLRPQDHFS--- 289 Query: 253 YMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +IG++ R++ K+ S + VR TD A+++A ++L Sbjct: 290 ---IIGFSNRIKVWKDHLVSVTPDNVRDGKIYIHHMSPTGGTDINGALQRAIKLLNDYVA 346 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF---IKIVTI 367 + + I+FLTDG+ +T+KI + KE + I TI Sbjct: 347 HNDIEDRSVS-----------LIVFLTDGK--PTVGETHTLKILNNTKEAARGQVCIFTI 393 Query: 368 SI 369 I Sbjct: 394 GI 395 >gi|291388325|ref|XP_002710627.1| PREDICTED: matrilin 3-like [Oryctolagus cuniculus] Length = 1109 Score = 42.6 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 30/167 (17%), Positives = 66/167 (39%), Gaps = 21/167 (12%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTP 296 ID L+ + +GL+ Y+T+V S+ T K + M + K + + Sbjct: 831 LTAIIDSLAVSPKAARVGLLQYSTQVRAEFTLRSFSTAKDMKKAVAHM-KYMGKGSMTGQ 889 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+K ++ + + G + S + I TDG + +V+ + +K Sbjct: 890 ALKHMFERSFTQLE----------GARPLSARVPRVAIVFTDG---RAQDDVS--EWANK 934 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 AK N I + + + + + L+ S P ++ + + ++ + Sbjct: 935 AKANGITMYAVGVGKAIEEE--LQEIASEPTDKHLFYAEDFSTMNEI 979 >gi|297203405|ref|ZP_06920802.1| VWA domain-containing protein [Streptomyces sviceus ATCC 29083] gi|197711494|gb|EDY55528.1| VWA domain-containing protein [Streptomyces sviceus ATCC 29083] Length = 421 Score = 42.6 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 39/230 (16%), Positives = 73/230 (31%), Gaps = 33/230 (14%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 I + A G ++LV+D+SGSM T+MAA K Sbjct: 22 IAFPVTAAAGEPTGGTAP-QVDLVLDVSGSM--------------RTADIDGGTRMAAAK 66 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 A L E+V +G+ + + + + + Y +D Sbjct: 67 RAFNEVL------DATPEEVQLGIRTLGANYPGDNQKTGCKDTAQLYPVSTLDRTEA--- 117 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 K L+ + K I+ ++DGE+ + + Sbjct: 118 ------KTQVATLSPTGWTPIGPALLKSAGDFTDSASSKRIVLISDGEDTCAPLDPCEVA 171 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIH 400 AK + I T+ + + ++ L C++ + +V +AD L Sbjct: 172 REIGAKGIGLTIDTLGLVPNTKMRKQLS-CIAEATGGTYTSVEHADELTD 220 >gi|297462925|ref|XP_608567.5| PREDICTED: collagen type VI alpha 6-like [Bos taurus] Length = 2343 Score = 42.2 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 42/301 (13%), Positives = 93/301 (30%), Gaps = 53/301 (17%) Query: 110 SAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRY 169 + + N + H+ + +N T + Y + + L + R Sbjct: 541 TNGASRDSVSGPANRLSEELIHVYAIGVREANQTQLREIAGEEKRVYYVHDFDALKDIRN 600 Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 + + E + I +VD SGS+ + KM Sbjct: 601 -----QVVQEICAEEACKEMKADIMFLVDSSGSIGL-----------------ENFIKM- 637 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 K + + + + V +G++ ++ ++ + ++ ++ +D + Sbjct: 638 --KTFMKNLVSKSQI---RADRVQIGVVQFSDVNKEEFQL--NRYTSQEEISDAIDRMAH 690 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T F + + K +KF+I +TDGE + + Sbjct: 691 IG------------ETTLMGSALTFVSQYFSPAKGARPNVRKFLILITDGEAQDIVKDPA 738 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 +E I I ++ + S Q L+ PE + V N + ++I + Sbjct: 739 V-----ALREEGIIIYSVGVFGSNVTQ--LEEISGRPEMVFYVEN----FDILKHIEDDL 787 Query: 410 V 410 V Sbjct: 788 V 788 Score = 41.0 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 31/186 (16%), Positives = 62/186 (33%), Gaps = 26/186 (13%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 + LK +SI L + + +GL+ Y+ + S G + V Sbjct: 234 TNSSWEDFDYLKE---FLEESISALDIKEHCMRVGLVAYSNETKVISTLSRGVN--KSEV 288 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 +D+ SL P +AY K R F Q + + + +T Sbjct: 289 LQDIQSL-------APQAGKAYTGAALRKIRKEV--FSAQHGSRKNQGVPQIAVLVT--- 336 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 ++ + T + + + + TI + + + Q L+ S P Y + Sbjct: 337 -HSPSQDNVTKAAVN-LRRQGVIVFTIGVEGASDTQ--LEKIASHPAEQYVSQ-----LR 387 Query: 401 VFQNIS 406 F +++ Sbjct: 388 SFSDLA 393 Score = 38.3 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 59/370 (15%), Positives = 128/370 (34%), Gaps = 49/370 (13%) Query: 56 ANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYN 115 +N +++ S+ V+ L D +S++ A +A A R I+ KE S N Sbjct: 270 SNETKVISTLSRGVNKSEVLQD-IQSLAPQAGKAYTGAALRKIR---KEVFSAQHGSRKN 325 Query: 116 TEIQNIVNSSRISMTHMANNRLDSS-------NNTIFYNMDVMTSYDYRLQFIEHLLNQR 168 + I + +TH + + I + + V + D +L+ I ++ Sbjct: 326 QGVPQIA----VLVTHSPSQDNVTKAAVNLRRQGVIVFTIGVEGASDTQLEKIASHPAEQ 381 Query: 169 YNQKIVSFIPALLRIEM-----GERPIFLIELVVDLSGSM--HCAMNSDPEDVNSAPICQ 221 Y ++ SF + + + ++ + + ++ C + + Sbjct: 382 YVSQLRSFSDLAAHNQTFLKKLRNQITHTVSVISERTETLKAGCVDTEEADIYLLIDGSG 441 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--VRQY 279 + T +K L ++ + V +G + Y R + E S T K VR+ Sbjct: 442 STQATDFQEMKTFLSEVAGMFNIAP---QKVRVGAVQYADRWDLEFEISKYTNKHDVRKA 498 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + + + +T A A +R + + +P ++ LT+G Sbjct: 499 IE---NIRQMGGNRNTGA---ALNFTLGLLQR-------AKQQRGGRVPC--HLVVLTNG 543 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 + + S ++ E I + I + Q L+ + Y V + D+L Sbjct: 544 ASRDSVSGPA-----NRLSEELIHVYAIGVR--EANQTQLREIAGEEKRVYYVHDFDALK 596 Query: 400 HVFQNISQLM 409 + + Q + Sbjct: 597 DIRNQVVQEI 606 >gi|297471452|ref|XP_002685218.1| PREDICTED: collagen, type VI, alpha 1-like [Bos taurus] gi|296490817|gb|DAA32930.1| collagen, type VI, alpha 1-like [Bos taurus] Length = 2268 Score = 42.2 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 42/301 (13%), Positives = 93/301 (30%), Gaps = 53/301 (17%) Query: 110 SAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRY 169 + + N + H+ + +N T + Y + + L + R Sbjct: 541 TNGASRDSVSGPANRLSEELIHVYAIGVREANQTQLREIAGEEKRVYYVHDFDALKDIRN 600 Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 + + E + I +VD SGS+ + KM Sbjct: 601 -----QVVQEICAEEACKEMKADIMFLVDSSGSIGL-----------------ENFIKM- 637 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 K + + + + V +G++ ++ ++ + ++ ++ +D + Sbjct: 638 --KTFMKNLVSKSQI---RADRVQIGVVQFSDVNKEEFQL--NRYTSQEEISDAIDRMAH 690 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T F + + K +KF+I +TDGE + + Sbjct: 691 IG------------ETTLMGSALTFVSQYFSPAKGARPNVRKFLILITDGEAQDIVKDPA 738 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 +E I I ++ + S Q L+ PE + V N + ++I + Sbjct: 739 V-----ALREEGIIIYSVGVFGSNVTQ--LEEISGRPEMVFYVEN----FDILKHIEDDL 787 Query: 410 V 410 V Sbjct: 788 V 788 Score = 41.0 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 31/186 (16%), Positives = 62/186 (33%), Gaps = 26/186 (13%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 + LK +SI L + + +GL+ Y+ + S G + V Sbjct: 234 TNSSWEDFDYLKE---FLEESISALDIKEHCMRVGLVAYSNETKVISTLSRGVN--KSEV 288 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 +D+ SL P +AY K R F Q + + + +T Sbjct: 289 LQDIQSL-------APQAGKAYTGAALRKIRKEV--FSAQHGSRKNQGVPQIAVLVT--- 336 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 ++ + T + + + + TI + + + Q L+ S P Y + Sbjct: 337 -HSPSQDNVTKAAVN-LRRQGVIVFTIGVEGASDTQ--LEKIASHPAEQYVSQ-----LR 387 Query: 401 VFQNIS 406 F +++ Sbjct: 388 SFSDLA 393 Score = 38.3 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 59/370 (15%), Positives = 128/370 (34%), Gaps = 49/370 (13%) Query: 56 ANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYN 115 +N +++ S+ V+ L D +S++ A +A A R I+ KE S N Sbjct: 270 SNETKVISTLSRGVNKSEVLQD-IQSLAPQAGKAYTGAALRKIR---KEVFSAQHGSRKN 325 Query: 116 TEIQNIVNSSRISMTHMANNRLDSS-------NNTIFYNMDVMTSYDYRLQFIEHLLNQR 168 + I + +TH + + I + + V + D +L+ I ++ Sbjct: 326 QGVPQIA----VLVTHSPSQDNVTKAAVNLRRQGVIVFTIGVEGASDTQLEKIASHPAEQ 381 Query: 169 YNQKIVSFIPALLRIEM-----GERPIFLIELVVDLSGSM--HCAMNSDPEDVNSAPICQ 221 Y ++ SF + + + ++ + + ++ C + + Sbjct: 382 YVSQLRSFSDLAAHNQTFLKKLRNQITHTVSVISERTETLKAGCVDTEEADIYLLIDGSG 441 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--VRQY 279 + T +K L ++ + V +G + Y R + E S T K VR+ Sbjct: 442 STQATDFQEMKTFLSEVAGMFNIAP---QKVRVGAVQYADRWDLEFEISKYTNKHDVRKA 498 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + + + +T A A +R + + +P ++ LT+G Sbjct: 499 IE---NIRQMGGNRNTGA---ALNFTLGLLQR-------AKQQRGGRVPC--HLVVLTNG 543 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 + + S ++ E I + I + Q L+ + Y V + D+L Sbjct: 544 ASRDSVSGPA-----NRLSEELIHVYAIGVR--EANQTQLREIAGEEKRVYYVHDFDALK 596 Query: 400 HVFQNISQLM 409 + + Q + Sbjct: 597 DIRNQVVQEI 606 >gi|148261962|ref|YP_001236089.1| hypothetical protein Acry_2980 [Acidiphilium cryptum JF-5] gi|326405471|ref|YP_004285553.1| hypothetical protein ACMV_33240 [Acidiphilium multivorum AIU301] gi|146403643|gb|ABQ32170.1| hypothetical protein Acry_2980 [Acidiphilium cryptum JF-5] gi|325052333|dbj|BAJ82671.1| hypothetical protein ACMV_33240 [Acidiphilium multivorum AIU301] Length = 431 Score = 42.2 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 60/447 (13%), Positives = 128/447 (28%), Gaps = 61/447 (13%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 FK + N +II AL ++ + ++G I +K+ MES +AA LA + Sbjct: 4 LFIKFKALCGDRRGNIAIITALVSLTLIFILGMGIDYGLAIDRKSQMESYADAAALAAVT 63 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + ++ A+ F + + Y+A I + Sbjct: 64 PAM------------VAAGQSSAITTAQNVFNAQALTMTGVTYNANDVTVSIATSGDKRT 111 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG 186 ++ + A ++ + F ++ + + + ++ + I Sbjct: 112 ATVQYQAQSQAMLPDVMGFGSIKIGGQATATTT-----IAPNIDFYLLLDDSPSMAIAAT 166 Query: 187 ERPIFLIELVVDLSGSMHCAMN---SDPEDVNSAPICQDKKRT---------KMAALKNA 234 + I + V + + CA +P + ++ L+ A Sbjct: 167 QSGINTM--VANTTAQGGCAFGCHEENPSADKLGNPYGEDNYALARSLGVTLRIDMLRQA 224 Query: 235 LLLFLDSIDLLSHVKEDV-YMGLIGYTTRVEKNIEPSWGTEKVRQY-----VTRDMDSLI 288 + + K M + + + + + + + + Sbjct: 225 TQDLMTTAQTTETQKGTTYRMAIYTFDIGLNTIGNLTSDLSQAQTEAGNIQLLEVYSNNW 284 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE-----NNN 343 L D Y + N P Q+ + F+TDG N N Sbjct: 285 LTQNDYNDDEDTNYDTALNGINAIM-PNPGNGTGAAGDTP-QEVLFFVTDGVEDEDVNGN 342 Query: 344 FKSNVNTIKICDKAKENFIKI---VTISINASPNG-------------QRLLKTCVSSPE 387 + ++ +C K I+I T + N L+ C SP Sbjct: 343 RQQSLLNTDLCTAIKNRGIRIAVLYTEYLPLPTNSWYNTYIAPFQNSIAPTLQQCA-SPG 401 Query: 388 YHYNVVNADSLIHVFQNISQLMVHRKY 414 ++ V + + + Q V Y Sbjct: 402 LYFEVKSGGDISAAMSALFQTAVQSSY 428 >gi|254787962|ref|YP_003075391.1| PKD domain-containing protein [Teredinibacter turnerae T7901] gi|237687416|gb|ACR14680.1| PKD domain protein [Teredinibacter turnerae T7901] Length = 1083 Score = 42.2 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 41/251 (16%), Positives = 75/251 (29%), Gaps = 53/251 (21%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + L++D SGSM +DP+ + K ++ F Sbjct: 403 VDEAPRTYSELDSLLIIDSSGSMR---TTDPQSRRLEAANTYVNISAAED-KIGIIDFDG 458 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 S ++ +G G T R++ + + T+ ++ Sbjct: 459 SARVVKP---FTLLGQQGSTERIQ----LENAINGI----------DAVGGTEIAGSLGL 501 Query: 301 AYQILTSD-----------------------KKRSFFTNFFRQGVKI--PSLPFQKFIIF 335 A L + +N R + ++ +K I Sbjct: 502 ACSTLMASWQDDIIEQQHMVNDIKALVEEFPNNTISHSNARRAAIPGIAHAINSKKIAIL 561 Query: 336 LTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA 395 LTDG+ + S N I EN + TI NG++L +S Y+ ++ Sbjct: 562 LTDGDTPSSYSQANQCFI-----ENGWMLYTIGFGG-ANGEKLAPLAEASGGYYIAADSS 615 Query: 396 -DSLIHVFQNI 405 L +Q I Sbjct: 616 LLDLNCAYQQI 626 >gi|224368584|ref|YP_002602747.1| hypothetical protein HRM2_14740 [Desulfobacterium autotrophicum HRM2] gi|223691300|gb|ACN14583.1| conserved hypothetical protein [Desulfobacterium autotrophicum HRM2] Length = 222 Score = 42.2 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 40/237 (16%), Positives = 93/237 (39%), Gaps = 43/237 (18%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + + ++ D SGSM K+ A+ AL +D+ Sbjct: 3 KFAARTARPIPVIILADTSGSMSVD-------------------GKIDAMNQALRDLIDT 43 Query: 242 IDLLSHVKEDVYMGLIGYT-TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 S + ++++ +I + ++++ + ++ D T A + Sbjct: 44 FSGESRLNAEIHLSVITFGGDGAKEHLPLT-----CAHTISGFSDLQAHGMTPMGGAFRI 98 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG-ENNNFKSNVNTIKICDKAKE 359 A + L DK+ KIPS ++ I+ ++DG N+++++ ++++ ++A++ Sbjct: 99 A-KELIEDKE------------KIPSRAYRPVIVLVSDGYPNDDWEAAFSSLRGSERAQK 145 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEY-HYNVVNADSLIHVFQNISQLMVHRKYS 415 + I S + +LK ++ PE + A +I F+ +S + R S Sbjct: 146 ATRMAMAIG---SDADENMLKDFINDPETPVFRANGARDIIRFFRAVSMSVTSRSRS 199 >gi|85708696|ref|ZP_01039762.1| hypothetical protein NAP1_05635 [Erythrobacter sp. NAP1] gi|85690230|gb|EAQ30233.1| hypothetical protein NAP1_05635 [Erythrobacter sp. NAP1] Length = 640 Score = 42.2 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 19/158 (12%), Positives = 43/158 (27%), Gaps = 16/158 (10%) Query: 261 TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 + + WG + + I +++T A Sbjct: 495 SNTYHDFGMIWGARFIS-------PNGIFAASNATAPNGDAISRHIVFMTDGLLVPNQEI 547 Query: 321 GVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLK 380 + + I G + C A++ I + I+ G L + Sbjct: 548 YSMYGIEWWDRRITNDGSGGQARDRHATRFQVACRAARQENISVWVIAF-----GTTLTQ 602 Query: 381 T---CVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 C + P + + +L F+ I+Q + + + Sbjct: 603 NLIDCAT-PGRAFQANDTAALETRFEQIAQEIAALRLT 639 Score = 41.0 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 26/99 (26%), Positives = 47/99 (47%), Gaps = 9/99 (9%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + VVD++GSM+CA D S +D K+ L++A+L F D+++ + Sbjct: 135 DVLFVVDVTGSMNCA--PDNPGGGSCGNTED-PGAKIKGLRSAVLKFYDTVETSTSPSAQ 191 Query: 252 VYMGLIGYTTRVE-----KNIEPSWGTEKVRQYVTRDMD 285 V G++ Y + V EP+W + Y +R+ + Sbjct: 192 VRYGMVPYASNVNVGAALMATEPAW-MAQTHTYQSREAN 229 >gi|317125812|ref|YP_004099924.1| von Willebrand factor A [Intrasporangium calvum DSM 43043] gi|315589900|gb|ADU49197.1| von Willebrand factor type A [Intrasporangium calvum DSM 43043] Length = 577 Score = 42.2 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 47/237 (19%), Positives = 82/237 (34%), Gaps = 40/237 (16%) Query: 174 VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 + + A + + F + V D+SGSM N + I Q+ + AL Sbjct: 356 AAQVTAAIDVWQAATTSFQLLSVFDVSGSM----NEKVGNTTRVRITQEAAGIALNALPR 411 Query: 234 A--LLLFLDSIDLLS--HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 + L L++ S D KE V +GL+ + + + + + V Sbjct: 412 STKLGLWVFSSDKGGGRDYKELVPLGLLSDDAQRARMAAAA---ASLSKEVD-------- 460 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T + AY + K S+ ++ LTDG+N + ++ Sbjct: 461 GWTGLYDTIWAAYSKV----KASYDPQRVNA------------VVILTDGKNEDPGGGLS 504 Query: 350 TIKICDKAK-----ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 ++ K K + I I TI I + + L K SS +Y N + V Sbjct: 505 LEQLLAKIKDATDPKRPIAITTIGIGPGVDAESLRKISRSSYSDYYGAENPADMTTV 561 >gi|149922129|ref|ZP_01910569.1| hypothetical protein PPSIR1_23374 [Plesiocystis pacifica SIR-1] gi|149817066|gb|EDM76548.1| hypothetical protein PPSIR1_23374 [Plesiocystis pacifica SIR-1] Length = 546 Score = 42.2 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 35/173 (20%), Positives = 61/173 (35%), Gaps = 45/173 (26%) Query: 177 IPALLRIEM------GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 +PA++R+ G RP + +V+D SGSM K+ Sbjct: 111 MPAIIRLSADDEAGQGPRPGLDLAIVLDRSGSMGGD--------------------KLRF 150 Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW----GTEKVRQYVTRDMDS 286 K A L ++ +D + LI Y V G E +R+ + +D Sbjct: 151 AKQAGLDLVNRLDEQD------RVTLISYDDTVTPLSNLQRVDDDGIEVLRR---QLLDI 201 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + T PA+ Q L + + T + ++ + +I L+DG Sbjct: 202 QVGGTTALGPALFMGLQRLAAPEPFGPQTRTEARHDRL------RHVILLSDG 248 >gi|257897710|ref|ZP_05677363.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com15] gi|257835622|gb|EEV60696.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com15] Length = 1104 Score = 42.2 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 30/223 (13%), Positives = 70/223 (31%), Gaps = 30/223 (13%) Query: 99 KNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRL 158 N + + E N+ + + + N+ + YN + + + Sbjct: 310 ANWRNYDYAADNGSTAGEEDVNV----PAQTVQLWGDERNFENSYLDYNGAYIKKWVEPV 365 Query: 159 --QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNS 216 L+ + + ++ E E I V+D S SM + Sbjct: 366 LSNNPTSDLHPEDATTLYNVYLDVIGSEKQEISPIDIVFVLDKSASM------------N 413 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY--------TTRVEKN-- 266 + ++K AAL ++ + +LLS D+ +G++ + + Sbjct: 414 EGTLEGGGQSKNAALIESVNEISE--NLLSDPNMDIRIGMVNFYHNSTVINNQEQISSDI 471 Query: 267 IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + ++ ++ + T T +K Y+ L +D Sbjct: 472 FPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGYETLYADN 514 >gi|114562801|ref|YP_750314.1| vault protein inter-alpha-trypsin subunit [Shewanella frigidimarina NCIMB 400] gi|114334094|gb|ABI71476.1| Vault protein inter-alpha-trypsin domain protein [Shewanella frigidimarina NCIMB 400] Length = 722 Score = 42.2 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 29/220 (13%), Positives = 64/220 (29%), Gaps = 45/220 (20%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 +V +P + + + LV+D SGSM Sbjct: 321 NNMYSLVMLMPPSVEVSEQHLIARELILVIDTSGSMSGQ--------------------S 360 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMD 285 + K AL L + + +I + + V S + + + Sbjct: 361 ITQAKQALQFALAGLRDIDSFN------IIEFNSDVTMLSATPLSANSRNIGKANRFIQS 414 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T+ A++ A L ++ + + +IF+TDG Sbjct: 415 LDADGGTEMRSALQTA---LVDSVQQDSDQTDAHSEML-------RQVIFMTDG----AV 460 Query: 346 SNVNTIKICDKAKENFIKIVTISI-NASPNGQRLLKTCVS 384 N + + + ++ T+ I +A + ++ + Sbjct: 461 GNEHELYQLINDQLGDSRLFTVGIGSAPNS--DFMRRAAT 498 >gi|299139640|ref|ZP_07032813.1| von Willebrand factor type A [Acidobacterium sp. MP5ACTX8] gi|298598264|gb|EFI54429.1| von Willebrand factor type A [Acidobacterium sp. MP5ACTX8] Length = 488 Score = 42.2 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 37/243 (15%), Positives = 81/243 (33%), Gaps = 38/243 (15%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + G+ ++ +V D+SGSM+ M T+ +LK+A+ FL + Sbjct: 98 DAGQHTNVVLMIV-DISGSMNQPMAGGS--------------TRFQSLKSAIAQFLAGMQ 142 Query: 244 LLSHVKEDVYMGLIGY-TTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 S + ++ + + V I + T + + P ++T Y Sbjct: 143 EGSD-----RVAIVPFESHNVVSTIRSAVFTTHRADALAQLNALPAPGPKNNT----ALY 193 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN-NFKSNVNTIKICDKAKENF 361 Q + S Q Q +I +TDG+N + + ++ Sbjct: 194 QAVFSGVDSMKGELASLQHEGATLAELQPHLIVMTDGKNEVAPGDDPQLLNGDLGLQQAV 253 Query: 362 IKIVT-------ISI-NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 ++ T I + + L+ + + +A+ L+ ++S+ V Sbjct: 254 AQVQTSNLDTIGIGFGDKNDIDAGALQKLT---KRFFYASDANQLLAAL-HVSRSAVSHS 309 Query: 414 YSV 416 + Sbjct: 310 IQI 312 >gi|149694147|ref|XP_001503972.1| PREDICTED: similar to Cartilage matrix protein precursor (Matrilin-1) [Equus caballus] Length = 495 Score = 42.2 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 37/223 (16%), Positives = 83/223 (37%), Gaps = 40/223 (17%) Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNA-----------LLLFLDSI-DLLSHVKED 251 +NSD + N T + L + + F++ I D L + Sbjct: 251 GFTLNSDGKTCNVCSGGGGSSATDLVFLIDGSKSVRPENFELVKKFINQIVDTLDVSDKL 310 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD--SLILKPTDSTPAMKQAYQILTSDK 309 +GL+ Y++ V + G ++ + + S + K T + A+K D Sbjct: 311 AQVGLVQYSSSVRQEFPL--GRFHTKKDIKAAVRNMSYMEKGTMTGAALKY-----LIDN 363 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + + K+ I+F TDG + ++ ++ KAK+ K+ + + Sbjct: 364 SFTVSSGARPGAQKVG-------IVF-TDGRSQDYINDAA-----KKAKDLGFKMFAVGV 410 Query: 370 -NASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 NA + L+ S P E+++ + ++ + + + + + Sbjct: 411 GNAVEDE---LREIASEPVAEHYFYTADFKTINQIGKKLQKKI 450 >gi|149481218|ref|XP_001506719.1| PREDICTED: similar to matrilin 4, partial [Ornithorhynchus anatinus] Length = 312 Score = 42.2 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 35/223 (15%), Positives = 76/223 (34%), Gaps = 48/223 (21%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LV+D S S+ + +K + + S+D+ Sbjct: 134 DLVLVIDGSKSVGAQ--------------------QFELVKRWVGELVGSLDVSPAG--- 170 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ +++RV K + + K T + A++ + S+ + Sbjct: 171 TRVGLVQFSSRVRTEFPLGRHGTKAEAEAAVRAVTPMDKGTMTGLALRHLVERGFSEAE- 229 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 G + S ++ + +TDG + +V+ +AK+ I + + + Sbjct: 230 ---------GARPGS---RRVGLLVTDG---RSQDDVS--PWAARAKDRGIVMFAVGVG- 271 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 + L+ S P V S F +++L+ H K Sbjct: 272 -KAVEEELREIASDP-----VERHVSYAADFGALTRLLDHLKL 308 >gi|284029341|ref|YP_003379272.1| von Willebrand factor type A [Kribbella flavida DSM 17836] gi|283808634|gb|ADB30473.1| von Willebrand factor type A [Kribbella flavida DSM 17836] Length = 315 Score = 42.2 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 36/236 (15%), Positives = 70/236 (29%), Gaps = 53/236 (22%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + + +D+S SM D + K A F+ ++ Sbjct: 83 RERATVMVAMDISNSMAA---------------TDVSPDRFTVAKEAATEFVRNL----- 122 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 E +GL+ + PS T + V + T A+ + Q + S Sbjct: 123 -PEQFNVGLVSFARTATVVAPPS--TNH-QAAVDAIEQLTLTDSTAIGEAVLTSLQAVRS 178 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 Q + P I+ L+DG N + + + A E + + TI Sbjct: 179 ---------LDAQAAEDPPPAR---IVLLSDGGNTSGRPIDEGAR---AATEAGVPVSTI 223 Query: 368 SINASPNGQRL-------------LKTCV-SSPEYHYNVVNADSLIHVFQNISQLM 409 + L L+ ++ Y + + L V+ ++ + Sbjct: 224 AYGTPEGTIDLEGRSIPVPADTESLRGLADATSGSFYAAESDEELRDVYSDLQSSI 279 >gi|47211020|emb|CAF94689.1| unnamed protein product [Tetraodon nigroviridis] Length = 2225 Score = 42.2 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 30/181 (16%), Positives = 67/181 (37%), Gaps = 23/181 (12%) Query: 235 LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSLILKP 291 + +FL+S+ + +GL ++ W T ++ V +L K Sbjct: 27 VRVFLESLVKAFDVDLDRTRIGLAQFS----GEPRIEWHLNTHTTKEAVMEAARNLPYKG 82 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 ++ + LT + SF P +P K + LTDG+ + +V Sbjct: 83 GNTLTGL-----ALTFILENSFSPE----SGSRPGIP--KIGVLLTDGK---SQDDVIPP 128 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + ++ +++ I + + G+ VS + YNV + + + +++ + Sbjct: 129 AQ--RLRDAGVEVFAIGVKNADEGELRAIASVSEDTHVYNVADFHLMADIVDVLTRTICE 186 Query: 412 R 412 R Sbjct: 187 R 187 >gi|148655977|ref|YP_001276182.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148568087|gb|ABQ90232.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 420 Score = 42.2 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 34/255 (13%), Positives = 77/255 (30%), Gaps = 49/255 (19%) Query: 150 VMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNS 209 V ++ + ++ P ++ R + V+D SGSM Sbjct: 5 VTIRASLARPYMAAAATPQVAYMLIEITPG--QVMTQVRAPVNVCFVIDRSGSM------ 56 Query: 210 DPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP 269 K K+ ++ A + ++ +D V ++ + R E I Sbjct: 57 --------------KGEKIDRVRRATIRAIEMLDAQDVVS------VVIFDHRTEVLIPA 96 Query: 270 SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 + + + + R T PA++ + + + Sbjct: 97 T-PVTRPAELIDRINRVRDSGGTRIAPAIEAGLREI-----------------EKGPPQM 138 Query: 330 QKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH 389 + +I LTDG+ N + A + + I + + N L++ S Sbjct: 139 VRRLILLTDGQTENESDCLRRAT---DAGQRNVPITALGVGKDWNEDLLIEMANRSGGTA 195 Query: 390 YNVVNADSLIHVFQN 404 + + ++ FQ+ Sbjct: 196 DYIDRPEKIVEYFQS 210 >gi|328954590|ref|YP_004371924.1| VWFA-related domain-containing protein [Desulfobacca acetoxidans DSM 11109] gi|328454914|gb|AEB10743.1| VWFA-related domain-containing protein [Desulfobacca acetoxidans DSM 11109] Length = 543 Score = 42.2 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 38/230 (16%), Positives = 74/230 (32%), Gaps = 48/230 (20%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + + L +D+SGSM K MA + A +FLD ++ Sbjct: 81 SSAKSPLSVVLAIDVSGSM--------------------KGEPMAEARKAAAIFLDELEK 120 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 HV LI + V + + +VR+ + ++ YQ Sbjct: 121 DDHV------ALITFGQGVYHLSDFAAKKHEVREQLQHL---------EAKEQWTWLYQA 165 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 +R+ R +I LTDG++ + + D+ K + I Sbjct: 166 TLEAMERAVQAPTTRAA-----------VILLTDGKDEGSPVSEE--AVLDRIKGAQVPI 212 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 + + L K +S + A L +++Q + + ++ Sbjct: 213 YAMGFGSKAQVDYLQKVASASQGAFLSTPQAADLTNLYQTVLDYLKNQYI 262 >gi|126330546|ref|XP_001381755.1| PREDICTED: similar to Cartilage matrix protein precursor (Matrilin-1) [Monodelphis domestica] Length = 495 Score = 42.2 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 31/168 (18%), Positives = 59/168 (35%), Gaps = 22/168 (13%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 I+ L +G+I Y + V+ K + T + A++ A Sbjct: 67 IESLDVGPNTTRVGVINYASAVKHEFPLKAHRSKASLLQAVRKIEPLSTGTMTGLAIQFA 126 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 S+ + S +K P + K I +TDG + V + +AK++ Sbjct: 127 INKAFSEVEGS--------RLKFPEIS--KVAIVVTDG---RPQDGVKDVS--ARAKQSG 171 Query: 362 IKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN---ADSLIHVFQN 404 I++ I + + L+ S P ++ V + + L FQ Sbjct: 172 IELFAIGVG--RVDKHTLRQIASEPLDDHVDYVESYSVIEKLSKKFQE 217 Score = 41.4 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 35/216 (16%), Positives = 79/216 (36%), Gaps = 31/216 (14%) Query: 199 LSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG 258 S A S + V + + +K + +DS+D+ + +GL+ Sbjct: 261 CSACNGAAGGSATDLVFLIDGSKSVRPENFELVKRFINQIVDSLDVSD---KLAQVGLVQ 317 Query: 259 YTTRVEKNIEPSWGTEKVRQYVTRDMD--SLILKPTDSTPAMKQAYQILTSDKKRSFFTN 316 Y++ V + G K ++ + + S + K T + A+K + Sbjct: 318 YSSSVRQEFPL--GRYKTKKDIKAAVKKMSYMEKGTMTGAALKYLIDNTFTIS------- 368 Query: 317 FFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-NASPNG 375 QK I TDG + ++ ++ KAK+ K+ + + NA + Sbjct: 369 ------SGARPGAQKVGIVFTDGRSQDYINDAA-----KKAKDLGFKMFAVGVGNAVEDE 417 Query: 376 QRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 L+ S P E+++ + ++ + + + + + Sbjct: 418 ---LREIASEPVAEHYFYTADFKTINQIGKKLQKKI 450 >gi|296481522|gb|DAA23637.1| inter-alpha-trypsin inhibitor heavy chain H5 precursor [Bos taurus] Length = 940 Score = 42.2 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 42/222 (18%), Positives = 76/222 (34%), Gaps = 48/222 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D S SM TK+ K+AL L + H Sbjct: 296 VVFVLDSSASMVG--------------------TKLRQTKDALFTILHDLRPQDHFN--- 332 Query: 253 YMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 ++G++ R++ K+ S +R TD A+++ Q+L Sbjct: 333 ---IVGFSNRIKVWKDHLVSVTPNSIRDGKVYIHHMSPSGGTDINGALQRGIQLLNDYVA 389 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF---IKIVTI 367 + + ++FLTDG+ +T+KI + +E + I T+ Sbjct: 390 HNDIEDRSVS-----------LVVFLTDGK--PTVGETHTLKILNNTREAARGRVCIFTV 436 Query: 368 SINASPNGQRL----LKTCVSSPEYHYNVVNADSLIHVFQNI 405 I A + + L L+ C + H + LI + I Sbjct: 437 GIGADVDFKLLEKLSLENCGLTRRVHEDHDARAQLIGFYDEI 478 >gi|220910752|ref|YP_002486062.1| von Willebrand factor type A [Cyanothece sp. PCC 7425] gi|219867524|gb|ACL47861.1| von Willebrand factor type A [Cyanothece sp. PCC 7425] Length = 411 Score = 42.2 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 37/240 (15%), Positives = 75/240 (31%), Gaps = 47/240 (19%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 +V IP + RP + LV+D SGSM A Sbjct: 20 PTTLDVLVKIIPPVPETRPQ-RPPLNLGLVIDRSGSMQGA-------------------- 58 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 KM + A ++ + + + + RVE + + + + + R + Sbjct: 59 KMEVARQAACFAVEQLLPSD------RLSVTIFDDRVECPVPST--LVRDKATIIRTIQG 110 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 + + + + + + + + I L+DG N ++ Sbjct: 111 IHSRGS--------------TALHDGWVQGGIQVSQHLNPAHLNRVI-LLSDGLANVGET 155 Query: 347 NVNTIKI-CDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQN 404 N + I + + T+ I G+ LL+ S Y++ + L +FQ Sbjct: 156 NPDAIAQHVHGLAQRGVSTSTMGI-GEDYGEDLLEAMARSGAGSFYHIERTEQLAAIFQA 214 >gi|171846231|ref|NP_034899.2| cartilage matrix protein precursor [Mus musculus] gi|28503027|gb|AAH47140.1| Matrilin 1, cartilage matrix protein [Mus musculus] gi|66350790|emb|CAC79633.1| cartilage matrix protein [Mus musculus] gi|123295072|emb|CAM17812.1| matrilin 1, cartilage matrix protein 1 [Mus musculus] gi|187466486|emb|CAQ51562.1| matrilin 1, cartilage matrix protein 1 [Mus musculus] Length = 500 Score = 42.2 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 26/190 (13%), Positives = 74/190 (38%), Gaps = 29/190 (15%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + +K + +D++D+ + + GL+ Y++ + + G ++ + Sbjct: 291 RPENFELVKKFINQIVDTLDVSDRLAQV---GLVQYSSSIRQEFPL--GRFHTKKDIKAA 345 Query: 284 MD--SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + S + K T + A+K D + + K+ I+F TDG + Sbjct: 346 VRNMSYMEKGTMTGAALKY-----LIDNSFTVSSGARPGAQKVG-------IVF-TDGRS 392 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLI 399 ++ ++ KAK+ K+ + + + L+ S P ++++ + ++ Sbjct: 393 QDYINDAA-----RKAKDLGFKMFAVGVG--NAVEEELREIASEPVADHYFYTADFKTIN 445 Query: 400 HVFQNISQLM 409 + + + + + Sbjct: 446 QIGKKLQKQI 455 >gi|325919326|ref|ZP_08181363.1| hypothetical protein containing a von Willebrand factor type A (vWA) domain [Xanthomonas gardneri ATCC 19865] gi|325550197|gb|EGD21014.1| hypothetical protein containing a von Willebrand factor type A (vWA) domain [Xanthomonas gardneri ATCC 19865] Length = 520 Score = 42.2 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 34/253 (13%), Positives = 86/253 (33%), Gaps = 54/253 (21%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + ++ A I + P + +VD+SGSM + K Sbjct: 128 NNDSLLLRVGVAGRAIATADLPPANLVFLVDVSGSM-------------------ESPDK 168 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + L+++L L + + + L+ Y + P+ G + + + +D++ Sbjct: 169 LPLLQSSLKLLVRQLRAKD------RITLVTYAGNTAVVLPPTPGDQ--QGRIIEAIDTV 220 Query: 288 ILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T ++ AY+ +QG + I+ TDG+ N + Sbjct: 221 QSGGSTAGASGIELAYKA-------------AQQGYLRGGINR---ILLATDGDFNVGVT 264 Query: 347 NVNTIK-ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA---------D 396 + + +K + + + + + + T+ L++ + + Y +++ Sbjct: 265 DFDQLKGMVAEKRRSGVALSTLGFGTGNYNDTLMEQLADAGDGAYAYIDSPLEARKVLTH 324 Query: 397 SLIHVFQNISQLM 409 L I++ + Sbjct: 325 ELGATLATIARDV 337 >gi|288921527|ref|ZP_06415802.1| von Willebrand factor type A [Frankia sp. EUN1f] gi|288347095|gb|EFC81397.1| von Willebrand factor type A [Frankia sp. EUN1f] Length = 587 Score = 42.2 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 23/132 (17%), Positives = 46/132 (34%), Gaps = 22/132 (16%) Query: 280 VTRDMDSLILK-PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 ++ D L L T A++ AY+ + P I+ +TD Sbjct: 468 ISAAADGLTLGSGTAIYSALEAAYRYVADSAAAPAD----------GVAPLTS-IVLMTD 516 Query: 339 GENN----NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVV 393 GENN + + + + D A+ ++ T+ ++T + ++ Sbjct: 517 GENNQGTTADAFHSSYLALPDAARS--VRTFTVVFG--DARVDEMRTIADWTGGAMFDAR 572 Query: 394 NADSLIHVFQNI 405 + SL F+ I Sbjct: 573 TS-SLSEAFREI 583 >gi|297606054|ref|NP_001057930.2| Os06g0578100 [Oryza sativa Japonica Group] gi|255677166|dbj|BAF19844.2| Os06g0578100 [Oryza sativa Japonica Group] Length = 622 Score = 42.2 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 34/212 (16%), Positives = 76/212 (35%), Gaps = 38/212 (17%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + V+D+SGSM+ + + PE + + T++ LK ++ + +D Sbjct: 70 IDVVAVLDVSGSMNDPVAASPE--------SNLQATRLDVLKASMKFIIRKLDDGD---- 117 Query: 251 DVYMGLIGYTT---RVEKNIEPSW---GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + ++ + + + G + + R + + P +++A +I Sbjct: 118 --RLSIVAFNDGPVKEYSSGLLDVSGDGRSIAGKKIDRLQ-ARGGSGSALMPELQEAVKI 174 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L RQG + FI+ LTDG++ + K + Sbjct: 175 L-----------DERQGNSRNRVG---FILLLTDGDDTTGFRWSRDVIHGAVGK---YPV 217 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 T ++ A+ + + LL S + V + + Sbjct: 218 HTFALGAAHDPEALLHIAQESRGTYSFVDDGN 249 >gi|54290564|dbj|BAD61973.1| zinc finger-like [Oryza sativa Japonica Group] gi|54291279|dbj|BAD62048.1| zinc finger-like [Oryza sativa Japonica Group] Length = 598 Score = 42.2 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 34/212 (16%), Positives = 76/212 (35%), Gaps = 38/212 (17%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + V+D+SGSM+ + + PE + + T++ LK ++ + +D Sbjct: 46 IDVVAVLDVSGSMNDPVAASPE--------SNLQATRLDVLKASMKFIIRKLDDGD---- 93 Query: 251 DVYMGLIGYTT---RVEKNIEPSW---GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + ++ + + + G + + R + + P +++A +I Sbjct: 94 --RLSIVAFNDGPVKEYSSGLLDVSGDGRSIAGKKIDRLQ-ARGGSGSALMPELQEAVKI 150 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L RQG + FI+ LTDG++ + K + Sbjct: 151 L-----------DERQGNSRNRVG---FILLLTDGDDTTGFRWSRDVIHGAVGK---YPV 193 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 T ++ A+ + + LL S + V + + Sbjct: 194 HTFALGAAHDPEALLHIAQESRGTYSFVDDGN 225 >gi|227832539|ref|YP_002834246.1| hypothetical protein cauri_0711 [Corynebacterium aurimucosum ATCC 700975] gi|227453555|gb|ACP32308.1| putative membrane protein [Corynebacterium aurimucosum ATCC 700975] Length = 693 Score = 42.2 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 52/298 (17%), Positives = 92/298 (30%), Gaps = 54/298 (18%) Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDV---MTSYDYRLQFIEHLLNQRYNQKIVSFI 177 I+ + + +RL + + + L + L ++ + Sbjct: 4 ILKKNHAGSSGDIPHRLSDGKRHAGVQCSLAHMENTSRFSLTPLLTALIAVLGTTALAVV 63 Query: 178 PALLRIEMGERPIFL----------IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 LL + E +V D SGSM +T+ Sbjct: 64 SGLLPVASAEEETNAPSSSSSTMAPTMVVFDSSGSM--------------ITNDAGGQTR 109 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + A K+A F+ +D +GL+ Y + E + VT Sbjct: 110 IDAAKDAARTFITEA------GDDAPLGLVTYGGNTGEAPEDEAAGCQDITVVTPPEAGN 163 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 K ++ + R ++G Q+ II ++DG + Sbjct: 164 SEKMIAHMDGLQPRGFTPIGESLRKAAAELPKEG--------QRSIILVSDG-----VAT 210 Query: 348 VNTIKICDKA---KENFI--KIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 +CD A KE I I T+ N P Q+ L+ C++ + N +ADSL Sbjct: 211 CTPPPVCDVAKELKEQGIDLVINTVGFNVEPEAQQELQ-CIADATGGTYANASDADSL 267 >gi|221039656|dbj|BAH11591.1| unnamed protein product [Homo sapiens] Length = 237 Score = 42.2 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 32/231 (13%), Positives = 76/231 (32%), Gaps = 58/231 (25%) Query: 195 LVVDLSGSMHCA--MNSDPEDVNSAPICQDKKRTKMAALK-NALLLFLDSIDLLSHVKED 251 LV+D SGSM +N + + + + + + ++ ++ + + E Sbjct: 22 LVLDKSGSMGGKDRLNRMNQAAKHFLLQTVENGSWVGMVHFDSTATIVNKLIQIKSSDE- 80 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 R + + + L T +K A+Q++ + Sbjct: 81 -------------------------RNTLMAGLPTYPLGGTSICSGIKYAFQVIGELHSQ 115 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISIN 370 + ++ LTDGE+N S C D+ K++ + I++ Sbjct: 116 LDGSE----------------VLLLTDGEDNTASS-------CIDEVKQSGAIVHFIALG 152 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNA--DSLIHVFQNIS---QLMVHRKYSV 416 + + + + ++ + Y A + LI F ++ + + V Sbjct: 153 RAADEAVIEMSKITGGSHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQV 203 >gi|148655541|ref|YP_001275746.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148567651|gb|ABQ89796.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 504 Score = 42.2 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 40/250 (16%), Positives = 83/250 (33%), Gaps = 65/250 (26%) Query: 177 IPALLRIEMGERPIFL--IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 + R GE I+LV D SGSM + + TK+ A + A Sbjct: 112 TTTIFRPAEGEAAQVTTNIQLVFDASGSMAQRIGGE---------------TKIQAARRA 156 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVE----------KNIEPSWGTEKVRQYVTRDM 284 + +D++ D+ +G + + ++ + V + + R Sbjct: 157 MERIIDTL----PDNPDLNVGFRVFGHEGDSSEAQKARSCQSTALLVPMQGVNKALLRQQ 212 Query: 285 DSL--ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 T + A+++A + + + II +TDGE Sbjct: 213 AQAWQPTGWTPISLALQRAGEDFQAGENVRN------------------VIIMVTDGEET 254 Query: 343 NFKSNVNTIKICDKAK-----ENFIKIVTISINASPNGQRLLKTCVS--SPEYHYNVVNA 395 C AK + ++I + +P+ + L+ C++ S + + N Sbjct: 255 CGGDP------CAVAKALAESQAEVRIDVVGFGTTPDVAKTLR-CIAENSGGVYTDAQNG 307 Query: 396 DSLIHVFQNI 405 D+L+ + + Sbjct: 308 DALVQTLEEL 317 >gi|320101795|ref|YP_004177386.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644] gi|319749077|gb|ADV60837.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644] Length = 764 Score = 42.2 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 32/211 (15%), Positives = 63/211 (29%), Gaps = 54/211 (25%) Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 ++ P + P + L++D SGSM K+ Sbjct: 264 TLLLLASPTIETTPNKTPPAKTVVLILDRSGSMSGK--------------------KIEQ 303 Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE--PSWGTEKVRQYVTRDMDSLI 288 + A+ +++++ LI Y VE E + + Sbjct: 304 ARAAMKFVVENLNQDDLFN------LILYDDTVEMFKPELLRCNAENRAEALRFIEGVRP 357 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 TD ++ A L +D+ R ++IFLTDG Sbjct: 358 GGSTDIDQGLR-AGLKLIADESRPN------------------YVIFLTDG--LPTSGET 396 Query: 349 NTIKICDKAK-----ENFIKIVTISINASPN 374 N +KI + A+ + + + + + + Sbjct: 397 NELKIAEAARAANPLKAKLFVFGVGYDVNAR 427 >gi|297285600|ref|XP_001085586.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 1 [Macaca mulatta] Length = 935 Score = 42.2 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 56/165 (33%), Gaps = 45/165 (27%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D SGSM K+ + A + +D LS + Sbjct: 275 VVFVIDKSGSMSGR--------------------KIQQTREA---LIKILDDLSPRDQF- 310 Query: 253 YMGLIGYTTRVEK-----NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 LI ++T + + + R + L T+ AM A Q+L S Sbjct: 311 --NLIVFSTEATQWRPSLVPASAENVNEARSFAAGIQA---LGGTNINEAMLVAVQLLDS 365 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + + S+ II LTDG+ ++N +I+ Sbjct: 366 SNQ--------EERLPDGSVSL---IILLTDGDPTVGETNPRSIQ 399 >gi|281348290|gb|EFB23874.1| hypothetical protein PANDA_022043 [Ailuropoda melanoleuca] Length = 426 Score = 42.2 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 21/128 (16%), Positives = 45/128 (35%), Gaps = 28/128 (21%) Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + +T ++ T ++ +Q + + + + II LT Sbjct: 44 EKITANLPREASGGTSICSGLRAGFQAIIHSNQSTSGSE----------------IILLT 87 Query: 338 DGENNNFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRL--LKTCVSSPEYHYNVVN 394 DGE++ S C ++ K++ I TI++ + L L ++ Y + Sbjct: 88 DGEDDQISS-------CFEEVKQSGAVIHTIAL-GPSAARELETLSN-MTGGYRFYANKD 138 Query: 395 ADSLIHVF 402 + L F Sbjct: 139 INGLTDAF 146 >gi|221042208|dbj|BAH12781.1| unnamed protein product [Homo sapiens] Length = 900 Score = 42.2 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 33/165 (20%), Positives = 55/165 (33%), Gaps = 45/165 (27%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D SGSM K+ + A + +D LS + Sbjct: 275 VVFVIDKSGSMSGR--------------------KIQQTREA---LIKILDDLSPRDQF- 310 Query: 253 YMGLIGYTTRVEK-----NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 LI ++T + + K R + L T+ AM A Q+L S Sbjct: 311 --NLIVFSTEATQWRPSLVPASAENVNKARSFAAGIQA---LGGTNINDAMLMAVQLLDS 365 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + S+ II LTDG+ ++N +I+ Sbjct: 366 SNQEG--------RLPEGSVSL---IILLTDGDPTVGETNPRSIQ 399 >gi|134093164|gb|ABO53024.1| matrilin 4 isoform 1 precursor, 3 prime [Chlorocebus aethiops] Length = 243 Score = 42.2 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 32/177 (18%), Positives = 70/177 (39%), Gaps = 26/177 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPS-WGTE-KVRQYVTRDMDSLILKPTDSTPAMK 299 +D L E +GL+ +++RV +GT +V+Q V + + T + A++ Sbjct: 34 VDFLDVSPEGTRVGLVQFSSRVRTEFPLGRYGTAVEVKQAVLAM--EYMERGTMTGLALR 91 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S+ QG + +L + + TDG + +++ +AKE Sbjct: 92 HMVEHSFSE----------AQGARPRALNVPRVGLVFTDG---RSQDDISV--WAARAKE 136 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I + + + + + L+ S P + D F ++ L+ + + S+ Sbjct: 137 EGIVMYAVGVGKAVEAE--LREIASEPAELHVSYAPD-----FGTMTHLLENLRGSI 186 >gi|313232459|emb|CBY24127.1| unnamed protein product [Oikopleura dioica] Length = 1632 Score = 42.2 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 30/188 (15%), Positives = 58/188 (30%), Gaps = 47/188 (25%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 I++V+D SGS+ A N D +N + T + +K L F +S Sbjct: 1393 NTTISGRLDIQIVIDTSGSLTSAPNKDQVLMNFTNNLANMYDT-INQVKIGLTSFSESS- 1450 Query: 244 LLSHVKEDVYMGLIGYT--TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + M L Y + +W T+ T ++ A Sbjct: 1451 -------VLEMPLDFYNQLELQDGVSNMTW----------------QGSFTNITSGVETA 1487 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 S +I +TDG + + ++ D+AK + Sbjct: 1488 LND------------------MDTSDAVDDVMILITDGFQST--NTTLMFQMIDQAKADG 1527 Query: 362 IKIVTISI 369 ++++ + Sbjct: 1528 VRLIALGF 1535 >gi|297286920|ref|XP_001082067.2| PREDICTED: collagen alpha-4(VI) chain-like, partial [Macaca mulatta] Length = 1624 Score = 42.2 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 17/101 (16%), Positives = 37/101 (36%), Gaps = 15/101 (14%) Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKPTDSTPAMKQAY 302 L + V +GL Y + + ++ V + +L T++ A++ Sbjct: 265 LDISSDHVRVGLAQYNDNIYPAFQL--NQHPLKSTVLEQIQNLPYRTGGTNTGSALEF-- 320 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 + ++ T K + +I +TDGE+N+ Sbjct: 321 ------IRTNYLTEESGSRAKD---RVPQIVILVTDGESND 352 Score = 36.4 bits (82), Expect = 9.9, Method: Composition-based stats. Identities = 40/303 (13%), Positives = 90/303 (29%), Gaps = 52/303 (17%) Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 + SR + I+ D+ +H L Y QN+ S + Sbjct: 727 IQEKGSRSNHGVQQIAVVIMEGFSQDSVSRPASH----LRRAGITIYAVGTQNVSESKEL 782 Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE 187 + S F + V+ S Q +++ + + + +S+ + + + Sbjct: 783 EKIASYPHWKYSVPLESFLQLSVVRSKLIN-QLCSEMVDSKVSFRGMSYPLQEGCVHVEK 841 Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 I+ + +D SGS+ + +M A ++ Sbjct: 842 ADIYFL---IDGSGSI-----------------NPQDFLEMKAFIKEVIKMFQI------ 875 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 V G+I Y+ +++ S V A+ Q Sbjct: 876 GPNRVQFGVIQYSDKIQSQFILSQYPSVAELKV----------------AIDNIQQGGGG 919 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 N + + +++I +TDG++++ + + +EN + I I Sbjct: 920 TATGEALNNMTQVFADTGRINVARYLIVITDGKSSDPVAEAA-----EGLRENGVIIYAI 974 Query: 368 SIN 370 + Sbjct: 975 GVR 977 >gi|297526263|ref|YP_003668287.1| von Willebrand factor type A [Staphylothermus hellenicus DSM 12710] gi|297255179|gb|ADI31388.1| von Willebrand factor type A [Staphylothermus hellenicus DSM 12710] Length = 416 Score = 42.2 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 49/245 (20%), Positives = 91/245 (37%), Gaps = 47/245 (19%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 ++ P +V+D S SM K+ K A L L Sbjct: 28 SIKGVYSAHPPIAFLIVIDTSYSMDGE--------------------KIFRAKQAALGLL 67 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAM 298 D + K+ V G+ G+ + K +EP R V R + SL L T+ + Sbjct: 68 DILR----DKDYV--GVYGFAGKFYKVLEPV--PATKRGEVERAIISLKLGSGTNIYDTL 119 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI-KICDKA 357 K+ ++ ++ +L + IIF+TDGE K N I ++ K Sbjct: 120 KK-------------LVEETKKVLQNGALSLVR-IIFITDGEPTVGKKNPKKILEMAKKL 165 Query: 358 KENFIKIVTISINASPNGQRLLKTCVS---SPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 +E + I + N + L + ++ E+ + + + LI + +Q + + Sbjct: 166 REAGASALIIGVGTEYNEKLLSRMAMALNGEFEHISDPASLEKLISEYAKSTQEVSAKNV 225 Query: 415 SVILK 419 +V+L+ Sbjct: 226 AVLLR 230 >gi|254819550|ref|ZP_05224551.1| hypothetical protein MintA_06484 [Mycobacterium intracellulare ATCC 13950] Length = 335 Score = 42.2 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 41/234 (17%), Positives = 72/234 (30%), Gaps = 51/234 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + LV+D+S SM D + +MAA + A F D + + Sbjct: 99 VMLVIDVSQSM---------------RATDVQPNRMAAAQEAAKQFADELTPG------I 137 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GLI Y + P+ + + + + + T + + A Q + + Sbjct: 138 NLGLIAYAGTATVLVSPTTNRDSTKAALDKLQFADR---TATGEGIFTALQAIATVGAV- 193 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENN---NFKSNVNTIKICDKAKENFIKIVTISI 369 + P I+ +DG+ N + AK+ + I TIS Sbjct: 194 ---------IGGGDKPPPARIVLFSDGKETMPTNPDNPKGAFTAARTAKDQGVPISTISF 244 Query: 370 NAS-------------PNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLM 409 P LK S YN + L V+ + Q + Sbjct: 245 GTPYGFVEINDQRQPVPVDDETLKKVAQLSGGNAYNAASLQELKAVYATLQQQI 298 >gi|148537043|dbj|BAF63430.1| Ca(2+)-activated chloride channel splicing variant [Rattus norvegicus] Length = 514 Score = 42.2 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 35/186 (18%), Positives = 67/186 (36%), Gaps = 50/186 (26%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM + + +M A L+L I V+++ + Sbjct: 311 LVLDKSGSMD----------------TEDRLIRMN---QAAELYLTQI-----VEKESMV 346 Query: 255 GLIGYT-TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 GL+ + T +N + +T ++ + T ++ +Q +TS + + Sbjct: 347 GLVTFDSTAQIQNYLIKITNTGDYKKITGNLPQQAVGGTSICRGLEAGFQAITSSDQSTS 406 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 + I+ LTDGE++ S C + K + I TI++ Sbjct: 407 GSE----------------IVLLTDGEDDLISS-------CFEVVKHSGAVIHTIAL-GP 442 Query: 373 PNGQRL 378 + L Sbjct: 443 KAAREL 448 >gi|126667415|ref|ZP_01738387.1| hypothetical protein MELB17_14151 [Marinobacter sp. ELB17] gi|126628171|gb|EAZ98796.1| hypothetical protein MELB17_14151 [Marinobacter sp. ELB17] Length = 774 Score = 42.2 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 22/144 (15%), Positives = 44/144 (30%), Gaps = 12/144 (8%) Query: 272 GTEKVRQYVTRDMDSLILKPTDSTPAMKQ--AYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 G QYV + + A++Q A + G + Sbjct: 95 GVWTFGQYVNMLVPHREVSDAWRDMAIEQSDAINSVAMRTNLGAAIETASDGYFTGGVLS 154 Query: 330 QKFIIFLTDGENNNFKSNVNTIKICDKA--------KENFIKIVTISINASPNGQRLLKT 381 I LTDG+ + ++ ++ K+ + ++++A L+ Sbjct: 155 NTHFIVLTDGKVDISRNPSANKAEANRILDTLVPPLKQQGARFHAVALSA-EADTEFLRK 213 Query: 382 CVS-SPEYHYNVVNADSLIHVFQN 404 S S + NA+ L F + Sbjct: 214 LASDSNGSFHVAENANDLSRAFLD 237 >gi|218961690|ref|YP_001741465.1| BatA protein (fragment) [Candidatus Cloacamonas acidaminovorans] gi|167730347|emb|CAO81259.1| BatA protein (fragment) [Candidatus Cloacamonas acidaminovorans] Length = 270 Score = 42.2 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 31/189 (16%), Positives = 60/189 (31%), Gaps = 41/189 (21%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 ++ I + +D+SGSM + AP + L A+ + D Sbjct: 15 IKTRDLSNKGVDIVMAIDISGSMLAM--------DFAPKNR---------LSAAVSVAKD 57 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 + + GL+ ++ + ++ + + + + T + + Sbjct: 58 FVKRRPND----RFGLVAFSEYALTQVPLTFDHLAMLNSLDKLKVNEEASATAIGMGLAK 113 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A L S K II +TDG +N + + T AKE Sbjct: 114 AVARL------------------KNSTAKSKVIILITDGVSNTGEIDPLTAA--GMAKEL 153 Query: 361 FIKIVTISI 369 IK+ I + Sbjct: 154 GIKVYPIGV 162 >gi|229005450|ref|ZP_04163163.1| D-amino acid dehydrogenase, large subunit [Bacillus mycoides Rock1-4] gi|228755812|gb|EEM05144.1| D-amino acid dehydrogenase, large subunit [Bacillus mycoides Rock1-4] Length = 474 Score = 42.2 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 48/316 (15%), Positives = 103/316 (32%), Gaps = 40/316 (12%) Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFI 161 +KE+ + + +++N NS + + L +++ LQ + Sbjct: 99 MKEAARDTESNIKSADLKNKSNSEQADLYVHMMYSLLKQ--------EIIPFDKMPLQIL 150 Query: 162 EHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 E + + +++ S + +R + IE+++D SGSM ++ Sbjct: 151 E--IGRVEDEEKKSNGTKGEQKNKEDRGNYNIEILLDASGSMAGKIDG------------ 196 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + KM K A+ F+ + +V VY G G +K G + + Sbjct: 197 ---KMKMDIAKEAIQQFVSDLPEAVNVSLRVY-GHKGSNDEKDKTASC--GAIENIYTLQ 250 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG-E 340 + + + D + T + + Q K + I ++DG E Sbjct: 251 KYDQTTFRQSLDGFQPVGW-----TPLAEAIKKSTETFQSAKEND---KNIIYIVSDGVE 302 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLI 399 K+ + + + I I ++ LK S + +A L Sbjct: 303 TCGGNPVEEAQKVSNSNIKPIMNI--IGFQVDHEAEKQLKEIAEVSKGKYVLANSAKELQ 360 Query: 400 HVFQNISQLMVHRKYS 415 F+ + + R+ Sbjct: 361 DQFKETGKDITSRRLK 376 >gi|148250139|gb|ABQ53165.1| MtsD [Myxococcus fulvus HW-1] Length = 659 Score = 42.2 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 33/162 (20%), Positives = 62/162 (38%), Gaps = 25/162 (15%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM-------AALKNALLLFLDSIDLL 245 + +V+D SGSM ++ P + CQ ++ + A AL + + Sbjct: 61 VVVVIDESGSM--CVSDPPGAQLDSGFCQRREILDIIPEGVTEPARVRALKRLVQQFREV 118 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWG-------TEKVRQYVTRDMDSLILKPTDSTPAM 298 + +V + + + T V P+ + Y+ + S + K TD A+ Sbjct: 119 NAQGGNVQVSVAPFETNVRNVWPPATTGDRFARPDNNIDSYI-EGLQSQLGKGTDYQGAL 177 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF-IIFLTDG 339 AY ++ D +N LP ++ ++FLTDG Sbjct: 178 SYAYSFISLDINAVAQSNPEL-------LPRTRYVVVFLTDG 212 >gi|75907530|ref|YP_321826.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] gi|75701255|gb|ABA20931.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] Length = 418 Score = 42.2 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 37/233 (15%), Positives = 74/233 (31%), Gaps = 52/233 (22%) Query: 177 IPALLRIEMGERPI-FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 I E +R + + L++D SGSM N P + +K A Sbjct: 27 ISISAGAEPQDRTVPLNLCLILDHSGSM----NGRP----------------LEIVKQAA 66 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTR--VEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + +D + + ++ + R V + E++++ + R T Sbjct: 67 IRLVDRLKTGD------RLSVVAFDHRAKVLVPNQVIDNPEQIKKQINRLAAD---GGTA 117 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 ++ + L KK + F LTDGEN + +N +K Sbjct: 118 IDEGLRLGIEELAKGKKETISQAF-----------------LLTDGENEHGDNN-RCLKF 159 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADSLIHVFQNI 405 A + + T+ Q +L+ + + A+ + F + Sbjct: 160 AQLAAGYNLTLNTLGF-GDNWNQDVLEKIADAGLGSLSYIQKAEQAVDEFGRL 211 >gi|226358120|ref|YP_002787859.1| hypothetical protein Deide_2p00900 [Deinococcus deserti VCD115] gi|226319763|gb|ACO47757.1| Conserved hypothetical protein [Deinococcus deserti VCD115] Length = 418 Score = 42.2 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 36/240 (15%), Positives = 72/240 (30%), Gaps = 48/240 (20%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 ++ PA + ++ +RP + V+D SGSM Sbjct: 21 TTLTLLIRVHPAPVTTQVSQRPPLNLAFVIDRSGSMSGL--------------------P 60 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMD 285 + K A + + + ++ + RV+ + + E V Q + D Sbjct: 61 LQMAKQAAIAAVRQARPDD------RVSVVAFDDRVDVIVPSQLATSREAVIQAIGTIDD 114 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T+ + Q + +L +I L+DG+ N Sbjct: 115 ---RGSTNLHGGWLE-------------GATQVAQHLTPGALNR---VILLSDGQANVGV 155 Query: 346 SNVNTIK-ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 ++ I E I TI + + + + LL + +V + L F+ Sbjct: 156 TDRREIARQVRGLTERGISTTTIGLGSHYDEELLLAIANAGDGNFEHVEDPSRLPTFFEE 215 >gi|209546584|ref|YP_002278502.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209537828|gb|ACI57762.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 698 Score = 42.2 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 42/291 (14%), Positives = 94/291 (32%), Gaps = 48/291 (16%) Query: 84 NHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN-RLDSSNN 142 + ++ + A IK+ + +S +SA + + S + R++ N Sbjct: 221 DPSRERFANAAANPIKSVATDPVSTFSADVDSASYSFVRRSLTGGAMPDPQSVRVEEMIN 280 Query: 143 TIFYNMDVMTSYDYRLQFIEHLLNQ--RYNQKIVSFIPALLRIEMGERPIFLIELVVDLS 200 Y+ D + ++ ++ +++ I P + ++D+S Sbjct: 281 YFPYDWAGPEKADQPFKATVTVMPTPWNHDTELMHVAIKGYDIAPATAPHANLVFLIDVS 340 Query: 201 GSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT 260 GSM + K+ LK+A L + + V Y G G Sbjct: 341 GSMD-------------------EPDKLPLLKSAFRLLVSKLKADDTVSIVTYAGNAGTV 381 Query: 261 TRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFR 319 + E + + +D L T ++ AY N + Sbjct: 382 LEPTRVAE--------KSKILSAIDRLEAGGSTGGAEGIEAAY-------------NLAK 420 Query: 320 QGVKIPSLPFQKFIIFLTDGE-NNNFKSNVNTIKICDKAKENFIKIVTISI 369 Q + ++ TDG+ N S+ + +I ++ +++ I + + Sbjct: 421 QAFVKDGVNR---VMLATDGDFNVGPSSDEDLKRIIEEKRKDGIFLTVLGF 468 >gi|288573236|ref|ZP_06391593.1| von Willebrand factor type A [Dethiosulfovibrio peptidovorans DSM 11002] gi|288568977|gb|EFC90534.1| von Willebrand factor type A [Dethiosulfovibrio peptidovorans DSM 11002] Length = 225 Score = 42.2 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 33/183 (18%), Positives = 65/183 (35%), Gaps = 40/183 (21%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 P + LV+D+SGSM A + L + LF S+ Sbjct: 19 PRVPVSLVLDVSGSMLGA--------------------PIEELNRGVELFFKSLKDDDVA 58 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + + +I ++ V + E +G + + + T A+ A + L Sbjct: 59 RYSAEVSVISFSNEVTQ--EVDFGPLEK----CDIPELKAIGKTRMGGAVSLALESLEKR 112 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK--SNVNTIKICDKAKENFIKIVT 366 K+ + +Q +++ +TDG+ N+ + T + DK K + + Sbjct: 113 KEL---------YRTLGVDYYQPWMVIMTDGKPNDDWQLAAAKTSALVDKGK---LTVFP 160 Query: 367 ISI 369 I+I Sbjct: 161 IAI 163 >gi|221123691|ref|XP_002160228.1| PREDICTED: similar to CnPolydom [Hydra magnipapillata] Length = 954 Score = 42.2 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 34/227 (14%), Positives = 71/227 (31%), Gaps = 37/227 (16%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E + +++D SGSM S Q T K + L + +S Sbjct: 49 EHGASDLVILLDSSGSM-----------YSTGSFQGVTMTGFDIGKTFINALLSKVH-IS 96 Query: 247 HVKEDVYMGLIGYTTRVEKNIEP--SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + +G G +++ N + K + + + T+ A++ + Sbjct: 97 FNATRIAIGTFGTNHKIDINFILRPDYSMHKCKFKKDFEKIRIYGGMTNLRGALQDSLN- 155 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN---NFKSNVNTIK--------- 352 + + + T+ R + +I L+DGE N N T Sbjct: 156 IFRELDSNPDTHKKRHKT-------NRVVILLSDGEGNVMDNPNGRGVTHNDGLARNPHD 208 Query: 353 ICDKAKENFIKIVTISI-NASPNGQRLLKTCVSSPEYHYNVVNADSL 398 I + +++ TI + +A +L+ + + L Sbjct: 209 IAHNLRLGLVEVYTIGVTSAPDR--AVLEGLATEKNLFLFSKDFTDL 253 >gi|156382097|ref|XP_001632391.1| predicted protein [Nematostella vectensis] gi|156219446|gb|EDO40328.1| predicted protein [Nematostella vectensis] Length = 286 Score = 42.2 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 33/213 (15%), Positives = 73/213 (34%), Gaps = 42/213 (19%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I ++D S SM + K+ ++ +D+ + Sbjct: 96 IDIAFLLDASASMG-----------------RRTWGKIKNYVKSI------VDMGDISDQ 132 Query: 251 DVYMGLIGYTTR--VEKNIEPSWGTEKVRQYVTRDMDSL--ILKPTDSTPAMKQAYQILT 306 ++G+I ++T ++ + G + + RD+D L T A+ A + L Sbjct: 133 GTHVGIITFSTDPVIDIPFDKYKGVKMNAVNIKRDIDELRRKKGYTFIDKALTLADKSLF 192 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKENFIKIV 365 + + QK + ++DG + TI ++ K +++ Sbjct: 193 TQEAGM-------------REDSQKVAVLMSDGIQTKDRGPFTPTIIAANRLKMKGVQVY 239 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 T+ I AS + L+ S ++ + + L Sbjct: 240 TVGIGASIDVLELM-WIASEDTGLFDGQHLELL 271 >gi|12655866|gb|AAK00631.1| complement factor B [Halocynthia roretzi] Length = 1084 Score = 42.2 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 42/265 (15%), Positives = 92/265 (34%), Gaps = 22/265 (8%) Query: 148 MDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF-LIELVVDLSGSMHCA 206 + V +Y I L + + + I E IF + + D+S S+ Sbjct: 485 LCVDPAYHSSKSDIARTLYKSLSNLPDQSLTISRTISASEVNIFHYVIFIFDVSKSVTKK 544 Query: 207 MNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN 266 + A D+ + LK +++ + S + + ++T V++ Sbjct: 545 YQDFSSGIEFAKRLIDRLKNFGGVLKYSIIAYASSNKTQLEITDR-------FSTNVKEV 597 Query: 267 IE-PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP 325 I+ +V++ V+ ++ T + A+K ++ F ++ Sbjct: 598 IKRLDNLDSQVKEAVSELIEETRSG-TATAKALKSLRDMML----------FMEHDIRND 646 Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL--LKTCV 383 + + TDG +N K+ V K K + I+ +IS P+ + L Sbjct: 647 QTNDKCHVFLFTDGMHNEGKNPVEVRKEMQKIFGSNIEFYSISAQEDPSPEAFEELIGLA 706 Query: 384 SSPEYHYNVVNADSLIHVFQNISQL 408 S PE + + + L ++ + Sbjct: 707 SEPENYIYIEDIHLLSSYLDKVTDV 731 >gi|118387578|ref|XP_001026893.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89308660|gb|EAS06648.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 1074 Score = 42.2 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 31/140 (22%), Positives = 51/140 (36%), Gaps = 32/140 (22%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 I + +RP + V+D SGSMH K+ LK Sbjct: 346 ISLNTKGNFDAKAYQRPPIDLICVMDNSGSMHGE--------------------KINMLK 385 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE---KVRQYVTRDMDSLIL 289 LL +D +D +GL+ + + V S T K++QY++ D Sbjct: 386 ETLLYLIDQLDEKD------RLGLVLFNSEVTFRPMKSMDTTNKLKLKQYIS---DIRAQ 436 Query: 290 KPTDSTPAMKQAYQILTSDK 309 TD M +A++ + + K Sbjct: 437 GGTDINLGMTEAFKFIKTRK 456 >gi|332305539|ref|YP_004433390.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Glaciecola agarilytica 4H-3-7+YE-5] gi|332172868|gb|AEE22122.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Glaciecola agarilytica 4H-3-7+YE-5] Length = 1359 Score = 42.2 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 33/194 (17%), Positives = 62/194 (31%), Gaps = 32/194 (16%) Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV---TRDMDSLILKPTDSTPAMKQAYQ 303 + + +Y G++ Y ++ + + G V V TR L D+T Sbjct: 492 PIVDTLYEGVLYYGG-LDVDYGLTRGNNSVSNTVRRNTRVSHRLSYTGQDATLPSGCEED 550 Query: 304 ILTSDK---KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE--NNNFKSNVNTI--KIC-- 354 L+S ++ + + I+ L+DGE NN+ + T+ C Sbjct: 551 NLSSSNCITQQIVQGARYLSPITDRQCQVNNHIVLLSDGEANNNHSVDEIETLLSASCTG 610 Query: 355 ------------------DKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNA 395 + ++ I TI A+ L Y N+ Sbjct: 611 SGGEKCGLSLVRNIADTEESVIDSRIITHTIGFAANTQANSFLNQIALQGGGGFYQADNS 670 Query: 396 DSLIHVFQNISQLM 409 L+ FQ+I + + Sbjct: 671 QELLGAFQSILKTV 684 >gi|301777181|ref|XP_002924014.1| PREDICTED: cartilage matrix protein-like [Ailuropoda melanoleuca] Length = 495 Score = 42.2 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 38/223 (17%), Positives = 83/223 (37%), Gaps = 40/223 (17%) Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNA-----------LLLFLDSI-DLLSHVKED 251 +NSD + N T + L + + F++ I D L + Sbjct: 251 GFTLNSDGKTCNVCSGGGGSSATDLVFLIDGSKSVRPENFELVKKFINQIVDTLDVSDKL 310 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD--SLILKPTDSTPAMKQAYQILTSDK 309 +GL+ Y++ V + G ++ + + S + K T + A+K D Sbjct: 311 AQVGLVQYSSSVRQEFPL--GRFHTKKDIKAAVRNMSYMEKGTMTGAALKY-----LIDN 363 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + + K+ I+F TDG + ++ +N KAK+ K+ + + Sbjct: 364 SFTVSSGARPGAQKVG-------IVF-TDGRSQDYINNAA-----KKAKDLGFKMFAVGV 410 Query: 370 -NASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 NA + L+ S P E+++ + ++ + + + + + Sbjct: 411 GNAVEDE---LREIASEPVAEHYFYTADFKTINQIGKRLQKKI 450 >gi|228997913|ref|ZP_04157515.1| D-amino acid dehydrogenase, large subunit [Bacillus mycoides Rock3-17] gi|228761788|gb|EEM10732.1| D-amino acid dehydrogenase, large subunit [Bacillus mycoides Rock3-17] Length = 474 Score = 42.2 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 48/316 (15%), Positives = 103/316 (32%), Gaps = 40/316 (12%) Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFI 161 +KE+ + + +++N NS + + L +++ LQ + Sbjct: 99 MKEAARDTESNIKSADLKNKSNSEQADLYVHMMYSLLKQ--------EIIPFDKMPLQIL 150 Query: 162 EHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 E + + +++ S + +R + IE+++D SGSM ++ Sbjct: 151 E--IGRVEDEEKKSNGTKGEQKNKEDRGNYNIEILLDASGSMAGKIDG------------ 196 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + KM K A+ F+ + +V VY G G +K G + + Sbjct: 197 ---KMKMDIAKEAIQQFVSDLPEAVNVSLRVY-GHKGSNDEKDKTASC--GAIENIYTLQ 250 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG-E 340 + + + D + T + + Q K + I ++DG E Sbjct: 251 KYDQTTFRQSLDGFQPVGW-----TPLAEAIKKSTETFQSAKEND---KNIIYIVSDGVE 302 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLI 399 K+ + + + I I ++ LK S + +A L Sbjct: 303 TCGGNPVEEAQKVSNSNIKPIMNI--IGFQVDHEAEKQLKEIAEVSKGKYVLANSAKELQ 360 Query: 400 HVFQNISQLMVHRKYS 415 F+ + + R+ Sbjct: 361 DQFKETGKDITSRRLK 376 >gi|148698183|gb|EDL30130.1| matrilin 1, cartilage matrix protein 1 [Mus musculus] Length = 456 Score = 42.2 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 26/190 (13%), Positives = 74/190 (38%), Gaps = 29/190 (15%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + +K + +D++D+ + + GL+ Y++ + + G ++ + Sbjct: 247 RPENFELVKKFINQIVDTLDVSDRLAQV---GLVQYSSSIRQEFPL--GRFHTKKDIKAA 301 Query: 284 MD--SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + S + K T + A+K D + + K+ I+F TDG + Sbjct: 302 VRNMSYMEKGTMTGAALKY-----LIDNSFTVSSGARPGAQKVG-------IVF-TDGRS 348 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLI 399 ++ ++ KAK+ K+ + + + L+ S P ++++ + ++ Sbjct: 349 QDYINDAA-----RKAKDLGFKMFAVGVG--NAVEEELREIASEPVADHYFYTADFKTIN 401 Query: 400 HVFQNISQLM 409 + + + + + Sbjct: 402 QIGKKLQKQI 411 >gi|83594486|ref|YP_428238.1| von Willebrand factor, type A [Rhodospirillum rubrum ATCC 11170] gi|83577400|gb|ABC23951.1| von Willebrand factor, type A [Rhodospirillum rubrum ATCC 11170] Length = 575 Score = 42.2 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 32/225 (14%), Positives = 82/225 (36%), Gaps = 47/225 (20%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 G P + VVD SGSM + P ++++A K ++ Sbjct: 385 SWSPGREPR--MIFVVDGSGSMSEGIAGAP--------------SRISAAKRSIR----- 423 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 D ++ + +D+ +G++ ++ + T+ + Y + + + TP Sbjct: 424 -DTVNALHKDIRVGMVSFSDCMA--------TQNSKYYSAAERPAFLAGVDAITP----- 469 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 ++ + R+G + + + ++ ++DGE+ + KA+++ Sbjct: 470 -------ERATSLAASIRRGGALATRRSETVMMVVSDGEDTCG-GDPCAAARAVKAEKSN 521 Query: 362 IKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQN 404 + I I ++ G + C++S + +A + + Sbjct: 522 VIIHVIDLSG--GGNSGVARCIASAGGGRVFTPGSAAQVTSSLRT 564 >gi|297285598|ref|XP_002802831.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 2 [Macaca mulatta] Length = 900 Score = 42.2 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 56/165 (33%), Gaps = 45/165 (27%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D SGSM K+ + A + +D LS + Sbjct: 275 VVFVIDKSGSMSGR--------------------KIQQTREA---LIKILDDLSPRDQF- 310 Query: 253 YMGLIGYTTRVEK-----NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 LI ++T + + + R + L T+ AM A Q+L S Sbjct: 311 --NLIVFSTEATQWRPSLVPASAENVNEARSFAAGIQA---LGGTNINEAMLVAVQLLDS 365 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + + S+ II LTDG+ ++N +I+ Sbjct: 366 SNQ--------EERLPDGSVSL---IILLTDGDPTVGETNPRSIQ 399 >gi|90413889|ref|ZP_01221875.1| inter-alpha-trypsin inhibitor domain protein [Photobacterium profundum 3TCK] gi|90325073|gb|EAS41583.1| inter-alpha-trypsin inhibitor domain protein [Photobacterium profundum 3TCK] Length = 714 Score = 42.2 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 41/262 (15%), Positives = 85/262 (32%), Gaps = 57/262 (21%) Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIF---LIELVVDLSGSMHCAMNSDPEDVNS 216 F +H+ Q Y + ++ + V+D+SGSM+ Sbjct: 299 FTQHVEGQGYGLLLTMPPQVNHQVNSTTSSALFHQSVTFVLDISGSMYGE---------- 348 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE-PSWGTEK 275 + K AL L + ++ + E T Sbjct: 349 ----------SIEQAKQALRYGLQQLQPEDSFN------IVTFNHEAMLYSEQLLPVTSS 392 Query: 276 VRQYVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 R +D L T+ A+K A+ I T ++ S + I+ Sbjct: 393 TITRALRFVDGLDADGGTEMAAALKAAFSIKT--------------HDQLNSTRWLNQIV 438 Query: 335 FLTDGE--NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV 392 F+TDG N + ++ ++ D+ ++ T+ I ++PN + + + + + Sbjct: 439 FITDGSVGNESALFDLIEQQLVDR------RLFTVGIGSAPNSYFMTRAAMKGKGTYTYI 492 Query: 393 VNADSLIH----VFQNISQLMV 410 + + +F ISQ ++ Sbjct: 493 GDVKEVNTKMRLLFSKISQPVM 514 >gi|114046077|ref|YP_736627.1| von Willebrand factor, type A [Shewanella sp. MR-7] gi|113887519|gb|ABI41570.1| von Willebrand factor, type A [Shewanella sp. MR-7] Length = 335 Score = 42.2 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 51/264 (19%), Positives = 79/264 (29%), Gaps = 46/264 (17%) Query: 159 QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP 218 + + L + L ++ E + ++VDLSGSM D Sbjct: 65 RGMLILSWLLIVTALAKP-SILGEVQTREAFGRDVLMLVDLSGSM---------DEADFT 114 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 T++ A KN L F+ GLI + T + Sbjct: 115 TADGSTLTRLNAAKNVLKTFIAK-RSGD------RFGLILFGDAAFIQTPF---TADQQV 164 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 +++ ++ A Q F Q P Q+ +I LTD Sbjct: 165 WLSLLEEAQTG----------MAGQSTHLGDAIGLGIKVFEQN---PQPSEQQVMIVLTD 211 Query: 339 G-ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 G + +F V+ KI A IKI TI++ P V Sbjct: 212 GNDTGSFVEPVDAAKI---AAARGIKIYTIAMGDPT-------HVGEQPMDMEVVQRVSQ 261 Query: 398 LI--HVFQNISQLMVHRKYSVILK 419 L F I Q + + Y +I K Sbjct: 262 LTQARAFIAIDQAELDKAYQLIDK 285 >gi|149916833|ref|ZP_01905335.1| hypothetical protein PPSIR1_05713 [Plesiocystis pacifica SIR-1] gi|149822550|gb|EDM81939.1| hypothetical protein PPSIR1_05713 [Plesiocystis pacifica SIR-1] Length = 447 Score = 42.2 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 33/193 (17%), Positives = 65/193 (33%), Gaps = 35/193 (18%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH----- 247 + +V+D SGSM N+ D N + + + N F DSI+ ++ Sbjct: 146 VMMVLDKSGSM--FTNTWDHDNNGGTPQITRWNSLYDVVDNITTTFDDSINFGANLFPST 203 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + +++Y T+ + + ++ + + T +T + AY LTS Sbjct: 204 LAQNIYGPQACTTSNFPEVTVGENNSAQILATIPGPGVTASYGGTPATLGVTTAYNHLTS 263 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD-----------K 356 + +I +TDG N ++ N ++ D Sbjct: 264 L-----------------DPELPRAMILVTDGAANCDQNAANNFQLFDVYDDGLPVIVGT 306 Query: 357 AKENFIKIVTISI 369 A N + + I Sbjct: 307 AAANGVPTYVVGI 319 >gi|118086119|ref|XP_426008.2| PREDICTED: similar to alpha 3 type VI collagen [Gallus gallus] Length = 2533 Score = 42.2 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 23/150 (15%), Positives = 55/150 (36%), Gaps = 20/150 (13%) Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + ++V +GLI +++ + + + V R + M+Q Sbjct: 862 KVDVGLDNVRIGLIQFSSEIREEFQLD--RYSTIADVQRAIQE-----------MQQIKL 908 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 + K +F ++F K ++++I +TDGE + +V + ++ I Sbjct: 909 GTLTGKALTFAASYFD-RPKGGRPELKQYLIVITDGE---AQDSVKSPAR--AIRDKGIT 962 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVV 393 I I N +L++ + + + Sbjct: 963 IYAID-MLQANNSQLVEITGAQDKVFFESE 991 >gi|99031847|pdb|2B2X|A Chain A, Vla1 Rdeltah I-Domain Complexed With A Quadruple Mutant Of The Aqc2 Fab gi|99031850|pdb|2B2X|B Chain B, Vla1 Rdeltah I-Domain Complexed With A Quadruple Mutant Of The Aqc2 Fab Length = 223 Score = 42.2 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 29/185 (15%), Positives = 68/185 (36%), Gaps = 25/185 (13%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + D + + + +G++ Y V + V + ++ + Sbjct: 42 IAFLNDLLKRMDIGPKQTQVGIVQYGENVTHEFNL--NKYSSTEEVLVAANKIVQRGGRQ 99 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 T A I T+ K+ R+G +K ++ +TDGE N ++ Sbjct: 100 T---MTALGIDTARKEAFTEARGARRG-------VKKVMVIVTDGE---SHDNYRLKQVI 146 Query: 355 DKAKENFIKIVTISINASPNGQRL--------LKTCVSSP--EYHYNVVNADSLIHVFQN 404 ++ I+ +I+I N L +K+ S P ++ +NV + +L+ + + Sbjct: 147 QDCEDENIQRFSIAILGHYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKA 206 Query: 405 ISQLM 409 + + + Sbjct: 207 LGERI 211 >gi|66805723|ref|XP_636583.1| hypothetical protein DDB_G0288697 [Dictyostelium discoideum AX4] gi|60464969|gb|EAL63080.1| hypothetical protein DDB_G0288697 [Dictyostelium discoideum AX4] Length = 585 Score = 42.2 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 42/233 (18%), Positives = 89/233 (38%), Gaps = 21/233 (9%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 I + + E + +V+DLSGSM A + + + + +++ + Sbjct: 144 STEPIKKQVELYESEIKQLDV-IVLDLSGSMKSA------AFKGSRVPGELEMSRIELAQ 196 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 F D L V +GL+ + R+E + + + + + + T Sbjct: 197 TLFQTFTDKAISLEVP---VAVGLVTFGERIELTFDLTRNFDSFSTELGEVVANQCK--T 251 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF-LTDGENNNFKSNVNTI 351 A++ A + L K+ G+K+ S P + +F LTDGE+N SN + Sbjct: 252 RLFEAIQLAAETLVKFKESCDAA--ATGGLKLSSNPMLR--VFCLTDGEDN---SNFDPY 304 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + K++ I + +I I + L ++ + ++ + +F+ Sbjct: 305 PVYQYMKKHNIILDSIPIGLDGRER-LSSFSQATGGSCFIADSSLEGVELFER 356 >gi|326532158|dbj|BAK01455.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 674 Score = 42.2 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 45/327 (13%), Positives = 91/327 (27%), Gaps = 84/327 (25%) Query: 108 GYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYD-YRLQFIEHLLN 166 TE + VNS ++ F ++ +++D + + + Sbjct: 185 NDDEPLQQTEATDSVNSRSKKTAEISTYP-------EFSSIPQSSTHDGFSVLIHLKAPS 237 Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 +Q + R + V+D+SGSM T Sbjct: 238 ASPDQATCRLVNESSVRSPSGRAPVDLVTVIDISGSMAG--------------------T 277 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS----WGTEKVRQYVTR 282 K+A LK A+ + + + +I +++ + +G ++ Q + Sbjct: 278 KLALLKRAMGFVIQHLGPSD------RLSVIAFSSSARRLFHLQRMSHYGRQQALQAINS 331 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 T+ A+K+A T + + N II L+DG+ Sbjct: 332 LGAG---GGTNIADALKKA----TKVIEDRSYKNSVCS------------IILLSDGQ-- 370 Query: 343 NFKSNVNTIKICDKAK------------------ENFIKIVTISINASPNGQRLLKTCVS 384 +T IC + + I A + L + Sbjct: 371 ------DTYNICSNVRGGSKDYSSLVPPSILSDTRRMLPIHAFGFGADHDSDSLHSIAEA 424 Query: 385 SPEYHYNVVNADSLIHVFQN-ISQLMV 410 S + + + F I L+ Sbjct: 425 SGGTFSFIEDEGVMQDAFAQCIGGLLS 451 >gi|326513050|dbj|BAK03432.1| predicted protein [Hordeum vulgare subsp. vulgare] gi|326519604|dbj|BAK00175.1| predicted protein [Hordeum vulgare subsp. vulgare] gi|326532408|dbj|BAK05133.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 700 Score = 42.2 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 45/327 (13%), Positives = 91/327 (27%), Gaps = 84/327 (25%) Query: 108 GYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYD-YRLQFIEHLLN 166 TE + VNS ++ F ++ +++D + + + Sbjct: 185 NDDEPLQQTEATDSVNSRSKKTAEISTYP-------EFSSIPQSSTHDGFSVLIHLKAPS 237 Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 +Q + R + V+D+SGSM T Sbjct: 238 ASPDQATCRLVNESSVRSPSGRAPVDLVTVIDISGSMAG--------------------T 277 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS----WGTEKVRQYVTR 282 K+A LK A+ + + + +I +++ + +G ++ Q + Sbjct: 278 KLALLKRAMGFVIQHLGPSD------RLSVIAFSSSARRLFHLQRMSHYGRQQALQAINS 331 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 T+ A+K+A T + + N II L+DG+ Sbjct: 332 LGAG---GGTNIADALKKA----TKVIEDRSYKNSVCS------------IILLSDGQ-- 370 Query: 343 NFKSNVNTIKICDKAK------------------ENFIKIVTISINASPNGQRLLKTCVS 384 +T IC + + I A + L + Sbjct: 371 ------DTYNICSNVRGGSKDYSSLVPPSILSDTRRMLPIHAFGFGADHDSDSLHSIAEA 424 Query: 385 SPEYHYNVVNADSLIHVFQN-ISQLMV 410 S + + + F I L+ Sbjct: 425 SGGTFSFIEDEGVMQDAFAQCIGGLLS 451 >gi|291399431|ref|XP_002716111.1| PREDICTED: matrilin 1, cartilage matrix protein [Oryctolagus cuniculus] Length = 497 Score = 42.2 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 40/222 (18%), Positives = 81/222 (36%), Gaps = 47/222 (21%) Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNA-----------LLLFLDSI-DLLSHVKED 251 +NSD + N T + L + + F++ I D L Sbjct: 253 GFTLNSDGKTCNVCSGGGGSSATDLVFLIDGSKSVRPENFELVKKFINQIVDTLDVSDRL 312 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD--SLILKPTDSTPAMKQAYQILTSDK 309 +GL+ Y++ V + G ++ + + S + K T + A+K D Sbjct: 313 AQVGLVQYSSSVRQEFPL--GRFHTKKDIKAAVRNMSYMEKGTMTGAALKY-----LIDN 365 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + + K+ I+F TDG + ++ ++ KAK+ K+ + + Sbjct: 366 SFTVSSGARPGAQKVG-------IVF-TDGRSQDYINDAA-----KKAKDLGFKMFAVGV 412 Query: 370 -NASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQL 408 NA + L+ S P E+++ + F+ I+Q+ Sbjct: 413 GNAVEDE---LREIASEPVAEHYFYTAD-------FKTITQI 444 >gi|219804750|ref|NP_001137338.1| cartilage matrix protein [Bos taurus] gi|296490178|gb|DAA32291.1| matrilin 1, cartilage matrix protein [Bos taurus] Length = 497 Score = 42.2 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 30/191 (15%), Positives = 76/191 (39%), Gaps = 31/191 (16%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + +K + +D++D+ + +GL+ Y++ V + G ++ + Sbjct: 288 RPENFELVKKFINQIVDTLDVSD---KLAQVGLVQYSSSVRQEFPL--GRFHTKKDIKAA 342 Query: 284 MD--SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + S + K T + A+K D + + K+ I+F TDG + Sbjct: 343 VRNMSYMEKGTMTGAALKY-----LIDNSFTVSSGARPGAQKVG-------IVF-TDGRS 389 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISI-NASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 ++ ++ KAK+ K+ + + NA + L+ S P E+++ + ++ Sbjct: 390 QDYINDAA-----KKAKDLGFKMFAVGVGNAVEDE---LREIASEPVAEHYFYTADFKTI 441 Query: 399 IHVFQNISQLM 409 + + + + + Sbjct: 442 NQIGKKLQKRI 452 >gi|326478189|gb|EGE02199.1| U-box domain-containing protein [Trichophyton equinum CBS 127.97] Length = 741 Score = 42.2 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 28/150 (18%), Positives = 55/150 (36%), Gaps = 11/150 (7%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + +VS P L I LV+D+SGSM NS +D Sbjct: 44 PIPNKDSMVVSIQPPLKPENDVPHVPCDIVLVIDISGSM----NSAAPIPTGERGGEDTG 99 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + + K+A +++++ + ++ + T V E ++ + V + Sbjct: 100 LSILDLTKHAAKTIIETLNEKD------RLAVVTFCTEVNVAFELDSMNKENKSTVLSAI 153 Query: 285 DSLILK-PTDSTPAMKQAYQILTSDKKRSF 313 D L K T+ +K+ +L ++ R Sbjct: 154 DKLYGKSSTNLWHGIKKGLNVLATNPVRGN 183 >gi|326474578|gb|EGD98587.1| hypothetical protein TESG_05957 [Trichophyton tonsurans CBS 112818] Length = 741 Score = 42.2 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 28/150 (18%), Positives = 55/150 (36%), Gaps = 11/150 (7%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + +VS P L I LV+D+SGSM NS +D Sbjct: 44 PIPNKDSMVVSIQPPLKPENDVPHVPCDIVLVIDISGSM----NSAAPIPTGERGGEDTG 99 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + + K+A +++++ + ++ + T V E ++ + V + Sbjct: 100 LSILDLTKHAAKTIIETLNEKD------RLAVVTFCTEVNVAFELDSMNKENKSTVLSAI 153 Query: 285 DSLILK-PTDSTPAMKQAYQILTSDKKRSF 313 D L K T+ +K+ +L ++ R Sbjct: 154 DKLYGKSSTNLWHGIKKGLNVLATNPVRGN 183 >gi|309812068|ref|ZP_07705828.1| Tat pathway signal sequence domain protein [Dermacoccus sp. Ellin185] gi|308433947|gb|EFP57819.1| Tat pathway signal sequence domain protein [Dermacoccus sp. Ellin185] Length = 597 Score = 42.2 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 32/229 (13%), Positives = 82/229 (35%), Gaps = 55/229 (24%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + +++D+SGSM ++ ++++ +++ I L + + Sbjct: 395 MLVLIDVSGSMQTKIDGG--------------QSRIELMESTA------IAALDVLPKTT 434 Query: 253 YMGLIGYTTRVEKN----IEPSWGTEKV--RQYVTRDMD--------SLILKPTDSTPAM 298 +G +++ ++KN + + G + + Y + + T + Sbjct: 435 RLGAWAFSSNLQKNHVDYLPLTNGEQPILDDTYRNGLIAKAHTLPGLAAKNGDTALYDTI 494 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 AY+ +T ++ + ++ LTDG N++ + ++ + K Sbjct: 495 AAAYKSVTDTYDPNYVNS----------------VVVLTDGTNDDPNGGLALDQLLARLK 538 Query: 359 EN-----FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 +KIVTIS+ + L + ++ Y + + VF Sbjct: 539 SQYSADKPVKIVTISLGTGTDPDALKRIAKATDGLSYQTKTPEQISGVF 587 >gi|297285602|ref|XP_002802832.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 3 [Macaca mulatta] Length = 888 Score = 42.2 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 56/165 (33%), Gaps = 45/165 (27%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D SGSM K+ + A + +D LS + Sbjct: 263 VVFVIDKSGSMSGR--------------------KIQQTREA---LIKILDDLSPRDQF- 298 Query: 253 YMGLIGYTTRVEK-----NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 LI ++T + + + R + L T+ AM A Q+L S Sbjct: 299 --NLIVFSTEATQWRPSLVPASAENVNEARSFAAGIQA---LGGTNINEAMLVAVQLLDS 353 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + + S+ II LTDG+ ++N +I+ Sbjct: 354 SNQ--------EERLPDGSVSL---IILLTDGDPTVGETNPRSIQ 387 >gi|148226222|ref|NP_001089834.1| hypothetical protein LOC734900 [Xenopus laevis] gi|80477144|gb|AAI08519.1| MGC130922 protein [Xenopus laevis] Length = 840 Score = 42.2 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 33/179 (18%), Positives = 72/179 (40%), Gaps = 30/179 (16%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSLILKPTDSTPAMK 299 +D L ++ +GLI Y+T V + + V++ V++ + + + + A+K Sbjct: 598 LDSLEISQKAARVGLIQYSTHVRTEFTMAQYSSAKDVKKAVSQI--KYMGRGSMTGLALK 655 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 ++ S+ QG + + + I TDG ++ + +KAK+ Sbjct: 656 LMHEKSFSE----------AQGARARPMRVPRVAIVFTDG-----RAQDEVSEYAEKAKQ 700 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 + I I I I + + + L+ S+P ++ + F + +M K S+ Sbjct: 701 SGITIYAIGIGKAIDEE--LQEIASAPQEKHVIYAED-------FSAMGYIMEKLKSSM 750 Score = 40.6 bits (93), Expect = 0.49, Method: Composition-based stats. Identities = 36/174 (20%), Positives = 66/174 (37%), Gaps = 26/174 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKPTDST 295 + + L ++ +GL+ Y + V+ T K + + R + ++ T + Sbjct: 75 LITMLKFLDIGPDNTRVGLLQYGSTVKNEFSLK--TYKRKPDIERAVKRMMHLATGTMTG 132 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A I S+ + + N + + + +TDG + + I Sbjct: 133 LAIQYAMNIAFSEAEGARPLNQY----------VPRIAMIVTDGRPQDPVAE-----IAA 177 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADS---LIHVFQN 404 KA+ + I I I + LKT S P E+ + V N L VFQN Sbjct: 178 KARNSGILIFAIGVG--RVDMSTLKTIGSQPHSEHVFLVANFSQIETLTSVFQN 229 >gi|156257452|gb|ABU63134.1| microneme 1 precursor [Eimeria tenella] Length = 675 Score = 41.8 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 32/167 (19%), Positives = 52/167 (31%), Gaps = 34/167 (20%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGS+ + + D ++ + E Sbjct: 13 LDVMLVVDESGSIGT-----------------------SNFRKVRQFIEDFVNSMPISPE 49 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 DV +GLI + TR + W + S + ST Y L KK Sbjct: 50 DVRVGLITFATR----SKVRWNLSDPKATNPSLAISAARSLSYSTGVTYTHY-GLQDAKK 104 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + TN + K ++ +TDG +N ++ A Sbjct: 105 LLYDTNAGARN------NVPKLVLVMTDGASNLPSQTRSSAAALRDA 145 >gi|302336645|ref|YP_003801851.1| von Willebrand factor type A [Spirochaeta smaragdinae DSM 11293] gi|301633830|gb|ADK79257.1| von Willebrand factor type A [Spirochaeta smaragdinae DSM 11293] Length = 474 Score = 41.8 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 36/242 (14%), Positives = 80/242 (33%), Gaps = 44/242 (18%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 L E I L++D SGSM+ ++ DP + + AL+ + Sbjct: 82 SLAPNPHESQGLSILLLMDNSGSMYDTLSGDPTGDPALMRTTYAR----NALRTFVGSSF 137 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 + D +S + T V + + ++L P Sbjct: 138 HAGDSVS---------FATFNTNV---------VLHADEAGDPVVMDMLLSGIR-RPGTD 178 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF---KSNVNTIK---- 352 ++Y L + + + ++ +I L+DGE+ ++ N + I Sbjct: 179 ESYTEL--------YHALADMALPVGERSGRRAVIVLSDGEDYSYATHSGNPHPIYGNQQ 230 Query: 353 -----ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + ++ N + + I Q L + + + Y+ + + L ++ +I Q Sbjct: 231 LSPDEVVEEYIRNGVTLYAIHFGL-EKDQYLGEMALKTGGAVYDAKDQEELTGIYHDIRQ 289 Query: 408 LM 409 + Sbjct: 290 KI 291 >gi|159027742|emb|CAO89612.1| unnamed protein product [Microcystis aeruginosa PCC 7806] Length = 416 Score = 41.8 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 34/204 (16%), Positives = 62/204 (30%), Gaps = 44/204 (21%) Query: 166 NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 NQ +Q+ + A ++ + LV+D SGSM Sbjct: 17 NQSSSQRQLMLSIAATSEQINTNLPINLCLVLDHSGSMQGK------------------- 57 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 + +K A L ++S+ + + +I + R + + + + ++ Sbjct: 58 -PLETVKKAALSLIESLGVND------RLSVIAFDHRAKVILPSQSRQDDLTLIRSKIQQ 110 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T +K Q S + I LTDGEN + Sbjct: 111 LRAGGGTAIDEGIKLG-----------------IQESSSGSKGYVSHIFLLTDGENEHG- 152 Query: 346 SNVNTIKICDKAKENFIKIVTISI 369 N +K+ A E I + T Sbjct: 153 DNQRCLKLAAVAAEYGITLNTFGF 176 >gi|293571190|ref|ZP_06682227.1| von Willebrand factor type A domain protein [Enterococcus faecium E980] gi|291608742|gb|EFF38027.1| von Willebrand factor type A domain protein [Enterococcus faecium E980] Length = 1219 Score = 41.8 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 46/281 (16%), Positives = 96/281 (34%), Gaps = 59/281 (20%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 E + I LVVD+SGSM + ++ D + A +N + FL +I Sbjct: 331 EQQDIKPVDIVLVVDMSGSMESSQSNGWND-------------RAGAARNGVKNFLQTIK 377 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSW-----GTEKVRQYVTRDMDSLI---LKPTDST 295 + + V +GL+G+++ + G ++ D+L T + Sbjct: 378 DA-GIGDYVNVGLVGFSSPGYVTGPNGYLTVPIGKASDTSHINAINDALKPKFTGGTYTQ 436 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKI------------PSLPFQKFIIFLTDGENNN 343 ++Q Q+L + GV + KF L + N + Sbjct: 437 IGIEQGQQMLAGSSNENKMMIVLTDGVPTFSKKVTAAQTIDGTTYATKFGNTLDEPRNTS 496 Query: 344 -----------------FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLK------ 380 + T+ AK+ + I T+ I S +G L + Sbjct: 497 KLNSSYEVGSWGNRTNINSTWPATLGAAKIAKDAGLTIHTLGIQLSKDGNFLTEQQVRDR 556 Query: 381 -TCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILKG 420 + +++P + + + + ++ +V + ++ I+ G Sbjct: 557 ASLIATPGKYKDAETTNDVSDYLNEQAKNVV-KSFNTIVNG 596 >gi|303251581|ref|ZP_07337755.1| hypothetical protein APP6_0784 [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|307252106|ref|ZP_07534005.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|302649579|gb|EFL79761.1| hypothetical protein APP6_0784 [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|306860406|gb|EFM92420.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 6 str. Femo] Length = 538 Score = 41.8 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 40/269 (14%), Positives = 79/269 (29%), Gaps = 26/269 (9%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 ++ I E ++++ L + L LI + + + + A+L+ ++ Sbjct: 6 LSQARRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAE 65 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 + + SN + D + + ++ + F + +N I Sbjct: 66 NNNGRKDNDYKLSGSSNKENDSF--DISSEVGKRDSQMVTTFVKAFLPQTNDDKMNLIPI 123 Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE 187 T N +T + S IEH V IP + + Sbjct: 124 CKTV---NNTSGKGHTSSSEVTCTVSGT-----IEHKSWFPLKVGTVEVIPQQVDVASKS 175 Query: 188 RPI--------FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + + +V DLSGSM +D +K+ L+ L Sbjct: 176 KAFKKNTFNIPIDLMVVADLSGSM--------KDGIKGEKLNGGTNSKIYILREVLKELA 227 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 D E +G+ + E E Sbjct: 228 DKSLFTQEANEYNRIGITAFAMGAEHPKE 256 >gi|169829413|ref|YP_001699571.1| BatA [Lysinibacillus sphaericus C3-41] gi|168993901|gb|ACA41441.1| BatA [Lysinibacillus sphaericus C3-41] Length = 973 Score = 41.8 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 46/297 (15%), Positives = 95/297 (31%), Gaps = 62/297 (20%) Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 +S I+ T N S + Y ++ + + ++ RY + + Sbjct: 622 ASFITNTSDPLNNTGSPGTAVAYFDSIIYGKSKVEATLVNPIDPRYATILKGLKDKTVTK 681 Query: 184 EMGERPIFL---------IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 ++ P F + VVD S SM ++ Sbjct: 682 DIFTNPNFSKNSCSLATEMAYVVDYSSSMKAVDPTNYRGKK------------------- 722 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 ++ F++ + +++ + K GT S TD Sbjct: 723 MIEFINQLKAKNNIVIETNT----------KATILGEGTTDAVLKKDLYKASKDKGATDI 772 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + A ++D K + K I+ ++DG+ + K IK Sbjct: 773 FAGIDIALTKFSNDTKTA------------------KAIVVVSDGKTSKSK----MIKAI 810 Query: 355 DKAKENFIKIVTISI--NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + AK+ +KI T+S+ + N L++ + +Y+ ++ L VFQ + + Sbjct: 811 NDAKKQGVKIYTVSMGKKSQINDATLMQLSTETGGAYYHALDNLQLHQVFQKLIDAI 867 >gi|116283392|gb|AAH14765.1| Itga1 protein [Mus musculus] Length = 685 Score = 41.8 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 29/184 (15%), Positives = 65/184 (35%), Gaps = 25/184 (13%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 D + + + +G++ Y V K + + + Sbjct: 191 AFLNDLLKRMDIGPKQTQVGIVQYGANVTHEFNL----NKYSSTEEVLVAANKIGRRGGL 246 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 M A I T+ K+ R+G +K ++ +TDGE N ++ Sbjct: 247 QTM-TALGIDTARKEAFTEARGARRG-------VKKVMVIVTDGE---SHDNYRLKQVIQ 295 Query: 356 KAKENFIKIVTISINASPNGQRL--------LKTCVSSP--EYHYNVVNADSLIHVFQNI 405 ++ I+ +I+I N L +K+ S P ++ +NV + +L+ + + + Sbjct: 296 DCEDENIQRFSIAILGHYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKAL 355 Query: 406 SQLM 409 + + Sbjct: 356 GERI 359 >gi|313240178|emb|CBY32528.1| unnamed protein product [Oikopleura dioica] Length = 524 Score = 41.8 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 28/190 (14%), Positives = 75/190 (39%), Gaps = 17/190 (8%) Query: 229 AALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE------KVRQYVT 281 +K L L +D++ D +S + ++ Y++ V++++ G+ +++ Sbjct: 237 EHVKRTLGLMIDNLCDGISPDTN--RVAMLRYSSDVKEDLNFIEGSNEPKVMRNIQRLKY 294 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + T + AM +A + + + + + G+K+ + ++ +TDGE+ Sbjct: 295 KPITDDRHGSTYTAHAMDKALKTIFT-SEAGWRNGTTEDGIKVRTE-----VVIITDGES 348 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 N+ K E IK+ + + + + E + +++ L Sbjct: 349 NDPDETFTIQGQKVKYDEYGIKVYALGV-GDIKKDEIRQLTSMDDESIFYLMSWKDL-AA 406 Query: 402 FQNISQLMVH 411 F I + ++ Sbjct: 407 FNRIIETLIE 416 >gi|313230659|emb|CBY18875.1| unnamed protein product [Oikopleura dioica] Length = 524 Score = 41.8 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 28/190 (14%), Positives = 75/190 (39%), Gaps = 17/190 (8%) Query: 229 AALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE------KVRQYVT 281 +K L L +D++ D +S + ++ Y++ V++++ G+ +++ Sbjct: 237 EHVKRTLGLMIDNLCDGISPDTN--RVAMLRYSSDVKEDLNFIEGSNEPTVMRNIQRLKY 294 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + T + AM +A + + + + + G+K+ + ++ +TDGE+ Sbjct: 295 KPITDDRHGSTYTAHAMDKALKTIFT-SEAGWRNGTTEDGIKVRTE-----VVIITDGES 348 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 N+ K E IK+ + + + + E + +++ L Sbjct: 349 NDPDETFTIQGQKVKYDEYGIKVYALGV-GDIKKDEIRQLTSMDDESIFYLMSWKDL-AA 406 Query: 402 FQNISQLMVH 411 F I + ++ Sbjct: 407 FNRIIETLIE 416 >gi|332977607|gb|EGK14375.1| von Willebrand factor type A domain protein [Psychrobacter sp. 1501(2011)] Length = 556 Score = 41.8 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 43/260 (16%), Positives = 86/260 (33%), Gaps = 53/260 (20%) Query: 161 IEHLLNQRYNQKI-VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 + N+ + V + +P + +VD+SGSM+ Sbjct: 168 MVKSPWHATNRIVKVGIKAEDVLAAKQNQPAANLVFLVDVSGSMNSD------------- 214 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 K+ K +L + + + Y G T V T+K+ Sbjct: 215 ------DKLQLAKASLKMLTKQLRAQDTITLITYAG----NTEVVLPATSGNQTQKI--- 261 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + + T+ A+K AYQ NF +QG+ I+ LTDG Sbjct: 262 LNAIDNLSANGSTNGEAAIKLAYQQ--------AEENFKKQGINR--------ILMLTDG 305 Query: 340 ENNNFKSNVNT-IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA--- 395 + N SNV + I ++ I + T+ +++ + +Y+ +++ Sbjct: 306 DFNVGVSNVKDMLDIIRNNRDKGISLSTLGFGQGNYNDHMMEQVADNGNGNYSYIDSLSE 365 Query: 396 ------DSLIHVFQNISQLM 409 D + F +++ + Sbjct: 366 AKKVLIDEMSSTFNTVAKDV 385 >gi|187939945|gb|ACD39081.1| hypothetical protein PACL_0293 [Pseudomonas aeruginosa] Length = 223 Score = 41.8 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 33/239 (13%), Positives = 78/239 (32%), Gaps = 37/239 (15%) Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 + L + ++ I ++ D SGSM K+ AL L Sbjct: 1 MSELKKFQVQTARPLPIIVLADTSGSMSVD-------------------GKIEALNKGLK 41 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 + S S ++ ++ + +I + + + P +++ + + T Sbjct: 42 DMISSFAGESRLRAEIQVSVITFGGSLAELNLPLTPAHQLQSF----TPLVAEGMTPLGG 97 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+ A + IPS ++ I+ ++DG N+ ++ + Sbjct: 98 ALSLA-------------SEMIEDKDSIPSRAYKPVIVLVSDGYPNDDWQGP-FARLVNG 143 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + + ++I A + L ++ NA + F+ ++ + R S Sbjct: 144 ERSSKATRFAMAIGADADEVMLSDFANDPEAPLFHAENARDIHRFFRAVTMSVSARSQS 202 >gi|149701450|ref|XP_001492635.1| PREDICTED: anthrax toxin receptor 2 [Equus caballus] Length = 488 Score = 41.8 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 33/242 (13%), Positives = 68/242 (28%), Gaps = 52/242 (21%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + + R F + V+D SGS+ ++ L Sbjct: 32 SAQEQPSCRGAFDLYFVLDKSGSVA------------------NNWIEIYDFVKQLTERF 73 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 V + + I ++++ + + K+ Q + + T +K Sbjct: 74 --------VSPQMRLSFIVFSSQATIILPLTGDRGKISQGLEDLKHVRPVGETYIHEGLK 125 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG--ENNNFKSNVNTIKICDKA 357 A Q K L II LTDG + K+ + Sbjct: 126 LA----------------NDQIQKAGGLKTSSIIIALTDGKLDGLVPSYAEKEAKL---S 166 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 + ++ + + Q L+ S E + V + I ++ + + I Sbjct: 167 RSLGARVYCVGVL--DFEQAQLERIADSKEQVFPVTGGFQ---ALKGIINSILAQSCTEI 221 Query: 418 LK 419 L+ Sbjct: 222 LE 223 >gi|28899191|ref|NP_798796.1| hypothetical protein VP2417 [Vibrio parahaemolyticus RIMD 2210633] gi|28807415|dbj|BAC60680.1| hypothetical protein [Vibrio parahaemolyticus RIMD 2210633] Length = 431 Score = 41.8 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 18/105 (17%), Positives = 42/105 (40%), Gaps = 6/105 (5%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 ++K ++ ++ ++ L + F I + K +++A + A LAGA +V++ + Sbjct: 20 RTQKGITLVLISMVLLILLGVAAFGIDLNHQVLNKTRLQNAVDTAALAGA--VVADKTED 77 Query: 76 GDRFE----SISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNT 116 D+ E + + + F + + S F N Sbjct: 78 VDQAEAAVIATLSSIASESGNTELSFTDGNTSVTFSHDMQTFVNA 122 >gi|257052678|ref|YP_003130511.1| von Willebrand factor type A [Halorhabdus utahensis DSM 12940] gi|256691441|gb|ACV11778.1| von Willebrand factor type A [Halorhabdus utahensis DSM 12940] Length = 1100 Score = 41.8 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 37/217 (17%), Positives = 66/217 (30%), Gaps = 49/217 (22%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + V+D SGSM A ++ K + F+ + E Sbjct: 513 IDLAFVIDESGSMGGA--------------------RIQDAKASAKRFVGGL------YE 546 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 D L+ + + V + + ++ +++A LTS+ + Sbjct: 547 DDRAALVSFAGGATLGQSLTTDHGAVNASIDQLNAGGGT---NTGAGLQKAVDELTSNGE 603 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 II L DG + + I A E+ I I TI + Sbjct: 604 GDTQE-----------------IILLADG---GTGLGPDPVTIAQTADEHRITINTIGMG 643 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + Q L ++ Y V ++ L VF + Q Sbjct: 644 TGIDAQELTSIADATGGEFYQVSDSSELPEVFDRVEQ 680 >gi|77567855|gb|AAI07522.1| Matn3b protein [Danio rerio] Length = 299 Score = 41.8 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 31/178 (17%), Positives = 63/178 (35%), Gaps = 34/178 (19%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDST 295 + +D L + + L+ Y + V + +V+Q +R + T + Sbjct: 92 LSEMVDSLDIGSDATRVALVNYASTVNIEFHLKKYFSKAEVKQAFSRIDP--LSTGTMTG 149 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI----IFLTDGENNNFKSNVNTI 351 A+K A + + ++ + P +K I I +TDG + V + Sbjct: 150 MAIKTAMEQVFTENA--------------GARPLKKGIGKVAIIVTDG---RPQDKVEEV 192 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN---ADSLIHVFQN 404 A+ + I+I + + R LK S P ++ + V + L F+ Sbjct: 193 S--AAARASGIEIYAVGV--DRAEVRSLKQMASQPLDDHVFYVETYGVIEKLTSKFRE 246 >gi|7766811|pdb|1CK4|A Chain A, Crystal Structure Of Rat A1b1 Integrin I-Domain. gi|7766812|pdb|1CK4|B Chain B, Crystal Structure Of Rat A1b1 Integrin I-Domain Length = 198 Score = 41.8 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 29/185 (15%), Positives = 66/185 (35%), Gaps = 25/185 (13%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + D + + + +G++ Y V K + + + Sbjct: 24 IAFLNDLLKRMDIGPKQTQVGIVQYGENVTHEFNL----NKYSSTEEVLVAANKIGRQGG 79 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 M A I T+ K+ R+G +K ++ +TDGE N ++ Sbjct: 80 LQTM-TALGIDTARKEAFTEARGARRG-------VKKVMVIVTDGE---SHDNYRLKQVI 128 Query: 355 DKAKENFIKIVTISINASPNGQRL--------LKTCVSSP--EYHYNVVNADSLIHVFQN 404 ++ I+ +I+I N L +K+ S P ++ +NV + +L+ + + Sbjct: 129 QDCEDENIQRFSIAILGHYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKA 188 Query: 405 ISQLM 409 + + + Sbjct: 189 LGERI 193 >gi|15840942|ref|NP_335979.1| hypothetical protein MT1528 [Mycobacterium tuberculosis CDC1551] gi|13881148|gb|AAK45793.1| conserved hypothetical protein [Mycobacterium tuberculosis CDC1551] Length = 335 Score = 41.8 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 40/234 (17%), Positives = 71/234 (30%), Gaps = 51/234 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + LV+D+S SM D + ++M A + A F D + + Sbjct: 99 VMLVIDVSQSM---------------RATDVEPSRMVAAQEAAKQFADELTPG------I 137 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GLI Y + P+ E + + + + T + A+ A Q + + Sbjct: 138 NLGLIAYAGTATVLVSPTTNREATKNALDKLQFADR---TATGEAIFTALQAIATVGAV- 193 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK--SNVNTIKICD-KAKENFIKIVTISI 369 + I+ +DG+ N AK+ + I TIS Sbjct: 194 ---------IGGGDTXPPARIVLFSDGKETMPTNPDNPKGAYTAARTAKDQGVPISTISF 244 Query: 370 NAS-------------PNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLM 409 P +K S YN L V+ ++ Q + Sbjct: 245 GTPYGFVEINDQRQPVPVDDETMKKVAQLSGGNSYNAATLAELRAVYSSLQQQI 298 >gi|326426681|gb|EGD72251.1| hypothetical protein PTSG_11571 [Salpingoeca sp. ATCC 50818] Length = 1748 Score = 41.8 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 31/259 (11%), Positives = 74/259 (28%), Gaps = 61/259 (23%) Query: 157 RLQFIEHLLNQRYNQKIVS----FIPALLRIEMG-ERPIFLIELVVDLSGSMHCAMNSDP 211 + +L ++S +L+ R F + LV+D SGS+ Sbjct: 7 PAVGVLAVLWPVLLTLLISCCSGPSSSLVAATPDCTRTDFDLVLVLDESGSVG------- 59 Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW 271 M ++ L + L + + + ++T Sbjct: 60 ----------------MEDWQHTLTFASHFVRALDTETNSIQVAAVTFSTDPVLQFSL-- 101 Query: 272 GTEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 T +++T + SL TD+ A++ + + + Sbjct: 102 NTYSTEEHITNALTSLPYAGKSTDTGAALELVLSGVLDNPAGGYRGGRA----------- 150 Query: 330 QKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI----NASPNGQRLLKTCVSS 385 ++ +TDG + +++ + + + + N L + Sbjct: 151 --VVVVMTDGHT----QDPERLQLAAPLLKARADVFAVGVGDNINVPE---LFLIASAPT 201 Query: 386 PEYH-----YNVVNADSLI 399 + + VVNA+ + Sbjct: 202 ESHVRWSSDFTVVNANEFV 220 >gi|293361343|ref|XP_236593.5| PREDICTED: collagen type VI alpha 6 [Rattus norvegicus] Length = 2264 Score = 41.8 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 33/235 (14%), Positives = 75/235 (31%), Gaps = 48/235 (20%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 + + E I +VD SGS+ + +KM K + Sbjct: 605 VVQEICAEEACRDMKADIMFLVDSSGSIGL-----------------ENFSKM---KVFM 644 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + + + V +G++ ++ + + T + + +D + + Sbjct: 645 KNLVSKSQI---GADRVQIGVVQFSHENREEFQL--NTFMSQNDIANAIDQMAHIGETTL 699 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 + ++ N +KF+I +TDGE + + Sbjct: 700 TGSALTFVSQYFSPEKGARPN------------VRKFLILITDGEAQDIVKDPAV----- 742 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 +++ + I ++ + S Q L+ PE + V N + Q+I +V Sbjct: 743 ALRKDGVIIYSVGVFGSNVTQ--LEEISGKPEMVFYVEN----FDILQHIEDDLV 791 Score = 37.9 bits (86), Expect = 3.3, Method: Composition-based stats. Identities = 43/343 (12%), Positives = 109/343 (31%), Gaps = 37/343 (10%) Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + L ++ I +A + + + I ++ S+ Sbjct: 85 NRNPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSAPTNGRDKKQFPPILVVLASAESED 144 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIE---HLLNQRYNQKIVSFIP---ALLRI 183 ++ + ++ V + + L+ + N R + + F P +++ Sbjct: 145 DVEEASKALREDGVKIISVGVQKASEEDLKAMATSQFHFNLRTARDLSMFAPNMTQIIKD 204 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 R +++V+ C S + V + + + + LK L +S+ Sbjct: 205 VTQYREAMTDDIIVE-----ACQGPSVADVVFLLDMAINGSQENLDHLKAFLG---ESVS 256 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 L + + +GL+ Y+ S G + V + + D +P + QAY Sbjct: 257 ALDIKENCMRVGLVAYSNETRVISSLSMGVN--KTEVLQRIQ-------DLSPHVGQAYT 307 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 + +++ F Q + + + +T + + T + + + Sbjct: 308 G--AALRKTRKEVFSAQRGSRKNQGVPQIAVLVT----HRASDDNVTKAAVN-LRREGVT 360 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + T+ + + Q L+ S Y S + F ++ Sbjct: 361 VFTMGVEGANPEQ--LEKIAS-----YPAEQFTSKLSNFSELA 396 >gi|293349448|ref|XP_002727144.1| PREDICTED: collagen type VI alpha 6-like [Rattus norvegicus] Length = 2264 Score = 41.8 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 33/235 (14%), Positives = 75/235 (31%), Gaps = 48/235 (20%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 + + E I +VD SGS+ + +KM K + Sbjct: 605 VVQEICAEEACRDMKADIMFLVDSSGSIGL-----------------ENFSKM---KVFM 644 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + + + V +G++ ++ + + T + + +D + + Sbjct: 645 KNLVSKSQI---GADRVQIGVVQFSHENREEFQL--NTFMSQNDIANAIDQMAHIGETTL 699 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 + ++ N +KF+I +TDGE + + Sbjct: 700 TGSALTFVSQYFSPEKGARPN------------VRKFLILITDGEAQDIVKDPAV----- 742 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 +++ + I ++ + S Q L+ PE + V N + Q+I +V Sbjct: 743 ALRKDGVIIYSVGVFGSNVTQ--LEEISGKPEMVFYVEN----FDILQHIEDDLV 791 Score = 37.9 bits (86), Expect = 3.3, Method: Composition-based stats. Identities = 43/343 (12%), Positives = 109/343 (31%), Gaps = 37/343 (10%) Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + L ++ I +A + + + I ++ S+ Sbjct: 85 NRNPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSAPTNGRDKKQFPPILVVLASAESED 144 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIE---HLLNQRYNQKIVSFIP---ALLRI 183 ++ + ++ V + + L+ + N R + + F P +++ Sbjct: 145 DVEEASKALREDGVKIISVGVQKASEEDLKAMATSQFHFNLRTARDLSMFAPNMTQIIKD 204 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 R +++V+ C S + V + + + + LK L +S+ Sbjct: 205 VTQYREAMTDDIIVE-----ACQGPSVADVVFLLDMAINGSQENLDHLKAFLG---ESVS 256 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 L + + +GL+ Y+ S G + V + + D +P + QAY Sbjct: 257 ALDIKENCMRVGLVAYSNETRVISSLSMGVN--KTEVLQRIQ-------DLSPHVGQAYT 307 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 + +++ F Q + + + +T + + T + + + Sbjct: 308 G--AALRKTRKEVFSAQRGSRKNQGVPQIAVLVT----HRASDDNVTKAAVN-LRREGVT 360 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + T+ + + Q L+ S Y S + F ++ Sbjct: 361 VFTMGVEGANPEQ--LEKIAS-----YPAEQFTSKLSNFSELA 396 >gi|281353337|gb|EFB28921.1| hypothetical protein PANDA_013248 [Ailuropoda melanoleuca] Length = 471 Score = 41.8 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 38/223 (17%), Positives = 83/223 (37%), Gaps = 40/223 (17%) Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNA-----------LLLFLDSI-DLLSHVKED 251 +NSD + N T + L + + F++ I D L + Sbjct: 236 GFTLNSDGKTCNVCSGGGGSSATDLVFLIDGSKSVRPENFELVKKFINQIVDTLDVSDKL 295 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD--SLILKPTDSTPAMKQAYQILTSDK 309 +GL+ Y++ V + G ++ + + S + K T + A+K D Sbjct: 296 AQVGLVQYSSSVRQEFPL--GRFHTKKDIKAAVRNMSYMEKGTMTGAALKY-----LIDN 348 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + + K+ I+F TDG + ++ +N KAK+ K+ + + Sbjct: 349 SFTVSSGARPGAQKVG-------IVF-TDGRSQDYINNAA-----KKAKDLGFKMFAVGV 395 Query: 370 -NASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 NA + L+ S P E+++ + ++ + + + + + Sbjct: 396 GNAVEDE---LREIASEPVAEHYFYTADFKTINQIGKRLQKKI 435 >gi|45384200|ref|NP_990400.1| integrin alpha-1 [Gallus gallus] gi|2582830|dbj|BAA23160.1| alpha1 integrin [Gallus gallus] Length = 1171 Score = 41.8 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 34/188 (18%), Positives = 74/188 (39%), Gaps = 28/188 (14%) Query: 234 ALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV-TRDMDSLILKP 291 ++ FL+S+ + + +G++ Y V T + V + Sbjct: 178 SVTAFLNSLLRNMDIGPQQTQVGIVQYGQTVVHEFYL--NTYSTTEEVMDAALRIRQRGG 235 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A I T+ ++ + R+G QK ++ +TDGE N Sbjct: 236 TQTM----TALGIDTAREEAFTEAHGARRG-------VQKVMVIVTDGE---SHDNYRLQ 281 Query: 352 KICDKAKENFIKIVTISINASPNGQRL--------LKTCVSSP--EYHYNVVNADSLIHV 401 ++ DK ++ I+ I+I S + L +K+ S P ++ +NV + +L+ + Sbjct: 282 EVIDKCEDENIQRFAIAILGSYSRGNLSTEKFVEEIKSIASKPTEKHFFNVSDELALVTI 341 Query: 402 FQNISQLM 409 + + + + Sbjct: 342 VEALGERI 349 >gi|126733489|ref|ZP_01749236.1| von Willebrand factor, type A [Roseobacter sp. CCS2] gi|126716355|gb|EBA13219.1| von Willebrand factor, type A [Roseobacter sp. CCS2] Length = 699 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 36/170 (21%), Positives = 56/170 (32%), Gaps = 44/170 (25%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 E+ RP + ++D SGSM TK+ LK + L LD + Sbjct: 338 EVAARPPLNLVFLIDTSGSMD-------------------DPTKLPLLKQSFRLMLDQLR 378 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAY 302 V Y G G E R + + + SL T+ ++QAY Sbjct: 379 PEDQVAIVEYAGSAGQVLVPTSASE--------RTTILQAIQSLGAGGSTNGQGGLEQAY 430 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 R+ ++ +I TDG+ N SN + +K Sbjct: 431 S----------VAEAMREDGEVNR------VILATDGDFNVGLSNPDALK 464 >gi|156743215|ref|YP_001433344.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156234543|gb|ABU59326.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 419 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 36/255 (14%), Positives = 75/255 (29%), Gaps = 49/255 (19%) Query: 150 VMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNS 209 V ++ + ++ P +I R + V+D SGSM Sbjct: 5 VAIRASLARPYLTAATMPQVAYLLIEVTPG--QIMTQVRAPVNVCFVIDRSGSM------ 56 Query: 210 DPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP 269 K K+ ++ A + ++ +D V ++ + R E I Sbjct: 57 --------------KGEKIDRVRRATIRAIEMLDAQDVVS------VVIFDHRTEVLIPA 96 Query: 270 SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 + K + R T PA++ + + Sbjct: 97 T-PVAKPAELADRVNRVRDSGGTRIAPAIEAGLREI-----------------DKGPSHM 138 Query: 330 QKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH 389 + +I LTDG+ + + + A + I + + N L++ S Sbjct: 139 VRRLILLTDGQTESESDCLRRA---EDAGRRNVPITALGVGKDWNEDLLIEMANRSGGTA 195 Query: 390 YNVVNADSLIHVFQN 404 + + ++ FQN Sbjct: 196 DYIDRPEKIVDYFQN 210 >gi|86356688|ref|YP_468580.1| hypothetical protein RHE_CH01044 [Rhizobium etli CFN 42] gi|86280790|gb|ABC89853.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 445 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 55/452 (12%), Positives = 131/452 (28%), Gaps = 71/452 (15%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL------AGAS 66 + I NF I+ AL V+ L G + + + +A +AA + + A+ Sbjct: 4 RFIFDRSGNFGIMTALLVVPLLGAAGTAVDFASALSLRTELYAAADAAAVGSITPTSEAA 63 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + +S G S K +K+ + +S + + ++ Sbjct: 64 AQANTMSGDGSLTLGKSEAQKIFFSQMSKKQGDAPVTVDIS------VQKKGDTLSSTVS 117 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG 186 + T + I Y +++ + + M Sbjct: 118 FNATMPTTFMQVMGFDEIAVTGAATAQYQTP-SYMDFFMLLDNTPSMGVAATTDDITAMK 176 Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT----KMAALKNALLLFLDSI 242 + + D + + C + S+ + + ++ + +A+ + Sbjct: 177 KATANGHDGGKDKNCAFACHIVSEKGVEDKNSYYNVARNNGVTIRIDVVASAVKALMAKA 236 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA----- 297 + + + + ++ + + + D + A Sbjct: 237 KDTQSMPSQFRVAAYT-SGKTAQDAK--------AAKLFKVSDLNYDLGAVAAAANMIKL 287 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVK--------IPSLPFQKFIIFLTDGENNNFKSNVN 349 M YQ SD++ SF + + QK + F+ DG +++K Sbjct: 288 MSIPYQNYYSDQQTSFDEALKGIEGEIKGNIGTGTSNADRQKIVFFVADGVGDSYKPTGC 347 Query: 350 T---------------IKICDKAKENFIKI---VTISINASPNG-------------QRL 378 T C K K+ IK+ T + NG Sbjct: 348 TSPKGANGGRCIEPIDTTYCKKLKDRGIKVAVLYTTYLPLPDNGFYKDWVKPFETRIAAK 407 Query: 379 LKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 ++ C + P +++ V + + + + + +V Sbjct: 408 MEECAT-PGFYFAVSPTEGIEEAMEALFRKIV 438 >gi|3183041|sp|Q90615|ITA1_CHICK RecName: Full=Integrin alpha-1; AltName: Full=Laminin and collagen receptor; AltName: Full=VLA-1 gi|497990|gb|AAA59067.1| alpha 1 integrin [Gallus gallus] Length = 285 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 34/188 (18%), Positives = 74/188 (39%), Gaps = 28/188 (14%) Query: 234 ALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV-TRDMDSLILKP 291 ++ FL+S+ + + +G++ Y V T + V + Sbjct: 79 SVTAFLNSLLRNMDIGPQQTQVGIVQYGQTVVHEFYL--NTYSTTEEVMDAALRIRQRGG 136 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A I T+ ++ + R+G QK ++ +TDGE N Sbjct: 137 TQTM----TALGIDTAREEAFTEAHGARRG-------VQKVMVIVTDGE---SHDNYRLQ 182 Query: 352 KICDKAKENFIKIVTISINASPNGQRL--------LKTCVSSP--EYHYNVVNADSLIHV 401 ++ DK ++ I+ I+I S + L +K+ S P ++ +NV + +L+ + Sbjct: 183 EVIDKCEDENIQRFAIAILGSYSRGNLSTEKFVEEIKSIASKPTEKHFFNVSDELALVTI 242 Query: 402 FQNISQLM 409 + + + + Sbjct: 243 VEALGERI 250 >gi|74193348|dbj|BAE20643.1| unnamed protein product [Mus musculus] Length = 682 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 29/184 (15%), Positives = 65/184 (35%), Gaps = 25/184 (13%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 D + + + +G++ Y V K + + + Sbjct: 191 AFLNDLLKRMDIGPKQTQVGIVQYGANVTHEFNL----NKYSSTEEVLVAANKIGRRGGL 246 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 M A I T+ K+ R+G +K ++ +TDGE N ++ Sbjct: 247 QTM-TALGIDTARKEAFTEARGARRG-------VKKVMVIVTDGE---SHDNYRLKQVIQ 295 Query: 356 KAKENFIKIVTISINASPNGQRL--------LKTCVSSP--EYHYNVVNADSLIHVFQNI 405 ++ I+ +I+I N L +K+ S P ++ +NV + +L+ + + + Sbjct: 296 DCEDENIQRFSIAILGHYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKAL 355 Query: 406 SQLM 409 + + Sbjct: 356 GERI 359 >gi|313219850|emb|CBY30766.1| unnamed protein product [Oikopleura dioica] Length = 1473 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 30/188 (15%), Positives = 57/188 (30%), Gaps = 47/188 (25%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 I++V+D SGS+ A N D +N + T + +K L F +S Sbjct: 1234 NTTISGRLDIQIVIDTSGSLTSAPNKDQVLMNFTNNLANMYDT-INQVKIGLTSFSESS- 1291 Query: 244 LLSHVKEDVYMGLIGYT--TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + M L Y + +W T+ T ++ A Sbjct: 1292 -------VLEMPLDFYNQLELQDGVSNMTW----------------QGSFTNITSGVETA 1328 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 S +I +TDG + + ++ D+AK Sbjct: 1329 LND------------------MDTSDAVDDVMILITDGFQST--NTTLMFQMIDQAKAEG 1368 Query: 362 IKIVTISI 369 ++++ + Sbjct: 1369 VRLIALGF 1376 >gi|187934443|ref|YP_001887479.1| von Willebrand factor type A domain protein [Clostridium botulinum B str. Eklund 17B] gi|187722596|gb|ACD23817.1| von Willebrand factor type A domain protein [Clostridium botulinum B str. Eklund 17B] Length = 1596 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 24/139 (17%), Positives = 52/139 (37%), Gaps = 30/139 (21%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLS-----HVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 + TK+ LK A F+DS+ +++ +G++ Y N T+ Sbjct: 188 YESYTTKIHELKKAAKNFIDSLTSTKTDGQTPNVKNLKIGIVSYNNSGYINEGLVQVTDS 247 Query: 276 VRQ---YVTRDMDSLIL----KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 R+ + D++ T++ +++A +L + + + Sbjct: 248 DRKNNGNINELKDTIENLRADGGTNTGDGLRKAAYLLNEENEAN---------------- 291 Query: 329 FQKFIIFLTDGENNNFKSN 347 K +IF+ DGE + S+ Sbjct: 292 --KTVIFMGDGEPTYYSSD 308 >gi|153836806|ref|ZP_01989473.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ3810] gi|149749952|gb|EDM60697.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ3810] gi|328474272|gb|EGF45077.1| hypothetical protein VP10329_16235 [Vibrio parahaemolyticus 10329] Length = 418 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 18/105 (17%), Positives = 42/105 (40%), Gaps = 6/105 (5%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 ++K ++ ++ ++ L + F I + K +++A + A LAGA +V++ + Sbjct: 7 RTQKGITLVLISMVLLILLGVAAFGIDLNHQVLNKTRLQNAVDTAALAGA--VVADKTED 64 Query: 76 GDRFE----SISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNT 116 D+ E + + + F + + S F N Sbjct: 65 VDQAEAAVIATLSSIASESGNTELSFTDGNTSVTFSHDMQTFVNA 109 >gi|115374996|ref|ZP_01462267.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] gi|310820519|ref|YP_003952877.1| von willebrand factor type a domain-containing protein [Stigmatella aurantiaca DW4/3-1] gi|115368023|gb|EAU66987.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] gi|309393591|gb|ADO71050.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 476 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 28/240 (11%), Positives = 71/240 (29%), Gaps = 45/240 (18%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 + V+ + R + L++D SGSM Sbjct: 71 PGSSDVFVTVDITGQEVPGARRSPVNLALIIDRSGSMSGY-------------------- 110 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ K A + + +D + ++ Y + V+ + + Sbjct: 111 KLEQAKQAARHLVTLLK------DDDRLAIVHYGSDVKSLPGLQATPANRERMIQYIEGI 164 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T+ + + A Q + + N +I ++DG+ + Sbjct: 165 WDEGGTNISAGL-LAGQAQVETARSDYRVNR---------------LILISDGQPTEGST 208 Query: 347 N-VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQN 404 + + ++ + I + +I + + + L++ + + +A L +FQ Sbjct: 209 DEGSLKQVVKDIRTRGITVSSIGV-GTDFNEDLMQAFAEYGAGSYGFLEDAGKLATLFQK 267 >gi|146302265|ref|YP_001196856.1| von Willebrand factor, type A [Flavobacterium johnsoniae UW101] gi|146156683|gb|ABQ07537.1| von Willebrand factor, type A [Flavobacterium johnsoniae UW101] Length = 2588 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 42/202 (20%), Positives = 76/202 (37%), Gaps = 36/202 (17%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L +D+SGSM ++ D +T M K+A L FL+ Sbjct: 69 DVVLAIDISGSMGNTISGDF-------------KTSMDYAKDAALAFLNQAKANPQN--- 112 Query: 252 VYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 + ++ Y+T I ++ V Q + T+ + ++ L ++ + Sbjct: 113 -RIAIVAYSTTASLKIGLTYLNATGVTQITNQINALQATNSTNIYAGIVRSETELETNGR 171 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 T II LTDG N ++ NT C+ +K + + VT +IN Sbjct: 172 FDCSTARA--------------IILLTDGVTNVTGTSGNTN--CNVSKTS--QCVTDAIN 213 Query: 371 ASPNGQRLLKTCVSSPEYHYNV 392 A+ N + K+ V ++V Sbjct: 214 AATNAKTTTKSSVVYNNQVFSV 235 >gi|54302287|ref|YP_132280.1| hypothetical protein PBPRB0607 [Photobacterium profundum SS9] gi|46915709|emb|CAG22480.1| hypothetical protein PBPRB0607 [Photobacterium profundum SS9] Length = 436 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 42/372 (11%), Positives = 106/372 (28%), Gaps = 47/372 (12%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 ++K +I L+++ + + V + K +++ ++A L+ A + S Sbjct: 12 RAQKGVVAIFATLAMVVLIGAGALALDVGNLILSKGKLQNLVDSAALSAAKAIDSGSDHA 71 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 + ++D F + I + S + ++ Sbjct: 72 AAILAGNAAINNNLILD-------------------GFGSMTIDDTDIHYEFSESLPFDS 112 Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 ++++ + V ++ + N + + + I Sbjct: 113 STSTTSSPY---VRVRIEDVDVADYLVAIFNIDMSARSSAVAGPSSSIATTCN------- 162 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 VV LS N+ + K + + + ++ G Sbjct: 163 VVPLSICEGDESNTTLSGYYEGSLHVLKASSSKDSAIGSGNFMPMALKDADGNAV---PG 219 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 Y + + + V + ++ PT Y+ + + + Sbjct: 220 ANSYGEALAGSFDAC--LTVVEDEIITSEPGNMVGPTRGIDTRFGVYEGTFKNDENIYPA 277 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 + K + Q + + DG N + T+ + D I + + N Sbjct: 278 DKDTYYDKTKLVEVQT--VEVDDGNGNIIEEYQTTLSLDD--------IYS--FDDYSNN 325 Query: 376 QRLLKTCVSSPE 387 Q ++C+S PE Sbjct: 326 QDF-ESCLSEPE 336 >gi|260781661|ref|XP_002585922.1| hypothetical protein BRAFLDRAFT_90332 [Branchiostoma floridae] gi|229270990|gb|EEN41933.1| hypothetical protein BRAFLDRAFT_90332 [Branchiostoma floridae] Length = 4065 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 23/171 (13%), Positives = 46/171 (26%), Gaps = 23/171 (13%) Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL--KPTDSTPAMKQAYQ 303 +G++ Y+ + T V + ++ T + A+ Q Sbjct: 1620 DVSTNLTRVGVVQYSDQTNSEFVL--NTFSTEAEVLAAIAAISYQNGGTSTGAALDYVRQ 1677 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 + +I LTDG +++ S A+ I Sbjct: 1678 NVFISASGD-------------RPDAANILIVLTDGVSSDDVSFPAMA-----ARNAGIT 1719 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 I ++ I L+ P SL + + +L+ Y Sbjct: 1720 IYSVGI-GDGVDYNTLQQIAGDPNKVLQATGFSSLDDIGGQLEELVCDATY 1769 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 33/215 (15%), Positives = 66/215 (30%), Gaps = 46/215 (21%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + + ++D SGS+ K TK+ DL Sbjct: 1839 GGWDLVFLLDGSGSVGSN-----------NFLNVKNFTKLIT------------DLFPVG 1875 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD--SLILKPTDSTPAMKQAYQILT 306 +GL+ ++ ++K + + + +D S + T + A+ Q+ Sbjct: 1876 DNATKVGLVQFSDTIQKEFDLR--DYDTKAEILSAIDNISYLGGGTYTGNAIDYVRQVSF 1933 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + + ++ +I LTDGE + T A++ I I Sbjct: 1934 NTINGNRGSH-------------PDMLIVLTDGE----SFDPVTFAS-QSARDQGITIFA 1975 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 I + + L+ P+ V + L V Sbjct: 1976 IGV-GTGVDYATLEEIAGDPQKVQQVTDFADLTSV 2009 >gi|162451432|ref|YP_001613799.1| hypothetical protein sce3160 [Sorangium cellulosum 'So ce 56'] gi|161162014|emb|CAN93319.1| hypothetical protein predicted by Glimmer/Critica [Sorangium cellulosum 'So ce 56'] Length = 404 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 30/155 (19%), Positives = 56/155 (36%), Gaps = 32/155 (20%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +VVD SGSM+ + DP LK A+L +D + ++ Sbjct: 109 TVMIVVDRSGSMYGSGFWDP-------------------LKTAVLSVVDRL------QDR 143 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 V G + +T + G + + + + + + P L + Sbjct: 144 VRFGFLTFTGTANQQCPLLAGADGIALNHHAAIAAAYDEASTVPPG------KLETPTAM 197 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 +F + + P P K+I+F+TDGE + Sbjct: 198 TFNETVVPELLAFPE-PGPKYILFVTDGEPDRCDD 231 >gi|229490509|ref|ZP_04384348.1| putative von Willebrand factor, type A [Rhodococcus erythropolis SK121] gi|229322588|gb|EEN88370.1| putative von Willebrand factor, type A [Rhodococcus erythropolis SK121] Length = 684 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 36/245 (14%), Positives = 79/245 (32%), Gaps = 60/245 (24%) Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 + R L+ V+DLSGSM+ + K+ K + Sbjct: 29 TPTTPPTRAVSAAPAALLM--VMDLSGSMN--------------DNDANGKNKLTGAKQS 72 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGY---TTRVEKNIEPSWGTEKVRQYVTRDMDSL---- 287 L + +GL Y + + + VR+ M + Sbjct: 73 LSRIVG-----DTASSSTPLGLWTYPTAGSNCDPGSFLAGADGGVRKDTDTLMAQVSGLK 127 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + PA++ + L ++ + ++ ++DGE+N ++ Sbjct: 128 ADGGTPTGPALRASVDSLKANGITTAT------------------VVLISDGESNCGQAP 169 Query: 348 VNTIKICDKAKEN-----FIKIVTISINASPNGQRLLKTCVSS--PEYHYNVVNADSLIH 400 CD AK+ + + + S G+ L+ C++S + ++ + D + Sbjct: 170 ------CDTAKQIVAEGFDVTVEALGFQLSGQGRTELE-CIASTTGGRYSDIADVDEMQK 222 Query: 401 VFQNI 405 + + Sbjct: 223 RLKEL 227 >gi|148676058|gb|EDL08005.1| inter-alpha (globulin) inhibitor H5, isoform CRA_a [Mus musculus] Length = 918 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 44/222 (19%), Positives = 76/222 (34%), Gaps = 48/222 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D+S SM A K+ + AL+ L+ + Sbjct: 262 VVFVLDISASMVGA--------------------KLQQTREALVTILNDLRPQD------ 295 Query: 253 YMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +IG++ R++ K+ + +R TD A++ A ++L + Sbjct: 296 RFNIIGFSNRIKMWKDHLLPVTPDNIRNGKIYMYHLSPTGGTDINGALQAAIKLLNNYVA 355 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN---FIKIVTI 367 ++ + IIFLTDG+ NT+KI KE I I T+ Sbjct: 356 QNDIEDRSVS-----------LIIFLTDGKPTF--GETNTLKILSNTKEATRGQICIFTV 402 Query: 368 SINASPNGQRL----LKTCVSSPEYHYNVVNADSLIHVFQNI 405 I + + L L+ C + H LI + I Sbjct: 403 GIGDDVDFKLLEKLSLENCGLTRRVHEEDKAGAQLIGFYDEI 444 >gi|72162079|ref|YP_289736.1| von Willebrand factor, type A [Thermobifida fusca YX] gi|71915811|gb|AAZ55713.1| von Willebrand factor, type A [Thermobifida fusca YX] Length = 315 Score = 41.8 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 36/236 (15%), Positives = 73/236 (30%), Gaps = 53/236 (22%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R I + +D+S SM D ++ + K + F++S+ Sbjct: 83 RERATILVAIDVSPSMAA---------------TDVAPDRLTSAKESAQSFIESL---PP 124 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 GL+ +++ P T+ + + I T + + Q + S Sbjct: 125 RFNV---GLVAFSSVATVVASP---TQDHQAVADSIANLTISSGTAIGEGVFASLQAIRS 178 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + K P I+ L+DGEN + + +A+ + + TI Sbjct: 179 FDE------------KATDDPPPAAIVLLSDGENTSGRPVAAAAD---EARAAGVPVSTI 223 Query: 368 SIN------------ASPN-GQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQLM 409 + N + LK ++ Y + L V+ +I + Sbjct: 224 AFGTGVSIIEIEGHYVPANIDKETLKELAMTTGGRFYEAESTGELKDVYADIGSSL 279 >gi|261253067|ref|ZP_05945640.1| hypothetical protein VIA_003092 [Vibrio orientalis CIP 102891] gi|260936458|gb|EEX92447.1| hypothetical protein VIA_003092 [Vibrio orientalis CIP 102891] Length = 424 Score = 41.8 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 22/126 (17%), Positives = 51/126 (40%), Gaps = 14/126 (11%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 + ++K ++ +S+++F+ + I V + K +++A + LAGA+ + + + Sbjct: 19 LRNQKGLTLVVMTMSMVAFITIAALSIDVSHFVVNKTRLQNAVDTIALAGAT-VANRTNE 77 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 GD +I K+ + ES T+ N +N I + N Sbjct: 78 KGDTDTAIIESYKKVI-------------ESPGNDEIELTATDDGNGLNLLSIEYSDSPN 124 Query: 135 NRLDSS 140 + ++ Sbjct: 125 SGFSTT 130 >gi|257062762|ref|YP_003142434.1| Mg-chelatase subunit ChlD [Slackia heliotrinireducens DSM 20476] gi|256790415|gb|ACV21085.1| Mg-chelatase subunit ChlD [Slackia heliotrinireducens DSM 20476] Length = 2281 Score = 41.8 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 39/238 (16%), Positives = 75/238 (31%), Gaps = 40/238 (16%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV--KE 250 + +++D SGSM +T+MAA +NA+ S+ + + Sbjct: 76 VIVILDNSGSMDTRTGGYG------------SQTRMAAAQNAVNNLARSLYAYNTTEFPD 123 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 V M L+G++T P+ V R T+ A++ A I +D Sbjct: 124 LVQMALVGFSTTGSVVQGPTNSYNTFSGAVNRLDAD---GGTNWEDALQDAAGINFNDDD 180 Query: 311 RSF-----FTNFFRQGVKIPSLPFQKFIIFL-------TDGENNNFKSNVNTIKIC---- 354 ++ N + + P + +D + + TI C Sbjct: 181 PTYVIFVSDGNPTFRNTRGNYNPMDNYYYNTWGVYGNGSDSQTVAGIAAATTIARCYEHA 240 Query: 355 -DKAKENFIKI-----VTI-SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 D A+ + TI + + L + +++ N L + I Sbjct: 241 VDDAESLATSVGADHFYTIGAYGNVDRMRSLTTDAGAPAGNYFSAANTTDLQNALAAI 298 >gi|218462279|ref|ZP_03502370.1| hypothetical protein RetlK5_23628 [Rhizobium etli Kim 5] Length = 347 Score = 41.8 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 43/323 (13%), Positives = 92/323 (28%), Gaps = 57/323 (17%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 I+ NF I+ AL ++ L G + + + +A +AA + + Sbjct: 9 ISDRSGNFGIMTALLMVPLLGTAGMAVDFAHALSLRTQLYAAADAAAVGS---IAEKSGA 65 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + SN DA+ + + L+ ++ + Sbjct: 66 VAAAMAMNSNGTVSLGKTDARNIFMSQMSGELA----------------EVQVDLGIDVT 109 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIE 194 + N+ + + V T+ +L + I E Sbjct: 110 KTANKLNSQVSFTATVPTT-------FMQILGRDSIT-----ISGTATAEYQTAAFMDFY 157 Query: 195 LVVDLSGSM----------------HCAMNSDPEDVNSAPICQDKK---RTKMAALKNAL 235 +++D + SM CA D K ++ ++ A Sbjct: 158 ILLDNTPSMGVGATPDDVSKLEAKAGCAFACHQMDKTINNYTIAKSLGVAMRIDVVRQAT 217 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 D+ + MG+ + T+ E + + +T+ + TD+ Sbjct: 218 QALTDTAKTERVSSDQFRMGVYTFGTKAEDAKLTT--ISGLTSDLTKVKNY-----TDAV 270 Query: 296 PAMKQAYQILTSDKKRSFFTNFF 318 M YQ +D+ +F + Sbjct: 271 DLMTIPYQNYNNDQITNFDSAMT 293 >gi|326500988|dbj|BAJ98725.1| predicted protein [Hordeum vulgare subsp. vulgare] gi|326527981|dbj|BAJ89042.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 707 Score = 41.8 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 24/169 (14%), Positives = 52/169 (30%), Gaps = 43/169 (25%) Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 A + R + V+D+SGSM K+A LK A+ Sbjct: 196 TVAAGSDKPQPRAPLDLVTVLDVSGSMSGH--------------------KLALLKQAMR 235 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS-WGTEKVRQYVTRDMDSLILKPTDST 295 +D++ + ++ +++ + + V + T+ Sbjct: 236 FVIDNLGPDD------RLSVVSFSSEARRLTRLARMSDAGKALSVNAVESLVARGGTNIA 289 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 ++ A ++L RQ S ++ L+DG++ Sbjct: 290 EGLRTAAKVL-----------DERQHRNAVSS-----VVLLSDGQDTYT 322 >gi|326489689|dbj|BAK01825.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 691 Score = 41.8 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 24/169 (14%), Positives = 52/169 (30%), Gaps = 43/169 (25%) Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 A + R + V+D+SGSM K+A LK A+ Sbjct: 180 TVAAGSDKPQPRAPLDLVTVLDVSGSMSGH--------------------KLALLKQAMR 219 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS-WGTEKVRQYVTRDMDSLILKPTDST 295 +D++ + ++ +++ + + V + T+ Sbjct: 220 FVIDNLGPDD------RLSVVSFSSEARRLTRLARMSDAGKALSVNAVESLVARGGTNIA 273 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 ++ A ++L RQ S ++ L+DG++ Sbjct: 274 EGLRTAAKVL-----------DERQHRNAVSS-----VVLLSDGQDTYT 306 >gi|291000628|ref|XP_002682881.1| von Willebrand factor type A domain-containing protein [Naegleria gruberi] gi|284096509|gb|EFC50137.1| von Willebrand factor type A domain-containing protein [Naegleria gruberi] Length = 207 Score = 41.8 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 31/175 (17%), Positives = 64/175 (36%), Gaps = 43/175 (24%) Query: 166 NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 NQ N ++ I A + +E R I V+D SGSM Sbjct: 21 NQTQNMFGMASIKAPIYVEKENRSSLDIIAVLDKSGSMS--------------------- 59 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 K+ +K +LL +D + +G++ + V ++ + ++ ++ Sbjct: 60 DKIELVKKSLLFMIDQMQARD------RLGIVEFDANVSTTLKLTSMDNGGKKQAMNCVN 113 Query: 286 SLILK-PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 ++ L T+ + A+ +A+ IL + + ++ I+ TDG Sbjct: 114 NIKLGTTTNISGAIIEAFDILAN---------------RGGNISPTTSILLFTDG 153 >gi|74183702|dbj|BAE24467.1| unnamed protein product [Mus musculus] Length = 952 Score = 41.8 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 44/222 (19%), Positives = 76/222 (34%), Gaps = 48/222 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D+S SM A K+ + AL+ L+ + Sbjct: 296 VVFVLDISASMVGA--------------------KLQQTREALVTILNDLRPQD------ 329 Query: 253 YMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +IG++ R++ K+ + +R TD A++ A ++L + Sbjct: 330 RFNIIGFSNRIKMWKDHLLPVTPDNIRNGKIYMYHLSPTGGTDINGALQAAIKLLNNYVA 389 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN---FIKIVTI 367 ++ + IIFLTDG+ NT+KI KE I I T+ Sbjct: 390 QNDIEDRSVS-----------LIIFLTDGKPTF--GETNTLKILSNTKEATRGQICIFTV 436 Query: 368 SINASPNGQRL----LKTCVSSPEYHYNVVNADSLIHVFQNI 405 I + + L L+ C + H LI + I Sbjct: 437 GIGDDVDFKLLEKLSLENCGLTRRVHEEDKAGAQLIGFYDEI 478 >gi|228991799|ref|ZP_04151737.1| D-amino acid dehydrogenase, large subunit [Bacillus pseudomycoides DSM 12442] gi|228767939|gb|EEM16564.1| D-amino acid dehydrogenase, large subunit [Bacillus pseudomycoides DSM 12442] Length = 453 Score = 41.8 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 39/241 (16%), Positives = 73/241 (30%), Gaps = 32/241 (13%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + +R + IE+++D SGSM ++ + KM K A+ Sbjct: 144 TKGEQKNKEDRGNYNIEILLDASGSMAGKIDG---------------KMKMDIAKEAIQQ 188 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 F+ + +V VY G G +K + V S Sbjct: 189 FVSDLPEAVNVSLRVY-GHKGSNDEKDKTAS-----CGAIENVYTLQKYNQTTLRQSLDG 242 Query: 298 MK-QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG-ENNNFKSNVNTIKICD 355 + + L KRS + + + + ++DG E K+ + Sbjct: 243 FQPVGWTPLAEAIKRS------TETFQSAKANDKNIMYVVSDGVETCGGNPVEEAQKVSN 296 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 + + I I ++ LK S + NA L F+ + + R+ Sbjct: 297 SNIKPIMNI--IGFQVDHEAEKQLKEIAEVSKGKYVLANNAKELQDQFKETGKDITSRRL 354 Query: 415 S 415 Sbjct: 355 K 355 >gi|218666515|ref|YP_002427074.1| hypothetical protein AFE_2697 [Acidithiobacillus ferrooxidans ATCC 23270] gi|218518728|gb|ACK79314.1| conserved hypothetical protein [Acidithiobacillus ferrooxidans ATCC 23270] Length = 590 Score = 41.8 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 13/58 (22%), Positives = 29/58 (50%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 E+ + +II A+ + +L + F I + Y + +++ + A +AGA + + S Sbjct: 14 RGERGDIAIIAAIVMPIMILALAFGIDIGHMAYVQRNLQKIADMAAIAGAEDVPNAQS 71 >gi|254414936|ref|ZP_05028700.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] gi|196178425|gb|EDX73425.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] Length = 576 Score = 41.8 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 39/204 (19%), Positives = 64/204 (31%), Gaps = 45/204 (22%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 K+V I P + ++D+SGSM K Sbjct: 190 NSKHKLVHIGLQGKSISTENLPPSNLVFLLDVSGSMS-------------------DANK 230 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + LK A L +D + V VY G G + P+ G + + + +D L Sbjct: 231 LPLLKEAFRLLVDQLRDEDKVSIVVYAGAAG------TVLPPTPGNQ--KDTILAAIDKL 282 Query: 288 ILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE-NNNFK 345 T +K AY L D N +I TDG+ N Sbjct: 283 EAGGSTAGGQGIKLAY-KLAQDNFIESGNNR---------------VILATDGDFNVGIS 326 Query: 346 SNVNTIKICDKAKENFIKIVTISI 369 S+ + + ++ +E I + + Sbjct: 327 SDEQLVSLIEEKREQDIFLTVLGF 350 >gi|27369644|ref|NP_766059.1| inter-alpha-trypsin inhibitor heavy chain H5 precursor [Mus musculus] gi|81873944|sp|Q8BJD1|ITIH5_MOUSE RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H5; Short=ITI heavy chain H5; Short=ITI-HC5; Short=Inter-alpha-inhibitor heavy chain 5; Flags: Precursor gi|26352482|dbj|BAC39871.1| unnamed protein product [Mus musculus] gi|37589944|gb|AAH43314.2| Inter-alpha (globulin) inhibitor H5 [Mus musculus] gi|38328214|gb|AAH62196.1| Inter-alpha (globulin) inhibitor H5 [Mus musculus] gi|74145221|dbj|BAE22250.1| unnamed protein product [Mus musculus] gi|122889674|emb|CAM13913.1| inter-alpha (globulin) inhibitor H5 [Mus musculus] gi|123858038|emb|CAM26660.1| inter-alpha (globulin) inhibitor H5 [Mus musculus] gi|148676059|gb|EDL08006.1| inter-alpha (globulin) inhibitor H5, isoform CRA_b [Mus musculus] Length = 952 Score = 41.8 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 44/222 (19%), Positives = 76/222 (34%), Gaps = 48/222 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D+S SM A K+ + AL+ L+ + Sbjct: 296 VVFVLDISASMVGA--------------------KLQQTREALVTILNDLRPQD------ 329 Query: 253 YMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +IG++ R++ K+ + +R TD A++ A ++L + Sbjct: 330 RFNIIGFSNRIKMWKDHLLPVTPDNIRNGKIYMYHLSPTGGTDINGALQAAIKLLNNYVA 389 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN---FIKIVTI 367 ++ + IIFLTDG+ NT+KI KE I I T+ Sbjct: 390 QNDIEDRSVS-----------LIIFLTDGKPTF--GETNTLKILSNTKEATRGQICIFTV 436 Query: 368 SINASPNGQRL----LKTCVSSPEYHYNVVNADSLIHVFQNI 405 I + + L L+ C + H LI + I Sbjct: 437 GIGDDVDFKLLEKLSLENCGLTRRVHEEDKAGAQLIGFYDEI 478 >gi|47218989|emb|CAG02027.1| unnamed protein product [Tetraodon nigroviridis] Length = 849 Score = 41.8 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 36/185 (19%), Positives = 65/185 (35%), Gaps = 50/185 (27%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D+SGSM TKM + A+L L+ +D H Sbjct: 271 VVFVIDMSGSMSG--------------------TKMQQTREAMLKILEDLDPEDH----- 305 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-------KPTDSTPAMKQAYQIL 305 G+I + R++ W T + ++++ TD + +A +L Sbjct: 306 -FGIILFDHRIQ-----FWNTSLSKATKENIDEAMVYVKAIQSYGGTDINAPVLKAVDML 359 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF-IKI 364 D+K II LTDG+ N+ +S + I+ KA + + Sbjct: 360 KEDRKAKRLPEKSI-----------DMIILLTDGDPNSGESRIPVIQENVKAAIGGQMSL 408 Query: 365 VTISI 369 ++ Sbjct: 409 FSLGF 413 >gi|148726250|emb|CAN88322.1| matrilin 1 [Danio rerio] gi|148726498|emb|CAN88268.1| matrilin 1 [Danio rerio] Length = 277 Score = 41.8 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 41/215 (19%), Positives = 78/215 (36%), Gaps = 28/215 (13%) Query: 197 VDLSGSMHCAM-NSDPEDVNSAPICQDK-KRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 + + +M + N+ P DV + ++ +K + ID LS + + Sbjct: 21 LRQAAAMAAGLCNTKPTDVVFIVDSSRSVRPSEFEQVK---VFLAKVIDGLSVGPDATRV 77 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 G++ Y +RV+ + K + T + A++ A + S+ + Sbjct: 78 GVVNYASRVKNEVSLKSHKTKAALVKAVSKIEPLSTGTMTGLAIQFAMNVAFSEAEG--- 134 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 G K P + K I +TDG + N+ I +A+E I+I I + Sbjct: 135 ------GRKSPDIS--KVAIIVTDG---RPQDNIRDIA--ARAREAGIEIFAIGVG--RV 179 Query: 375 GQRLLKTCVSSP--EYHYNVVN---ADSLIHVFQN 404 L+ S P ++ V + + L FQ Sbjct: 180 DMTTLRQMASEPLEDHVDYVESYSLIEKLTKKFQE 214 >gi|47208180|emb|CAF89812.1| unnamed protein product [Tetraodon nigroviridis] Length = 1636 Score = 41.8 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 36/314 (11%), Positives = 101/314 (32%), Gaps = 52/314 (16%) Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFI 161 +KE L + ++++ + R + + + + + + Sbjct: 560 VKEYLVVITDGNSTDKVKDPADKLRAQGVVVYAIGVKDAVEKELLEISGEPQRTFYVNNF 619 Query: 162 EHLLNQR--YNQKIVSFIPALLRIEMG--ERPIFLIELVVDLSGSMHCAMNSDPEDVNSA 217 + L + I S + L + + + ++D SGS+ Sbjct: 620 DALKPIKDDIITDICSTDGSDLSLLSTVCKDVPGDLIFLIDSSGSI-------------- 665 Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 + KM +L + + K+ V++G++ Y+T + + Sbjct: 666 ---YPEDYQKMKDFMKSL------VQKSNIGKDQVHVGVLQYSTEQKLVFPLI--QYYTK 714 Query: 278 QYVTRDMDSL--ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 +++ +D + I T + A+ + + ++ ++ Sbjct: 715 DQLSKAIDDMQQIGGGTHTGEAI--------------AVVSKYFDAQNGGRPDLKQRLVV 760 Query: 336 LTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA 395 +TDGE + +V + + + + +I + A+ Q L + Y + Sbjct: 761 VTDGE---SQDDVKLPA--EALRAKGVIVYSIGVVAANTSQLL--EISGDADRMYAERDF 813 Query: 396 DSLIHVFQNISQLM 409 D+L + + ++ + Sbjct: 814 DALKDLEKQMALEI 827 >gi|322419943|ref|YP_004199166.1| hypothetical protein GM18_2437 [Geobacter sp. M18] gi|320126330|gb|ADW13890.1| Protein of unknown function DUF2134, membrane [Geobacter sp. M18] Length = 351 Score = 41.8 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 52/166 (31%), Gaps = 11/166 (6%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 M R R +EK + AL ++ + + + + K +++A +A Sbjct: 1 MDAFRRIR------KLNEKGFAIVYIALMLVVLVAFVSLAVDMGYMFVAKGQLQNAADAG 54 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 LAG +K+ ++ + + +A + K + V + Sbjct: 55 ALAGVAKLSDTVT--ARQSAKLFAERNKAAGESVKVALNETNSAD---GDIVVGYWDKVT 109 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLN 166 S+ + +AN + T + F +LN Sbjct: 110 RTMSATVPTGKVANAVKVVARRTTETGTGISAENKQVDLFFGQVLN 155 >gi|114557515|ref|XP_524757.2| PREDICTED: calcium-activated chloride channel regulator 4 isoform 2 [Pan troglodytes] Length = 919 Score = 41.8 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 35/238 (14%), Positives = 74/238 (31%), Gaps = 64/238 (26%) Query: 195 LVVDLSGSMHCA--MNSDPEDVNSAPICQDKKRTKMAAL----KNALLLFLDSIDLLSHV 248 LV+D SGSM +N + + + + + + ++ L I Sbjct: 309 LVLDKSGSMGGKDRLNRMNQAAKHFLLQTVENGSWVGMVHFDSTATVVNKLIQIKSSD-E 367 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + + GL Y T +K A+Q++ Sbjct: 368 RNTLMAGLPTY----------------------------PRGGTSICSGIKYAFQVIGEL 399 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTI 367 + + ++ LTDGE+N S C D+ K++ + I Sbjct: 400 HSQLDGSE----------------VLLLTDGEDNTASS-------CIDEVKQSGAIVHFI 436 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNA--DSLIHVFQNIS---QLMVHRKYSVILKG 420 ++ + + + ++ + Y A + LI F ++ + + + KG Sbjct: 437 ALGRDADEAVIEMSKITGGSHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKG 494 >gi|47218988|emb|CAG02026.1| unnamed protein product [Tetraodon nigroviridis] Length = 1039 Score = 41.8 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 36/185 (19%), Positives = 65/185 (35%), Gaps = 50/185 (27%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D+SGSM TKM + A+L L+ +D H Sbjct: 435 VVFVIDMSGSMSG--------------------TKMQQTREAMLKILEDLDPEDH----- 469 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-------KPTDSTPAMKQAYQIL 305 G+I + R++ W T + ++++ TD + +A +L Sbjct: 470 -FGIILFDHRIQ-----FWNTSLSKATKENIDEAMVYVKAIQSYGGTDINAPVLKAVDML 523 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF-IKI 364 D+K II LTDG+ N+ +S + I+ KA + + Sbjct: 524 KEDRKAKRLPEKSI-----------DMIILLTDGDPNSGESRIPVIQENVKAAIGGQMSL 572 Query: 365 VTISI 369 ++ Sbjct: 573 FSLGF 577 >gi|315502365|ref|YP_004081252.1| von willebrand factor type a [Micromonospora sp. L5] gi|315408984|gb|ADU07101.1| von Willebrand factor type A [Micromonospora sp. L5] Length = 319 Score = 41.8 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 38/237 (16%), Positives = 73/237 (30%), Gaps = 51/237 (21%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + + VD+S SM D ++ A K A F + Sbjct: 83 RERATVMVAVDVSTSMLAG---------------DVDPDRLTAAKEAGRRF------VDG 121 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS-LILKPTDSTPAMKQAYQILT 306 + ++ +GL+ + + PS E + + R + ++ T A+ + + Sbjct: 122 LPDEFNVGLVAFAGSAAVLVPPSTDREALHDGIGRLAEGITGVQGTAIGEAISTSLGAVK 181 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 S + P II L+DG N + + D+A + + T Sbjct: 182 SLDATAA------------KDPPPARIIILSDGANTSGMDPMEAA---DQAVAAKVPVHT 226 Query: 367 ISI--------------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 IS +GQ L + + L V+++I + Sbjct: 227 ISFGTPGGSVDRGGRAIQVPVDGQTLRAVAEQTGGGFHEASTTAELKDVYEDIGTSV 283 >gi|302865820|ref|YP_003834457.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] gi|302568679|gb|ADL44881.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] Length = 319 Score = 41.8 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 38/237 (16%), Positives = 73/237 (30%), Gaps = 51/237 (21%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + + VD+S SM D ++ A K A F + Sbjct: 83 RERATVMVAVDVSTSMLAG---------------DVDPDRLTAAKEAGRRF------VDG 121 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS-LILKPTDSTPAMKQAYQILT 306 + ++ +GL+ + + PS E + + R + ++ T A+ + + Sbjct: 122 LPDEFNVGLVAFAGSAAVLVPPSTDREALHDGIGRLAEGITGVQGTAIGEAISTSLGAVK 181 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 S + P II L+DG N + + D+A + + T Sbjct: 182 SLDATAA------------KDPPPARIIILSDGANTSGMDPMEAA---DQAVAAKVPVHT 226 Query: 367 ISI--------------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 IS +GQ L + + L V+++I + Sbjct: 227 ISFGTPGGSVDRGGRAIQVPVDGQTLRAVAEQTGGGFHEASTTAELKDVYEDIGTSV 283 >gi|159473306|ref|XP_001694780.1| flagellar associated protein [Chlamydomonas reinhardtii] gi|158276592|gb|EDP02364.1| flagellar associated protein [Chlamydomonas reinhardtii] Length = 4349 Score = 41.8 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 37/236 (15%), Positives = 73/236 (30%), Gaps = 34/236 (14%) Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 + I E+ +R + V+D SGSM ++ ++ Sbjct: 958 AVISIKASAEVKQRAHVALTCVLDRSGSMGGE--------------------RIELVRET 997 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-KPTD 293 +D + Y+G++ Y+ V +++ T + R+ + SL L T Sbjct: 998 CHFLIDQLTADD------YLGIVSYSNTVREDVPLLRMTPEARRLAHTMISSLTLHGGTA 1051 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI-- 351 ++ + + G S +F TDG+ VN I Sbjct: 1052 LYAGLEAGVKQQMAAASELKALAAAAGGGSDSSRIVHSCFLF-TDGQATTGPCTVNEIMG 1110 Query: 352 --KICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQN 404 + I + T + LL+ + +Y + AD + F + Sbjct: 1111 QMTSLQSPADQNITVHTFGF-GDDHSVELLQGVAEAQSGVYYYISCADDIPSGFGD 1165 >gi|218710404|ref|YP_002418025.1| hypothetical protein VS_2441 [Vibrio splendidus LGP32] gi|218323423|emb|CAV19600.1| hypothetical protein VS_2441 [Vibrio splendidus LGP32] Length = 422 Score = 41.8 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 17/114 (14%), Positives = 40/114 (35%), Gaps = 2/114 (1%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 + K ++ ++ +++ FL + + V K +++A ++A LA A+ Sbjct: 1 MLYRVSKSPRKQQGLVVVLVTAALLVFLAVSALAVDVNHMLVNKTRLQNAVDSAALAAAT 60 Query: 67 KMVSNLSRLGDRFESIS--NHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEI 118 + ++ + E + N + + F I S F T Sbjct: 61 ILDNSKDKDAVDAEIGTALNAMAASTGNHEIDFTTASINIDYSNDPQDFTGTAT 114 >gi|149920875|ref|ZP_01909337.1| hypothetical protein PPSIR1_38721 [Plesiocystis pacifica SIR-1] gi|149818274|gb|EDM77727.1| hypothetical protein PPSIR1_38721 [Plesiocystis pacifica SIR-1] Length = 367 Score = 41.8 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 36/240 (15%), Positives = 75/240 (31%), Gaps = 63/240 (26%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID-LLSHVKE 250 + L+VD SGSM P + + D ++ ++ +++ Sbjct: 89 TVVLLVDQSGSMWDDFGGQP------------RWVALE------NTLFDPVNGVVKPLED 130 Query: 251 DVYMGLIGYTTR---------VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 V GL Y++ + PS+G ++ L T + ++K Sbjct: 131 QVRFGLALYSSMNGSFGGECPLITEFAPSFGNH--ASLAATFASAMPLDDTPTGDSIKAV 188 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN-----NFKSNVN-TIKICD 355 + L + + K I+ TDGE + + + +++ Sbjct: 189 AETLAAFPEDG-----------------PKIIVLATDGEPDTCAVPDPQEGQPLSLEATQ 231 Query: 356 KAKENFIKIVTISI----------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 A ++ I+ IS+ + G L Y +N L+ F+ + Sbjct: 232 AAFDDGIRTFVISVGNQVTDAHLQELANAGVGLPTQGAVENAPFYKTLNPAELVSAFEAV 291 >gi|148258701|ref|YP_001243286.1| hypothetical protein BBta_7530 [Bradyrhizobium sp. BTAi1] gi|146410874|gb|ABQ39380.1| hypothetical protein BBta_7530 [Bradyrhizobium sp. BTAi1] Length = 511 Score = 41.8 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 69/201 (34%), Gaps = 31/201 (15%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 F + E+AN SIIFA++++ L IG + ++ ++SA +AA +A Sbjct: 18 FRGFLQQALRLGHDERANISIIFAMALLPILSAIGCAVDYTQATRLRSKLQSAADAASVA 77 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 S+ + S+ + +++A + + SL + Sbjct: 78 SISQ---QSLGYNAALQMTSDGTVQVAVEEATKLFNGNAANSLGYTNLSL---------- 124 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 N ++ + + + D F+ ++ R + + Sbjct: 125 ----------NAQVMKTGVKLAATVAFSA--DVPTTFMT-VVGYRKLT-----VTGTSKS 166 Query: 184 EMGERPIFLIELVVDLSGSMH 204 P L++D+SGSM Sbjct: 167 TSSLPPYLDFYLMLDVSGSMG 187 Score = 40.6 bits (93), Expect = 0.50, Method: Composition-based stats. Identities = 35/244 (14%), Positives = 66/244 (27%), Gaps = 39/244 (15%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM--AALKNALLLFLDSIDLLSHVKEDV 252 L+ + G + ++ S V S P ++ A+ A+ L + + Sbjct: 254 LLTNNGGKLPSSLLSSLTAVTSCPTDGSDACIQLRADAVGAAVQQLLVTANATQKTPNQF 313 Query: 253 YMGLIGYTTRVEKNIEPS---WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 +GL + + + G+ + +L + A + + Sbjct: 314 RIGLYPFVRYLYAYSPLTASINGSPTTPGTINHAAANLASQLDTGANASLGSGGTHFENA 373 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN------NFKSNVNTIKI-----CDKAK 358 + G S Q ++ +TDG N S N+ C K Sbjct: 374 FPTMNGIITSVGDGSASNKTQPYVFLITDGAQNPQVYWNGSWSGSNSATTMDTSKCTTLK 433 Query: 359 ENFIKIVTISINASPNGQR---------------------LLKTCVSSPEYHYNVVNADS 397 I IV++ Q L+ C SP Y Y + Sbjct: 434 SRGI-IVSVLYIPYQPIQNPTSFANSEDFYANANIPKIPPSLQACA-SPGYFYTANSPAD 491 Query: 398 LIHV 401 + Sbjct: 492 ITAA 495 >gi|116003875|ref|NP_001070294.1| anthrax toxin receptor 2 [Bos taurus] gi|115305014|gb|AAI23758.1| Anthrax toxin receptor 2 [Bos taurus] gi|296486409|gb|DAA28522.1| anthrax toxin receptor 2 [Bos taurus] Length = 488 Score = 41.8 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 28/216 (12%), Positives = 56/216 (25%), Gaps = 49/216 (22%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + + F + V+D SGS+ V + Sbjct: 32 SAQEQPSCHGAFDLYFVLDKSGSVANNWIEIYNFVQQLTERFVSPQ-------------- 77 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 + + I ++++ + + K+ + + + T +K Sbjct: 78 ------------MRLSFIVFSSQATIILPLTGDRGKISEGLDNLKHVSPVGETYIHEGLK 125 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG--ENNNFKSNVNTIKICDKA 357 A Q K L II LTDG + KI + Sbjct: 126 LA----------------NEQIEKARDLKTSSIIIALTDGKLDGLVPSYAEKEAKI---S 166 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV 393 + ++ + + Q L+ S E + V Sbjct: 167 RSLGARVYCVGVL--DFEQAQLERIADSKEQVFPVK 200 >gi|33592721|ref|NP_880365.1| hypothetical protein BP1639 [Bordetella pertussis Tohama I] gi|33572367|emb|CAE41926.1| putative exported protein [Bordetella pertussis Tohama I] gi|332382136|gb|AEE66983.1| hypothetical protein BPTD_1619 [Bordetella pertussis CS] Length = 336 Score = 41.8 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 42/231 (18%), Positives = 78/231 (33%), Gaps = 43/231 (18%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 P+ I LVVD+S SM D Q + ++ A++ + F+D Sbjct: 89 EPMRDILLVVDISQSM---------DSEDFRDAQGRPASRWQAVQAVVGDFIDK------ 133 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + D +GLI + + +R + R + T A+ ++L Sbjct: 134 -RPDDRLGLIVFGAGAYPQAPLTRDHAALRLLLQRTAVGMAGPNTALGDAIGLGIRMLDH 192 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 +R K +I LTDG N+ S V + + A ++ + + TI Sbjct: 193 AGER------------------DKILILLTDG--NDTASAVPPARAAELAAQHRVVVHTI 232 Query: 368 SINASPNGQR------LLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVH 411 I L+ + + + SL V+ + ++ H Sbjct: 233 GIGDPAASGEDRVDFDALRDIARIAGGRFFRARDQASLQEVYATLDRITPH 283 >gi|162449101|ref|YP_001611468.1| hypothetical protein sce0831 [Sorangium cellulosum 'So ce 56'] gi|161159683|emb|CAN90988.1| hypothetical protein predicted by Glimmer/Critica [Sorangium cellulosum 'So ce 56'] Length = 377 Score = 41.8 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 24/184 (13%), Positives = 58/184 (31%), Gaps = 32/184 (17%) Query: 191 FLIELVVDLSGSM-HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + +++D SGSM +++ A + + L S+ + Sbjct: 92 LDMLILLDRSGSMLGSKWVGVTNALSAFVT---------DAASAGMNVGLTYFPRSSNGQ 142 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 D Y E T ++ + + T M+ A Q + ++ Sbjct: 143 SDCNH--TSYDELAVAIGELPVNTPELTASI---QSTSPGGGT----PMRPALQGVLTNA 193 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV--NTIKICDKA-KENFIKIVT 366 N + ++ TDG+ + N +T ++ +A + N ++ Sbjct: 194 TAYKDANPSH----------KVIVVLATDGDPSGCSGNTVASTAEMAQRALRYNGVQTYV 243 Query: 367 ISIN 370 +++ Sbjct: 244 VAVQ 247 >gi|147776143|emb|CAN69721.1| hypothetical protein VITISV_014218 [Vitis vinifera] Length = 686 Score = 41.8 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 27/175 (15%), Positives = 63/175 (36%), Gaps = 43/175 (24%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 +R + V+D+SGSM +K++ LK A+ + ++ Sbjct: 199 DRAPIDLVAVLDVSGSMAG--------------------SKLSLLKRAVCFLIQNLGPSD 238 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDSTPAMKQAYQIL 305 + ++ +++ + ++ R+ ++SL T+ +K+ ++L Sbjct: 239 ------RLSIVSFSSTARRIFPLRRMSDNGREAAGLAINSLXSSGGTNIVEGLKKGVRVL 292 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 +++ + II L+DG++ NVN + A N Sbjct: 293 EERSEQNPVAS----------------IILLSDGKDTYNCDNVNRRQTSHCASSN 331 >gi|45361321|ref|NP_989238.1| matrilin 2 [Xenopus (Silurana) tropicalis] gi|39645939|gb|AAH63920.1| matrilin 2 [Xenopus (Silurana) tropicalis] Length = 839 Score = 41.8 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 34/172 (19%), Positives = 60/172 (34%), Gaps = 22/172 (12%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 + + L + +GL+ Y + V+ +K+ + T + A Sbjct: 75 LITMLKFLDIGPDTTRVGLLQYGSTVKNEFSLKTYKKKMDIERAVKRMMHLATGTMTGLA 134 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 ++ A I S+ + Q V + + +TDG + + I KA Sbjct: 135 IQYAMNIAFSESE---GARPLNQHVP-------RIAMIVTDGRPQDPVAE-----IAAKA 179 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADS---LIHVFQN 404 + + I I + + LKT S P E+ + V N L VFQN Sbjct: 180 RNSGILIFAVGVG--RVDMSTLKTIGSEPHTEHVFLVANFSQIETLTTVFQN 229 >gi|59857769|gb|AAX08719.1| inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Bos taurus] Length = 916 Score = 41.8 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 30/165 (18%), Positives = 53/165 (32%), Gaps = 45/165 (27%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D SGSM K+ + A + +D LS + Sbjct: 276 VIFVIDKSGSMMGR--------------------KIKQTREA---LIKILDDLSPHDQF- 311 Query: 253 YMGLIGYTTRVEKNIEP-----SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 LI +++ + + + Y T T+ AM A Q+L Sbjct: 312 --DLISFSSEATTWKPLLVPASTENVNEAKSYATGIQAQ---GGTNINDAMLMAVQLLEK 366 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + S+ II LTDG+ ++N + I+ Sbjct: 367 ANQEEL--------LPEGSITL---IILLTDGDPTVGETNPSNIQ 400 >gi|262183593|ref|ZP_06043014.1| hypothetical protein CaurA7_06346 [Corynebacterium aurimucosum ATCC 700975] Length = 604 Score = 41.8 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 43/221 (19%), Positives = 73/221 (33%), Gaps = 59/221 (26%) Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 +V D SGSM +T++ A K+A F+ + + Y Sbjct: 1 MVVFDSSGSM--------------ITNDAGGQTRIDAAKDAARTFITEAGDDAPLGLVTY 46 Query: 254 MG---------LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 G G + +EK+ ++ T ++++A Sbjct: 47 GGNTGEAPEDEAAGCQDITVVTPPEAGNSEKMIAHMDGLQ---PRGFTPIGESLRKAAAE 103 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA---KENF 361 L + +RS II ++DG + +CD A KE Sbjct: 104 LPKEGQRS--------------------IILVSDG-----VATCTPPPVCDVAKELKEQG 138 Query: 362 I--KIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 I I T+ N P Q+ L+ C++ + N +ADSL Sbjct: 139 IDLVINTVGFNVEPEAQQELQ-CIADATGGTYANASDADSL 178 >gi|24375056|ref|NP_719099.1| von Willebrand factor type A domain-containing protein [Shewanella oneidensis MR-1] gi|24349804|gb|AAN56543.1|AE015791_7 von Willebrand factor type A domain protein [Shewanella oneidensis MR-1] Length = 621 Score = 41.8 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 60/187 (32%), Gaps = 43/187 (22%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 E + ++D+SGSM K+ L+ AL + +D Sbjct: 226 SKAELGASNLVFLLDVSGSMASD-------------------DKLPLLQTALKMLTQQLD 266 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 V VY G G ++ K+ Y + T+ ++ AYQ Sbjct: 267 EQDKVSIVVYAGAAG--VVLDGAAG---NDIKILTYALEQL--TAGGSTNGAEGIQLAYQ 319 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV-NTIKICDKAKENFI 362 ++ + +I TDG+ N +N+ + + + K++ I Sbjct: 320 -------------LAQKHFVKGGINR---VILATDGDFNVGTTNLDELVDLVEVQKKHGI 363 Query: 363 KIVTISI 369 + T+ Sbjct: 364 GLTTLGF 370 >gi|42521785|ref|NP_967165.1| hypothetical protein Bd0148 [Bdellovibrio bacteriovorus HD100] gi|39574315|emb|CAE77819.1| hypothetical protein predicted by Glimmer/Critica [Bdellovibrio bacteriovorus HD100] Length = 739 Score = 41.8 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 14/80 (17%), Positives = 32/80 (40%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 R F + + +++ ++ AL L +I V + K +++++ + A GA K Sbjct: 5 RSTFIQNLNNKRGQIALFVALIFQILFLFFAMVINVGLLVHHKINLQNSVDLAAYYGAMK 64 Query: 68 MVSNLSRLGDRFESISNHAK 87 ++ + I K Sbjct: 65 QAEGMNVIAHTNYQIRQSWK 84 >gi|226504618|ref|NP_001148048.1| protein binding protein [Zea mays] gi|195615516|gb|ACG29588.1| protein binding protein [Zea mays] Length = 696 Score = 41.4 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 55/166 (33%), Gaps = 27/166 (16%) Query: 137 LDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELV 196 ++ + F + + D+ + Q I + A R + V Sbjct: 203 VEIKTYSEFSAIQQSSQDDFAVLIHLKAPYANPEQVIGRPVNATSVGYPTARAPVDLVTV 262 Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL 256 +D+SGSM TK+A LK A+ + + + + Sbjct: 263 LDVSGSMAG--------------------TKLALLKRAMGFVIQHLGPSD------RLSV 296 Query: 257 IGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQA 301 I +++ + + RQ + ++SL T+ A+K+A Sbjct: 297 IAFSSTARRLFHLQRMSHSGRQQALQTVNSLVASGGTNIADALKKA 342 >gi|254519993|ref|ZP_05132049.1| von Willebrand factor [Clostridium sp. 7_2_43FAA] gi|226913742|gb|EEH98943.1| von Willebrand factor [Clostridium sp. 7_2_43FAA] Length = 960 Score = 41.4 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 46/303 (15%), Positives = 102/303 (33%), Gaps = 57/303 (18%) Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 +++ + + E I +++ + A + + E L S V + ++ N Sbjct: 296 LINGVKGDSEALEGILSNSGANIKKIAPSASPSTLNELLEYKSIVLNDVHRDDLSNGFMD 355 Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE 187 ++ + F D Y+ +E +L +++ E Sbjct: 356 NIEAYVKD--YGGGLITFGGEDSYALGGYKDTSLEKVLPVYMDKR-----------GKNE 402 Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 P I L++D SGSM +K+ K A + L+++ + Sbjct: 403 VPAISINLIIDKSGSMSAEGGGV---------------SKLTLAKEAAMKALENLREVDE 447 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + +I + ++ + G ++ + + + I T PA++Q Y Sbjct: 448 IS------VIAFDDTYDEVVPLQKVGDKEAIKELISGIQ--IRGGTSIYPALEQGYN--- 496 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 +++ S K I LTDG++ N T+ +N I + T Sbjct: 497 ---------------MQMQSSAKIKHTILLTDGQDGYGLDNYATLLQ--NFIDNNITLST 539 Query: 367 ISI 369 +++ Sbjct: 540 VAV 542 >gi|198284406|ref|YP_002220727.1| membrane protein [Acidithiobacillus ferrooxidans ATCC 53993] gi|198248927|gb|ACH84520.1| membrane protein [Acidithiobacillus ferrooxidans ATCC 53993] Length = 596 Score = 41.4 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 13/58 (22%), Positives = 29/58 (50%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 E+ + +II A+ + +L + F I + Y + +++ + A +AGA + + S Sbjct: 20 RGERGDIAIIAAIVMPIMILALAFGIDIGHMAYVQRNLQKIADMAAIAGAEDVPNAQS 77 >gi|56797853|emb|CAG26904.1| matrilin-1 [Danio rerio] Length = 277 Score = 41.4 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 41/215 (19%), Positives = 78/215 (36%), Gaps = 28/215 (13%) Query: 197 VDLSGSMHCAM-NSDPEDVNSAPICQDK-KRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 + + +M + N+ P DV + ++ +K + ID LS + + Sbjct: 21 LRQAAAMAAGLCNTKPTDVVFIVDSSRSVRPSEFEQVK---VFLAKVIDGLSVGPDATRV 77 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 G++ Y +RV+ + K + T + A++ A + S+ + Sbjct: 78 GVVNYASRVKNEVSLKSHKTKAALVKAVSKIEPLSTGTMTGLAIQFAMNVAFSEAEG--- 134 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 G K P + K I +TDG + N+ I +A+E I+I I + Sbjct: 135 ------GRKSPDIS--KVAIIVTDG---RPQDNIRDIA--ARAREAGIEIFAIGVG--RV 179 Query: 375 GQRLLKTCVSSP--EYHYNVVN---ADSLIHVFQN 404 L+ S P ++ V + + L FQ Sbjct: 180 DMTTLRQMASEPLEDHVDYVESYSLIEKLTKKFQE 214 >gi|332254886|ref|XP_003276564.1| PREDICTED: integrin alpha-1 [Nomascus leucogenys] Length = 1179 Score = 41.4 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 70/190 (36%), Gaps = 30/190 (15%) Query: 233 NALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 +++ FL D ++ + + +G++ Y V ++ + + Sbjct: 187 DSVTAFLNDLLERMDIGPKQTQVGIVQYGENVTHEFNL-NKYSSTKEVLVAAKKIVQRGG 245 Query: 292 --TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T + + A + ++ + +K ++ +TDGE N Sbjct: 246 RQTMTALGIDTARKEAFTEA-------------RGARRGVKKVMVIVTDGE---SHDNHR 289 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRL--------LKTCVSSP--EYHYNVVNADSLI 399 K+ ++ I+ +I+I S N L +K+ S P ++ +NV + +L+ Sbjct: 290 LKKVIQDCEDENIQRFSIAILGSYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALV 349 Query: 400 HVFQNISQLM 409 + + + + + Sbjct: 350 TIVKTLGERI 359 >gi|317403331|gb|EFV83845.1| hemolysin-type calcium-binding region [Achromobacter xylosoxidans C54] Length = 1141 Score = 41.4 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 32/186 (17%), Positives = 55/186 (29%), Gaps = 37/186 (19%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + +E G+ + I +VVD SGSM A + K T+M +AL Sbjct: 620 GTVTTVEPGKN--YNIAIVVDTSGSMSEASGT-------------KGLTRMQLTIDALKN 664 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 +++ + +G + V + T+ A Sbjct: 665 LANTLKGHDGIVNVALIGFESTASTKYTINGL--NASNVGDLIKAIEKLSASGGTNYEGA 722 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE--------NNNFKSNVN 349 +A + S F+ FLTDG+ N ++ S + Sbjct: 723 FDEAVKWFNKQPTSSNGQ------------AFENVTYFLTDGDPTFSNRGSNGDWWSGGS 770 Query: 350 TIKICD 355 T D Sbjct: 771 TTNYYD 776 >gi|255537858|ref|XP_002509994.1| protein binding protein, putative [Ricinus communis] gi|223550695|gb|EEF52181.1| protein binding protein, putative [Ricinus communis] Length = 767 Score = 41.4 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 63/164 (38%), Gaps = 43/164 (26%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 R + V+D+SGSM A K+ LK A+ L + S+ Sbjct: 353 AHRAPIDLVTVLDVSGSMTGA--------------------KLQMLKRAMRLVISSLGSA 392 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQI 304 + ++ +++ ++ + T ++ R +D L+ T A+++A ++ Sbjct: 393 D------RLSIVAFSSVPKRLLPLRRMTAHGQRAARRIIDRLVCGQGTSVGDALRKATKV 446 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 L ++R+ + I+ L+DG++ +++ Sbjct: 447 LEDRRERNPVAS----------------IMLLSDGQDERVQTSS 474 >gi|119575262|gb|EAW54867.1| hCG2002731, isoform CRA_d [Homo sapiens] Length = 768 Score = 41.4 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 69/190 (36%), Gaps = 30/190 (15%) Query: 233 NALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 +++ FL D ++ + + +G++ Y V + + + Sbjct: 187 DSVTAFLNDLLERMDIGPKQTQVGIVQYGENVTHEFNL-NKYSSTEEVLVAAKKIVQRGG 245 Query: 292 --TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T + + A + ++ + +K ++ +TDGE N Sbjct: 246 RQTMTALGIDTARKEAFTEA-------------RGARRGVKKVMVIVTDGE---SHDNHR 289 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRL--------LKTCVSSP--EYHYNVVNADSLI 399 K+ ++ I+ +I+I S N L +K+ S P ++ +NV + +L+ Sbjct: 290 LKKVIQDCEDENIQRFSIAILGSYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALV 349 Query: 400 HVFQNISQLM 409 + + + + + Sbjct: 350 TIVKTLGERI 359 >gi|153791389|ref|NP_001028400.2| integrin alpha-1 precursor [Mus musculus] gi|189442109|gb|AAI67237.1| Integrin alpha 1 [synthetic construct] Length = 1179 Score = 41.4 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 29/184 (15%), Positives = 65/184 (35%), Gaps = 25/184 (13%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 D + + + +G++ Y V K + + + Sbjct: 191 AFLNDLLKRMDIGPKQTQVGIVQYGANVTHEFNL----NKYSSTEEVLVAANKIGRRGGL 246 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 M A I T+ K+ R+G +K ++ +TDGE N ++ Sbjct: 247 QTM-TALGIDTARKEAFTEARGARRG-------VKKVMVIVTDGE---SHDNYRLKQVIQ 295 Query: 356 KAKENFIKIVTISINASPNGQRL--------LKTCVSSP--EYHYNVVNADSLIHVFQNI 405 ++ I+ +I+I N L +K+ S P ++ +NV + +L+ + + + Sbjct: 296 DCEDENIQRFSIAILGHYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKAL 355 Query: 406 SQLM 409 + + Sbjct: 356 GERI 359 >gi|119575261|gb|EAW54866.1| hCG2002731, isoform CRA_c [Homo sapiens] Length = 766 Score = 41.4 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 69/190 (36%), Gaps = 30/190 (15%) Query: 233 NALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 +++ FL D ++ + + +G++ Y V + + + Sbjct: 187 DSVTAFLNDLLERMDIGPKQTQVGIVQYGENVTHEFNL-NKYSSTEEVLVAAKKIVQRGG 245 Query: 292 --TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T + + A + ++ + +K ++ +TDGE N Sbjct: 246 RQTMTALGIDTARKEAFTEA-------------RGARRGVKKVMVIVTDGE---SHDNHR 289 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRL--------LKTCVSSP--EYHYNVVNADSLI 399 K+ ++ I+ +I+I S N L +K+ S P ++ +NV + +L+ Sbjct: 290 LKKVIQDCEDENIQRFSIAILGSYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALV 349 Query: 400 HVFQNISQLM 409 + + + + + Sbjct: 350 TIVKTLGERI 359 >gi|304347707|gb|ADM25314.1| MIC2-like protein 1 [Neospora caninum] gi|325118031|emb|CBZ53582.1| hypothetical protein NCLIV_033690 [Neospora caninum Liverpool] Length = 756 Score = 41.4 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 32/180 (17%), Positives = 71/180 (39%), Gaps = 21/180 (11%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTR---VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 LD IDL+ E+V++ ++ + V +P +++ + + + T + Sbjct: 91 VLDFIDLVPISSEEVHLSVVTFADSPQDVFTFKQPQATNKQLAKEAFKYLRYRRGGSTAT 150 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + +A + LT + V K ++ +TDGE++ TI+ Sbjct: 151 DKGLIRARRYLT-------------RPVYGTRANVPKVLVLMTDGESDRHYD---TIQAA 194 Query: 355 DKAKENFIKIVTISI-NASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHR 412 D+A+ I + + + A+P R + C P + + N + L+ +I + + Sbjct: 195 DQARAEGISVFVVGVGMANPVECRGVCGCGRYGPCPQFIMSNWNELVQTVDSIMGEVCKK 254 >gi|225465131|ref|XP_002271188.1| PREDICTED: hypothetical protein [Vitis vinifera] Length = 768 Score = 41.4 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 27/175 (15%), Positives = 63/175 (36%), Gaps = 43/175 (24%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 +R + V+D+SGSM +K++ LK A+ + ++ Sbjct: 281 DRAPIDLVAVLDVSGSMAG--------------------SKLSLLKRAVCFLIQNLGPSD 320 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQIL 305 + ++ +++ + ++ R+ ++SL T+ +K+ ++L Sbjct: 321 ------RLSIVSFSSTARRIFPLRRMSDNGREAAGLAINSLTSSGGTNIVEGLKKGVRVL 374 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 +++ + II L+DG++ NVN + A N Sbjct: 375 EERSEQNPVAS----------------IILLSDGKDTYNCDNVNRRQTSHCASSN 413 >gi|307245403|ref|ZP_07527491.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 1 str. 4074] gi|306853744|gb|EFM85961.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 1 str. 4074] Length = 538 Score = 41.4 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 40/269 (14%), Positives = 80/269 (29%), Gaps = 26/269 (9%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 ++ I E ++++ L + L LI + + + + A+L+ ++ Sbjct: 6 LSQARRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAE 65 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 + + SN + D + + ++ + F + +N I Sbjct: 66 NNNGRKDNDYKLSGSSNKENDSF--DISSEVGKRDSQMVTTFVKAFLPQTNDDKMNLIPI 123 Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE 187 T N +T + S IEH V IP + + Sbjct: 124 CKTV---NNTSGKGHTSSSEVTCTVSGT-----IEHKSWFPLKVGTVEVIPQQVDVASKS 175 Query: 188 RPI--------FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + + +V DLSGSM +D + +K+ L+ L Sbjct: 176 KAFKKNTFNIPIDLMVVADLSGSM--------KDGIKGEKLKGGTNSKIYILREVLKELA 227 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 D E +G+ + E E Sbjct: 228 DKSLFTQEANEYNRIGITAFAMGAEHPKE 256 >gi|299529294|ref|ZP_07042734.1| ferredoxin-dependent glutamate synthase [Comamonas testosteroni S44] gi|298722738|gb|EFI63655.1| ferredoxin-dependent glutamate synthase [Comamonas testosteroni S44] Length = 1449 Score = 41.4 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 42/315 (13%), Positives = 88/315 (27%), Gaps = 33/315 (10%) Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 +G + ++ + N +D + N + + S + + Sbjct: 617 SGTFTITASAGLKSLTLDGPDNADATLTLDRLADLVNNPVTLTTSKGTLTLTGYDATTGK 676 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 S + + D +N + + V + + +L + + Sbjct: 677 VSYTYQTSGQQAHTGDDTNVQDHFQITVEDKFGGKATGDLGVLITDTAPSLKPIAESSAL 736 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 G I L +D SGSM+ +++A LK+++ LD Sbjct: 737 SSHGTN----IMLTLDTSGSMNYGSGVYNGWTQL---------SRLAVLKSSVNNLLDKY 783 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 DV + ++ + T + + + V+ T+ A+ A Sbjct: 784 ----GEAGDVRVMIVEFNTSASQKGGGWMSLAEAKALVSGLG---YGGGTNYQTALDTAM 836 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE--NNNFKSNVNTIKICDKAKEN 360 + K+ Q F TDGE +N ++ +N Sbjct: 837 NAWNNSG-----------TGKLEGGNVQNISYFFTDGEPDSNRSVNSAQQATWEKFLADN 885 Query: 361 FIKIVTISINASPNG 375 I I + G Sbjct: 886 HINSYGIGLGTGATG 900 >gi|218509981|ref|ZP_03507859.1| hypothetical protein RetlB5_22275 [Rhizobium etli Brasil 5] Length = 448 Score = 41.4 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 51/363 (14%), Positives = 105/363 (28%), Gaps = 81/363 (22%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 R + N +I+ ALS++ L+ +G + + + M+S +AA++A Sbjct: 8 RCLHTLRSLGRDRTGNVAIVVALSLVPMLVAVGASFDYIRSYNVRQRMQSDLDAALIAAV 67 Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 ++ +N + +S+ + NS Sbjct: 68 KQI-NNTEDTDALKQKVSDWFH-------------------------------AQVENSY 95 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEM 185 + + +T +N+ S F+ + + ++ Sbjct: 96 ALG---------EIEIDTTNHNITATASGTVPTTFM------KIANIDTVPVSVASAVKG 140 Query: 186 GERPIFLIELVVDLSGSM----------------HCAM-----NSDPEDVNSAPICQDKK 224 + +VVD S SM C ++ + D Sbjct: 141 PATSYLNVYIVVDTSPSMLLAATTAGQSTMYSGIKCQFACHTGDTHTIGKKTYANNYDYS 200 Query: 225 RTK-----MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 K +A+ LD ID E + +GL G + + + P+ T+ R Sbjct: 201 TEKGIKLRADVAGDAVREVLDMIDESDSNHERIKVGLYGLGDTLTEVLAPTLSTDIART- 259 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + T +T + + + K+ P K ++ LTDG Sbjct: 260 ---RLADSSYGLTSATSKAATYFDVSLATLKQKVGAGGDGTTSGTPL----KLVLLLTDG 312 Query: 340 ENN 342 + Sbjct: 313 VQS 315 >gi|126660809|ref|ZP_01731904.1| hypothetical protein CY0110_12397 [Cyanothece sp. CCY0110] gi|126617906|gb|EAZ88680.1| hypothetical protein CY0110_12397 [Cyanothece sp. CCY0110] Length = 416 Score = 41.4 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 36/206 (17%), Positives = 65/206 (31%), Gaps = 49/206 (23%) Query: 166 NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 NQ Q+ V+ + + + L++D SGSM K Sbjct: 17 NQSNTQRQVAISLSAVSESSDRTLPLNLGLILDHSGSM-----------------TGKP- 58 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT--RVEKNIEPSWGTEKVRQYVTRD 283 + +K A + ++S+ + ++ + +V +P + V Q + R Sbjct: 59 --IKTVKEAAIRLVESLGSGD------RLSVVAFDHKAKVIVPNQPIDDIKTVNQQIQRL 110 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 + T MK + + K I LTDGEN + Sbjct: 111 EPA---GGTCIDEGMKLGIKEVALGKDDRVSQ-----------------IFLLTDGENEH 150 Query: 344 FKSNVNTIKICDKAKENFIKIVTISI 369 N +K+ A E I + T+ Sbjct: 151 G-DNERCLKLAQVAAEYNITLNTLGF 175 >gi|29828547|ref|NP_823181.1| hypothetical protein SAV_2005 [Streptomyces avermitilis MA-4680] gi|29605651|dbj|BAC69716.1| hypothetical protein [Streptomyces avermitilis MA-4680] Length = 420 Score = 41.4 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 70/227 (30%), Gaps = 34/227 (14%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 A+ + P ++LV+D+SGSM ++MAA K A Sbjct: 25 PAGAVADETTADAPK--VDLVLDVSGSMRARDIDGG--------------SRMAAAKQAF 68 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 L E+V +G+ + + + + Y +D Sbjct: 69 NEVL------DATPEEVRLGIRTLGANYPGDDRKTGCKDTAQLYPVSTLDRTEA------ 116 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 K A L+ + + I+ +TDGE+ + + Sbjct: 117 ---KTAVATLSPTGWTPIGPALLKAADDLDGGTGSHRIVLITDGEDTCAPLDPCEVAREI 173 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIH 400 AK + I T+ + + +C++ + +V + + L Sbjct: 174 AAKGVGLTIDTLGL-VPNSKLSKQLSCIAEATGGTYTSVEHKEDLTD 219 >gi|123781093|sp|Q3V3R4|ITA1_MOUSE RecName: Full=Integrin alpha-1; AltName: Full=CD49 antigen-like family member A; AltName: Full=Laminin and collagen receptor; AltName: Full=VLA-1; AltName: CD_antigen=CD49a; Flags: Precursor gi|74186862|dbj|BAE20498.1| unnamed protein product [Mus musculus] Length = 1179 Score = 41.4 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 29/184 (15%), Positives = 65/184 (35%), Gaps = 25/184 (13%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 D + + + +G++ Y V K + + + Sbjct: 191 AFLNDLLKRMDIGPKQTQVGIVQYGANVTHEFNL----NKYSSTEEVLVAANKIGRRGGL 246 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 M A I T+ K+ R+G +K ++ +TDGE N ++ Sbjct: 247 QTM-TALGIDTARKEAFTEARGARRG-------VKKVMVIVTDGE---SHDNYRLKQVIQ 295 Query: 356 KAKENFIKIVTISINASPNGQRL--------LKTCVSSP--EYHYNVVNADSLIHVFQNI 405 ++ I+ +I+I N L +K+ S P ++ +NV + +L+ + + + Sbjct: 296 DCEDENIQRFSIAILGHYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKAL 355 Query: 406 SQLM 409 + + Sbjct: 356 GERI 359 >gi|262195149|ref|YP_003266358.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262078496|gb|ACY14465.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 412 Score = 41.4 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 42/251 (16%), Positives = 77/251 (30%), Gaps = 50/251 (19%) Query: 183 IEMGERPIF-LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 IE+ PI ++ ++D SGSM + A++NA D Sbjct: 111 IEVETDPIIPTVQFLIDFSGSMDQNFGGI---------------KRSQAVRNA---LFDE 152 Query: 242 ID-LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 D +++ ++ V G YT+ P V ++L D + Sbjct: 153 DDGVVALLQSQVRFGASLYTSFDGNEAPPC----PRLTQVAPAFNNLTALRADIGGPLND 208 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN-FKSNVNT--------- 350 D + + + I+ TDGE ++ + NT Sbjct: 209 PPNA--GDTPTGESIDAIAENFPDNGPNDKPLIVLATDGEPDSCTDPDPNTDPGRAATRR 266 Query: 351 --IKICDKAKENFIKIVTISINASPNGQRLLKTCVS-----------SPEYHYNVVNADS 397 + +A E I++ +S+ + G L+ + P Y N Sbjct: 267 LSEEATQRAFEAGIELYVLSV-GNDVGADHLQRVANAGVGKALDESNDPATVYIGNNQQE 325 Query: 398 LIHVFQNISQL 408 L+ F I + Sbjct: 326 LVDAFSEIIRS 336 >gi|326675264|ref|XP_002665076.2| PREDICTED: collagen alpha-1(XXVIII) chain-like [Danio rerio] Length = 1046 Score = 41.4 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 32/143 (22%), Positives = 59/143 (41%), Gaps = 24/143 (16%) Query: 252 VYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQILTSDK 309 + LI Y++ V N W + + ++ + D+ + T ST A+ A Q+ Sbjct: 87 TRLALIYYSSSVHINQHFNDW--QDLDVFLDQLEDASYIGQGTYSTYAISNATQL----- 139 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 F Q V++ + +TDG ++ N + + + +AK + IKI I + Sbjct: 140 ---FIRETSGQSVRVS--------LLMTDG--SDHPRNPDIMTVVAEAKSHNIKIFAIGL 186 Query: 370 N--ASPNGQRLLKTCVSSPEYHY 390 + A + L+ SSP Y Sbjct: 187 SMRAMDSNSAKLRAVASSPAQQY 209 >gi|332358821|gb|EGJ36643.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK1056] Length = 434 Score = 41.4 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 59/339 (17%), Positives = 106/339 (31%), Gaps = 54/339 (15%) Query: 49 KKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSG 108 + SM+ + I S V + S+ G L D K+ I +S Sbjct: 52 MRTSMQ-YVDRTIGKATSVFVLDESKYGKDVRKTEGWNYIGLSPDGKKVINYIWNKSTKS 110 Query: 109 -YSAVFYNTEIQNIVNSSRISMTHMAN-NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLN 166 +V + ++ NRL + N T Y + + LN Sbjct: 111 WDESVLGTNSLYDMQLDLEFKADESYQDNRLINYNLTGQYK---NSKNKLSIDTAISALN 167 Query: 167 QRYNQKIVSFIPALLRIEMGERPI-----FLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 + V+ + + PI + V D SGSM + + + Sbjct: 168 TKQVISKVAKGKKGVALAYRNDPIEGQVNTAVTFVFDTSGSMGYGLWNQKLEPTD----- 222 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVK-EDVY-MGLIGYTTRVEKNIEPSWGTEKVRQY 279 RT+M LK L +D + + +V V G Y E +E T+ ++ Sbjct: 223 --SRTRMNILKTKANLLVDDLKEIGNVSVNLVRFSGDASY--IQEDFVELDKDTDTIKTK 278 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + S I T+ ++ L + + K+++ LTDG Sbjct: 279 IKALPTSWI---TNPGDGLRYGLVSLQRNPAQL------------------KYVVLLTDG 317 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL 378 N + A + I ++ N NG+++ Sbjct: 318 IPNAY-----------TASPDGIGKYDLTANFPTNGKQI 345 >gi|109087573|ref|XP_001097885.1| PREDICTED: collagen alpha-1(XXII) chain-like [Macaca mulatta] Length = 232 Score = 41.4 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 28/186 (15%), Positives = 66/186 (35%), Gaps = 29/186 (15%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL--KPTDSTPAMK 299 +D + +G++ Y+ R E G R+ V L T++ A++ Sbjct: 65 VDTFEVGPDRTRVGVVRYSDRPTTAFEL--GLFGSREEVKAAARRLAYHGGNTNTGDALR 122 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + + R G + +++ I LTDG + + + Sbjct: 123 Y----------ITALSFSPRAGGRPGDRAYKQVAILLTDGRSQDLVLDAAAAA-----HR 167 Query: 360 NFIKIVTISINASPNGQRL---LKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVHRKY 414 I+I + + G+ L L+ S P + ++V + +++ + + + + + Sbjct: 168 AGIRIFAVGV-----GKALKEELEEIASEPKSAHVFHVSDFNAIDKIRGKLRRRLCESEC 222 Query: 415 SVILKG 420 S +G Sbjct: 223 SRAPRG 228 >gi|197118196|ref|YP_002138623.1| VWFA superfamily protein [Geobacter bemidjiensis Bem] gi|197087556|gb|ACH38827.1| VWFA superfamily protein [Geobacter bemidjiensis Bem] Length = 331 Score = 41.4 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 40/252 (15%), Positives = 75/252 (29%), Gaps = 60/252 (23%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 A+ R + + L +DLS SM + + ++AA K L F Sbjct: 75 AVARESQVQSRGMDLVLALDLSTSMLAEEQGR----------EGRGENRLAAAKRVLSEF 124 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 + + ++ +GL+ + R + + ++ V R + + T A+ Sbjct: 125 IGA-------RKQDRIGLVAFAGRPYPAAPLTSDHQWLQGIVERLDTNSVEDGTALGDAI 177 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 L + +I +TDG NN + AK Sbjct: 178 LAGVNRLRQRPAEG------------------RALILITDGRNNAGAEPQLAAQ---AAK 216 Query: 359 ENFIKIVTIS----------INASPNGQ-----------RLLKTCVS-SPEYHYNVVNAD 396 I++ I + + G LK + ++ +A Sbjct: 217 ALGIRVHAIGIGSRGSAVIPVPSPLGGTIYRRLDADLDAATLKGVAEITGGRYFEAGDAT 276 Query: 397 SLIHVFQNISQL 408 L VF I +L Sbjct: 277 VLSRVFAEIDRL 288 >gi|307108695|gb|EFN56934.1| hypothetical protein CHLNCDRAFT_143466 [Chlorella variabilis] Length = 785 Score = 41.4 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 23/111 (20%), Positives = 40/111 (36%), Gaps = 12/111 (10%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ----DKKRTKMAALKNALLLFLDSID 243 RP + L++D+SGSM CA + D + + +RTK+ K + L+ + Sbjct: 300 RPRLNLCLLLDVSGSMDCAFDRHYYDAATGRQATLQGEEARRTKLDVAKEVIKGVLERLS 359 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPS-WGTEKVRQYVTRDMDSLILKPTD 293 V + ++ WG + +DSL D Sbjct: 360 PDDCV------AISLFSDAAATPKRMGRWGDADAAG-IQAGIDSLRTAGLD 403 >gi|293360567|ref|XP_216941.5| PREDICTED: matrilin 2 [Rattus norvegicus] Length = 900 Score = 41.4 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 30/182 (16%), Positives = 63/182 (34%), Gaps = 28/182 (15%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTP 296 ID L+ + +GL+ Y+T+V G + + + + Sbjct: 636 VTGIIDSLAVSPKAARVGLLQYSTQVRTEFTLK-GFSSAKDMKKAVAHMKYMGKGSMTGL 694 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+K Y+ + + G + S + I TDG + +V+ + K Sbjct: 695 ALKHMYERSFTQVE----------GARPLSTRVPRAAIVFTDG---RAQDDVS--EWARK 739 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVHRKY 414 AK N I + + I + + L+ S P ++ + + F + ++ K Sbjct: 740 AKANGITMYAVGIGKAIEEE--LQEIASEPIDKHLFYAED-------FSTMGEISEKLKL 790 Query: 415 SV 416 + Sbjct: 791 GI 792 >gi|293348660|ref|XP_001058523.2| PREDICTED: matrilin 2 [Rattus norvegicus] Length = 922 Score = 41.4 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 30/182 (16%), Positives = 63/182 (34%), Gaps = 28/182 (15%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTP 296 ID L+ + +GL+ Y+T+V G + + + + Sbjct: 664 VTGIIDSLAVSPKAARVGLLQYSTQVRTEFTLK-GFSSAKDMKKAVAHMKYMGKGSMTGL 722 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+K Y+ + + G + S + I TDG + +V+ + K Sbjct: 723 ALKHMYERSFTQVE----------GARPLSTRVPRAAIVFTDG---RAQDDVS--EWARK 767 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVHRKY 414 AK N I + + I + + L+ S P ++ + + F + ++ K Sbjct: 768 AKANGITMYAVGIGKAIEEE--LQEIASEPIDKHLFYAED-------FSTMGEISEKLKL 818 Query: 415 SV 416 + Sbjct: 819 GI 820 >gi|149066552|gb|EDM16425.1| matrilin 2 (predicted) [Rattus norvegicus] Length = 898 Score = 41.4 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 30/182 (16%), Positives = 63/182 (34%), Gaps = 28/182 (15%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTP 296 ID L+ + +GL+ Y+T+V G + + + + Sbjct: 636 VTGIIDSLAVSPKAARVGLLQYSTQVRTEFTLK-GFSSAKDMKKAVAHMKYMGKGSMTGL 694 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+K Y+ + + G + S + I TDG + +V+ + K Sbjct: 695 ALKHMYERSFTQVE----------GARPLSTRVPRAAIVFTDG---RAQDDVS--EWARK 739 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVHRKY 414 AK N I + + I + + L+ S P ++ + + F + ++ K Sbjct: 740 AKANGITMYAVGIGKAIEEE--LQEIASEPIDKHLFYAED-------FSTMGEISEKLKL 790 Query: 415 SV 416 + Sbjct: 791 GI 792 >gi|301620566|ref|XP_002939640.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like protein-like [Xenopus (Silurana) tropicalis] Length = 1179 Score = 41.4 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 49/287 (17%), Positives = 97/287 (33%), Gaps = 53/287 (18%) Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFI 161 + S+ G S + +T+++ +R++ T + ++++ D + YD L+ + Sbjct: 211 MTNSVRGESKMPPSTQVEKGTYCARVTYTPTPTEQ--AAHSRPGVTADFVLQYDVSLKDL 268 Query: 162 EHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 + + F P L + + V+D+SGSM Sbjct: 269 AGDVQIYNGYFVHYFAPRGLPPIQKD-----VIFVIDVSGSMFG---------------- 307 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV------EKNIEPSWGTEK 275 TK+ K+A+ + L+ + +I ++ V + + + Sbjct: 308 ----TKIKQTKSAMHVILNDLHRDDSFN------IITFSDVVHVWRPGQSIPATAQNKKS 357 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 + YV + TD A+ A I + S KIP IIF Sbjct: 358 AKDYVNKIEAD---GWTDINAALMAAASIF---NQTSHKPEKETSTKKIP------LIIF 405 Query: 336 LTDGE-NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT 381 LTDGE + + + KA I + ++ L++ Sbjct: 406 LTDGEATSGVLATSRILSNAQKAMGGTISLFCLAF-GEDADYNLMRR 451 >gi|300786826|ref|YP_003767117.1| hypothetical protein AMED_4949 [Amycolatopsis mediterranei U32] gi|299796340|gb|ADJ46715.1| conserved hypothetical protein [Amycolatopsis mediterranei U32] Length = 326 Score = 41.4 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 38/241 (15%), Positives = 75/241 (31%), Gaps = 55/241 (22%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + LV+D+S SM D T++ A + A F + Sbjct: 86 RNRATVMLVIDVSLSMEA---------------TDVLPTRLQAAQEAATSF------ARN 124 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + + +GLI + T + + + + T + + A Q + S Sbjct: 125 MTPGINLGLISF---AGTATVLVNPTTDRNGVIKAIENLKLAQSTATGEGIFAALQSVES 181 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS-----NVNTIKICDKAKENFI 362 G P + I+ ++DG+ + T AK+ + Sbjct: 182 FS--------SLVGGADGPPPAR--IVLMSDGKQTVPEDLYAARGGYTAAQ--AAKQAGV 229 Query: 363 KIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 I +IS S + + L + S Y +A+ L V+ ++ + Sbjct: 230 PISSISFGTTHGSVTIDDKPQPVSVDDESLREIARLSGGDFYKAASAEELKKVYADLGEQ 289 Query: 409 M 409 + Sbjct: 290 I 290 >gi|299535615|ref|ZP_07048936.1| BatA [Lysinibacillus fusiformis ZC1] gi|298728815|gb|EFI69369.1| BatA [Lysinibacillus fusiformis ZC1] Length = 972 Score = 41.4 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 27/147 (18%), Positives = 55/147 (37%), Gaps = 24/147 (16%) Query: 265 KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 K GT + S TD + A ++D K S Sbjct: 742 KATVLGEGTTENVLKKDLYKASKEKGATDIFAGIDIALTKFSNDSKTS------------ 789 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI--NASPNGQRLLKTC 382 K I+ ++DG+ + K K+ ++AK+ +K+ T+S+ + N L++ Sbjct: 790 ------KAIVVVSDGKTSKSK----MTKVINEAKKQGVKVYTVSMGKKSQVNDATLMQLS 839 Query: 383 VSSPEYHYNVVNADSLIHVFQNISQLM 409 + +++ ++ L VFQ + + Sbjct: 840 SETSGAYFHAIDNMQLHQVFQKLIDTI 866 >gi|311978223|ref|YP_003987343.1| putative ariadne-like ring finger protein [Acanthamoeba polyphaga mimivirus] gi|82000061|sp|Q5UQ35|YR811_MIMIV RecName: Full=Putative ariadne-like RING finger protein R811 gi|55417421|gb|AAV51071.1| unknown [Acanthamoeba polyphaga mimivirus] gi|308205062|gb|ADO18863.1| putative ariadne-like ring finger protein [Acanthamoeba polyphaga mimivirus] Length = 990 Score = 41.4 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 32/188 (17%), Positives = 60/188 (31%), Gaps = 34/188 (18%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 M + + + +VVD +GSM ++S L +L + ID+ Sbjct: 1 MDNQAVVDLAIVVDATGSMGTFLSS---------------------LSESLQQIVQIIDI 39 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + ++ + +I Y + I S G K+ + D+ A K A Sbjct: 40 TNVIQN---INIIMYRDYCDSVITASSGWVSKIDDLIPFIRGLRASGGGDTPEAGKTAAN 96 Query: 304 ILTSDKKRS----FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 L K + ++ + F + I L + I++CD Sbjct: 97 NLLDVVKNNTIVIWYADAPPHHKSNARDNFAREINTLI-----GSDKIFDWIELCDTLAA 151 Query: 360 NFIKIVTI 367 I + I Sbjct: 152 RNIIVYPI 159 >gi|298709908|emb|CBJ31633.1| conserved unknown protein [Ectocarpus siliculosus] Length = 304 Score = 41.4 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 53/170 (31%), Gaps = 35/170 (20%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI--DLLSHVK 249 + +V+D+SGSM + ++ K A ++++ D +V Sbjct: 115 DVVIVLDVSGSMS-------------------QYGRLDLAKEAAETVINTLGADSFVNVV 155 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + T ++ V L T+ A + + IL Sbjct: 156 TFSETARVLLTNSTTLVRATEDNLGELVSLVQNLEFDLANVGTNFGAAFETTFDIL---- 211 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN-NFKSNVNTIKI--CDK 356 + S + + I+FLTDG N ++ T K CD Sbjct: 212 EASRTSEETSSNCQTA-------IVFLTDGNTNVGLSTDEVTSKQIACDT 254 >gi|291395333|ref|XP_002714013.1| PREDICTED: integrin, alpha 1 [Oryctolagus cuniculus] Length = 1200 Score = 41.4 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 32/182 (17%), Positives = 69/182 (37%), Gaps = 25/182 (13%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 D ++ + + +G++ Y V + V + +I + T Sbjct: 215 LNDLLERMDIGPKQTQVGIVQYGENVTHEFNL--NKYSSTEEVLVAANKIIQRGGRQT-- 270 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 A I T+ K+ R+G +K ++ +TDGE N K+ Sbjct: 271 -MTALGIDTARKEAFTEARGARRG-------VKKVMVIVTDGE---SHDNHRLKKVIQDC 319 Query: 358 KENFIKIVTISINASPNGQRL--------LKTCVSSP--EYHYNVVNADSLIHVFQNISQ 407 ++ I+ +I+I S N L +K+ S P ++ +NV + +L+ + + + + Sbjct: 320 EDENIQRFSIAILGSYNRGNLSAEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKALGE 379 Query: 408 LM 409 + Sbjct: 380 RI 381 >gi|260771476|ref|ZP_05880401.1| hypothetical protein VFA_000095 [Vibrio furnissii CIP 102972] gi|260613602|gb|EEX38796.1| hypothetical protein VFA_000095 [Vibrio furnissii CIP 102972] Length = 407 Score = 41.4 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 22/167 (13%), Positives = 55/167 (32%), Gaps = 15/167 (8%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 +GI ++ +I ++++ + + F I + + ++++ +AA LA A + + Sbjct: 8 RGIRKQRGLVVVIVTIAMLVLIAVAAFAIDINHAMMNRTKLQNSVDAAALAAAIVLDKDG 67 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 + A D R + + + + N+ T Sbjct: 68 T--------------EAQADTIARSTLTKMSTAAGNAELTLDVSNVVNVEVQFSNDPTVF 113 Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPA 179 ++ SS + Y V+ D F ++ + S + Sbjct: 114 PDSGYSSSPDGDRYVRVVINQLDLESFFFARVMGVT-KRLTASAVAG 159 >gi|291455286|ref|ZP_06594676.1| von Willebrand factor [Streptomyces albus J1074] gi|291358235|gb|EFE85137.1| von Willebrand factor [Streptomyces albus J1074] Length = 422 Score = 41.4 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 40/223 (17%), Positives = 77/223 (34%), Gaps = 31/223 (13%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 RP +ELV+D+SGSM +++M+A K A D+ Sbjct: 27 PSARPAPKVELVLDVSGSM--------------KTRDIDGQSRMSAAKQAFN------DV 66 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + V E+V +G+ + + G + RQ + P D T A K A Sbjct: 67 IDAVPEEVELGIRTLGADYPGEDK-ARGCKDTRQ-------LYPVGPIDRTEA-KTAVAT 117 Query: 305 LTSDKKRSFFTNFFRQGVK-IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L+ + I+ ++DGE+ + + AK + Sbjct: 118 LSPTGWTPIGPALLGAADDLDGDEGGSRRIVLISDGEDTCGPLDPCEVAREIAAKGVDLV 177 Query: 364 IVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNI 405 I T+ + + ++ L ++ + V + + L + + Sbjct: 178 IDTLGLVPNAKIRQQLSCIAGATGGTYTAVQHKEDLSDKVKQL 220 >gi|239983463|ref|ZP_04705987.1| hypothetical protein SalbJ_28780 [Streptomyces albus J1074] Length = 423 Score = 41.4 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 40/223 (17%), Positives = 77/223 (34%), Gaps = 31/223 (13%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 RP +ELV+D+SGSM +++M+A K A D+ Sbjct: 28 PSARPAPKVELVLDVSGSM--------------KTRDIDGQSRMSAAKQAFN------DV 67 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + V E+V +G+ + + G + RQ + P D T A K A Sbjct: 68 IDAVPEEVELGIRTLGADYPGEDK-ARGCKDTRQ-------LYPVGPIDRTEA-KTAVAT 118 Query: 305 LTSDKKRSFFTNFFRQGVK-IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L+ + I+ ++DGE+ + + AK + Sbjct: 119 LSPTGWTPIGPALLGAADDLDGDEGGSRRIVLISDGEDTCGPLDPCEVAREIAAKGVDLV 178 Query: 364 IVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNI 405 I T+ + + ++ L ++ + V + + L + + Sbjct: 179 IDTLGLVPNAKIRQQLSCIAGATGGTYTAVQHKEDLSDKVKQL 221 >gi|124485081|ref|YP_001029697.1| hypothetical protein Mlab_0254 [Methanocorpusculum labreanum Z] gi|124362622|gb|ABN06430.1| von Willebrand factor, type A [Methanocorpusculum labreanum Z] Length = 313 Score = 41.4 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 24/175 (13%), Positives = 57/175 (32%), Gaps = 43/175 (24%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 + +D+S SM D T++ A K + + + S+ Sbjct: 91 VALDVSASMSA---------------SDYSPTRVEAAKGSSEILIRSLSESDT------A 129 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 G++ + + S +V + + S+ T + A ++T+ ++ Sbjct: 130 GVVIFESGASSAAYLSSDKNRVVSRL--EQVSVKTGKTALGDGLALAVDMVTAIPAGTY- 186 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 ++ L+DG +N+ + AK + + + TI + Sbjct: 187 -----------------IVVLLSDGVSNSGMITPQEAA--EYAKNSGVVVYTIGV 222 >gi|24431113|gb|AAN61407.1|AF486289_1 matrilin [Biomphalaria glabrata] Length = 394 Score = 41.4 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 30/186 (16%), Positives = 63/186 (33%), Gaps = 27/186 (14%) Query: 232 KNALLLFLDSID--LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 K A+ + + +S V + +I Y + I + T + V + + Sbjct: 71 KTAIKFLQEFLSQYEISSDPNGVRVSIISYGKGIYPEIGFNLTTYDTKDEVIEAIGRIPH 130 Query: 290 KP---TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 K TD+ A++ ++ ++ K I +TDG N + Sbjct: 131 KAGLRTDTGRAIQYMHEAQLANGVVRPG--------------VTKVSIVITDG---NSQE 173 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 T + ++A+++ I + I + LL V N + L +I Sbjct: 174 WKLTKEAAEEARKDNIVMFAIGVGTDIRNSELL-NIAGDQSRVTKVDNYNQL----SSIK 228 Query: 407 QLMVHR 412 + + H+ Sbjct: 229 ESLAHQ 234 >gi|311262926|ref|XP_003129419.1| PREDICTED: anthrax toxin receptor 2-like [Sus scrofa] Length = 241 Score = 41.4 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 28/216 (12%), Positives = 57/216 (26%), Gaps = 49/216 (22%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + + F + V+D SGS+ V+ + Sbjct: 32 SAQEQPSCSGAFDLYFVLDKSGSVANNWIEIYNFVHQLTERFVSPQ-------------- 77 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 + + I ++++ + + K+ + + + T +K Sbjct: 78 ------------MRLSFIVFSSQATIILPLTGDRGKISEGLDNLKRVSPVGETYIHEGLK 125 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG--ENNNFKSNVNTIKICDKA 357 A Q K L II LTDG + KI + Sbjct: 126 LA----------------NEQIEKAGGLKTSSIIIALTDGKLDGLVPSYAEKEAKI---S 166 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV 393 + ++ + + Q L+ S E + V Sbjct: 167 RSLGARVYCVGVL--DFEQAQLERIADSKEQVFPVT 200 >gi|5726289|gb|AAD48398.1|AF127035_1 calcium-activated chloride channel protein 2 [Homo sapiens] Length = 917 Score = 41.4 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 33/235 (14%), Positives = 78/235 (33%), Gaps = 58/235 (24%) Query: 195 LVVDLSGSMHCA--MNSDPEDVNSAPICQDKKRTKMAALK-NALLLFLDSIDLLSHVKED 251 LV+D SGSM +N + + + + + + ++ ++ + + E Sbjct: 309 LVLDKSGSMGGKDRLNRMNQAAKHFLLQTVENGSWVGMVHFDSTATIVNKLIQIKSSDE- 367 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 R + + + L T +K A+Q++ + Sbjct: 368 -------------------------RNTLMAGLPTYPLGGTSICSGIKYAFQVIGELHSQ 402 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISIN 370 + ++ LTDGE+N S C D+ K++ + I++ Sbjct: 403 LDGSE----------------VLLLTDGEDNTASS-------CIDEVKQSGAIVHFIALG 439 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNA--DSLIHVFQNIS---QLMVHRKYSVILKG 420 + + + + ++ + Y A + LI F ++ + + + KG Sbjct: 440 RAADEAVIEMSKITGGSHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKG 494 >gi|21323788|dbj|BAB98414.1| Hypothetical membrane protein [Corynebacterium glutamicum ATCC 13032] Length = 634 Score = 41.4 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 43/236 (18%), Positives = 79/236 (33%), Gaps = 43/236 (18%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 IV+ +L + + +V+D SGSM +T++ A K Sbjct: 20 IVALAFSLSPVAKAQANETPTMIVLDNSGSMTAQDAGG--------------QTRIDAAK 65 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 A S L++ + + +GL Y T + V + + Sbjct: 66 QA------STQLINDISDRTDVGLTYYGG----------NTGETEADVEMGCQDVTILGG 109 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 S + S + R F LP I+ ++DG N + Sbjct: 110 PSRGNADTLIDTINSLQPRGFTPIGKALTDTAAELPEGGNIVLVSDGIANCTPPD----- 164 Query: 353 ICDKAK---ENFI--KIVTISINASPNGQRLLKTCVS--SPEYHYNVVNADSLIHV 401 +C+ A+ ++ I I TI +N P + L+ C++ + + +A SL Sbjct: 165 VCEVAQELAQSGINLVINTIGLNVDPAAREELE-CIAGVGGGTYADASDAQSLTDA 219 >gi|56797994|emb|CAG27564.2| matrilin-3b [Danio rerio] gi|220675932|emb|CAX12091.1| matrilin 3b [Danio rerio] Length = 343 Score = 41.4 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 30/178 (16%), Positives = 63/178 (35%), Gaps = 34/178 (19%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDST 295 + ++ L + + L+ Y + V + +V+Q +R + T + Sbjct: 92 LSEMVNSLDIGSDATRVALVNYASTVNIEFHLKKYFSKAEVKQAFSRIDP--LSTGTMTG 149 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI----IFLTDGENNNFKSNVNTI 351 A+K A + + ++ + P +K I I +TDG + V + Sbjct: 150 MAIKTAMEQVFTENA--------------GARPLKKGIGKVAIIVTDG---RPQDKVEEV 192 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN---ADSLIHVFQN 404 A+ + I+I + + R LK S P ++ + V + L F+ Sbjct: 193 S--AAARASGIEIYAVGV--DRAEMRSLKQMASQPLDDHVFYVETYGVIEKLTSKFRE 246 >gi|62389907|ref|YP_225309.1| hypothetical protein cg1159 [Corynebacterium glutamicum ATCC 13032] gi|41325243|emb|CAF19723.1| putative secreted protein [Corynebacterium glutamicum ATCC 13032] Length = 634 Score = 41.4 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 43/236 (18%), Positives = 79/236 (33%), Gaps = 43/236 (18%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 IV+ +L + + +V+D SGSM +T++ A K Sbjct: 20 IVALAFSLSPVAKAQANETPTMIVLDNSGSMTAQDAGG--------------QTRIDAAK 65 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 A S L++ + + +GL Y T + V + + Sbjct: 66 QA------STQLINDISDRTDVGLTYYGG----------NTGETEADVEMGCQDVTILGG 109 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 S + S + R F LP I+ ++DG N + Sbjct: 110 PSRGNADTLIDTINSLQPRGFTPIGKALTDTAAELPEGGNIVLVSDGIANCTPPD----- 164 Query: 353 ICDKAK---ENFI--KIVTISINASPNGQRLLKTCVS--SPEYHYNVVNADSLIHV 401 +C+ A+ ++ I I TI +N P + L+ C++ + + +A SL Sbjct: 165 VCEVAQELAQSGINLVINTIGLNVDPAAREELE-CIAGVGGGTYADASDAQSLTDA 219 >gi|297460736|ref|XP_599315.5| PREDICTED: collagen, type XXII, alpha 1 [Bos taurus] Length = 1605 Score = 41.4 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 25/174 (14%), Positives = 56/174 (32%), Gaps = 26/174 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 +D E +G++ Y+ R E + + T + A++ Sbjct: 65 VDTFEVGPERTRVGVVRYSDRPATAFELGRFGSRAAVRAAARQLAYHGGHTHTGDALRF- 123 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + + R G + F++ I LTDG ++ A+ Sbjct: 124 ---------ITRHSFTPRAGGRPGDRAFKQVAILLTDGR----SQDLVLPAA-TAARRAG 169 Query: 362 IKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVHRK 413 I+I + + + + L+ S P + ++V + F I ++ + Sbjct: 170 IRIFAVGVGEALREE--LEEIASEPTAAHVFHVSD-------FDAIDKIRGKLR 214 >gi|297482250|ref|XP_002692646.1| PREDICTED: collagen, type XXII, alpha 1-like [Bos taurus] gi|296480820|gb|DAA22935.1| collagen, type XXII, alpha 1-like [Bos taurus] Length = 1605 Score = 41.4 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 25/174 (14%), Positives = 56/174 (32%), Gaps = 26/174 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 +D E +G++ Y+ R E + + T + A++ Sbjct: 65 VDTFEVGPERTRVGVVRYSDRPATAFELGRFGSRAAVRAAARQLAYHGGHTHTGDALRF- 123 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + + R G + F++ I LTDG ++ A+ Sbjct: 124 ---------ITRHSFTPRAGGRPGDRAFKQVAILLTDGR----SQDLVLPAA-TAARRAG 169 Query: 362 IKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVHRK 413 I+I + + + + L+ S P + ++V + F I ++ + Sbjct: 170 IRIFAVGVGEALREE--LEEIASEPTAAHVFHVSD-------FDAIDKIRGKLR 214 >gi|19552242|ref|NP_600244.1| hypothetical protein NCgl0978 [Corynebacterium glutamicum ATCC 13032] Length = 594 Score = 41.4 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 40/215 (18%), Positives = 72/215 (33%), Gaps = 43/215 (20%) Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 +V+D SGSM +T++ A K A S L++ + + Sbjct: 1 MIVLDNSGSMTAQDAGG--------------QTRIDAAKQA------STQLINDISDRTD 40 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 +GL Y T + V + + S + S + R F Sbjct: 41 VGLTYYGG----------NTGETEADVEMGCQDVTILGGPSRGNADTLIDTINSLQPRGF 90 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK---ENFI--KIVTIS 368 LP I+ ++DG N + +C+ A+ ++ I I TI Sbjct: 91 TPIGKALTDTAAELPEGGNIVLVSDGIANCTPPD-----VCEVAQELAQSGINLVINTIG 145 Query: 369 INASPNGQRLLKTCVS--SPEYHYNVVNADSLIHV 401 +N P + L+ C++ + + +A SL Sbjct: 146 LNVDPAAREELE-CIAGVGGGTYADASDAQSLTDA 179 >gi|301626452|ref|XP_002942405.1| PREDICTED: collagen alpha-6(VI) chain-like [Xenopus (Silurana) tropicalis] Length = 2615 Score = 41.4 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 23/166 (13%), Positives = 58/166 (34%), Gaps = 27/166 (16%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + + ++ + V +GLI +++ ++ K + + + + Sbjct: 1041 YETMKEFMESMVKQAEIGP---DRVQIGLIQFSSETKEEFPL--NRYKRKDEIQSAIRGI 1095 Query: 288 I--LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + T A+K + K + ++++I +TDGE + Sbjct: 1096 QQLSQGTLMGEALKYTL--------------PYFSASKGGRVNTKQYLIVITDGEAQDAV 1141 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYN 391 N +++ + I I + + N Q LL+ + +Y Sbjct: 1142 GNPA-----KAIRDHGVIIYAIGVQQANNTQ-LLEIAGKQEQVYYE 1181 Score = 39.5 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 29/175 (16%), Positives = 64/175 (36%), Gaps = 22/175 (12%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTT-RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 ++ I + V G++ Y+ R + T+K+ + ++ L T + Sbjct: 862 MIELISMFQVGANRVRFGVVQYSDVRRTEFFISEHNTQKMLKDAISQIEQL-GGGTLTGE 920 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+ Q+ + K K+P ++ +TDGE + T + Sbjct: 921 ALTSMKQLFVNAAKDRPH--------KVPQS-----LVVITDGE----SQDRVTEAA-AE 962 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + I I I + + ++ S E + V N DSL + ++++ + Sbjct: 963 IRNDGITIFAIGVK--NAVEEEIRDIAGSNEKMFFVNNFDSLKVIKNDLARELCT 1015 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 25/161 (15%), Positives = 55/161 (34%), Gaps = 17/161 (10%) Query: 254 MGLIGYTTRVEKNIEPSWGTE--KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + + + S V QY L S+ MK+ ++ K Sbjct: 1420 TSAKTFMKEIVDSFTISENRVRIGVAQYSANPKKEFFLNEYYSSSDMKKQIDSISQLKAT 1479 Query: 312 SFFTNFFRQGVK------IPSLPFQKFIIFLTDG-ENNNFKSNVNTIKICDKAKENFIKI 364 ++ R + +++I +TDG N++ + + + + +KI Sbjct: 1480 TYTGKGLRFVKQFFDPANGGRKNVPQYLIVMTDGMSNDSVNEDAAAL------RSSGVKI 1533 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 +I I + + ++ SP+ Y V +L + + I Sbjct: 1534 FSIGIGLRNSFELVM--IAGSPKNVYEVETFQALDSIKRQI 1572 >gi|269960459|ref|ZP_06174831.1| hypothetical protein VME_12150 [Vibrio harveyi 1DA3] gi|269834536|gb|EEZ88623.1| hypothetical protein VME_12150 [Vibrio harveyi 1DA3] Length = 420 Score = 41.4 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 18/116 (15%), Positives = 45/116 (38%), Gaps = 2/116 (1%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS--NLS 73 ++K ++ ++ ++ L + F I + K +++A ++A LAGA + N+S Sbjct: 11 RTQKGITLVLISMVLLILLGMAAFGIDLNHQVLNKTRLQNAVDSAALAGAVVVDENGNVS 70 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 ++ + + + F ++ + S A F + S + Sbjct: 71 AAETAAKATLSSISASDGNAELVFTDSNTAVTFSTDRATFVSAASFTPPASGEYDI 126 >gi|156408866|ref|XP_001642077.1| predicted protein [Nematostella vectensis] gi|156229218|gb|EDO50014.1| predicted protein [Nematostella vectensis] Length = 251 Score = 41.4 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 22/129 (17%), Positives = 47/129 (36%), Gaps = 15/129 (11%) Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 I T++ A+ A + L K + ++ +TDG +N ++ Sbjct: 132 FIAGTTNTQEALNLAQRELFGKKNSGATPGAIGR------------VLIITDGLSNVQRN 179 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNG-QRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 T+ K K +I +++ G L+ S+ + + + L + I Sbjct: 180 --LTLFNAYKLKMAGPEIYVVAVGQYLYGLHELVGLASSTENHLFRAQSMRGLEGAVRLI 237 Query: 406 SQLMVHRKY 414 + ++RKY Sbjct: 238 PKPSMYRKY 246 >gi|109731121|gb|AAI13690.1| Chloride channel accessory 4 [Homo sapiens] gi|109731369|gb|AAI13688.1| Chloride channel accessory 4 [Homo sapiens] gi|313883598|gb|ADR83285.1| chloride channel accessory 4 [synthetic construct] Length = 917 Score = 41.4 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 33/235 (14%), Positives = 78/235 (33%), Gaps = 58/235 (24%) Query: 195 LVVDLSGSMHCA--MNSDPEDVNSAPICQDKKRTKMAALK-NALLLFLDSIDLLSHVKED 251 LV+D SGSM +N + + + + + + ++ ++ + + E Sbjct: 309 LVLDKSGSMGGKDRLNRMNQAAKHFLLQTVENGSWVGMVHFDSTATIVNKLIQIKSSDE- 367 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 R + + + L T +K A+Q++ + Sbjct: 368 -------------------------RNTLMAGLPTYPLGGTSICSGIKYAFQVIGELHSQ 402 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISIN 370 + ++ LTDGE+N S C D+ K++ + I++ Sbjct: 403 LDGSE----------------VLLLTDGEDNTASS-------CIDEVKQSGAIVHFIALG 439 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNA--DSLIHVFQNIS---QLMVHRKYSVILKG 420 + + + + ++ + Y A + LI F ++ + + + KG Sbjct: 440 RAADEAVIEMSKITGGSHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKG 494 >gi|119593591|gb|EAW73185.1| chloride channel, calcium activated, family member 4, isoform CRA_b [Homo sapiens] Length = 918 Score = 41.4 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 33/235 (14%), Positives = 78/235 (33%), Gaps = 58/235 (24%) Query: 195 LVVDLSGSMHCA--MNSDPEDVNSAPICQDKKRTKMAALK-NALLLFLDSIDLLSHVKED 251 LV+D SGSM +N + + + + + + ++ ++ + + E Sbjct: 310 LVLDKSGSMGGKDRLNRMNQAAKHFLLQTVENGSWVGMVHFDSTATIVNKLIQIKSSDE- 368 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 R + + + L T +K A+Q++ + Sbjct: 369 -------------------------RNTLMAGLPTYPLGGTSICSGIKYAFQVIGELHSQ 403 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISIN 370 + ++ LTDGE+N S C D+ K++ + I++ Sbjct: 404 LDGSE----------------VLLLTDGEDNTASS-------CIDEVKQSGAIVHFIALG 440 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNA--DSLIHVFQNIS---QLMVHRKYSVILKG 420 + + + + ++ + Y A + LI F ++ + + + KG Sbjct: 441 RAADEAVIEMSKITGGSHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKG 495 >gi|1705570|sp|P51942|MATN1_MOUSE RecName: Full=Cartilage matrix protein; AltName: Full=Matrilin-1; Flags: Precursor gi|1163179|gb|AAB06521.1| cartilage matrix protein precursor [Mus musculus] Length = 500 Score = 41.4 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 26/190 (13%), Positives = 74/190 (38%), Gaps = 29/190 (15%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + +K + +D++D+ + + GL+ Y++ + + G ++ + Sbjct: 291 RPENFELVKKFINQIVDTLDVSDRLAQV---GLVQYSSSIRQEFPL--GRFHSKKDIKAR 345 Query: 284 MD--SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + S + K T + A+K D + + K+ I+F TDG + Sbjct: 346 VRNMSYMEKGTMTGAALKY-----LIDNSFTVSSGARPGAQKVG-------IVF-TDGRS 392 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLI 399 ++ ++ KAK+ K+ + + + L+ S P ++++ + ++ Sbjct: 393 QDYINDAA-----RKAKDLGFKMFAVGVG--NAVEEELREIASEPVADHYFYTADFKTIN 445 Query: 400 HVFQNISQLM 409 + + + + + Sbjct: 446 QIGKKLQKQI 455 >gi|326505554|dbj|BAJ95448.1| predicted protein [Hordeum vulgare subsp. vulgare] gi|326515132|dbj|BAK03479.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 707 Score = 41.4 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 39/255 (15%), Positives = 79/255 (30%), Gaps = 55/255 (21%) Query: 94 AKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM---THMANNRLDSSNNTIFYNMDV 150 R + N +L VF E + I + T NN + S Y Sbjct: 161 VIRRLSNSYSGNLLEQLPVFRTPEADIFNDDEHIDVHSETAEENNEVTGSVEIKTYA--- 217 Query: 151 MTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 +Q I+ + Q+ ++ + R + V+D+SGSM Sbjct: 218 ------EVQAIQQSVTQKVFSILIHLKAPKSLESVSSRAPLDLVTVLDVSGSM------- 264 Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 K K+A LK A+ + ++ + +I +++ + Sbjct: 265 -------------KGAKLALLKKAMCFVIQTLGPND------RLSVIAFSSTARRLFPLR 305 Query: 271 -WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 Q + + T+ + +K+ +++ + +K P Sbjct: 306 QMNVNGRMQAIQAVNSLVDGGGTNISDGLKKGAKVI------------EHRRLKNPVCS- 352 Query: 330 QKFIIFLTDGENNNF 344 II L+DG++ Sbjct: 353 ---IILLSDGQDTYS 364 >gi|149021014|gb|EDL78621.1| rCG55860, isoform CRA_b [Rattus norvegicus] gi|149021015|gb|EDL78622.1| rCG55860, isoform CRA_b [Rattus norvegicus] Length = 919 Score = 41.4 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 43/222 (19%), Positives = 76/222 (34%), Gaps = 48/222 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D+S SM A K+ + AL+ L+ + Sbjct: 262 VVFVLDISASMVGA--------------------KLQQTREALVTILNDLRPQD------ 295 Query: 253 YMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +IG++ R++ K+ + +R TD A++ A ++L + Sbjct: 296 RFNIIGFSNRIKMWKDHLLPVTPDNIRNGKIYMYHLSPTGGTDINGALQTAIKLLNNYVA 355 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN---FIKIVTI 367 ++ + IIFLTDG+ NT++I KE I I T+ Sbjct: 356 QNDIEDRSVS-----------LIIFLTDGKPTF--GETNTLRILSNTKEATGGQICIFTV 402 Query: 368 SINASPNGQRL----LKTCVSSPEYHYNVVNADSLIHVFQNI 405 I + + L L+ C + H LI + I Sbjct: 403 GIGNDVDFRLLEKLSLENCGLTRRVHEEEKAGAQLIGFYDEI 444 >gi|119593590|gb|EAW73184.1| chloride channel, calcium activated, family member 4, isoform CRA_a [Homo sapiens] Length = 917 Score = 41.4 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 33/235 (14%), Positives = 78/235 (33%), Gaps = 58/235 (24%) Query: 195 LVVDLSGSMHCA--MNSDPEDVNSAPICQDKKRTKMAALK-NALLLFLDSIDLLSHVKED 251 LV+D SGSM +N + + + + + + ++ ++ + + E Sbjct: 309 LVLDKSGSMGGKDRLNRMNQAAKHFLLQTVENGSWVGMVHFDSTATIVNKLIQIKSSDE- 367 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 R + + + L T +K A+Q++ + Sbjct: 368 -------------------------RNTLMAGLPTYPLGGTSICSGIKYAFQVIGELHSQ 402 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISIN 370 + ++ LTDGE+N S C D+ K++ + I++ Sbjct: 403 LDGSE----------------VLLLTDGEDNTASS-------CIDEVKQSGAIVHFIALG 439 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNA--DSLIHVFQNIS---QLMVHRKYSVILKG 420 + + + + ++ + Y A + LI F ++ + + + KG Sbjct: 440 RAADEAVIEMSKITGGSHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKG 494 >gi|150036262|ref|NP_036260.2| calcium-activated chloride channel regulator 4 [Homo sapiens] gi|205831469|sp|Q14CN2|CLCA4_HUMAN RecName: Full=Calcium-activated chloride channel regulator 4; AltName: Full=Calcium-activated chloride channel family member 4; Short=hCLCA4; AltName: Full=Calcium-activated chloride channel protein 2; Short=CaCC-2; Short=hCaCC-2; Contains: RecName: Full=Calcium-activated chloride channel regulator 4, 110 kDa form; Contains: RecName: Full=Calcium-activated chloride channel regulator 4, 30 kDa form; Flags: Precursor gi|37182063|gb|AAQ88834.1| CLCA4 [Homo sapiens] gi|56203696|emb|CAI22170.1| chloride channel, calcium activated, family member 4 [Homo sapiens] Length = 919 Score = 41.4 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 33/235 (14%), Positives = 78/235 (33%), Gaps = 58/235 (24%) Query: 195 LVVDLSGSMHCA--MNSDPEDVNSAPICQDKKRTKMAALK-NALLLFLDSIDLLSHVKED 251 LV+D SGSM +N + + + + + + ++ ++ + + E Sbjct: 309 LVLDKSGSMGGKDRLNRMNQAAKHFLLQTVENGSWVGMVHFDSTATIVNKLIQIKSSDE- 367 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 R + + + L T +K A+Q++ + Sbjct: 368 -------------------------RNTLMAGLPTYPLGGTSICSGIKYAFQVIGELHSQ 402 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISIN 370 + ++ LTDGE+N S C D+ K++ + I++ Sbjct: 403 LDGSE----------------VLLLTDGEDNTASS-------CIDEVKQSGAIVHFIALG 439 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNA--DSLIHVFQNIS---QLMVHRKYSVILKG 420 + + + + ++ + Y A + LI F ++ + + + KG Sbjct: 440 RAADEAVIEMSKITGGSHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKG 494 >gi|153830633|ref|ZP_01983300.1| type I secretion target ggxgxdxxx repeat (2 copies) domain protein [Vibrio cholerae 623-39] gi|148873874|gb|EDL72009.1| type I secretion target ggxgxdxxx repeat (2 copies) domain protein [Vibrio cholerae 623-39] Length = 1426 Score = 41.0 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 35/193 (18%), Positives = 63/193 (32%), Gaps = 25/193 (12%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNS------APICQDKKRTKMAALKNAL 235 +E G+ + I L+VD SGSM + + + + ++M +AL Sbjct: 812 SVEPGKN--YNIALIVDTSGSMRYDLAGNQNATYDGWNDSWSQTPEQYAASRMKLTIDAL 869 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLILKPTDS 294 + + V + LIG+ + + + + + + T+ Sbjct: 870 KVLATQLADHDG---TVNITLIGFNGTAADALPFNNLSAANLSDLIGKINLLIADGGTNY 926 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A +A F Q L F+ FLTDG+ + +T Sbjct: 927 EDAFIEA------------TKWFNTQPGSETDLKFENLTYFLTDGDPTVHNGDSSTGFTT 974 Query: 355 DKAK-ENFIKIVT 366 D A +N I T Sbjct: 975 DYADMQNAIDAFT 987 >gi|180654|gb|AAA63904.1| cartilage matrix protein [Homo sapiens] Length = 340 Score = 41.0 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 30/191 (15%), Positives = 77/191 (40%), Gaps = 31/191 (16%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + + +K + +D++D+ + +GL+ Y++ V + G ++ + Sbjct: 131 RPENLELVKKFISQIVDTLDVSD---KLAQVGLVQYSSSVRQEFPL--GRFHTKKDIKAA 185 Query: 284 MD--SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + S + K T + A+K D + + K+ I+F TDG + Sbjct: 186 VRNMSYMEKGTMTGAALKY-----LIDNSFTVSSGARPGAQKVG-------IVF-TDGRS 232 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISI-NASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 ++ ++ KAK+ K+ + + NA + L+ S P E+++ + ++ Sbjct: 233 QDYINDAA-----KKAKDLGFKMFAVGVGNAVEDE---LREIASEPVAEHYFYTADFKTI 284 Query: 399 IHVFQNISQLM 409 + + + + + Sbjct: 285 NQIGKKLQKKI 295 >gi|145298663|ref|YP_001141504.1| hemolysin-type calcium-binding repeat-containing protein [Aeromonas salmonicida subsp. salmonicida A449] gi|142851435|gb|ABO89756.1| hemolysin-type calcium-binding repeat protein [Aeromonas salmonicida subsp. salmonicida A449] Length = 1156 Score = 41.0 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 47/229 (20%), Positives = 75/229 (32%), Gaps = 35/229 (15%) Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 S ++T + NN +F + V L + + VS + R+ Sbjct: 254 SYSYTLTDNEEHPSGGGNNNLFEDFAVTLVDSDGDSANNTLSVRIVDDVPVSPVDLNTRV 313 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 E P + +++D SGSM D V T+MA K ++L + D Sbjct: 314 EEFSNPGTNLMIILDTSGSMD-----DASGVAGFA-------TRMAIAKASILQLIGDYD 361 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPS---WGTEKVRQYV-TRDMDSLILKPTDSTPAMK 299 + DV + L+G+ + N + W T V D L TD A+ Sbjct: 362 DV----GDVMVRLVGFASSATTNFLGAGDVWLTATQALNVINGITDYLGNGGTDYDDALI 417 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 +A S K Q + FL+DGE + Sbjct: 418 KAMSAYDSAGKIIGG---------------QSVLYFLSDGEPTESTNWP 451 >gi|56797861|emb|CAG27403.1| matrilin-3b [Danio rerio] gi|220675930|emb|CAX12089.1| matrilin 3b [Danio rerio] Length = 299 Score = 41.0 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 30/178 (16%), Positives = 63/178 (35%), Gaps = 34/178 (19%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDST 295 + ++ L + + L+ Y + V + +V+Q +R + T + Sbjct: 92 LSEMVNSLDIGSDATRVALVNYASTVNIEFHLKKYFSKAEVKQAFSRIDP--LSTGTMTG 149 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI----IFLTDGENNNFKSNVNTI 351 A+K A + + ++ + P +K I I +TDG + V + Sbjct: 150 MAIKTAMEQVFTENA--------------GARPLKKGIGKVAIIVTDG---RPQDKVEEV 192 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN---ADSLIHVFQN 404 A+ + I+I + + R LK S P ++ + V + L F+ Sbjct: 193 S--AAARASGIEIYAVGV--DRAEMRSLKQMASQPLDDHVFYVETYGVIEKLTSKFRE 246 >gi|148655419|ref|YP_001275624.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148567529|gb|ABQ89674.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 824 Score = 41.0 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 27/179 (15%), Positives = 54/179 (30%), Gaps = 36/179 (20%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKR--------------TKMAALKNALLLFLD 240 L++D+SGSM + C + + ++ K L F+ Sbjct: 407 LILDVSGSMSWTFDGRGVQNGQVVTCTNPTQGCVSIETAWPNVQERRIYTAKQVLRRFVQ 466 Query: 241 SID----LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 ID + V M + +T R+ + + + D L T+ Sbjct: 467 QIDQDRRSGLRPHDTVRM--VTFTGRLGNYVNNEGRVGDNNRALNDLTDVLPAGWTNDRA 524 Query: 297 AMKQAYQI---LTSDKKRSFFTNFFRQGVKIPS-------------LPFQKFIIFLTDG 339 ++ A + D + S + +++ +IF+TDG Sbjct: 525 TLEAAINEAGMVDGDPYMTAGATPSAVAFARASQVFAAAPERAPNGMKYRRVVIFVTDG 583 >gi|293342867|ref|XP_001069890.2| PREDICTED: rCG55860-like [Rattus norvegicus] gi|149021013|gb|EDL78620.1| rCG55860, isoform CRA_a [Rattus norvegicus] Length = 953 Score = 41.0 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 43/222 (19%), Positives = 76/222 (34%), Gaps = 48/222 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D+S SM A K+ + AL+ L+ + Sbjct: 296 VVFVLDISASMVGA--------------------KLQQTREALVTILNDLRPQD------ 329 Query: 253 YMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +IG++ R++ K+ + +R TD A++ A ++L + Sbjct: 330 RFNIIGFSNRIKMWKDHLLPVTPDNIRNGKIYMYHLSPTGGTDINGALQTAIKLLNNYVA 389 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN---FIKIVTI 367 ++ + IIFLTDG+ NT++I KE I I T+ Sbjct: 390 QNDIEDRSVS-----------LIIFLTDGKPTF--GETNTLRILSNTKEATGGQICIFTV 436 Query: 368 SINASPNGQRL----LKTCVSSPEYHYNVVNADSLIHVFQNI 405 I + + L L+ C + H LI + I Sbjct: 437 GIGNDVDFRLLEKLSLENCGLTRRVHEEEKAGAQLIGFYDEI 478 >gi|325268974|ref|ZP_08135595.1| aerotolerance protein BatB [Prevotella multiformis DSM 16608] gi|324988595|gb|EGC20557.1| aerotolerance protein BatB [Prevotella multiformis DSM 16608] Length = 330 Score = 41.0 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 30/176 (17%), Positives = 56/176 (31%), Gaps = 45/176 (25%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 + +D+S SM D +++ K + + + Sbjct: 95 IALDISNSMLAE---------------DVAPSRLEKSKLLVENLMSRFSED-------KI 132 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDSTPAMKQAYQILTSDKKRSF 313 GLI + + + + ++ SLI + TD A++ A T + K Sbjct: 133 GLIVFAGEAFVQLPITSDYVSAKMFLDNINPSLIGTQGTDIGKALQLAANSFTPNSKAG- 191 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 K II +TDGE+N + + A+ IK+ + I Sbjct: 192 -----------------KAIILITDGEDNEGGAEAMAKQ----ARSKGIKVFILGI 226 >gi|47217883|emb|CAG05005.1| unnamed protein product [Tetraodon nigroviridis] Length = 647 Score = 41.0 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 35/186 (18%), Positives = 71/186 (38%), Gaps = 25/186 (13%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + +K ++ + L E +GL+ Y + V+ S T + V + Sbjct: 66 RPRDYEKVKTFIVNL---VQFLEVGPEATRVGLLQYGSVVQPEFSLS--TFSTKAEVEQA 120 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 + ++ T + + Y TS + G + L + + +TDG Sbjct: 121 VRNMKHLATGTMTGLAIQYAAETSFTEAD--------GARPAHLHIPRIAVVVTDG---R 169 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN---ADSL 398 + V + +A++ I+I I + + LKT S P E+ + V + ++L Sbjct: 170 PQDRVEEVA--AQARQAGIQIFAIGVG--RVDMKTLKTIGSEPHSEHVHLVASFSQMETL 225 Query: 399 IHVFQN 404 + VFQ+ Sbjct: 226 VSVFQS 231 >gi|332221825|ref|XP_003260065.1| PREDICTED: calcium-activated chloride channel regulator 4 isoform 2 [Nomascus leucogenys] Length = 684 Score = 41.0 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 33/235 (14%), Positives = 78/235 (33%), Gaps = 58/235 (24%) Query: 195 LVVDLSGSMHC--AMNSDPEDVNSAPICQDKKRTKMAALK-NALLLFLDSIDLLSHVKED 251 LV+D SGSM +N + + + + + + + ++ + + E Sbjct: 72 LVLDKSGSMGGYDRLNRMNQAAKHFLLQTVENGSWVGMVHFDTTATIVNKLIQIKGSDE- 130 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 R + + + L T +K A+Q++ + Sbjct: 131 -------------------------RNTLMAGLPTYALGGTSICSGIKYAFQVIGELHSQ 165 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISIN 370 + ++ LTDGE+N S C D+ K++ + I++ Sbjct: 166 LDGSE----------------VVLLTDGEDNTASS-------CIDEVKQSGAIVHFIALG 202 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNA--DSLIHVFQNIS---QLMVHRKYSVILKG 420 ++ + + + ++ + Y A + LI F ++ + + + KG Sbjct: 203 SAADEAVIEMSNITGGSHFYASDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKG 257 >gi|30749469|pdb|1MHP|A Chain A, Crystal Structure Of A Chimeric Alpha1 Integrin I-Domain In Complex With The Fab Fragment Of A Humanized Neutralizing Antibody gi|30749472|pdb|1MHP|B Chain B, Crystal Structure Of A Chimeric Alpha1 Integrin I-Domain In Complex With The Fab Fragment Of A Humanized Neutralizing Antibody Length = 192 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 29/185 (15%), Positives = 68/185 (36%), Gaps = 25/185 (13%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + D + + + +G++ Y V + V + ++ + Sbjct: 22 IAFLNDLLKRMDIGPKQTQVGIVQYGENVTHEFNL--NKYSSTEEVLVAANKIVQRGGRQ 79 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 T A I T+ K+ R+G +K ++ +TDGE N ++ Sbjct: 80 T---MTALGIDTARKEAFTEARGARRG-------VKKVMVIVTDGE---SHDNYRLKQVI 126 Query: 355 DKAKENFIKIVTISINASPNGQRL--------LKTCVSSP--EYHYNVVNADSLIHVFQN 404 ++ I+ +I+I N L +K+ S P ++ +NV + +L+ + + Sbjct: 127 QDCEDENIQRFSIAILGHYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKA 186 Query: 405 ISQLM 409 + + + Sbjct: 187 LGERI 191 >gi|88801582|ref|ZP_01117110.1| hypothetical protein PI23P_02947 [Polaribacter irgensii 23-P] gi|88782240|gb|EAR13417.1| hypothetical protein PI23P_02947 [Polaribacter irgensii 23-P] Length = 330 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 34/227 (14%), Positives = 74/227 (32%), Gaps = 49/227 (21%) Query: 152 TSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDP 211 + + L+ LL + ++ +++ +R I +D+S SM Sbjct: 33 SGFKAALKVTILLLGMTFLIISLTNPKMGSKLKTIKREGVDIVFALDISKSMLAE----- 87 Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW 271 D ++ K + +D + +G+I Y + + Sbjct: 88 ----------DIAPNRLEKSKQIISKIIDRLGSD-------RVGIIVYAGNSYPLLPIT- 129 Query: 272 GTEKVRQYV---TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 T+ + + D + + T A++ A +D++ + Sbjct: 130 -TDHAAANMFLQNANPDMVSSQGTAINEALELAKTYYNNDEQTN---------------- 172 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 +F+I L+DGE T ++ N +KI TI + + G Sbjct: 173 --RFLIILSDGE----DHQEETKQVAQNLANNGVKIYTIGVGTARGG 213 >gi|310799477|gb|EFQ34370.1| von Willebrand factor type A domain-containing protein [Glomerella graminicola M1.001] Length = 698 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 18/101 (17%), Positives = 41/101 (40%), Gaps = 9/101 (8%) Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 + I P E + I LV+D+SGSM C P N ++ + + Sbjct: 66 DGLIAKITPPTQPTEPTDHVPCDIVLVIDVSGSMGCN---APVPANPGEKAENYGLSVLD 122 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 +K+A L++++ +G++ + ++ + + + Sbjct: 123 LVKHAARTVLETLNDGD------RLGIVTFASKAKVLQKLT 157 >gi|56797867|emb|CAG27567.1| matrilin-4 [Danio rerio] Length = 548 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 62/169 (36%), Gaps = 25/169 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTP 296 +D I L +G++ Y+++V+ K Q V + + L T + Sbjct: 45 MIDIIHELDIGLAATRIGVVQYSSQVQNVFSLK-AFSKTEQMVKAINEIIPLAQGTMTGL 103 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A + S ++ P++P I+ TDG + + V Sbjct: 104 AIRYAMNVAFSAEE-----------GARPNVPHVAVIV--TDGRPQDRVAEVAAAAR--- 147 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 E+ I+I + + L+ S P ++ + V + LIH F Sbjct: 148 --ESGIEIYAVGVA--RADMTSLRAMASPPFEDHVFLVESF-DLIHQFG 191 >gi|34810098|pdb|1QCY|A Chain A, The Crystal Structure Of The I-Domain Of Human Integrin Alpha1beta1 Length = 193 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 30/188 (15%), Positives = 70/188 (37%), Gaps = 26/188 (13%) Query: 233 NALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 +++ FL D + + + +G++ Y V + + + Sbjct: 19 DSVTAFLNDLLKRMDIGPKQTQVGIVQYGENVTHEFNL-NKYSSTEEVLVAAKKIVQRGG 77 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 + A+ T+ K+ R+G +K ++ +TDGE N Sbjct: 78 AQTMTALGTD----TARKEAFTEARGARRG-------VKKVMVIVTDGE---SHDNHRLK 123 Query: 352 KICDKAKENFIKIVTISINASPNGQRL--------LKTCVSSP--EYHYNVVNADSLIHV 401 K+ ++ I+ +I+I S N L +K+ S P ++ +NV + +L+ + Sbjct: 124 KVIQDCEDENIQRFSIAILGSYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTI 183 Query: 402 FQNISQLM 409 + + + + Sbjct: 184 VKTLGERI 191 >gi|224078385|ref|XP_002194338.1| PREDICTED: collagen, type XX, alpha 1 [Taeniopygia guttata] Length = 1505 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 28/156 (17%), Positives = 55/156 (35%), Gaps = 23/156 (14%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 +K L + + + + +GL Y++ E S R+ V + +L K Sbjct: 280 IKEFLSNLISPFSIAE---DKIRVGLSQYSSDPRTEWELS--AYSTREQVLEAVRNLRYK 334 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 ++ + LT +++ + L +K +I LTDG+ + + N Sbjct: 335 GGNTFTGL-----ALTHVLEQNLKPDAGA------RLEAEKLVILLTDGK---SQDDANL 380 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 K I+I I + + L+ S P Sbjct: 381 AAQ--TLKNLGIEIFAIGVK--NADEAELRQVASEP 412 >gi|123443829|ref|YP_001007800.1| putative tight adherance operon protein [Yersinia enterocolitica subsp. enterocolitica 8081] gi|122090790|emb|CAL13672.1| putative tight adherance operon protein [Yersinia enterocolitica subsp. enterocolitica 8081] Length = 459 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 52/276 (18%), Positives = 95/276 (34%), Gaps = 60/276 (21%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 +H + F + K +E+ I F + + F+ LI + + +K + A A Sbjct: 8 IHKFNHFTLFKK----NEQGTILISFMIILPFFIALIFITFEISHYLQRKAKLSDAIEQA 63 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 LA + + + D + I N+A + ++ L S F I N Sbjct: 64 TLA----LTIENNAIPDEPQQIKNNA----------LVLSYANAYLP--SKEFSV-PIIN 106 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 I +++ + YN V +Y + L N + I A+ Sbjct: 107 INDNTYY----------------LEYNAAVTMAYPAKFLTQTSLTNAITDINITDNGVAI 150 Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + + + V D SGSM + + + ++ AL++A D Sbjct: 151 KNKAIEASDLTDVIFVADYSGSMLYNFD----------VNEPNDHERINALRSAFRKLHD 200 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 I S++ IGY I SWGT+++ Sbjct: 201 IIMNNSNIN------AIGY-------IPFSWGTKRI 223 >gi|297816770|ref|XP_002876268.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata] gi|297322106|gb|EFH52527.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata] Length = 672 Score = 41.0 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 26/166 (15%), Positives = 62/166 (37%), Gaps = 43/166 (25%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 +I R + V+D+SGSM TK+A LK A+ + + Sbjct: 233 QISQYRRAPVDLVTVLDVSGSMGG--------------------TKLALLKRAMGFVIQN 272 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQ 300 + + +I +++ + + ++ RQ + ++SL+ T+ +++ Sbjct: 273 LGSSD------RLSVIAFSSTARRLFPLTRMSDAGRQQALQAVNSLVANGGTNIFDGLRK 326 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 +++ ++R+ + II L+DG + + Sbjct: 327 GAKVMEDRRERNSVAS----------------IILLSDGRDTYTTN 356 >gi|119913152|ref|XP_616068.3| PREDICTED: integrin, alpha 1 [Bos taurus] gi|297487671|ref|XP_002696385.1| PREDICTED: integrin, alpha 1 [Bos taurus] gi|296475808|gb|DAA17923.1| integrin, alpha 1 [Bos taurus] Length = 1195 Score = 41.0 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 26/193 (13%), Positives = 69/193 (35%), Gaps = 29/193 (15%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--TD 293 D + + + +G++ Y V + + + T Sbjct: 207 AFLNDLLKRMDIGPKQTQVGIVQYGENVTHEFNL-NKYSSTEEVLVAAKKIVQRGGRQTM 265 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + + A + ++ + +K ++ +TDGE+++ I+ Sbjct: 266 TALGIDTARKEAFTEA-------------RGARRGVKKVMVIVTDGESHDNHRLNKVIQD 312 Query: 354 CDKAKENFIKIVTISINASPNGQRL--------LKTCVSSP--EYHYNVVNADSLIHVFQ 403 C+ + I+ +I+I S N L +K+ S P ++ +NV + +L+ + + Sbjct: 313 CE---DESIQRFSIAILGSYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVE 369 Query: 404 NISQLMVHRKYSV 416 + + + + +V Sbjct: 370 ALGERIFALEATV 382 >gi|309266594|ref|XP_003086799.1| PREDICTED: collagen alpha-5(VI) chain-like [Mus musculus] Length = 2601 Score = 41.0 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 37/247 (14%), Positives = 80/247 (32%), Gaps = 46/247 (18%) Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 D R+ F ++ + + + + + + E I +VD SGS+ Sbjct: 584 DERVNFGQNFDSLKSIKN--EIVHRICSEKGCEDMKADIMFLVDSSGSIG---------- 631 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 T +K + + I + + +G++ ++ + + + + Sbjct: 632 ----------PTNFETMKTFMKNLVGKIQI---GADRSQVGVVQFSDYNREEFQLNKYST 678 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 Y D S I + T + A+ F N + K +KF+I Sbjct: 679 HEEIYAAIDRMSPINRNTLTGGAL--------------TFVNEYFDLSKGGRPQVRKFLI 724 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 LTDG+ + + + I ++ + + Q L+ ++V N Sbjct: 725 LLTDGKAQDEVGGPAM-----ALRSKSVTIFSVGVYGANRAQ--LEEISGDGSLVFHVEN 777 Query: 395 ADSLIHV 401 D L + Sbjct: 778 FDHLKAI 784 Score = 40.2 bits (92), Expect = 0.59, Method: Composition-based stats. Identities = 53/344 (15%), Positives = 121/344 (35%), Gaps = 58/344 (16%) Query: 80 ESISNHAKRALIDDAKRFIKNHIKESLSGY----------SAVFYNTEIQNIVNSSRISM 129 + +S + L A F+ + S G + E+ + R Sbjct: 687 DRMSPINRNTLTGGALTFVNEYFDLSKGGRPQVRKFLILLTDGKAQDEVGGPAMALRSKS 746 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 + + + +N + S + ++ +HL + K++ + AL + E Sbjct: 747 VTIFSVGVYGANRAQLEEISGDGSLVFHVENFDHL--KAIESKLIFRVCALHDCKRIE-- 802 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + I V+D SGS+ P Q+ + L + + + Sbjct: 803 LLDIVFVLDHSGSIG-------------PREQESM----------MNLTIHLVKKADIGR 839 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEK--VRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + V +G + Y+ E + + + +++ R D+ T + A++ + IL + Sbjct: 840 DRVQIGALTYSNHPEILFYLNTYSSGSAIAEHLRRPRDT--GGETYTAKALQHS-NILFT 896 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 ++ S T RQ +I +TDG ++ + ++ + ++ I I + Sbjct: 897 EEHGSRLTQNVRQ-----------LMIVITDG-VSHDRDKLDEAAR--ELRDKGITIFAV 942 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + Q L+T E +V N D L ++ + + + + Sbjct: 943 GVG--NANQDELETMAGKKENTVHVDNFDKLRDIYLPLQETLCN 984 >gi|194288834|ref|YP_002004741.1| flp pilus assembly protein [Cupriavidus taiwanensis LMG 19424] gi|193222669|emb|CAQ68672.1| putative flp pilus assembly protein [Cupriavidus taiwanensis LMG 19424] Length = 418 Score = 41.0 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 14/108 (12%), Positives = 37/108 (34%), Gaps = 6/108 (5%) Query: 20 ANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRF 79 +II LS+ + +G + + + K+ ++++ +A LA A + L Sbjct: 19 GAVAIIVGLSLAVLIGFVGLALDLGKLYVTKSELQNSVDACALAAARDVTGATPLLVSEA 78 Query: 80 ESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 ++ + A + E + + +T ++ Sbjct: 79 AGLTTGTRNA------ALFQGKAVEMFENLNVSYSDTPDNTFYTKDKV 120 >gi|148689167|gb|EDL21114.1| mCG140659 [Mus musculus] Length = 1670 Score = 41.0 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 37/247 (14%), Positives = 80/247 (32%), Gaps = 46/247 (18%) Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 D R+ F ++ + + + + + + E I +VD SGS+ Sbjct: 372 DERVNFGQNFDSLKSIKN--EIVHRICSEKGCEDMKADIMFLVDSSGSIG---------- 419 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 T +K + + I + + +G++ ++ + + + + Sbjct: 420 ----------PTNFETMKTFMKNLVGKIQI---GADRSQVGVVQFSDYNREEFQLNKYST 466 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 Y D S I + T + A+ F N + K +KF+I Sbjct: 467 HEEIYAAIDRMSPINRNTLTGGAL--------------TFVNEYFDLSKGGRPQVRKFLI 512 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 LTDG+ + + + I ++ + + Q L+ ++V N Sbjct: 513 LLTDGKAQDEVGGPAM-----ALRSKSVTIFSVGVYGANRAQ--LEEISGDGSLVFHVEN 565 Query: 395 ADSLIHV 401 D L + Sbjct: 566 FDHLKAI 572 Score = 40.2 bits (92), Expect = 0.59, Method: Composition-based stats. Identities = 53/344 (15%), Positives = 121/344 (35%), Gaps = 58/344 (16%) Query: 80 ESISNHAKRALIDDAKRFIKNHIKESLSGY----------SAVFYNTEIQNIVNSSRISM 129 + +S + L A F+ + S G + E+ + R Sbjct: 475 DRMSPINRNTLTGGALTFVNEYFDLSKGGRPQVRKFLILLTDGKAQDEVGGPAMALRSKS 534 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 + + + +N + S + ++ +HL + K++ + AL + E Sbjct: 535 VTIFSVGVYGANRAQLEEISGDGSLVFHVENFDHL--KAIESKLIFRVCALHDCKRIE-- 590 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + I V+D SGS+ P Q+ + L + + + Sbjct: 591 LLDIVFVLDHSGSIG-------------PREQESM----------MNLTIHLVKKADIGR 627 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEK--VRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + V +G + Y+ E + + + +++ R D+ T + A++ + IL + Sbjct: 628 DRVQIGALTYSNHPEILFYLNTYSSGSAIAEHLRRPRDT--GGETYTAKALQHS-NILFT 684 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 ++ S T RQ +I +TDG ++ + ++ + ++ I I + Sbjct: 685 EEHGSRLTQNVRQ-----------LMIVITDG-VSHDRDKLDEAAR--ELRDKGITIFAV 730 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + Q L+T E +V N D L ++ + + + + Sbjct: 731 GVG--NANQDELETMAGKKENTVHVDNFDKLRDIYLPLQETLCN 772 >gi|304412560|ref|ZP_07394165.1| von Willebrand factor type A [Shewanella baltica OS183] gi|307303576|ref|ZP_07583329.1| von Willebrand factor type A [Shewanella baltica BA175] gi|304349036|gb|EFM13449.1| von Willebrand factor type A [Shewanella baltica OS183] gi|306912474|gb|EFN42897.1| von Willebrand factor type A [Shewanella baltica BA175] Length = 627 Score = 41.0 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 38/231 (16%), Positives = 72/231 (31%), Gaps = 52/231 (22%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + ++D+SGSM K+ L+ AL L + V Sbjct: 221 GASNLVFLLDVSGSMA-------------------SADKLPLLQTALKLLTAQLSAQDKV 261 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 VY G G ++ + T+ + QAYQ Sbjct: 262 SIVVYAGAAG--VVLDGASG-----NDTQTLTYALEQLSAGGSTNGGQGITQAYQ----L 310 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKENFIKIVTI 367 K+ F N + +I TDG+ N ++ + I + +K K++ I + T+ Sbjct: 311 AKKHFIPNGINR------------VILATDGDFNVGVTDFDDLIALIEKEKDHGIGLTTL 358 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNA---------DSLIHVFQNISQLM 409 +L++ +Y ++ D L I++ + Sbjct: 359 GFGLGNYNDQLMEQLADKGNGNYAYIDTLNEARKVLVDELSSTLFTIAKDV 409 >gi|255531386|ref|YP_003091758.1| von Willebrand factor A [Pedobacter heparinus DSM 2366] gi|255344370|gb|ACU03696.1| von Willebrand factor type A [Pedobacter heparinus DSM 2366] Length = 344 Score = 41.0 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 32/232 (13%), Positives = 72/232 (31%), Gaps = 45/232 (19%) Query: 139 SSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVD 198 + T+ M ++ L+F+ +L + +IE +R + +++D Sbjct: 38 GNAYTVKKMMPEVSFGRQGLKFVFFVLAYAALVLGAANPQIGTKIEEAKRSGSDLMILLD 97 Query: 199 LSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG 258 +S SM D ++ K A+ +D++ +G+I Sbjct: 98 VSNSMLAG---------------DLAPNRLENAKRAISQLIDNLHND-------RIGIII 135 Query: 259 YTTRVEKNIEPSWGTEKVRQYVTR-DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNF 317 + + + + ++ D + + T A+ + S Sbjct: 136 FAGEAYVQLPITTDYSAAKLFLNNITTDIVPTQGTAIGAAIDMGMKSFNFVNGTS----- 190 Query: 318 FRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 K +I +TDGEN+ + + A + I I + Sbjct: 191 -------------KAMILMTDGENHEDDAVSAAKR----ASAKDVAIHVIGV 225 >gi|91082539|ref|XP_973726.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Tribolium castaneum] Length = 842 Score = 41.0 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 46/335 (13%), Positives = 95/335 (28%), Gaps = 28/335 (8%) Query: 95 KRFIKN----HIKESLSGYSAVFYNTEI---QNIVNSSRISMTHMANNRLDSSNNT---I 144 +F+K+ E EI + + + + +L + T Sbjct: 206 LKFVKSPPLRTGNEISKNDDKTASLAEIKQNNSTSATVKFNPNIERQKQLATGLGTKEEN 265 Query: 145 FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMH 204 + YD + + + F P+ + + + V+D SGSM Sbjct: 266 GLAGQFVVQYDVERDPKGGEVLLKDGYFVHFFAPSEVEALPKQ-----VIFVLDTSGSMD 320 Query: 205 CAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE 264 + + + A + K + + + V+ D +G + Sbjct: 321 ---GNRIKQLKEAMNSILSELKKEDVFNIVEFSSIVKVWNVDKVQVDYEVGEDPWPLYDS 377 Query: 265 KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 T +V + D K + AY ++ + Sbjct: 378 PEAPQKNKTNQVLPPAYKATDENKEKAKKVVEKLN-AYGGTDIKSALEVGLKLVKKNKEN 436 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF---IKIVTISINASPNGQRLLKT 381 Q I+FLTDGE ++N I + I ++S + L+ Sbjct: 437 KEDAHQPIIVFLTDGEPTMGETNTEKITSAISEMNSGETRAPIFSLSF-GDGADREFLQK 495 Query: 382 CVSS----PEYHYNVVNAD-SLIHVFQNISQLMVH 411 + Y +A L ++ IS +++ Sbjct: 496 ISLKNLGFARHIYEAADASLQLQEFYKQISSPLLN 530 >gi|255602535|ref|XP_002537872.1| conserved hypothetical protein [Ricinus communis] gi|223514758|gb|EEF24510.1| conserved hypothetical protein [Ricinus communis] Length = 120 Score = 41.0 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 11/45 (24%), Positives = 24/45 (53%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMES 55 + G +E F+I+ AL + + ++GF I + + +K ++S Sbjct: 16 LRPGRRAESGAFAIMAALVLPIMIAMLGFAIDLSRVYNRKVELQS 60 >gi|74185538|dbj|BAE30236.1| unnamed protein product [Mus musculus] Length = 411 Score = 41.0 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 32/167 (19%), Positives = 59/167 (35%), Gaps = 33/167 (19%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 GE+ ++V+D SGSM+ + D D + K L ++ + Sbjct: 251 SPGEQQK--RKIVLDPSGSMNIYLVLDGSDSIGSSNFTG--------AKRCLTNLIEKVA 300 Query: 244 LLSHVKEDVYMGLIGYTT--------RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 GL+ Y T E++ + W TEK+ Q D + T++ Sbjct: 301 SYGVRPRY---GLLTYATVPKVLVRVSDERSSDVDWVTEKLNQ--ISYEDHKLKSGTNTK 355 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 A++ Y ++ ++ + II +TDG +N Sbjct: 356 RALQAVYSMM----------SWAGDAPPEGWNRTRHVIIIMTDGLHN 392 >gi|319902110|ref|YP_004161838.1| von Willebrand factor type A [Bacteroides helcogenes P 36-108] gi|319417141|gb|ADV44252.1| von Willebrand factor type A [Bacteroides helcogenes P 36-108] Length = 342 Score = 41.0 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 63/194 (32%), Gaps = 45/194 (23%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 ++E +R + + +D+S SM QD + +++ K + +D Sbjct: 81 KLETVKRQGVEVMIALDISNSMLA---------------QDVQPSRLEKAKRLVAQLVDK 125 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAMKQ 300 + + V G+I + + + + ++ SLI K T A+ Sbjct: 126 MQ-----NDKV--GMIVFAGDAFTQLPITNDYISAKMFLESISPSLISKQGTAIGAAISL 178 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A + T P + II +TDGEN + A E Sbjct: 179 ATRSFT------------------PQEGIGRAIIVITDGEN----HEGGVAEAAKTATEK 216 Query: 361 FIKIVTISINASPN 374 I++ + + Sbjct: 217 GIQVNVLGVGMPDG 230 >gi|296227520|ref|XP_002759384.1| PREDICTED: matrilin-2 [Callithrix jacchus] Length = 973 Score = 41.0 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 67/185 (36%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 +K + +DS+ +S +GL+ Y+T+V + Sbjct: 672 EVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVRTEFTLRNFNSAKDMKKAVAHMKYM 728 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 K + + A+K ++ + + G + S + I TDG + +V Sbjct: 729 GKGSMTGLALKHMFERSFTQGE----------GARPLSTRVPRVAIVFTDG---RAQDDV 775 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + KAK N I + + + + + L+ S P ++ + + ++ + + + Sbjct: 776 S--EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTDKHLFYAEDFSTMDEISEKLK 831 Query: 407 QLMVH 411 + + Sbjct: 832 KGICE 836 >gi|218191186|gb|EEC73613.1| hypothetical protein OsI_08104 [Oryza sativa Indica Group] Length = 709 Score = 41.0 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 41/244 (16%), Positives = 73/244 (29%), Gaps = 66/244 (27%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 R + V+D+SGSM TK+A LK A+ + + Sbjct: 259 PTSRAPVDLVTVLDVSGSMAG--------------------TKLALLKRAMGFVIQHLGP 298 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQ 303 + +I +++ + + RQ + ++ L T+ A+K+A + Sbjct: 299 SD------RLSVIAFSSTARRLFHLRRMSHSGRQQALQAVNLLGAGGGTNIADALKKAAK 352 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG-ENNNFKSNVNTI----------- 351 ++ K P II L+DG + N SNV Sbjct: 353 VIEDRN------------YKNPVCS----IILLSDGQDTYNISSNVRGTRPDYRSLVPSS 396 Query: 352 ----KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN-IS 406 IC + + A + L SS + + + F I Sbjct: 397 ILNHTICT------VPVHGFGFGADHDSDALHSIAESSGGTFSFIEDESVIQDAFAQCIG 450 Query: 407 QLMV 410 L+ Sbjct: 451 GLLS 454 >gi|126303712|ref|XP_001380869.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 628 Score = 41.0 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 52/169 (30%), Gaps = 33/169 (19%) Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD----SLILKPTDSTPAMKQAYQI 304 D +G + YT +G +K + T + A+ A + Sbjct: 479 DTDTRIGAVQYTYEQ----RLEFGFDKYSTKQDILNAIKRVNYWSGGTSTGAAINYALEH 534 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L K + +K +I +TDG +V + A +N + Sbjct: 535 LFKKSKPNK----------------RKLMILITDG---RSYDDVRIPAM--AAHQNGVIT 573 Query: 365 VTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVH 411 +I I Q L+ + P ++ + V D L I Q + Sbjct: 574 YSIGIA--WAAQDELEVIATHPTKDHSFFVDEFDDLYKSVPKIIQNICT 620 >gi|51244491|ref|YP_064375.1| hypothetical protein DP0639 [Desulfotalea psychrophila LSv54] gi|50875528|emb|CAG35368.1| hypothetical membrane protein (BatB) [Desulfotalea psychrophila LSv54] Length = 566 Score = 41.0 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 34/183 (18%), Positives = 58/183 (31%), Gaps = 44/183 (24%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 +R I +D S SM QD K ++ + A++ F+ ++ Sbjct: 91 QRRGIDILFAIDTSRSMLS---------------QDLKPNRLERARYAVMDFVATL---- 131 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 +GLI + + + TDS A+ + Sbjct: 132 ---GGDRVGLIPFAGSSYLMCPLTLDYQAF---------------TDSLKALDT---KII 170 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + + + K +I LTDGEN +K D AK+N + I T Sbjct: 171 PRRGTNIAKVIALAEKTVADSSNHKILIILTDGENLQGD----VLKAADLAKKNGLTIYT 226 Query: 367 ISI 369 I + Sbjct: 227 IGV 229 >gi|115447343|ref|NP_001047451.1| Os02g0619600 [Oryza sativa Japonica Group] gi|47847560|dbj|BAD21612.1| zinc finger (C3HC4-type RING finger)-like protein [Oryza sativa Japonica Group] gi|47847788|dbj|BAD21564.1| zinc finger (C3HC4-type RING finger)-like protein [Oryza sativa Japonica Group] gi|113536982|dbj|BAF09365.1| Os02g0619600 [Oryza sativa Japonica Group] gi|215701433|dbj|BAG92857.1| unnamed protein product [Oryza sativa Japonica Group] gi|222623257|gb|EEE57389.1| hypothetical protein OsJ_07557 [Oryza sativa Japonica Group] Length = 709 Score = 41.0 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 41/244 (16%), Positives = 73/244 (29%), Gaps = 66/244 (27%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 R + V+D+SGSM TK+A LK A+ + + Sbjct: 259 PTSRAPVDLVTVLDVSGSMAG--------------------TKLALLKRAMGFVIQHLGP 298 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQ 303 + +I +++ + + RQ + ++ L T+ A+K+A + Sbjct: 299 SD------RLSVIAFSSTARRLFHLRRMSHSGRQQALQAVNLLGAGGGTNIADALKKAAK 352 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG-ENNNFKSNVNTI----------- 351 ++ K P II L+DG + N SNV Sbjct: 353 VIEDRN------------YKNPVCS----IILLSDGQDTYNISSNVRGTRPDYRSLVPSS 396 Query: 352 ----KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN-IS 406 IC + + A + L SS + + + F I Sbjct: 397 ILNHTICT------VPVHGFGFGADHDSDALHSIAESSGGTFSFIEDESVIQDAFAQCIG 450 Query: 407 QLMV 410 L+ Sbjct: 451 GLLS 454 >gi|156409367|ref|XP_001642141.1| predicted protein [Nematostella vectensis] gi|156229282|gb|EDO50078.1| predicted protein [Nematostella vectensis] Length = 193 Score = 41.0 bits (94), Expect = 0.38, Method: Composition-based stats. Identities = 28/142 (19%), Positives = 56/142 (39%), Gaps = 12/142 (8%) Query: 253 YMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +GLI Y++R V Q + R I T + AMK A + L S + Sbjct: 42 RIGLIVYSSRSYLVGGFRRYRNLRSVLQAIKRIR--YIRGGTYTGKAMKYALRKLFS--R 97 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 R+ + + + + K ++ +TDG + V T + + K+ + I ++ + Sbjct: 98 RAGYHHARVRLFRSSRKGAAKILVMITDG---ISQDRVTTPAL--RLKKMGVVIFSVGVG 152 Query: 371 ASPNGQRLLKTCVSSPEYHYNV 392 ++L++ S P + Sbjct: 153 KRYRLKQLMQI-ASRPRLVFTA 173 >gi|149018699|gb|EDL77340.1| rCG25821 [Rattus norvegicus] Length = 1513 Score = 41.0 bits (94), Expect = 0.38, Method: Composition-based stats. Identities = 33/235 (14%), Positives = 73/235 (31%), Gaps = 48/235 (20%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 + + E I +VD SGS+ + +KM L Sbjct: 605 VVQEICAEEACRDMKADIMFLVDSSGSIGL-----------------ENFSKMKVFMKNL 647 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + + V +G++ ++ + + T + + +D + + Sbjct: 648 ------VSKSQIGADRVQIGVVQFSHENREEFQL--NTFMSQNDIANAIDQMAHIGETTL 699 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 + ++ N +KF+I +TDGE + + Sbjct: 700 TGSALTFVSQYFSPEKGARPN------------VRKFLILITDGEAQDIVKDPAV----- 742 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 +++ + I ++ + S Q L+ PE + V N + Q+I +V Sbjct: 743 ALRKDGVIIYSVGVFGSNVTQ--LEEISGKPEMVFYVEN----FDILQHIEDDLV 791 Score = 36.8 bits (83), Expect = 6.7, Method: Composition-based stats. Identities = 45/343 (13%), Positives = 110/343 (32%), Gaps = 37/343 (10%) Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + L ++ I +A + + + I ++ S+ Sbjct: 85 NRNPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSAPTNGRDKKQFPPILVVLASAESED 144 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIE---HLLNQRYNQKIVSFIP---ALLRI 183 ++ + ++ V + + L+ + N R + + F P +++ Sbjct: 145 DVEEASKALREDGVKIISVGVQKASEEDLKAMATSQFHFNLRTARDLSMFAPNMTQIIKD 204 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 R +++V+ C S + V + + + + LK L +S+ Sbjct: 205 VTQYREAMTDDIIVE-----ACQGPSVADVVFLLDMAINGSQENLDHLKAFLG---ESVS 256 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 L + + +GL+ Y+ S G + V + + D +P + QAY Sbjct: 257 ALDIKENCMRVGLVAYSNETRVISSLSMGVN--KTEVLQRIQ-------DLSPHVGQAYT 307 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 + +++ F Q + + + +T + S+ N K + + Sbjct: 308 G--AALRKTRKEVFSAQRGSRKNQGVPQIAVLVT-----HRASDDNVTKAAVNLRREGVT 360 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + T+ + + Q L+ S Y S + F ++ Sbjct: 361 VFTMGVEGANPEQ--LEKIAS-----YPAEQFTSKLSNFSELA 396 >gi|124006669|ref|ZP_01691501.1| domain of unknown function protein [Microscilla marina ATCC 23134] gi|123987824|gb|EAY27515.1| domain of unknown function protein [Microscilla marina ATCC 23134] Length = 3238 Score = 41.0 bits (94), Expect = 0.38, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 47/153 (30%), Gaps = 31/153 (20%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 P + V+D+SGSM P D +T ++ + A + + L+ Sbjct: 2205 EPPIEVIYVLDMSGSMKW----------EYPKTDDAGKT-ISRFRAAQDALIYANSALAQ 2253 Query: 248 VKEDVYMGLIGYTTRVEK-NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 LI + + + +++ V T + M A ++L Sbjct: 2254 QGMSSRSALIVFNDTTAQVMSGFTNNFQQLNSIVENLGA--PNGGTPMSKGMLSAKELL- 2310 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 K S + ++ +TDG Sbjct: 2311 ----------------KTRSADKKPVVVLITDG 2327 >gi|115555|sp|P05099|MATN1_CHICK RecName: Full=Cartilage matrix protein; AltName: Full=Matrilin-1; Flags: Precursor gi|833607|emb|CAA30915.1| cartilage matrix protein [Gallus gallus] Length = 493 Score = 41.0 bits (94), Expect = 0.38, Method: Composition-based stats. Identities = 32/174 (18%), Positives = 63/174 (36%), Gaps = 26/174 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD--SLILKPTDST 295 I+ L +G+I Y + V+ T + + + + + + T + Sbjct: 62 LSRVIEGLDVGPNSTRVGVINYASAVKNEFSLK--THQTKAELLQAVQRIEPLSTGTMTG 119 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A SD + G ++ S K I +TDG + +V+ Sbjct: 120 LAIQFAISRAFSDTE----------GARLRSPNINKVAIVVTDGRPQDGVQDVS-----A 164 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN---ADSLIHVFQN 404 +A++ I+I I + L+ S P ++ V + + L H FQ Sbjct: 165 RARQAGIEIFAIGVG--RVDMHTLRQIASEPLDDHVDYVESYSVIEKLTHKFQE 216 >gi|118348690|ref|XP_001007820.1| U-box domain containing protein [Tetrahymena thermophila] gi|89289587|gb|EAR87575.1| U-box domain containing protein [Tetrahymena thermophila SB210] Length = 790 Score = 41.0 bits (94), Expect = 0.38, Method: Composition-based stats. Identities = 30/168 (17%), Positives = 67/168 (39%), Gaps = 25/168 (14%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 ++ E +R I V+D+SGSM + + NS + T + +K+++ Sbjct: 124 SIKTPEGQQRSACDICCVIDVSGSMS----DEAKIKNSKGDIESNGLTILDLVKHSVKTI 179 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS----WGTEKVRQYVTRDMDSLILKPTDS 294 ++++D + L+ + T K + + G + + + + L T+ Sbjct: 180 INNLDERD------RLSLVAFHTNAYKITDLTPMNENGRNHAIKELEKLI---PLDSTNI 230 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + QA +++ + +++S Q V I+ TDG+ N Sbjct: 231 WDGIYQALEVVKAGQQQS--IQKGEQRVAFSQ------ILLFTDGQPN 270 >gi|27376088|ref|NP_767617.1| hypothetical protein bll0977 [Bradyrhizobium japonicum USDA 110] gi|27349227|dbj|BAC46242.1| bll0977 [Bradyrhizobium japonicum USDA 110] Length = 754 Score = 41.0 bits (94), Expect = 0.38, Method: Composition-based stats. Identities = 36/244 (14%), Positives = 66/244 (27%), Gaps = 69/244 (28%) Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 + + P + + V+D SGSM T + Sbjct: 334 DYLLAFVTPPSAEQATQKPLPREVVFVIDNSGSMGG--------------------TSIV 373 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS-----WGTEKVRQYVTRDM 284 K +LL L + +I + ++ S + +V+ Sbjct: 374 QAKASLLYALGRLQPAD------RFNVIRFDDTMDVLFPASVPADAAHVGEATSFVSALQ 427 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 T+ PAM+ A KI + ++FLTDG N Sbjct: 428 A---RGGTEMVPAMRAALTD------------------KIGDTGMVRQVVFLTDGAIGNE 466 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + TI A ++ + I S+P + ++ F + Sbjct: 467 QQLFETI----TAMRGRSRVFMVGI-------------GSAPNTYLMTRASELGRGAFTH 509 Query: 405 ISQL 408 I + Sbjct: 510 IGSV 513 >gi|270007560|gb|EFA04008.1| hypothetical protein TcasGA2_TC014157 [Tribolium castaneum] Length = 805 Score = 41.0 bits (94), Expect = 0.38, Method: Composition-based stats. Identities = 46/335 (13%), Positives = 95/335 (28%), Gaps = 28/335 (8%) Query: 95 KRFIKN----HIKESLSGYSAVFYNTEI---QNIVNSSRISMTHMANNRLDSSNNT---I 144 +F+K+ E EI + + + + +L + T Sbjct: 147 LKFVKSPPLRTGNEISKNDDKTASLAEIKQNNSTSATVKFNPNIERQKQLATGLGTKEEN 206 Query: 145 FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMH 204 + YD + + + F P+ + + + V+D SGSM Sbjct: 207 GLAGQFVVQYDVERDPKGGEVLLKDGYFVHFFAPSEVEALPKQ-----VIFVLDTSGSMD 261 Query: 205 CAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE 264 + + + A + K + + + V+ D +G + Sbjct: 262 ---GNRIKQLKEAMNSILSELKKEDVFNIVEFSSIVKVWNVDKVQVDYEVGEDPWPLYDS 318 Query: 265 KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 T +V + D K + AY ++ + Sbjct: 319 PEAPQKNKTNQVLPPAYKATDENKEKAKKVVEKLN-AYGGTDIKSALEVGLKLVKKNKEN 377 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF---IKIVTISINASPNGQRLLKT 381 Q I+FLTDGE ++N I + I ++S + L+ Sbjct: 378 KEDAHQPIIVFLTDGEPTMGETNTEKITSAISEMNSGETRAPIFSLSF-GDGADREFLQK 436 Query: 382 CVSS----PEYHYNVVNAD-SLIHVFQNISQLMVH 411 + Y +A L ++ IS +++ Sbjct: 437 ISLKNLGFARHIYEAADASLQLQEFYKQISSPLLN 471 >gi|163849427|ref|YP_001637471.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222527431|ref|YP_002571902.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163670716|gb|ABY37082.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222451310|gb|ACM55576.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 851 Score = 41.0 bits (94), Expect = 0.38, Method: Composition-based stats. Identities = 34/237 (14%), Positives = 68/237 (28%), Gaps = 50/237 (21%) Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPAL-----LRIEMGERPIFLIELVVDLSGSMHCAMNS 209 D + + IV ++ +I +RP +++D +GSM N Sbjct: 367 DRQGTACSATPPPPEDTSIVRLTGSVELWPEYQIVANDRPPVQYVVILDATGSMSMNFNG 426 Query: 210 D---PEDVNSA------------PICQDKK---RT-----KMAALKNALLLFLDSIDL-- 244 P C T ++ K A+ L + ++ Sbjct: 427 QGKLNGQTRQCITGPPGWPFPDDPSCDGSPNFAWTPREERRIYVAKEAVRLLIKQTNMPG 486 Query: 245 ---LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM---------DSLILKPT 292 + M ++ + V ++ G + R + L T Sbjct: 487 NPGYDPTRPIDQMAIVWFNQDVPESRIMLGGFSSDPAALDRAVLDAGKVNNDPYLSQGGT 546 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 ++ A + Q+L + +++ II +TDG N F + N Sbjct: 547 NTAGAFYRVGQLLGRAPTGTTQLGREW--------TYRRAIILVTDGLANIFLNTSN 595 >gi|297683708|ref|XP_002819511.1| PREDICTED: collagen alpha-1(XXII) chain-like [Pongo abelii] Length = 259 Score = 41.0 bits (94), Expect = 0.39, Method: Composition-based stats. Identities = 26/161 (16%), Positives = 57/161 (35%), Gaps = 23/161 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL--KPTDSTPAMK 299 +D + +G++ Y+ R E G ++ V L T++ A++ Sbjct: 40 VDTFEVGPDRTRVGVVRYSDRPTTAFEL--GLFGSQEEVKAAARRLAYHGGNTNTGDALR 97 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 RSF R G + +++ I LTDG + + + Sbjct: 98 Y-------ITARSFSP---RTGGRPGDRAYKQVAILLTDGRSQDLVLDAAAAA-----HR 142 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 I+I + + + L+ S P + ++V + +++ Sbjct: 143 AGIRIFAVGVG--EALKEELEEIASEPKSAHVFHVSDFNAI 181 >gi|218198427|gb|EEC80854.1| hypothetical protein OsI_23472 [Oryza sativa Indica Group] Length = 604 Score = 41.0 bits (94), Expect = 0.39, Method: Composition-based stats. Identities = 35/209 (16%), Positives = 75/209 (35%), Gaps = 32/209 (15%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + V+D+SGSM+ + + PE + + T++ LK ++ + +D Sbjct: 46 IDVVAVLDVSGSMNDPVAASPE--------SNLQATRLDVLKASMKFIIRKLDDGD---- 93 Query: 251 DVYMGLIGYTT---RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + ++ + + + V+ D S+ K D A + L Sbjct: 94 --RLSIVAFNDGPVKEYSSGLLD---------VSGDGRSIAGKKIDRLQARGGSGSALML 142 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + + + RQG + FI+ LTDG++ + K + T Sbjct: 143 ELQEAVKILDERQGNSRNRVG---FILLLTDGDDTTGFRWSRDVIHGAVGK---YPVHTF 196 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNAD 396 ++ A+ + + LL S + V + + Sbjct: 197 ALGAAHDPEALLHIAQESRGTYSFVDDGN 225 >gi|241554201|ref|YP_002979414.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240863507|gb|ACS61169.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 706 Score = 41.0 bits (94), Expect = 0.39, Method: Composition-based stats. Identities = 28/188 (14%), Positives = 58/188 (30%), Gaps = 43/188 (22%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 I P + ++D+SGSM + K+ LK+A L ++ + Sbjct: 330 IAPATTPRANLVFLIDVSGSMD-------------------EPDKLPLLKSAFRLMVNRL 370 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 V Y G K+ + R ++ AY Sbjct: 371 KADDTVSIVTYAG----NAGTVLAPTRVAEKSKILSAIDRLEPGGSTGG---AEGIEAAY 423 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE-NNNFKSNVNTIKICDKAKENF 361 + +QG + ++ TDG+ N S+ + +I ++ +++ Sbjct: 424 -------------DLAKQGFVKDGVNR---VMLATDGDFNVGPSSDGDLKRIIEEKRKDG 467 Query: 362 IKIVTISI 369 I + + Sbjct: 468 IFLTVLGF 475 >gi|311268548|ref|XP_003132103.1| PREDICTED: collagen alpha-5(VI) chain-like [Sus scrofa] Length = 2519 Score = 41.0 bits (94), Expect = 0.39, Method: Composition-based stats. Identities = 39/299 (13%), Positives = 91/299 (30%), Gaps = 49/299 (16%) Query: 100 NHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQ 159 N + L + ++ R + + +N T + Q Sbjct: 539 NKVPCYLIVLTDGMSMDDVLEPAEKLRAENIAVHAIGIGEANRTQLQQIAGKEERVSFGQ 598 Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 + L N + + ++ + E I +VD SGS+ Sbjct: 599 NFDSLKNIKNE-----VLHSICTEKGCEDMKADIMFLVDSSGSIG--------------- 638 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 +K+ + L I + + +G++ ++ ++ + T ++ Sbjct: 639 -----PENFRTMKSFMKNLLAKIQI---GLDKTRIGVVQFSDVTKEEFKLD--TYFTQKE 688 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 ++ ++ + ++Q LT S F K +KF+I +TDG Sbjct: 689 ISDAIERM--------SPIEQ--NTLTGKALTSIEPYFTES--KGARSMVRKFLILITDG 736 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 E + N + ++ + I + + + Q L+ + V + L Sbjct: 737 EAQDDVRNPAKV-----LRDKGVVIFAVGVFRANRTQ--LEEISGDSSLVFQVESFSDL 788 >gi|222635792|gb|EEE65924.1| hypothetical protein OsJ_21784 [Oryza sativa Japonica Group] Length = 578 Score = 41.0 bits (94), Expect = 0.39, Method: Composition-based stats. Identities = 35/209 (16%), Positives = 75/209 (35%), Gaps = 32/209 (15%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + V+D+SGSM+ + + PE + + T++ LK ++ + +D Sbjct: 46 IDVVAVLDVSGSMNDPVAASPE--------SNLQATRLDVLKASMKFIIRKLDDGD---- 93 Query: 251 DVYMGLIGYTT---RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + ++ + + + V+ D S+ K D A+ L Sbjct: 94 --RLSIVAFNDGPVKEYSSGLLD---------VSGDGRSIAGKKIDRLQAVVAVALRLCP 142 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + + + RQG + FI+ LTDG++ + K + T Sbjct: 143 ELQEAVKILDERQGNSRNRVG---FILLLTDGDDTTGFRWSRDVIHGAVGK---YPVHTF 196 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNAD 396 ++ A+ + + LL S + V + + Sbjct: 197 ALGAAHDPEALLHIAQESRGTYSFVDDGN 225 >gi|41407305|ref|NP_960141.1| hypothetical protein MAP1207 [Mycobacterium avium subsp. paratuberculosis K-10] gi|118463234|ref|YP_882479.1| hypothetical protein MAV_3297 [Mycobacterium avium 104] gi|81414471|sp|Q740Y5|Y1207_MYCPA RecName: Full=UPF0353 protein MAP_1207 gi|41395657|gb|AAS03524.1| hypothetical protein MAP_1207 [Mycobacterium avium subsp. paratuberculosis K-10] gi|118164521|gb|ABK65418.1| protein Nfa34780 [Mycobacterium avium 104] Length = 335 Score = 41.0 bits (94), Expect = 0.39, Method: Composition-based stats. Identities = 42/234 (17%), Positives = 71/234 (30%), Gaps = 51/234 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + LV+D+S SM D +MAA + A F D + + Sbjct: 99 VMLVIDVSQSM---------------RATDVAPNRMAAAQEAAKQFADELTPG------I 137 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GLI Y + P+ E + + + + T + + A Q + + Sbjct: 138 NLGLIAYAGTATVLVSPTTNREATKNALDKLQFADR---TATGEGIFTALQAIATVGAV- 193 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENN---NFKSNVNTIKICDKAKENFIKIVTISI 369 + P I+ +DG+ N + AK+ + I TIS Sbjct: 194 ---------IGGGDKPPPARIVLFSDGKETMPTNPDNPKGAFTAARTAKDQGVPISTISF 244 Query: 370 NAS-------------PNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLM 409 P LK S YN + L V+ + Q + Sbjct: 245 GTPYGFVEINDQRQPVPVDDETLKKVAQLSGGNAYNAASLQELKSVYATLQQQI 298 >gi|163848161|ref|YP_001636205.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222526064|ref|YP_002570535.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163669450|gb|ABY35816.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222449943|gb|ACM54209.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 905 Score = 41.0 bits (94), Expect = 0.39, Method: Composition-based stats. Identities = 28/216 (12%), Positives = 61/216 (28%), Gaps = 47/216 (21%) Query: 185 MGERPIFLIEL--VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 F + L V+D SGSM ++ T++ + A Sbjct: 404 PSREERFDLALTLVIDRSGSMSELVDGLR--------------TQLDLAREAA------F 443 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 + + +I + + + + + T+ M A Sbjct: 444 QASLGLSRQDQLSIIAFDSVADVILPLQ-PLPDLATIEDALSRLSAGGGTNIRSGMALAA 502 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + + + R + +I LTDG S + + + Sbjct: 503 ETIATADARI------------------RHVILLTDG-----VSETEYADLVANLRAQGV 539 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 + T++I + + L + +Y V A++L Sbjct: 540 TVSTVAIGLNTD-PELERVAQIGGGKYYVVRQAEAL 574 >gi|254492197|ref|ZP_05105371.1| type I secretion target GGXGXDXXX repeat protein domain protein [Methylophaga thiooxidans DMS010] gi|224462522|gb|EEF78797.1| type I secretion target GGXGXDXXX repeat protein domain protein [Methylophaga thiooxydans DMS010] Length = 2740 Score = 41.0 bits (94), Expect = 0.40, Method: Composition-based stats. Identities = 51/297 (17%), Positives = 100/297 (33%), Gaps = 45/297 (15%) Query: 58 NAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTE 117 ++ L+ + ++ GD ++I + LI ++ +S + VF T Sbjct: 1952 DSVSLSTTADTTGLMTHDGDAVDTIWDSGSNTLIG----YVAGT--DSSDAANQVFTVTL 2005 Query: 118 IQNIVNSSRISMTHM--ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVS 175 + + +T + + + N + + ++V + + I N + + Sbjct: 2006 TNINASGADYDVTLLQAVKHPDADNENNVDFTVNVSVLDNDGSEGITSF-NVSIDDDVPV 2064 Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 P E + +++D+SGSM+ + N + T++ + + Sbjct: 2065 ASPEENSGEATLEVNTNLMMILDVSGSMNDSANF-------------QGMTRLQVMIKSS 2111 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 L LD D DV + +I + T PS G V Q + T+ Sbjct: 2112 LELLDQYDAY----GDVMVNIITFAT---SASNPSGGWVTVDQAKAIILGLTAGGNTNYD 2164 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE-NNNFKSNVNTI 351 A+ A K N F++DGE N+N SN T+ Sbjct: 2165 DALNDAINAFALGGKLGDGQNISY---------------FMSDGEPNSNNVSNSATV 2206 >gi|62088356|dbj|BAD92625.1| inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) variant [Homo sapiens] Length = 699 Score = 41.0 bits (94), Expect = 0.40, Method: Composition-based stats. Identities = 32/160 (20%), Positives = 52/160 (32%), Gaps = 35/160 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D SGSM K+ + AL+ LD + V Sbjct: 260 VVFVIDKSGSMSGR--------------------KIQQTREALIKILDDLSPRDQFNLIV 299 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 + R + K R + L T+ AM A Q+L S + Sbjct: 300 FSTEATQ-WRPSLVPASAENVNKARSFAAGIQA---LGGTNINDAMLMAVQLLDSSNQ-- 353 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + S+ II LTDG+ ++N +I+ Sbjct: 354 ------EERLPEGSVSL---IILLTDGDPTVGETNPRSIQ 384 >gi|291436333|ref|ZP_06575723.1| von Willebrand factor [Streptomyces ghanaensis ATCC 14672] gi|291339228|gb|EFE66184.1| von Willebrand factor [Streptomyces ghanaensis ATCC 14672] Length = 424 Score = 41.0 bits (94), Expect = 0.40, Method: Composition-based stats. Identities = 42/230 (18%), Positives = 77/230 (33%), Gaps = 32/230 (13%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 +V+F E R + L++D+SGSM +++MAA K Sbjct: 22 LVAFPAGAAADEPTGRDAPKVNLLLDVSGSMRAR--------------DIDGQSRMAAAK 67 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 A L ++V +G+ + + + Y + P Sbjct: 68 QAFNEVL------DATPKEVELGIRTLGANYAGDDRKEGCKDTAQLY--------PVGPL 113 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 D T A K A LT + + + K I+ ++DGE+ + + Sbjct: 114 DRTEA-KAAVATLTPTGWTPIGPSLLKAADDLEGGNGSKRIVLISDGEDTCAPLDPCEVA 172 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIH 400 AK + I T+ + + R L C++ + +V + D L Sbjct: 173 REIAAKGIGLTIDTLGLVPNAKLSRQLS-CIAEATGGTYASVEHQDELTD 221 >gi|239928001|ref|ZP_04684954.1| secreted protein [Streptomyces ghanaensis ATCC 14672] Length = 417 Score = 41.0 bits (94), Expect = 0.40, Method: Composition-based stats. Identities = 42/230 (18%), Positives = 77/230 (33%), Gaps = 32/230 (13%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 +V+F E R + L++D+SGSM +++MAA K Sbjct: 15 LVAFPAGAAADEPTGRDAPKVNLLLDVSGSMRAR--------------DIDGQSRMAAAK 60 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 A L ++V +G+ + + + Y + P Sbjct: 61 QAFNEVL------DATPKEVELGIRTLGANYAGDDRKEGCKDTAQLY--------PVGPL 106 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 D T A K A LT + + + K I+ ++DGE+ + + Sbjct: 107 DRTEA-KAAVATLTPTGWTPIGPSLLKAADDLEGGNGSKRIVLISDGEDTCAPLDPCEVA 165 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIH 400 AK + I T+ + + R L C++ + +V + D L Sbjct: 166 REIAAKGIGLTIDTLGLVPNAKLSRQLS-CIAEATGGTYASVEHQDELTD 214 >gi|149601522|ref|XP_001515070.1| PREDICTED: similar to integrin alpha 2 subunit, partial [Ornithorhynchus anatinus] Length = 329 Score = 41.0 bits (94), Expect = 0.40, Method: Composition-based stats. Identities = 33/190 (17%), Positives = 65/190 (34%), Gaps = 30/190 (15%) Query: 233 NALLLFLDS-IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 NA+ FL+ + L +GLI Y + + V + Sbjct: 127 NAVKNFLEKFVQGLDIGPTKTQVGLIQYGNEPRVVFNM-NKFKTKEEMVQETSRTNQNGG 185 Query: 292 --TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T++ A++ A Q S K ++ +TDGE+++ + Sbjct: 186 SLTNTFKAIEFARQNAFSLA-------------SGGRPHATKVMVVVTDGESHDGSNLKK 232 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRL--------LKTCVSSP--EYHYNVVNADSLI 399 I C+ ++ I I++ L +K SSP +Y +NV + +L+ Sbjct: 233 VIDQCE---QDNIIRFGIAVLGYLLRNELDTKNLITEIKAIASSPTEKYFFNVSDEAALL 289 Query: 400 HVFQNISQLM 409 + + + Sbjct: 290 EKAGTLGERI 299 >gi|146284567|ref|YP_001165520.1| von Willebrand factor, type A [Enterobacter sp. 638] gi|145320700|gb|ABP62846.1| von Willebrand factor, type A [Enterobacter sp. 638] Length = 401 Score = 41.0 bits (94), Expect = 0.40, Method: Composition-based stats. Identities = 36/282 (12%), Positives = 90/282 (31%), Gaps = 43/282 (15%) Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQ-----FIEHLLNQRYNQKIVSFIPALLRIEM 185 + + + S ++ + + L L Q I + + Sbjct: 136 AVPVSPVASKGRRWWWLLLALLVLLAFLLLRGCMPTAKLPVQGAPAPAPLPIKQICPAKR 195 Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPED-----VNSAPICQDKKRTKMAALKNALLLFLD 240 ++ + L+ D SGSM +M+ P++ + D++ +++ + +D Sbjct: 196 TKQQAPEMVLIFDASGSMSISMDITPDELRRLMQDRPVKNFDREPRRISLAHRSAKQLID 255 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 + +D+ + L+ T + ++ P+ + + + + T A+++ Sbjct: 256 EV------PKDMDISLVSAATCQQVSVTPALSFAQRDELKYAIDNIQPVGKTALAEALEK 309 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A G + + I+ +TDGE + + K Sbjct: 310 A-------------------GKLVDGVDRDAIIVLITDGEETCGGDPCVVAQQLKQQKPR 350 Query: 361 FIKIVTISINASPNGQRLLKTCVSS--PEYHYNVVNADSLIH 400 +++ + I + G C++S Y V N Sbjct: 351 -LQVNVVDIMNTGAG-----NCIASQTGGSVYAVNNTHEFNE 386 >gi|118353830|ref|XP_001010180.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89291947|gb|EAR89935.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 544 Score = 40.6 bits (93), Expect = 0.41, Method: Composition-based stats. Identities = 33/229 (14%), Positives = 76/229 (33%), Gaps = 52/229 (22%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 A+ + + RP + V+D SGSM K+ +K L Sbjct: 109 AVKQESLENRPNLDLICVIDNSGSMSGE--------------------KIQNVKKTLEYL 148 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP--SWGTEK--VRQYVTRDMDSLILKPTDS 294 L+ + + LI + + + + + K ++ + + TD Sbjct: 149 LELLGDND------RLCLILFNSYSTRLCHLMKTNNSNKPAFKEIINKIQA---TGGTDI 199 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 M+ A+++L K ++ ++ I L+DG+ + +++ + Sbjct: 200 NSGMELAFRVLKERKYQNPVSS----------------IFLLSDGQ--DGSADLRVRQSL 241 Query: 355 DK-AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 ++ + I + + +G + K C Y V + + F Sbjct: 242 ERHLPQECFTIHSFGFGSDHDGPLMNKICSLKDGNFYYVEKINQVDEFF 290 >gi|160896215|ref|YP_001561797.1| von Willebrand factor type A [Delftia acidovorans SPH-1] gi|160361799|gb|ABX33412.1| von Willebrand factor type A [Delftia acidovorans SPH-1] Length = 536 Score = 40.6 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 45/266 (16%), Positives = 86/266 (32%), Gaps = 44/266 (16%) Query: 150 VMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSM-HCAMN 208 S + L + I + + A RP I V+D+SGSM + Sbjct: 301 AQASPALPTAPVVELSFPNRLEVIDAVLSAYQSDL--RRPATSI-FVLDVSGSMKGARLA 357 Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE-KNI 267 E + + ++ A A L L+ + GL+G RV+ Sbjct: 358 QMKEALKLLSGAEASAASQRYAAFQARERVL----LIP------FSGLVGQPARVQFAAG 407 Query: 268 EPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 + + +V Y + T A+ A Q Q + Sbjct: 408 DLQAASAQVLAYADSLVAD---GGTAIYDALTLAQQ----------------QARQELRA 448 Query: 328 PFQKF--IIFLTDGENNNFKSNVNTIKICDKAKENF---IKIVTISINASPNGQRLLKTC 382 ++F I+ LTDG N + + A++ +++ I + +G+ ++ Sbjct: 449 DPERFVSIVLLTDGANTAGRDWAAFEREQRMARDGGAPLVRVFPIIFGEAQSGE--MQAL 506 Query: 383 VS-SPEYHYNVVNADS--LIHVFQNI 405 + + ++ N L VF+ I Sbjct: 507 AALTGGRAFDARNTGKSGLPLVFKEI 532 >gi|269125771|ref|YP_003299141.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] gi|268310729|gb|ACY97103.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] Length = 601 Score = 40.6 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 36/231 (15%), Positives = 75/231 (32%), Gaps = 35/231 (15%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 +L+ R + LV+D SGSM ++K K A L Sbjct: 397 ILKTWAELRKPANVLLVIDRSGSMQ-------------QTVPGTGKSKGDLAKEAAAEAL 443 Query: 240 DSIDLLSHVKEDVYMGLIGYTTR-VEKNIEPSWGTEKVRQYV-TRDMDSLILKPTDSTPA 297 V V+ R ++ + TE R + R + + T Sbjct: 444 AEFRGQDQVGLWVFSAARRQGERDWQEVVPLGRMTEAHRSLLRERLLGLTLSGGTGLYNT 503 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS-NVN-TIKICD 355 AY+ +T ++ ++ +TDG+N +++ I Sbjct: 504 TAAAYEKMTGSRRGDAINA----------------VVVMTDGKNERPGGLDLDGLIAKLG 547 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNI 405 +E +++ TI Q +L+ ++ Y+ + +++ +F + Sbjct: 548 ARREESVRVFTIGY-GEDADQNVLRRIAEAADGAAYDSSDPNTIGDIFTEV 597 >gi|212634223|ref|YP_002310748.1| Von Willebrand factor type A domain-containing protein [Shewanella piezotolerans WP3] gi|212555707|gb|ACJ28161.1| Von Willebrand factor type A domain protein [Shewanella piezotolerans WP3] Length = 342 Score = 40.6 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 30/214 (14%), Positives = 71/214 (33%), Gaps = 29/214 (13%) Query: 157 RLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNS 216 R+Q + LL+ ++ + + ++ + + VDLS SM P + S Sbjct: 59 RIQALALLLSWIVITLCIARPVLIGEPIVTQKAGRDMMIAVDLSQSMEQKDYLLPSEEPS 118 Query: 217 A--PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + +++ ALK+ L F ++ +GLI + + + + Sbjct: 119 QDGQLKAASNVSRLVALKSLLSSFSQQ-------RDGDRLGLIVFGSGAYLQVPFTEDVR 171 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 + + + + T A+ + + Q+ ++ Sbjct: 172 LWQTLLEQMDTQMAGPATAIGDAVGLSIRAFERSNTS------------------QRILL 213 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 +TDG + + + + A I+I T+ Sbjct: 214 LVTDGSDTSSRLDPVDAARVAAA--EGIEIFTLG 245 >gi|327193756|gb|EGE60633.1| hypothetical protein RHECNPAF_136001 [Rhizobium etli CNPAF512] Length = 433 Score = 40.6 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 66/204 (32%), Gaps = 45/204 (22%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 ++ K++ I P + ++D+SGSM + K Sbjct: 42 NHDTKLMHVAIKGYDIAPATAPHANLVFLIDVSGSMD-------------------EPDK 82 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + LK+A L ++ + V Y G G V + + + + +D L Sbjct: 83 LPLLKSAFRLLVNRLKADDTVSIVTYAGNAG---TVLEPTRVA-----EKSKILSAIDKL 134 Query: 288 ILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE-NNNFK 345 T ++ AY L N ++ TDG+ N Sbjct: 135 EAGGSTGGAEGIEAAYD-LAKKAFVKDGVNR---------------VMLATDGDFNVGPS 178 Query: 346 SNVNTIKICDKAKENFIKIVTISI 369 S+ + +I ++ +++ I + + Sbjct: 179 SDEDLKRIIEEKRKDGIFLTVLGF 202 >gi|149773083|emb|CAO01891.1| collagen typeVI alpha 5 [Mus musculus] Length = 1212 Score = 40.6 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 38/247 (15%), Positives = 81/247 (32%), Gaps = 46/247 (18%) Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 D R+ F ++ + + + + + + E I +VD SGS+ Sbjct: 625 DERVNFGQNFDSLKSIKN--EIVHRICSEKGCEDMKADIMFLVDSSGSIG---------- 672 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 T +K + + I + + +G++ ++ + + + + Sbjct: 673 ----------PTNFETMKTFMKNLVGKIQI---GADRSQVGVVQFSDYNREEFQLNKYST 719 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 Y D S I + T + A+ F N + K +KF+I Sbjct: 720 HEEIYAAIDRMSPINRNTLTGGAL--------------TFVNEYFDLSKGGRPQVRKFLI 765 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 LTDG+ + T + + I ++ + + Q L+ ++V N Sbjct: 766 LLTDGKAQDEVGGPAT-----ALRSKSVTIFSVGVYGANRAQ--LEEISGDGSLVFHVEN 818 Query: 395 ADSLIHV 401 D L + Sbjct: 819 FDHLKAI 825 Score = 37.2 bits (84), Expect = 5.5, Method: Composition-based stats. Identities = 50/342 (14%), Positives = 114/342 (33%), Gaps = 54/342 (15%) Query: 80 ESISNHAKRALIDDAKRFIKNHIKESLSGY----------SAVFYNTEIQNIVNSSRISM 129 + +S + L A F+ + S G + E+ + R Sbjct: 728 DRMSPINRNTLTGGALTFVNEYFDLSKGGRPQVRKFLILLTDGKAQDEVGGPATALRSKS 787 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 + + + +N + S + ++ +HL + K++ + AL + E Sbjct: 788 VTIFSVGVYGANRAQLEEISGDGSLVFHVENFDHL--KAIESKLIFRVCALHDCKRIE-- 843 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + I V+D SGS+ P Q+ + L + + + Sbjct: 844 LLDIVFVLDHSGSIG-------------PREQESM----------MNLTIHLVKKADVGR 880 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + V +G + Y+ E + + T + A++ + +L +++ Sbjct: 881 DRVQIGALTYSNHPEILFYLNTYSSGSAIAEHLRRPRDTGGETYTAKALQHS-NVLFTEE 939 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 S T RQ +I +TDG ++ + ++ + ++ I I + + Sbjct: 940 HGSRLTQNVRQ-----------LMIVITDG-VSHDRDKLDEAAR--ELRDKGITIFAVGV 985 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 Q L+T E +V N D L ++ + + + + Sbjct: 986 G--NANQDELETMAGKKENTVHVDNFDKLRDIYLPLQETLCN 1025 >gi|291527684|emb|CBK93270.1| Mg-chelatase subunit ChlD [Eubacterium rectale M104/1] Length = 1237 Score = 40.6 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 34/218 (15%), Positives = 78/218 (35%), Gaps = 48/218 (22%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM + D+ ++ + +L+ ++ E+ + Sbjct: 613 LVIDDSGSM---------------VSNDRYNQRLTVAQ----------NLIDNLPENSKV 647 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 G++ +T+ K ++ + T +M A S + + Sbjct: 648 GVVKFTSSTTKLTTSLTSDKETAKSYLTTSYFRSSGGT----SMYTAINSSFSMFEATDD 703 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 K +I L+DG + + + + A N +KI T+ + +S + Sbjct: 704 -------------NILKMMIVLSDGATSYTYLHSSVVT---TANNNGVKIYTVGLGSSSS 747 Query: 375 GQ--RLLKTCV-SSPEYHYNVVNADSLIHVFQNISQLM 409 + LK ++ Y +A L ++++I++ + Sbjct: 748 SYFTQYLKPLANNTGGAFYLASDASQLEDIYKDINKKI 785 >gi|225449026|ref|XP_002273050.1| PREDICTED: hypothetical protein [Vitis vinifera] Length = 710 Score = 40.6 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 34/247 (13%), Positives = 79/247 (31%), Gaps = 49/247 (19%) Query: 108 GYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQ 167 + + NSS + NN + + + + F + + Sbjct: 176 NDDESLDHQPVPAERNSSNGN--AAENNPVRTVEIKTYPEVSAAPRSKSYDNFTVLVHLK 233 Query: 168 RYNQKIVSFIPALLRIEM----GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 I + R + V+D+SGSM Sbjct: 234 AAVANTGQNIQRNMSNSPLNSHNPRAPVDLVTVLDISGSMAG------------------ 275 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 TK+A LK A+ + ++ + +I +++ + T+ RQ + Sbjct: 276 --TKLALLKRAMGFVIQNLGSSD------RLSVIAFSSTARRLFPLRRMTDAGRQQALQA 327 Query: 284 MDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 ++SL+ T+ +++ +++ K+R+ ++ II L+DG++ Sbjct: 328 VNSLVANGGTNIAEGLRKGAKVMEDRKERNPVSS----------------IILLSDGQDT 371 Query: 343 NFKSNVN 349 + + Sbjct: 372 YTVNGSS 378 >gi|222623880|gb|EEE58012.1| hypothetical protein OsJ_08791 [Oryza sativa Japonica Group] Length = 759 Score = 40.6 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 30/236 (12%), Positives = 69/236 (29%), Gaps = 54/236 (22%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 R + V+D+SGSM TK+A LK A+ + ++ Sbjct: 313 TRAPVDLITVLDVSGSMAG--------------------TKLALLKRAMGFVIQNLGSSD 352 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGT-EKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 + +I +++ + T +Q + T+ +++ +++ Sbjct: 353 ------RLSVIAFSSSARRLFPLRRMTETGRQQSLQAVYSLTSNGGTNIAEGLRKGSKVI 406 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK-----IC-----D 355 + ++ + II L+DG++ S + C Sbjct: 407 EDRQAKNPVCS----------------IILLSDGQDTYTVSPTAGVHKAAPEYCSLLPYT 450 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN-ISQLMV 410 + + A + L +S + ++ F I L+ Sbjct: 451 SNGCQQVPVHVFGFGADHDSVSLHSISQTSGGTFSFIETEAAIQDAFAQCIGGLLS 506 >gi|15451571|gb|AAK98695.1|AC069158_7 Hypothetical protein protein containing a von Willebrand factor type A domain [Oryza sativa Japonica Group] Length = 714 Score = 40.6 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 30/236 (12%), Positives = 69/236 (29%), Gaps = 54/236 (22%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 R + V+D+SGSM TK+A LK A+ + ++ Sbjct: 268 TRAPVDLITVLDVSGSMAG--------------------TKLALLKRAMGFVIQNLGSSD 307 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGT-EKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 + +I +++ + T +Q + T+ +++ +++ Sbjct: 308 ------RLSVIAFSSSARRLFPLRRMTETGRQQSLQAVYSLTSNGGTNIAEGLRKGSKVI 361 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK-----IC-----D 355 + ++ + II L+DG++ S + C Sbjct: 362 EDRQAKNPVCS----------------IILLSDGQDTYTVSPTAGVHKAAPEYCSLLPYT 405 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN-ISQLMV 410 + + A + L +S + ++ F I L+ Sbjct: 406 SNGCQQVPVHVFGFGADHDSVSLHSISQTSGGTFSFIETEAAIQDAFAQCIGGLLS 461 >gi|48425220|pdb|1PT6|A Chain A, I Domain From Human Integrin Alpha1-Beta1 gi|48425221|pdb|1PT6|B Chain B, I Domain From Human Integrin Alpha1-Beta1 Length = 213 Score = 40.6 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 30/188 (15%), Positives = 70/188 (37%), Gaps = 26/188 (13%) Query: 233 NALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 +++ FL D + + + +G++ Y V + + + Sbjct: 24 DSVTAFLNDLLKRMDIGPKQTQVGIVQYGENVTHEFNL-NKYSSTEEVLVAAKKIVQRGG 82 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 + A+ T+ K+ R+G +K ++ +TDGE N Sbjct: 83 RQTMTALGTD----TARKEAFTEARGARRG-------VKKVMVIVTDGE---SHDNHRLK 128 Query: 352 KICDKAKENFIKIVTISINASPNGQRL--------LKTCVSSP--EYHYNVVNADSLIHV 401 K+ ++ I+ +I+I S N L +K+ S P ++ +NV + +L+ + Sbjct: 129 KVIQDCEDENIQRFSIAILGSYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTI 188 Query: 402 FQNISQLM 409 + + + + Sbjct: 189 VKTLGERI 196 >gi|115449371|ref|NP_001048450.1| Os02g0806700 [Oryza sativa Japonica Group] gi|47497349|dbj|BAD19389.1| zinc finger-like [Oryza sativa Japonica Group] gi|113537981|dbj|BAF10364.1| Os02g0806700 [Oryza sativa Japonica Group] gi|215701428|dbj|BAG92852.1| unnamed protein product [Oryza sativa Japonica Group] Length = 723 Score = 40.6 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 30/236 (12%), Positives = 69/236 (29%), Gaps = 54/236 (22%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 R + V+D+SGSM TK+A LK A+ + ++ Sbjct: 277 TRAPVDLITVLDVSGSMAG--------------------TKLALLKRAMGFVIQNLGSSD 316 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGT-EKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 + +I +++ + T +Q + T+ +++ +++ Sbjct: 317 ------RLSVIAFSSSARRLFPLRRMTETGRQQSLQAVYSLTSNGGTNIAEGLRKGSKVI 370 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK-----IC-----D 355 + ++ + II L+DG++ S + C Sbjct: 371 EDRQAKNPVCS----------------IILLSDGQDTYTVSPTAGVHKAAPEYCSLLPYT 414 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN-ISQLMV 410 + + A + L +S + ++ F I L+ Sbjct: 415 SNGCQQVPVHVFGFGADHDSVSLHSISQTSGGTFSFIETEAAIQDAFAQCIGGLLS 470 >gi|326505132|dbj|BAK02953.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 521 Score = 40.6 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 24/236 (10%), Positives = 69/236 (29%), Gaps = 53/236 (22%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 + + + R + +V+D+SGSM + K+ ++K Sbjct: 40 VTAVVELNATSSTAVREGLDLVVVLDVSGSM--------------------RGEKLQSMK 79 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-KP 291 A+ + + + + ++ +++ ++ T+ + + +D L+ Sbjct: 80 RAMQFVIMKLTPVD------RLSVVSFSSSATRHCPLRSVTQAAQAELKGIVDGLVANGG 133 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T+ + A ++ T + ++DG+ + + Sbjct: 134 TNIKAGLDTALAVIAGRATTKARTPN---------------VFLMSDGQQTDGDARQVD- 177 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCV--SSPEYHYNVVNADSLIHVFQNI 405 + + T LL S +V + ++ F + Sbjct: 178 -------PGNVAVYTFGF-GKDADHALLSDVAKKSPGGTFNSVPDGGNVSAPFSQL 225 >gi|291401551|ref|XP_002717040.1| PREDICTED: anthrax toxin receptor 2 [Oryctolagus cuniculus] Length = 486 Score = 40.6 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 33/240 (13%), Positives = 68/240 (28%), Gaps = 52/240 (21%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + + R F + V+D SGS+ ++ L Sbjct: 34 QEQPSCRTAFDLYFVLDKSGSVA------------------NNWIEIYDFVQKLTERF-- 73 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 V ++ + I ++++ + + K+ + + + T +K A Sbjct: 74 ------VSPEMRLSFIVFSSQATIILPLTGDRGKITKGLEDLKSVSPVGETYIHEGLKLA 127 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG--ENNNFKSNVNTIKICDKAKE 359 Q K L II LTDG + KI ++ Sbjct: 128 ----------------NEQIQKAGGLKTSSIIIALTDGKLDGLVPSYAEKEAKI---SRS 168 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + + + Q L+ S E + V + I ++ + + IL+ Sbjct: 169 LGASVYCVGVL--DFEQAQLEKIADSKEQVFPVKGGFQ---ALKGIINSILAQSCTEILE 223 >gi|237748397|ref|ZP_04578877.1| conserved hypothetical protein [Oxalobacter formigenes OXCC13] gi|229379759|gb|EEO29850.1| conserved hypothetical protein [Oxalobacter formigenes OXCC13] Length = 736 Score = 40.6 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 33/173 (19%), Positives = 63/173 (36%), Gaps = 37/173 (21%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL-LSHVK 249 + + LV+D+S SM+ + D T++A+ AL +DSI + Sbjct: 149 YNLSLVLDVSNSMYAGL--------------DNGNTRLASSVEALNNLIDSIAAQTAQGG 194 Query: 250 EDVYMGLIGYTTRVEKNIEPSW------GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 V + L+G+ + E SW +++ YV + + S + A++ Sbjct: 195 ITVNIQLVGFHS---SAFENSWLSVTQDNVNELKSYVNELANHIRDGG-ASNTNYQAAFE 250 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFII---FLTDGENNNFKSNVNTIKI 353 + +F Q +P + + F++DG N N I Sbjct: 251 EV---------CQWFAQQSAVPPVQGTDVVNKVFFISDGVPNAITVNGAVINY 294 >gi|284799403|ref|ZP_06390123.1| PilC protein [Neisseria subflava NJ9703] gi|284797751|gb|EFC53098.1| PilC protein [Neisseria subflava NJ9703] Length = 1126 Score = 40.6 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 14/77 (18%), Positives = 35/77 (45%), Gaps = 3/77 (3%) Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINA--SPNGQRLLKTCVSSPEYHYNVVN 394 +D ++ + + K D +K+ ++ T+ S G+ L+ S P++++N Sbjct: 343 SDAAGKSWDGDPSDPKGIDYSKQL-VQTFTVGFGEGISEVGREYLEKGASRPDWYFNAAK 401 Query: 395 ADSLIHVFQNISQLMVH 411 + L+ F+ I + + Sbjct: 402 KEDLLEAFKTIVDNIEN 418 >gi|224090451|ref|XP_002195054.1| PREDICTED: similar to Integrin alpha-2 [Taeniopygia guttata] Length = 1178 Score = 40.6 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 30/185 (16%), Positives = 59/185 (31%), Gaps = 20/185 (10%) Query: 233 NALLLFLDS-IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 +A+ FL + L +GLI Y T + + V + M+ K Sbjct: 184 DAVRAFLKKFVQGLDIGLNKTQVGLIQYANDPRVVFNL--NTYQTKDEVVKAMEKTFQKG 241 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 D T K + K ++ +TDGE+++ + I Sbjct: 242 GDLTNTFKA----------IDNARQYAFSAESGGRPTATKVMVVVTDGESHDGSNLKTVI 291 Query: 352 KIC--DKAKENFIKIVTISINASPNGQRLLKTCVSSPEY-----HYNVVNADSLIHVFQN 404 C D I ++ I + + L+K + +NV + +L+ Sbjct: 292 GKCNEDNITRFGIAVLGYLIRHELDTKNLIKEIKGIASHPTEKYFFNVSSEAALLEEAGT 351 Query: 405 ISQLM 409 + + + Sbjct: 352 LGERI 356 >gi|147815707|emb|CAN70517.1| hypothetical protein VITISV_016246 [Vitis vinifera] Length = 715 Score = 40.6 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 34/247 (13%), Positives = 79/247 (31%), Gaps = 49/247 (19%) Query: 108 GYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQ 167 + + NSS + NN + + + + F + + Sbjct: 181 NDDESLDHQPVPAERNSSNGN--AAENNPVRTVEIKTYPEVSAAPRSKSYDNFTVLVHLK 238 Query: 168 RYNQKIVSFIPALLRIEM----GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 I + R + V+D+SGSM Sbjct: 239 AAVANTGQNIQRNMSNSPLNSHNPRAPVDLVTVLDISGSMAG------------------ 280 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 TK+A LK A+ + ++ + +I +++ + T+ RQ + Sbjct: 281 --TKLALLKRAMGFVIQNLGSSD------RLSVIAFSSTARRLFPLRRMTDAGRQQALQA 332 Query: 284 MDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 ++SL+ T+ +++ +++ K+R+ ++ II L+DG++ Sbjct: 333 VNSLVANGGTNIAEGLRKGAKVMEDRKERNPVSS----------------IILLSDGQDT 376 Query: 343 NFKSNVN 349 + + Sbjct: 377 YTVNGSS 383 >gi|332255311|ref|XP_003276777.1| PREDICTED: collagen alpha-1(XXII) chain-like, partial [Nomascus leucogenys] Length = 695 Score = 40.6 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 26/161 (16%), Positives = 57/161 (35%), Gaps = 23/161 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL--KPTDSTPAMK 299 +D + +G++ Y+ R E G ++ V L T++ A++ Sbjct: 65 VDTFEVGPDRTRVGVVRYSDRPTTAFEL--GLFGSQEEVKAAARRLAYHGGNTNTGDALR 122 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 RSF R G + +++ I LTDG + + + Sbjct: 123 Y-------ITARSFSP---RAGGRPGDRAYKQVAILLTDGRSQDLVLDAAAAA-----HR 167 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 I+I + + + L+ S P + ++V + +++ Sbjct: 168 AGIRIFAVGVG--EALKEELEEIASEPKSAHVFHVSDFNAI 206 >gi|307254358|ref|ZP_07536196.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 9 str. CVJ13261] gi|307258816|ref|ZP_07540548.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 11 str. 56153] gi|306862657|gb|EFM94613.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 9 str. CVJ13261] gi|306867167|gb|EFM99023.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 11 str. 56153] Length = 538 Score = 40.6 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 39/269 (14%), Positives = 80/269 (29%), Gaps = 26/269 (9%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 ++ I E ++++ L + L LI + + + + A+L+ ++ Sbjct: 6 LSQARRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAE 65 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 + + SN + D + + ++ + F + +N I Sbjct: 66 NNNGRKDNDYKLSGSSNKENDSF--DISSEVGKRDSQMVTTFVKAFLPQTNDDKMNLIPI 123 Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE 187 T N +T + S I+H V IP + + Sbjct: 124 CKTV---NNTSGKGHTSSSEVTCTVSGT-----IKHKSWFPLKVGTVEVIPQQVDVASKS 175 Query: 188 RPI--------FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + + +V DLSGSM +D + +K+ L+ L Sbjct: 176 KAFKKNTFNIPIDLMVVADLSGSM--------KDGIKGEKLKGGTNSKIYILREVLKELA 227 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 D E +G+ + E E Sbjct: 228 DKSLFTQEANEYNRIGITAFAMGAEHPKE 256 >gi|296227162|ref|XP_002759255.1| PREDICTED: hypothetical protein LOC100397584 [Callithrix jacchus] Length = 1319 Score = 40.6 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 25/161 (15%), Positives = 57/161 (35%), Gaps = 23/161 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL--KPTDSTPAMK 299 +D + +G++ Y+ + E G R+ V L T++ A++ Sbjct: 65 VDTFEVGPDRTRVGVVRYSDQPTTAFEL--GLFGSREEVKAAARRLAYHGGNTNTGDALR 122 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 RSF + G + +++ I LTDG + + + Sbjct: 123 Y-------ITARSFSPHA---GGRPGDRAYKQVAILLTDGRSQDLVLDAAAAA-----HR 167 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 I+I + + + L+ S P + ++V + +++ Sbjct: 168 AGIRIFAVGVG--EALKEELEEIASEPKSAHVFHVSDFNAI 206 >gi|290956351|ref|YP_003487533.1| hypothetical protein SCAB_18411 [Streptomyces scabiei 87.22] gi|260645877|emb|CBG68968.1| putative secreted protein [Streptomyces scabiei 87.22] Length = 422 Score = 40.6 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 40/247 (16%), Positives = 78/247 (31%), Gaps = 34/247 (13%) Query: 156 YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 RL +L I A+ P + L++D+SGSM Sbjct: 5 QRLATGAFVLLAVLTAGIAVPTTAVADEISATAPK--VNLLLDVSGSMRAK--------- 53 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 +++M+A K A L ++V +G+ + + + Sbjct: 54 -----DIDGQSRMSAAKQAFNEVL------DATPKEVQLGIRTLGADYPGDDRKTGCKDT 102 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 + Y + P D T A K A L + + K I+ Sbjct: 103 AQLY--------PVGPLDRTEA-KTAVATLAPTGWTPIGPALLKAADDLEGGEGTKRIVL 153 Query: 336 LTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVV 393 ++DGE+ + + AK + I T+ + R+ +C++ + ++ Sbjct: 154 ISDGEDTCAPLDPCEVAREIAAKGIGLTIDTLGL-VPNAKLRVQLSCIAEATGGTYTSIE 212 Query: 394 NADSLIH 400 + D L Sbjct: 213 HRDELTD 219 >gi|242065788|ref|XP_002454183.1| hypothetical protein SORBIDRAFT_04g026250 [Sorghum bicolor] gi|241934014|gb|EES07159.1| hypothetical protein SORBIDRAFT_04g026250 [Sorghum bicolor] Length = 703 Score = 40.6 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 26/166 (15%), Positives = 54/166 (32%), Gaps = 27/166 (16%) Query: 137 LDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELV 196 ++ + F + + D+ + Q + A R + V Sbjct: 207 VEIKTYSEFSAIQQSSQDDFAVLIHLKAPYANPEQVTGRSVSATSVGYPTSRAPVDLVTV 266 Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL 256 +D+SGSM TK+A LK A+ + + + + Sbjct: 267 LDVSGSMAG--------------------TKLALLKRAMGFVIQHLGPSD------RLSV 300 Query: 257 IGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQA 301 I +++ + + RQ + ++SL T+ A+K+A Sbjct: 301 IAFSSTARRLFHLRRMSHSGRQQALQAVNSLGASGGTNIADALKKA 346 >gi|271963053|ref|YP_003337249.1| hypothetical protein Sros_1513 [Streptosporangium roseum DSM 43021] gi|270506228|gb|ACZ84506.1| hypothetical protein Sros_1513 [Streptosporangium roseum DSM 43021] Length = 605 Score = 40.6 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 39/223 (17%), Positives = 73/223 (32%), Gaps = 45/223 (20%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + +V+D+SGSM + D R+K+ K A + L V Sbjct: 414 VLMVIDVSGSMGAGV-------------PDTGRSKLDLAKQAAINALPQFGPHDKV---- 456 Query: 253 YMGLIGYTTRVEKNI------EPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQIL 305 GL ++T+ + R+ + +D L T AYQ + Sbjct: 457 --GLWMFSTKRDGEKDHLELAPLDTVDAAQRKTLRTRLDGLTPDGGTGLYDTALAAYQHV 514 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG---ENNNFKSNVNTIKICDKAKENFI 362 +IFLTDG +NN+ + ++ E + Sbjct: 515 RDRHSGEAINA----------------VIFLTDGKNEDNNSLSLENLLPDLRAESAEESV 558 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 ++ TI+ + L + ++ Y+ + S+ VF + Sbjct: 559 RMFTIAYGQDADLGVLKQISETTNAAAYDSRESGSIDQVFTAV 601 >gi|15965798|ref|NP_386151.1| putative signal peptide protein [Sinorhizobium meliloti 1021] gi|307311332|ref|ZP_07590975.1| putative signal peptide protein [Sinorhizobium meliloti BL225C] gi|307318865|ref|ZP_07598297.1| conserved hypothetical protein [Sinorhizobium meliloti AK83] gi|15075067|emb|CAC46624.1| Hypothetical signal peptide protein [Sinorhizobium meliloti 1021] gi|306895586|gb|EFN26340.1| conserved hypothetical protein [Sinorhizobium meliloti AK83] gi|306899633|gb|EFN30261.1| putative signal peptide protein [Sinorhizobium meliloti BL225C] Length = 444 Score = 40.6 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 11/57 (19%), Positives = 28/57 (49%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 FR + ++ + NF+++ A++ + + I +++ ++SA +AA L Sbjct: 5 TFRLHLRRLVRDRDGNFAVLGAIAFVPIIGAAALAIDFAGAYFEAEKIQSALDAAAL 61 >gi|329963582|ref|ZP_08301061.1| von Willebrand factor type A domain protein [Bacteroides fluxus YIT 12057] gi|328528571|gb|EGF55542.1| von Willebrand factor type A domain protein [Bacteroides fluxus YIT 12057] Length = 342 Score = 40.6 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 34/225 (15%), Positives = 74/225 (32%), Gaps = 45/225 (20%) Query: 151 MTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 ++ Y ++F + +++ ++E +R + + +D+S SM Sbjct: 50 VSKYRPDVKFWLVFVAIGLFAVLLARPQFGSKLETVKRQGVEVMIALDISNSMLA----- 104 Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 QD + +++ K + +D + + V G+I + + + Sbjct: 105 ----------QDVQPSRLEKAKRLVAQLVDKMQ-----NDKV--GMIVFAGDAFTQLPIT 147 Query: 271 WGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 + ++ SLI K T A+ A + T P Sbjct: 148 SDYISAKMFLESIDPSLISKQGTAIGAAINLATRSFT------------------PQEGV 189 Query: 330 QKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 + II +TDGEN+ + A E I++ + + Sbjct: 190 GRAIIVITDGENHEGGAVEAAKA----ATEKGIQVNVLGVGMPDG 230 >gi|218191772|gb|EEC74199.1| hypothetical protein OsI_09355 [Oryza sativa Indica Group] Length = 723 Score = 40.6 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 30/236 (12%), Positives = 69/236 (29%), Gaps = 54/236 (22%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 R + V+D+SGSM TK+A LK A+ + ++ Sbjct: 277 TRAPVDLITVLDVSGSMAG--------------------TKLALLKRAMGFVIQNLGSSD 316 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGT-EKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 + +I +++ + T +Q + T+ +++ +++ Sbjct: 317 ------RLSVIAFSSSARRLFPLRRMTETGRQQSLQAVYSLTSNGGTNIAEGLRKGSKVI 370 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK-----IC-----D 355 + ++ + II L+DG++ S + C Sbjct: 371 EDRQAKNPVCS----------------IILLSDGQDTYTVSPTAGVHKAAPEYCSLLPYT 414 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN-ISQLMV 410 + + A + L +S + ++ F I L+ Sbjct: 415 SNGCQQVPVHVFGFGADHDSVSLHSISQTSGGTFSFIETEAAIQDAFAQCIGGLLS 470 >gi|91216720|ref|ZP_01253685.1| aerotolerance-related membrane protein [Psychroflexus torquis ATCC 700755] gi|91185189|gb|EAS71567.1| aerotolerance-related membrane protein [Psychroflexus torquis ATCC 700755] Length = 349 Score = 40.6 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 30/189 (15%), Positives = 61/189 (32%), Gaps = 45/189 (23%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 ++E +R I +D+S SM D ++ K + ++ Sbjct: 81 KMETIKREGVDIVFALDVSKSMLAE---------------DIAPNRLEKSKRIITEIVNK 125 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-MDSLILKPTDSTPAMKQ 300 + +GL+GY + + + ++ D + + T + A+ Sbjct: 126 L-TAD------RVGLVGYAGSAFPQVPITTDYASTKTFLQSMNTDMVSSQGTAISQAIDL 178 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A D + + K +I L++GE + N + + A Sbjct: 179 AKSYYNDDDQTN------------------KVLIILSEGE----DHDSNVESMAETAAAE 216 Query: 361 FIKIVTISI 369 IKI TI + Sbjct: 217 GIKIYTIGV 225 >gi|301772356|ref|XP_002921600.1| PREDICTED: integrin alpha-1-like, partial [Ailuropoda melanoleuca] Length = 1160 Score = 40.6 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 32/184 (17%), Positives = 72/184 (39%), Gaps = 25/184 (13%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 D ++ + + +G++ Y V + V + +I + T Sbjct: 172 AFLNDLLERMDIGPKQTQVGIVQYGENVTHEFNL--NKYSSTEEVLVAANKIIQRGGRQT 229 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A I T+ K+ R+G +K ++ +TDGE+++ I+ C+ Sbjct: 230 ---MTALGIDTARKEAFTEARGARRG-------VKKVMVIVTDGESHDNHQLNKVIQDCE 279 Query: 356 KAKENFIKIVTISINASPNGQRL--------LKTCVSSP--EYHYNVVNADSLIHVFQNI 405 + I+ +I+I S N L +K+ S P ++ +NV + +L+ + + + Sbjct: 280 ---DENIQRFSIAILGSYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVEAL 336 Query: 406 SQLM 409 + + Sbjct: 337 GERI 340 >gi|300022610|ref|YP_003755221.1| von Willebrand factor A [Hyphomicrobium denitrificans ATCC 51888] gi|299524431|gb|ADJ22900.1| von Willebrand factor type A [Hyphomicrobium denitrificans ATCC 51888] Length = 638 Score = 40.6 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 34/224 (15%), Positives = 69/224 (30%), Gaps = 50/224 (22%) Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 L+VD SGSM + D + K+ ++ + + + Sbjct: 32 MLIVDGSGSMWGRLAPD-------------NKPKIDVVR-------EKLATILQTPSSTR 71 Query: 254 MGLIGYTTRVEKN----IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 +GL+ + R + + + + P T A++ A + + + Sbjct: 72 VGLVSFGHRRRGDCNDVELIASPDSERAALLGPLAKLNPRGPGPVTAALEIAADAIGTSR 131 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN-FIKIVTIS 368 II + DG +N + + D AK + + I Sbjct: 132 PAQ--------------------IIIVGDGADNCQQDSCAAAN--DFAKSAPGVAVQVIG 169 Query: 369 INASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMV 410 I + + CV+ +Y+V +A L +QL + Sbjct: 170 IGVPATERPRI-ACVAQATGGRYYDVTDAAGLNAALDEATQLAI 212 >gi|86360183|ref|YP_472072.1| hypothetical protein RHE_PC00139 [Rhizobium etli CFN 42] gi|86284285|gb|ABC93345.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 671 Score = 40.6 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 41/291 (14%), Positives = 92/291 (31%), Gaps = 48/291 (16%) Query: 84 NHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN-RLDSSNN 142 + ++ + A IK+ + +S +SA + + S + R++ N Sbjct: 193 DPSRERFANAAVNPIKSVAADPVSTFSADVDSASYSFVRRSLTAGAMPDPQSVRVEEMIN 252 Query: 143 TIFYNMDVMTSYDYRLQFIEHLLNQ--RYNQKIVSFIPALLRIEMGERPIFLIELVVDLS 200 Y+ D + ++ ++ +++ I P + ++D+S Sbjct: 253 YFPYDWPGPEKADQPFKATVTVMPTPWNHDTQLMHVAIKGYDIAPATAPHANLVFLIDVS 312 Query: 201 GSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT 260 GSM + K+ LK+A L ++ + V Y G G Sbjct: 313 GSMD-------------------EPDKLPLLKSAFRLLVNKLKADDTVSIVTYAGNAG-- 351 Query: 261 TRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFR 319 V + + + + +D L T + AY L N Sbjct: 352 -TVLEPTRVA-----EKSKILSAIDRLEAGGSTGGAEGIAAAYD-LAKKAFVKDGVNR-- 402 Query: 320 QGVKIPSLPFQKFIIFLTDGE-NNNFKSNVNTIKICDKAKENFIKIVTISI 369 ++ TDG+ N + + +I ++ +++ I + + Sbjct: 403 -------------VMLATDGDFNVGPSIDEDLKRIIEEKRKDGIFLTVLGF 440 >gi|219848163|ref|YP_002462596.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219542422|gb|ACL24160.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 914 Score = 40.6 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 32/212 (15%), Positives = 60/212 (28%), Gaps = 45/212 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 ERP + LV+D SGSM + D RT++ + A+ + Sbjct: 407 ERPDLALVLVIDRSGSMRELV--------------DDGRTQLDLAREAVYQASRGLTQRD 452 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + LI + + + + + T+ + A + + Sbjct: 453 ------QIALIAFDSIADTLLPLQPLPGLFTIE-DALSRLVAGGGTNIRSGIALAAETIA 505 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + + R + +I LTDG S + + I + Sbjct: 506 TSQARI------------------RHVILLTDG-----VSETEYADLVADLRAQGITVSA 542 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 I+I L + +Y V L Sbjct: 543 IAIGL-DTDPALERVAQIGGGKYYLVQRVPDL 573 >gi|88857796|ref|ZP_01132439.1| von Willebrand factor type A domain protein [Pseudoalteromonas tunicata D2] gi|88820993|gb|EAR30805.1| von Willebrand factor type A domain protein [Pseudoalteromonas tunicata D2] Length = 608 Score = 40.6 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 38/204 (18%), Positives = 69/204 (33%), Gaps = 45/204 (22%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 ++++ IE E + ++D+SGSM K Sbjct: 219 NNQRQLLKIGIKGFDIEKAELKAANLVFLLDVSGSM-------------------NAPDK 259 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + LK++L + +D V VY G G K E Q ++ +++L Sbjct: 260 LPLLKSSLTMLTKQLDENDSVAIVVYAGAAGLVLPATKGNEY--------QVISNALNNL 311 Query: 288 ILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T+ ++ AYQ Q K + +I TDG+ N S Sbjct: 312 SAGGSTNGAQGIELAYQ-------------IASQNFKKEGINR---VILATDGDFNVGMS 355 Query: 347 NVN-TIKICDKAKENFIKIVTISI 369 +V+ K+ ++ I + T+ Sbjct: 356 SVDALKKLIANKRKTGIALTTLGF 379 >gi|297626137|ref|YP_003687900.1| ChlD, Mg-chelatase subunit ChlD [Propionibacterium freudenreichii subsp. shermanii CIRM-BIA1] gi|296921902|emb|CBL56462.1| ChlD, Mg-chelatase subunit ChlD [Propionibacterium freudenreichii subsp. shermanii CIRM-BIA1] Length = 324 Score = 40.6 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 43/237 (18%), Positives = 75/237 (31%), Gaps = 55/237 (23%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R I LV+D+S SM D +++AA K+A F+ ++ + Sbjct: 92 RERATIVLVLDISQSMMA---------------TDVSPSRLAAEKDAATKFVAALPAQYN 136 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILT 306 V G T R V + + +L L T ++ + L Sbjct: 137 VSVVTLSG---------HPNTLVPPTTD-RAPVNQGIKTLELADGTAIASSIDVGLEALK 186 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 ++ L+DG V + DKAK+ + I T Sbjct: 187 QAPAGDDGKQAPG------------LMVLLSDGSETGGGDPVASA---DKAKQQNVPIYT 231 Query: 367 ISI-----NASPNGQRL--------LKTCV-SSPEYHYNVVNADSLIHVFQNISQLM 409 I+ +GQR LK +S + +A L V++ ++ + Sbjct: 232 IAFGTQNGYVDLDGQRFNVAPDTDMLKRIADASSGKALDAASASQLDDVYKTLTSDV 288 >gi|192359695|ref|YP_001982895.1| von Willebrand factor type A domain-containing protein [Cellvibrio japonicus Ueda107] gi|190685860|gb|ACE83538.1| von Willebrand factor type A domain protein [Cellvibrio japonicus Ueda107] Length = 660 Score = 40.6 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 21/114 (18%), Positives = 39/114 (34%), Gaps = 10/114 (8%) Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK-------- 352 A + Q V F++ ++ LTDG + V I+ Sbjct: 112 AINSVALHTHIGAALEKAAQDVVAGDDGFRRNLVLLTDGVVDIDPEAVVNIQERKRILTE 171 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNI 405 + + K + TI+++ Q L+K ++ +AD L+ F I Sbjct: 172 LLPQLKAAGYVVHTIALS-QDADQELMKKLALTTDGVFAVAQSADELMQAFLTI 224 >gi|212715236|ref|ZP_03323364.1| hypothetical protein BIFCAT_00127 [Bifidobacterium catenulatum DSM 16992] gi|212661917|gb|EEB22492.1| hypothetical protein BIFCAT_00127 [Bifidobacterium catenulatum DSM 16992] Length = 1192 Score = 40.6 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 25/149 (16%), Positives = 48/149 (32%), Gaps = 39/149 (26%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I LV+D SGSM ++++ + A L + ++ Sbjct: 492 DIVLVMDKSGSMKGELDNNAK----------------EAANALAKKLLTDKNSTLPSEQQ 535 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 V M ++ ++T+ + ++ T+ A+KQA Sbjct: 536 VQMAVVTFSTKA----TIEQNFTTDVLKINNAVEGDPDGGTNWEAALKQA---------- 581 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 + +K IIFL+DG+ Sbjct: 582 ---------NILSGRSNVKKHIIFLSDGD 601 >gi|156523144|ref|NP_001095986.1| inter-alpha-trypsin inhibitor heavy chain H5 precursor [Bos taurus] gi|187609595|sp|A2VE29|ITIH5_BOVIN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H5; Short=ITI heavy chain H5; Short=ITI-HC5; Short=Inter-alpha-inhibitor heavy chain 5; Flags: Precursor gi|126010782|gb|AAI33545.1| ITIH5 protein [Bos taurus] Length = 940 Score = 40.6 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 42/222 (18%), Positives = 75/222 (33%), Gaps = 48/222 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D S SM TK+ K+AL L + H Sbjct: 296 VVFVLDSSASMVG--------------------TKLRQTKDALFTILHDLRPQDHFN--- 332 Query: 253 YMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 ++G++ R++ K+ S +R TD A+++ Q+L Sbjct: 333 ---IVGFSNRIKVWKDHLVSVTPNSIRDGKVYIHHMSPSGGTDINGALQRGIQLLNDYVA 389 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF---IKIVTI 367 + + ++FLTDG+ +T KI + +E + I T+ Sbjct: 390 HNDIEDRSVS-----------LVVFLTDGK--PTVGETHTFKILNNTREATRGRVCIFTV 436 Query: 368 SINASPNGQRL----LKTCVSSPEYHYNVVNADSLIHVFQNI 405 I A + + L L+ C + H + LI + I Sbjct: 437 GIGADVDFKLLEKLSLENCGLTRRVHEDHDARAQLIGFYDEI 478 >gi|117620923|ref|YP_857205.1| structural toxin protein RtxA [Aeromonas hydrophila subsp. hydrophila ATCC 7966] gi|117562330|gb|ABK39278.1| structural toxin protein RtxA [Aeromonas hydrophila subsp. hydrophila ATCC 7966] Length = 4260 Score = 40.6 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 38/273 (13%), Positives = 88/273 (32%), Gaps = 53/273 (19%) Query: 114 YNTEIQNIVNSSRISMTHMAN-NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQK 172 N+ N+++ + + A +D N+T + + ++F + + + N Sbjct: 3135 DNSNDNNLISLANGKVLLSATVTVVDGDNDTATGTVSADLGGN--IRFEDDVPTAKDNSV 3192 Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 +++ P F + +V+D SGSM + + ++ K Sbjct: 3193 VITEAGL---------PPFNLVMVIDTSGSMLWQIGTSTNG----------SPNRLELAK 3233 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNI-----EPSWGTEKVRQYVTRDMDSL 287 +AL +DS V +G+ T ++ + + + ++ Sbjct: 3234 DALNHMIDS---------YVALGVPLVFTVIDFASGAVLIPQTSDPDVAKASISGLP--T 3282 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T+ + A LT+D + + FL+DG N Sbjct: 3283 DGGGTNYNAPLVLAQNQLTADLANPALAGYETKVY------------FLSDGAPNEGNVP 3330 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLK 380 + N +++ + +N S N + + Sbjct: 3331 AGWTSFVN---SNNVEVYAVGLNVSGNATAIAQ 3360 >gi|307323133|ref|ZP_07602343.1| von Willebrand factor type A [Streptomyces violaceusniger Tu 4113] gi|306890622|gb|EFN21598.1| von Willebrand factor type A [Streptomyces violaceusniger Tu 4113] Length = 422 Score = 40.6 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 38/186 (20%), Positives = 61/186 (32%), Gaps = 36/186 (19%) Query: 174 VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 +S A+ + P +ELV+D+SGSM T+MAA K Sbjct: 23 ISPPAAMAGEPAPDSPK--VELVLDVSGSMRARDVDGD--------------TRMAAAKQ 66 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 A L E+V +G+ + + + Y + D Sbjct: 67 AFNEVL------DATPEEVRLGIRTLGANYPGKDRVAGCRDSEQLY--------PVGQVD 112 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 T A K A L + S + I+ +TDGE++ + + Sbjct: 113 RTEA-KAAVATLRPTGWTPIGLALRGASKDLSSGEGTRRIVLITDGEDSCGQPDP----- 166 Query: 354 CDKAKE 359 CD A+E Sbjct: 167 CDVARE 172 >gi|296282333|ref|ZP_06860331.1| von Willebrand factor type A domain-containing protein [Citromicrobium bathyomarinum JL354] Length = 571 Score = 40.6 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 35/250 (14%), Positives = 75/250 (30%), Gaps = 55/250 (22%) Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 + ++V A ERP + ++D+SGSM K+ Sbjct: 195 DTRLVRIGLAGYEAPKAERPAANLVFLLDVSGSMS-------------------SADKLP 235 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 +K A+ + + V VY G G + R+ + Sbjct: 236 LVKTAMKTLVGQLTPKDRVSIVVYAGAAGLVLEPTSDS---------REIMAALDQLQAG 286 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE-NNNFKSNV 348 T ++ AY++ + K +I TDG+ N N Sbjct: 287 GSTAGGAGLELAYKVAEASKVDGINR-----------------VILATDGDFNVGLSDND 329 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA---------DSLI 399 ++ + ++N I + + + L++ +Y +++ + L Sbjct: 330 KLLEYVEDKRKNGIAMSVLGFGRGNINEALMEQIADKGNGNYGYIDSAIEARKVLGEQLG 389 Query: 400 HVFQNISQLM 409 I++ + Sbjct: 390 ATLYTIAKDV 399 >gi|163758555|ref|ZP_02165642.1| von Willebrand factor type A domain protein [Hoeflea phototrophica DFL-43] gi|162283845|gb|EDQ34129.1| von Willebrand factor type A domain protein [Hoeflea phototrophica DFL-43] Length = 549 Score = 40.6 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 31/218 (14%), Positives = 69/218 (31%), Gaps = 49/218 (22%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + +V+D S SM ++ + K+ K + + + D + V Sbjct: 16 VMIVLDGSNSMWGQVDGE---------------AKITIAKEVMTDLITNWD------DSV 54 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GL Y R + + D++ + + A+ Q +T K Sbjct: 55 DLGLTVYGHRRKGD--------------CADIEVVAMPGKVDRQALIDKVQSITPRGKTP 100 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDG-ENNNFKSNVNTIKICDKAKENFIK-----IVT 366 + + + ++ ++DG E N C +AK I + Sbjct: 101 ISKTLSLAALSVGFFSGKSSVVLVSDGLETCNADP-------CAQAKSLGIINPGFDVHV 153 Query: 367 ISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQ 403 I + + + L+ + + + NA+ L + Sbjct: 154 IGFDVTEEEFKSLQCIATETGGKFFRANNAEELKDALR 191 >gi|221044732|dbj|BAH14043.1| unnamed protein product [Homo sapiens] Length = 560 Score = 40.6 bits (93), Expect = 0.49, Method: Composition-based stats. Identities = 32/160 (20%), Positives = 52/160 (32%), Gaps = 35/160 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D SGSM K+ + AL+ LD + V Sbjct: 187 VVFVIDKSGSMSGR--------------------KIQQTREALIKILDDLSPRDQFNLIV 226 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 + R + K R + L T+ AM A Q+L S + Sbjct: 227 FSTEATQ-WRPSLVPASAENVNKARSFAAGIQA---LGGTNINDAMLMAVQLLDSSNQ-- 280 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + S+ II LTDG+ ++N +I+ Sbjct: 281 ------EERLPEGSVSL---IILLTDGDPTVGETNPRSIQ 311 >gi|283779907|ref|YP_003370662.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] gi|283438360|gb|ADB16802.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] Length = 1040 Score = 40.6 bits (93), Expect = 0.49, Method: Composition-based stats. Identities = 19/82 (23%), Positives = 37/82 (45%), Gaps = 7/82 (8%) Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS-PNGQRLLKTC 382 P+ K +I ++DG+ + I ++ K+ IKI T+++ P G L+ Sbjct: 550 KPNPASVKHMIIISDGDPSPPSG-----TILNQYKQAGIKITTVAVGTHGPAGSTPLQNI 604 Query: 383 V-SSPEYHYNVVNADSLIHVFQ 403 ++ +Y N +L +FQ Sbjct: 605 ANATGGKYYVATNPKALPRIFQ 626 >gi|115438797|ref|NP_001043678.1| Os01g0640200 [Oryza sativa Japonica Group] gi|20805117|dbj|BAB92788.1| zinc finger (C3HC4-type RING finger)-like protein [Oryza sativa Japonica Group] gi|113533209|dbj|BAF05592.1| Os01g0640200 [Oryza sativa Japonica Group] gi|125527021|gb|EAY75135.1| hypothetical protein OsI_03030 [Oryza sativa Indica Group] gi|125571342|gb|EAZ12857.1| hypothetical protein OsJ_02777 [Oryza sativa Japonica Group] Length = 589 Score = 40.6 bits (93), Expect = 0.49, Method: Composition-based stats. Identities = 29/228 (12%), Positives = 69/228 (30%), Gaps = 52/228 (22%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 +R + V+D+SGSM ++ +K AL + Sbjct: 58 GSSSSTDRAGLDLVAVIDVSGSMDGD--------------------RIDKVKTALQFVIR 97 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMK 299 + L + ++ + T + + T + + +D L T+ ++ Sbjct: 98 KLSDLD------RLCIVTFCTNATRLCPLRFVTAAAQAELKALVDGLKAYGDTNMKGGLE 151 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 ++ + ++ ++DG N+ + + + Sbjct: 152 TGMSVVDGRSLAAGRAVS---------------VMLMSDGYQNHG-GDARDVHL------ 189 Query: 360 NFIKIVTISINASPNGQRLLKTCV--SSPEYHYNVVNADSLIHVFQNI 405 + + T S AS + LL+ S V ++ +L F + Sbjct: 190 KNVPVYTFSFGASHDS-NLLEAIARKSLGGTFNYVADSANLTGPFSQL 236 >gi|281339017|gb|EFB14601.1| hypothetical protein PANDA_010505 [Ailuropoda melanoleuca] Length = 1153 Score = 40.6 bits (93), Expect = 0.50, Method: Composition-based stats. Identities = 32/184 (17%), Positives = 72/184 (39%), Gaps = 25/184 (13%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 D ++ + + +G++ Y V + V + +I + T Sbjct: 173 AFLNDLLERMDIGPKQTQVGIVQYGENVTHEFNL--NKYSSTEEVLVAANKIIQRGGRQT 230 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A I T+ K+ R+G +K ++ +TDGE+++ I+ C+ Sbjct: 231 ---MTALGIDTARKEAFTEARGARRG-------VKKVMVIVTDGESHDNHQLNKVIQDCE 280 Query: 356 KAKENFIKIVTISINASPNGQRL--------LKTCVSSP--EYHYNVVNADSLIHVFQNI 405 + I+ +I+I S N L +K+ S P ++ +NV + +L+ + + + Sbjct: 281 ---DENIQRFSIAILGSYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVEAL 337 Query: 406 SQLM 409 + + Sbjct: 338 GERI 341 >gi|297157667|gb|ADI07379.1| hypothetical protein SBI_04258 [Streptomyces bingchenggensis BCW-1] Length = 528 Score = 40.6 bits (93), Expect = 0.51, Method: Composition-based stats. Identities = 39/255 (15%), Positives = 67/255 (26%), Gaps = 69/255 (27%) Query: 176 FIPALLRIEMGERPIFLIEL--------VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 P + G + EL V+D SGSM + Sbjct: 314 PFPGTRSVADGLLAAYENELRRPSRTVYVLDTSGSMEGD--------------------R 353 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD-- 285 + LK AL + G + + VR + + D Sbjct: 354 LDRLKAALTQL--------AGADGAATGERFRDREEVTLMPFGSEVKAVRTHTVPEDDPG 405 Query: 286 ------------SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 T +++ AY+ L + + F I Sbjct: 406 KALAAIRADAKALTADGETAIFSSLRAAYRHL------------AERASALGDDRFTS-I 452 Query: 334 IFLTDGENNNFKS--NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHY 390 + +TDGEN S + + + + I S G+ L+ S + + Sbjct: 453 VLMTDGENTAGDSADDFESFYRRLPGAQRTTPVFPILFGDSDRGE--LENIASLTGGRLF 510 Query: 391 NVVNADSLIHVFQNI 405 + SL F+ I Sbjct: 511 DATK-GSLDQAFEEI 524 >gi|297626138|ref|YP_003687901.1| Von Willebrand factor, type A [Propionibacterium freudenreichii subsp. shermanii CIRM-BIA1] gi|296921903|emb|CBL56463.1| Von Willebrand factor, type A [Propionibacterium freudenreichii subsp. shermanii CIRM-BIA1] Length = 321 Score = 40.6 bits (93), Expect = 0.51, Method: Composition-based stats. Identities = 41/236 (17%), Positives = 79/236 (33%), Gaps = 50/236 (21%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R I + +D+S SM D ++ A K+ F+DS+ + Sbjct: 86 RERATIVVTIDVSRSMEA---------------TDVTPNRLDAAKSGAKDFVDSLPSAFN 130 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 V L+ + + P+ +++ + + T + Y L Sbjct: 131 V------ALVTFAGTANVKMPPTTDRTQLKAAIDAIRLAP---STAIGEGI---YTSLDV 178 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 +K + + I+ L+DG N + + + +AK+ + I TI Sbjct: 179 LEKLAPQDPDHPDDPAPGA------IVLLSDGATNMGRDSADAAT---EAKKKNVPIYTI 229 Query: 368 SINAS-----PNGQ---------RLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + S NGQ L + S Y+ + +L V+Q IS+ + Sbjct: 230 AYGTSTGYVVENGQRQTVAVNHAELSQVAKLSGGKKYSADSMKNLQAVYQTISRQI 285 >gi|319639064|ref|ZP_07993821.1| PilC protein [Neisseria mucosa C102] gi|317399642|gb|EFV80306.1| PilC protein [Neisseria mucosa C102] Length = 1123 Score = 40.6 bits (93), Expect = 0.52, Method: Composition-based stats. Identities = 14/76 (18%), Positives = 34/76 (44%), Gaps = 3/76 (3%) Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINA--SPNGQRLLKTCVSSPEYHYNVVNA 395 D ++ + + K D +K+ ++ T+ S G+ L+ S P++++N Sbjct: 342 DDAGKSWDGDPSDPKGVDYSKQL-VQTFTVGFGEGISEVGREYLEKGASRPDWYFNAAKK 400 Query: 396 DSLIHVFQNISQLMVH 411 + L+ F+ I + + Sbjct: 401 EDLLEAFKTIVDNIEN 416 >gi|296474801|gb|DAA16916.1| inter-alpha-trypsin inhibitor heavy chain H4 precursor [Bos taurus] Length = 916 Score = 40.6 bits (93), Expect = 0.52, Method: Composition-based stats. Identities = 30/165 (18%), Positives = 52/165 (31%), Gaps = 45/165 (27%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D SGSM K+ + A + +D LS + Sbjct: 276 VIFVIDKSGSMMGR--------------------KIKQTREA---LIKILDDLSPHDQF- 311 Query: 253 YMGLIGYTTRVEKNIEP-----SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 LI +++ + + + Y T T+ AM A Q+L Sbjct: 312 --DLISFSSEATTWKPLLVPASTENVNEAKSYATGIQAQ---GGTNINDAMLMAVQLLEK 366 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + S+ II LTDG+ ++N I+ Sbjct: 367 ANQEEL--------LPEGSITL---IILLTDGDPTVGETNPLNIQ 400 >gi|291528739|emb|CBK94325.1| von Willebrand factor type A domain [Eubacterium rectale M104/1] Length = 410 Score = 40.6 bits (93), Expect = 0.52, Method: Composition-based stats. Identities = 21/93 (22%), Positives = 40/93 (43%), Gaps = 5/93 (5%) Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 N K S + +I +TDG NN +S ++ + + A E + I + + + Sbjct: 7 GINSATTEFKNYSTDASRIMIVVTDGYNN--QSGASSATVINNAIEENVIIYCVGVGSVN 64 Query: 374 NGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNI 405 + +LK S+ +Y + L +F+NI Sbjct: 65 S--TVLKNISESTGGCYYYINQFSQLNGIFENI 95 >gi|224113057|ref|XP_002316375.1| predicted protein [Populus trichocarpa] gi|222865415|gb|EEF02546.1| predicted protein [Populus trichocarpa] Length = 714 Score = 40.6 bits (93), Expect = 0.52, Method: Composition-based stats. Identities = 35/246 (14%), Positives = 88/246 (35%), Gaps = 51/246 (20%) Query: 111 AVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTI---FYNMDVMTSYDYRLQFIEHLLNQ 167 +VF + E ++ +S + N+ + T+ Y S L++ Sbjct: 177 SVFDDDESLDLQPASSERSSGNKNDAGHNPAKTVEIKTYPEVSAASCSNSYDNFTVLVHI 236 Query: 168 RYNQKIVSFIP-----ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 + + P +L ++ R + V+D+SGSM Sbjct: 237 KAAATVGRLNPRGNQASLPQLSQTPRAPVDLVTVLDISGSMAG----------------- 279 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 TK+A LK A+ + ++ + +I +++ + ++ RQ+ + Sbjct: 280 ---TKLALLKRAMGFVIQNLGSND------RLSVIAFSSTARRLFPLRRMSDTGRQHALQ 330 Query: 283 DMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 +++L+ T+ +++ +++ ++++ + II L+DG++ Sbjct: 331 AVNALVANGGTNIAEGLRKGAKVMEDRREKNPVAS----------------IILLSDGQD 374 Query: 342 NNFKSN 347 S Sbjct: 375 TYTVSG 380 >gi|109077202|ref|XP_001094788.1| PREDICTED: integrin alpha-1 [Macaca mulatta] Length = 1179 Score = 40.6 bits (93), Expect = 0.52, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 69/190 (36%), Gaps = 30/190 (15%) Query: 233 NALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 +++ FL D ++ + + +G++ Y V + + + Sbjct: 187 DSVTAFLNDLLERMDIGPKQTQVGIVQYGENVTHEFNL-NKYSSTEEVLVAAKKIVQRGG 245 Query: 292 --TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T + + A + ++ + +K ++ +TDGE N Sbjct: 246 RQTMTALGIDTARKEAFTEA-------------RGARRGVKKVMVIVTDGE---SHDNHR 289 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRL--------LKTCVSSP--EYHYNVVNADSLI 399 K+ ++ I+ +I+I S N L +K+ S P ++ +NV + +L+ Sbjct: 290 LKKVIQDCEDENIQRFSIAILGSYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALV 349 Query: 400 HVFQNISQLM 409 + + + + + Sbjct: 350 TIVKTLGERI 359 >gi|75832116|ref|NP_001015590.2| inter-alpha-trypsin inhibitor heavy chain H4 precursor [Bos taurus] gi|122140331|sp|Q3T052|ITIH4_BOVIN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H4; Short=ITI heavy chain H4; Short=ITI-HC4; Short=Inter-alpha-inhibitor heavy chain 4; Flags: Precursor gi|74267794|gb|AAI02562.1| Inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Bos taurus] Length = 916 Score = 40.6 bits (93), Expect = 0.52, Method: Composition-based stats. Identities = 30/165 (18%), Positives = 52/165 (31%), Gaps = 45/165 (27%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D SGSM K+ + A + +D LS + Sbjct: 276 VIFVIDKSGSMMGR--------------------KIKQTREA---LIKILDDLSPHDQF- 311 Query: 253 YMGLIGYTTRVEKNIEP-----SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 LI +++ + + + Y T T+ AM A Q+L Sbjct: 312 --DLISFSSEATTWKPLLVPASTENVNEAKSYATGIQAQ---GGTNINDAMLMAVQLLEK 366 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + S+ II LTDG+ ++N I+ Sbjct: 367 ANQEEL--------LPEGSITL---IILLTDGDPTVGETNPLNIQ 400 >gi|283852082|ref|ZP_06369356.1| von Willebrand factor type A [Desulfovibrio sp. FW1012B] gi|283572472|gb|EFC20458.1| von Willebrand factor type A [Desulfovibrio sp. FW1012B] Length = 442 Score = 40.6 bits (93), Expect = 0.52, Method: Composition-based stats. Identities = 59/447 (13%), Positives = 139/447 (31%), Gaps = 89/447 (19%) Query: 36 LIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAK 95 G + + + N +++A +AA LAG+ ++ + + ++ DA Sbjct: 3 AAGVAVDLSRVYVAHNQLQNAVDAAALAGSLQLPDDPDVTNGKVKAAVTANLALNDPDAT 62 Query: 96 RFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM-THMANNRLDSSNNTIFYNMDVMTSY 154 + + S ++ + I + + N I + + ++ Sbjct: 63 DI---QVTSGGATRSVCVDAKANVDMTLTKVIGIGDTTVTAEACAGYNDIELVLVLDSTG 119 Query: 155 DYRLQFIEHL--LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPE 212 + I+ + I+ + R ++G P F ++ +D S + N D Sbjct: 120 SMKGSPIDSAKDAARDLVNLIMPASTSSTRSKIGLVP-FQGKVRIDGSDPVTAERNPDGV 178 Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE---- 268 P C++ T + K + + + + G+ +T + + Sbjct: 179 G----PGCRNADGT-LNTGKLKV-EYSRTATSTNIFYGYTLSGVSTFTDKTCSGMSPIRA 232 Query: 269 PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 S + + + T + +K ++L+ + + + Sbjct: 233 LSSDKNTILNNIEAINAGAVTSGTLISEGIKWGRKVLSPEAPYVEGSTDKK--------- 283 Query: 329 FQKFIIFLTDGEN------NNFKSNVNTIKI----------------------------- 353 +K +I LTDG+ NF S T+ Sbjct: 284 VRKIMIVLTDGDTEDGRCGGNFASASKTVNTYWTNAYFGQGLKPDTATSPYATLSTATAT 343 Query: 354 ------C--------------DKAKEN---FIKIVTISINASPN-GQRLLKTCVSS---- 385 C D AK + ++I ++ AS + L++ SS Sbjct: 344 LAQIPDCKDGGKLNQFVLDEADAAKNDLNYPVEIFSVRFGASDATDKSLMQKIASSKPGT 403 Query: 386 PEYHYNVVNADSLIHVFQNISQLMVHR 412 +++Y+ ++ + +F+ I Q + R Sbjct: 404 TDHYYDAPSSTGIQDMFKKIGQQLGQR 430 >gi|254775742|ref|ZP_05217258.1| hypothetical protein MaviaA2_13890 [Mycobacterium avium subsp. avium ATCC 25291] Length = 335 Score = 40.6 bits (93), Expect = 0.53, Method: Composition-based stats. Identities = 41/234 (17%), Positives = 72/234 (30%), Gaps = 51/234 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + LV+D+S SM D +MAA + A F D + + Sbjct: 99 VMLVIDVSQSM---------------RATDVAPNRMAAAQEAAKQFADELTPG------I 137 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GLI Y + P+ E + + + + T + + A Q + + Sbjct: 138 NLGLIAYAGTATVLVSPTTNREATKNALDKLQFADR---TATGEGIFTALQAIATVGAV- 193 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENN---NFKSNVNTIKICDKAKENFIKIVTISI 369 + P I+ +DG+ N + AK+ + I TIS Sbjct: 194 ---------IGGGDKPPPARIVLFSDGKETMPTNPDNPKGAFTAARTAKDQGVPISTISF 244 Query: 370 N--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + + L K S YN + L V+ + Q + Sbjct: 245 GTPYGFVEINDQRQPVPVDDETLKKVAQLSGGNAYNAASLQELKSVYATLQQQI 298 >gi|67968521|dbj|BAE00622.1| unnamed protein product [Macaca fascicularis] Length = 480 Score = 40.6 bits (93), Expect = 0.53, Method: Composition-based stats. Identities = 16/76 (21%), Positives = 31/76 (40%), Gaps = 7/76 (9%) Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV 392 II +TDGE + +T++ KA++ + T+ + Q + SPE+ + V Sbjct: 25 IIAMTDGE-LVAHAFQDTLREAQKARKLGANVYTVGVADYKLDQ--ITAIADSPEHVFAV 81 Query: 393 VN----ADSLIHVFQN 404 N + + Sbjct: 82 ENGFKAMRDTVDALTS 97 >gi|297675239|ref|XP_002815595.1| PREDICTED: integrin alpha-1-like [Pongo abelii] Length = 1179 Score = 40.6 bits (93), Expect = 0.53, Method: Composition-based stats. Identities = 28/190 (14%), Positives = 69/190 (36%), Gaps = 30/190 (15%) Query: 233 NALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 +++ FL D ++ + + +G++ Y V + + + Sbjct: 187 DSVTAFLNDLLERMDIGPKQTQVGIVQYGENVTHEFNL-NKYSSTEEVLVAAKKIVQRGG 245 Query: 292 --TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T + + A + ++ + +K ++ +TDGE N Sbjct: 246 RQTMTALGIDTARKEAFTEA-------------RGARRGVKKVMVIVTDGE---SHDNHR 289 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRL--------LKTCVSSP--EYHYNVVNADSLI 399 K+ +E I+ +I+I S N L +K+ S P ++ +NV + +L+ Sbjct: 290 LKKVIQDCEEENIQRFSIAILGSYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALV 349 Query: 400 HVFQNISQLM 409 + + + + + Sbjct: 350 TIVKTLGERI 359 >gi|194390782|dbj|BAG62150.1| unnamed protein product [Homo sapiens] Length = 1173 Score = 40.2 bits (92), Expect = 0.54, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 69/190 (36%), Gaps = 30/190 (15%) Query: 233 NALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 +++ FL D ++ + + +G++ Y V + + + Sbjct: 187 DSVTAFLNDLLERMDIGPKQTQVGIVQYGENVTHEFNL-NKYSSTEEVLVAAKKIVQRGG 245 Query: 292 --TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T + + A + ++ + +K ++ +TDGE N Sbjct: 246 RQTMTALGIDTARKEAFTEA-------------RGARRGVKKVMVIVTDGE---SHDNHR 289 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRL--------LKTCVSSP--EYHYNVVNADSLI 399 K+ ++ I+ +I+I S N L +K+ S P ++ +NV + +L+ Sbjct: 290 LKKVIQDCEDENIQRFSIAILGSYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALV 349 Query: 400 HVFQNISQLM 409 + + + + + Sbjct: 350 TIVKTLGERI 359 >gi|114600333|ref|XP_517769.2| PREDICTED: integrin alpha-1 [Pan troglodytes] Length = 1179 Score = 40.2 bits (92), Expect = 0.54, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 69/190 (36%), Gaps = 30/190 (15%) Query: 233 NALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 +++ FL D ++ + + +G++ Y V + + + Sbjct: 187 DSVTAFLNDLLERMDIGPKQTQVGIVQYGENVTHEFNL-NKYSSTEEVLVAAKKIVQRGG 245 Query: 292 --TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T + + A + ++ + +K ++ +TDGE N Sbjct: 246 RQTMTALGIDTARKEAFTEA-------------RGARRGVKKVMVIVTDGE---SHDNHR 289 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRL--------LKTCVSSP--EYHYNVVNADSLI 399 K+ ++ I+ +I+I S N L +K+ S P ++ +NV + +L+ Sbjct: 290 LKKVIQDCEDENIQRFSIAILGSYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALV 349 Query: 400 HVFQNISQLM 409 + + + + + Sbjct: 350 TIVKTLGERI 359 >gi|31657142|ref|NP_852478.1| integrin alpha-1 precursor [Homo sapiens] gi|124056463|sp|P56199|ITA1_HUMAN RecName: Full=Integrin alpha-1; AltName: Full=CD49 antigen-like family member A; AltName: Full=Laminin and collagen receptor; AltName: Full=VLA-1; AltName: CD_antigen=CD49a; Flags: Precursor gi|187951605|gb|AAI37122.1| Integrin, alpha 1 [Homo sapiens] gi|187957526|gb|AAI37123.1| Integrin, alpha 1 [Homo sapiens] Length = 1179 Score = 40.2 bits (92), Expect = 0.54, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 69/190 (36%), Gaps = 30/190 (15%) Query: 233 NALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 +++ FL D ++ + + +G++ Y V + + + Sbjct: 187 DSVTAFLNDLLERMDIGPKQTQVGIVQYGENVTHEFNL-NKYSSTEEVLVAAKKIVQRGG 245 Query: 292 --TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T + + A + ++ + +K ++ +TDGE N Sbjct: 246 RQTMTALGIDTARKEAFTEA-------------RGARRGVKKVMVIVTDGE---SHDNHR 289 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRL--------LKTCVSSP--EYHYNVVNADSLI 399 K+ ++ I+ +I+I S N L +K+ S P ++ +NV + +L+ Sbjct: 290 LKKVIQDCEDENIQRFSIAILGSYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALV 349 Query: 400 HVFQNISQLM 409 + + + + + Sbjct: 350 TIVKTLGERI 359 >gi|302652520|ref|XP_003018108.1| hypothetical protein TRV_07884 [Trichophyton verrucosum HKI 0517] gi|291181717|gb|EFE37463.1| hypothetical protein TRV_07884 [Trichophyton verrucosum HKI 0517] Length = 240 Score = 40.2 bits (92), Expect = 0.54, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 56/147 (38%), Gaps = 11/147 (7%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 + IVS P L I LV+D+SGSM NS +D + Sbjct: 46 PNKDSMIVSIQPPLKPGNDVPHVPCDIVLVIDISGSM----NSAAPIPTGEKGGEDTGLS 101 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + K+A +++++ + ++ + T V E + ++ + V +D Sbjct: 102 ILDLTKHAAKTIIETLNEKD------RLAVVTFCTEVNVAFELDYMNKENKSTVLSAVDK 155 Query: 287 LILK-PTDSTPAMKQAYQILTSDKKRS 312 L K T+ +K+ +L ++ + Sbjct: 156 LYGKSSTNLWHGIKKGLNVLATNPAQG 182 >gi|254427565|ref|ZP_05041272.1| Vault protein inter-alpha-trypsin [Alcanivorax sp. DG881] gi|196193734|gb|EDX88693.1| Vault protein inter-alpha-trypsin [Alcanivorax sp. DG881] Length = 684 Score = 40.2 bits (92), Expect = 0.54, Method: Composition-based stats. Identities = 53/327 (16%), Positives = 104/327 (31%), Gaps = 63/327 (19%) Query: 94 AKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTS 153 ++ + +L +A + E NS + A N ++ T+ ++ V Sbjct: 207 IQQLGPQQWQVTLDNRTATASSEEE---TNSENPNFAPPAANGQPANAFTLDQDIVVYWR 263 Query: 154 YDYRLQFIEHL-----LNQRYNQKIVSFIPA--LLRIEMGERPIFLIELVVDLSGSMHCA 206 + L L + ++S P L I G +F V+D+SGSM Sbjct: 264 HQQDLPGSVDLVAYKAPGKDRGTFMLSITPGDDLPPITTGSDWVF----VLDISGSMSAK 319 Query: 207 MNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN 266 + + + V+ AL I L E++ G + T Sbjct: 320 LATLGDGVSQ-------------ALGKLRGGDRFRIVLFDDRAEELTSGFVDAT------ 360 Query: 267 IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 +RQY + M T+ + A L +D+ Sbjct: 361 ------PNNIRQYTKKVMQLQSRGGTNLFGGLSLALNPLDADRPTG-------------- 400 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 I+ +TDG N K D + + +++ T + + + +L ++ Sbjct: 401 ------IVLVTDGVANVGK--TQQKDFIDLLENHDVRLFT-FVMGNSANRPMLTAMTNAS 451 Query: 387 EYH-YNVVNADSLIHVFQNISQLMVHR 412 +V N+D + N + + H+ Sbjct: 452 NGFAISVSNSDDIAGQILNATAKLTHQ 478 >gi|145497681|ref|XP_001434829.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124401957|emb|CAK67432.1| unnamed protein product [Paramecium tetraurelia] Length = 648 Score = 40.2 bits (92), Expect = 0.54, Method: Composition-based stats. Identities = 40/259 (15%), Positives = 83/259 (32%), Gaps = 47/259 (18%) Query: 148 MDVMTSYDY-RLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCA 206 + V YDY +L+ E I I ++ + V+D SGSM Sbjct: 184 LRVQAQYDYCKLKKSESQRLPAMVSIITKDIEQYVKNNSSIEAGIDLLCVIDKSGSMEGK 243 Query: 207 MNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN 266 K+A+++ +L+ LD + + LI + ++ Sbjct: 244 --------------------KIASVQQSLVQLLDFLSEKD------RLCLITFDGSAQRL 277 Query: 267 IEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP 325 T+ + Y + + S+ T+ + A+ + K ++ T+ Sbjct: 278 TPLKTLTQDNKNYFKKAIYSIRASGQTNIAKGTEIAFNQIQQRKMKNQVTS--------- 328 Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSS 385 I L+DG+ + I+ E+ + I + + + + K C Sbjct: 329 -------IFLLSDGQ---DQGAAEYIQRQKDVVEDIVTIHSFGYGSDHDAALMSKICKVG 378 Query: 386 PEYHYNVVNADSLIHVFQN 404 Y + + L F + Sbjct: 379 QGSFYYIEDVKLLDEFFAD 397 >gi|119575263|gb|EAW54868.1| hCG2002731, isoform CRA_e [Homo sapiens] Length = 1177 Score = 40.2 bits (92), Expect = 0.55, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 69/190 (36%), Gaps = 30/190 (15%) Query: 233 NALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 +++ FL D ++ + + +G++ Y V + + + Sbjct: 187 DSVTAFLNDLLERMDIGPKQTQVGIVQYGENVTHEFNL-NKYSSTEEVLVAAKKIVQRGG 245 Query: 292 --TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T + + A + ++ + +K ++ +TDGE N Sbjct: 246 RQTMTALGIDTARKEAFTEA-------------RGARRGVKKVMVIVTDGE---SHDNHR 289 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRL--------LKTCVSSP--EYHYNVVNADSLI 399 K+ ++ I+ +I+I S N L +K+ S P ++ +NV + +L+ Sbjct: 290 LKKVIQDCEDENIQRFSIAILGSYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALV 349 Query: 400 HVFQNISQLM 409 + + + + + Sbjct: 350 TIVKTLGERI 359 >gi|82702351|ref|YP_411917.1| von Willebrand factor, type A [Nitrosospira multiformis ATCC 25196] gi|82410416|gb|ABB74525.1| von Willebrand factor, type A [Nitrosospira multiformis ATCC 25196] Length = 888 Score = 40.2 bits (92), Expect = 0.55, Method: Composition-based stats. Identities = 29/171 (16%), Positives = 52/171 (30%), Gaps = 39/171 (22%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + RP + LV+D SGSM N D D +K+ L+ A +F+ Sbjct: 394 IAANTTARPRSAVALVLDRSGSM----NEDAGD----------GISKVQKLREAANVFIS 439 Query: 241 SIDLLSHVKEDVYMGLIGYTT------RVEKNIEPSWGTE-KVRQYVTRDMDSLILKPTD 293 ++ +GL+ + +++ GT + D T Sbjct: 440 AMQPADG------IGLVRFNEAAQRLMEIQEAGAAPGGTGRTIALEHIAGSDIDPAGATS 493 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + Q+L + + ++ LTDG N Sbjct: 494 IGDGVVNGKQMLDDAQATAGTPYDVTA------------MVVLTDGMWNRP 532 >gi|257453795|ref|ZP_05619073.1| von Willebrand factor type A domain protein [Enhydrobacter aerosaccus SK60] gi|257448722|gb|EEV23687.1| von Willebrand factor type A domain protein [Enhydrobacter aerosaccus SK60] Length = 260 Score = 40.2 bits (92), Expect = 0.55, Method: Composition-based stats. Identities = 32/189 (16%), Positives = 68/189 (35%), Gaps = 29/189 (15%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+DLSGSM + S D ++ L + F + + V + Sbjct: 46 LVLDLSGSMA--IRSGNGDKR-----------RIDMLNEGIEAFYHDLMKDETARNRVRL 92 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 ++ + T+ + + + ++ T M A ++ Sbjct: 93 AIVIVGGVNDTAELMMDWTDAIDFFPIKFREN---GMTPLGQGMLLALNLI--------- 140 Query: 315 TNFFRQGVKIPSLPFQK-FIIFLTDGENNNFKS-NVNTIKICDKAKENF-IKIVTISINA 371 R ++ + + + ++I +TDG + + I C +A++N I I+I+A Sbjct: 141 -EQERINLRDNGINYTRPWVIAMTDGLPTDSQDVWQAAINQCHQAEQNNQCIIYPIAIDA 199 Query: 372 SPNGQRLLK 380 ++LK Sbjct: 200 GVQEVKMLK 208 >gi|115374040|ref|ZP_01461329.1| thrombospondin type 3 repeat family [Stigmatella aurantiaca DW4/3-1] gi|310819324|ref|YP_003951682.1| MtsD protein involved in cell-cell cohesion [Stigmatella aurantiaca DW4/3-1] gi|115368930|gb|EAU67876.1| thrombospondin type 3 repeat family [Stigmatella aurantiaca DW4/3-1] gi|309392396|gb|ADO69855.1| MtsD protein involved in cell-cell cohesion [Stigmatella aurantiaca DW4/3-1] Length = 655 Score = 40.2 bits (92), Expect = 0.56, Method: Composition-based stats. Identities = 35/163 (21%), Positives = 62/163 (38%), Gaps = 32/163 (19%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDK--------KRTKMAALKNALLLFLDSIDL 244 + LV+D SGSM + + V+ D + ++ ALK + F + Sbjct: 60 VVLVIDQSGSMCVSDPPGSQGVDGFCEQVDDILLPPGVLEPARVRALKRLVNQFRQQPN- 118 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR-------QYVTRDMDSLILKPTDSTPA 297 V + ++ + T V+ P+ + Y+ + L K TD A Sbjct: 119 -------VQISIVPFETNVKNVWPPATTGNRFARPDASLDTYIRGLQNQL-GKGTDYQGA 170 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF-IIFLTDG 339 + AY ++ SD +N LP ++ ++FLTDG Sbjct: 171 VGYAYSLIASDINAVSASNPEV-------LPRTRYVVVFLTDG 206 >gi|113866742|ref|YP_725231.1| flp pilus assembly protein TadG [Ralstonia eutropha H16] gi|113525518|emb|CAJ91863.1| flp pilus assembly protein TadG [Ralstonia eutropha H16] Length = 417 Score = 40.2 bits (92), Expect = 0.56, Method: Composition-based stats. Identities = 15/102 (14%), Positives = 37/102 (36%), Gaps = 6/102 (5%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 A ++ +II LS+ + +G + + + K+ ++++ +A LA A + L Sbjct: 15 ARQRGAVAIIVGLSLAVLIGFVGLALDLGKLYVTKSELQNSVDACALAAARDVTGATPLL 74 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTE 117 ++ A + E + + +T Sbjct: 75 VSEAAGLATGTSNA------ALFQGKAVEMFENLNVSYSDTP 110 >gi|283455087|ref|YP_003359651.1| fimbriae protein with LPXTG motif and von Willebrand factor typeA domain [Bifidobacterium dentium Bd1] gi|283101721|gb|ADB08827.1| Fimbriae protein with LPXTG motif and von Willebrand factor typeA domain [Bifidobacterium dentium Bd1] Length = 1256 Score = 40.2 bits (92), Expect = 0.57, Method: Composition-based stats. Identities = 37/272 (13%), Positives = 76/272 (27%), Gaps = 50/272 (18%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 V A + LV+D+ ++ K Sbjct: 568 NTYTVNVDVTGAANSSTITTTQSVDFTLVLDV-----------SSSMSDEMDSDQGSIKK 616 Query: 228 MAALKNALLLFLDSIDLLS--HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY----VT 281 M ALK+A+ FL ++ E + +GL+ + + + TE Y + Sbjct: 617 MTALKSAVNNFLGEAAEINEQSGSELIRVGLVKFAGKESSKVGNETYTEGRFVYNYSQIV 676 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + + + A++ + + ++ +IF TDG Sbjct: 677 SPLTADMSDLKNKVSALRH-----NGATRADLGFKHASTVMSGARTDAKRVVIFFTDGTP 731 Query: 342 NNFKS-----NVNTIKICDKAKENFIKIVTISI-------NASPNGQRLLKTCVSS---- 385 + + K++ + +I + + N + VSS Sbjct: 732 TKVSDFDKDVANSAVTYAKSLKDSGATVYSIGVFDGANPSSIEENQKNQFMNAVSSNYPH 791 Query: 386 ------------PEYHYNVVNADSLIHVFQNI 405 Y+ V N L +F+ I Sbjct: 792 ATAYDKLGTGSNAGYYKVVSNVSDLKSIFEKI 823 >gi|217974408|ref|YP_002359159.1| von Willebrand factor type A [Shewanella baltica OS223] gi|217499543|gb|ACK47736.1| von Willebrand factor type A [Shewanella baltica OS223] Length = 627 Score = 40.2 bits (92), Expect = 0.57, Method: Composition-based stats. Identities = 38/231 (16%), Positives = 72/231 (31%), Gaps = 52/231 (22%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + ++D+SGSM K+ L+ AL L + V Sbjct: 221 GASNLVFLLDVSGSMAST-------------------DKLPLLQTALKLLTAQLSAQDKV 261 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 VY G G ++ + T+ + QAYQ Sbjct: 262 SIVVYAGAAG--VVLDGASG-----NDTQTLTYALEQLSAGGSTNGGQGITQAYQ----L 310 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKENFIKIVTI 367 K+ F N + +I TDG+ N ++ + I + +K K++ I + T+ Sbjct: 311 AKKHFIPNGINR------------VILATDGDFNVGVTDFDDLIALIEKEKDHGIGLTTL 358 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNA---------DSLIHVFQNISQLM 409 +L++ +Y ++ D L I++ + Sbjct: 359 GFGLGNYNDQLMEQLADKGNGNYAYIDTLNEARKVLVDELSSTLFTIAKDV 409 >gi|254421496|ref|ZP_05035214.1| von Willebrand factor type A domain protein [Synechococcus sp. PCC 7335] gi|196188985|gb|EDX83949.1| von Willebrand factor type A domain protein [Synechococcus sp. PCC 7335] Length = 410 Score = 40.2 bits (92), Expect = 0.57, Method: Composition-based stats. Identities = 36/222 (16%), Positives = 70/222 (31%), Gaps = 49/222 (22%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 G + V+D SGSM T + +K A +D + Sbjct: 35 GVSAPLNVCFVLDRSGSMMG--------------------TPLQTVKQAASRIVDRLSNR 74 Query: 246 SHVKEDVYMGLIGYTTRVEK--NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + +I + + E + E + + +++ + T +K + Sbjct: 75 D------RISIIAFDHKAEVLISNELASDPQAIKRRINSLRAG---GGTCIDDGLKAGIE 125 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L S K+ + ++ LTDGEN + N IK+ D A + Sbjct: 126 QLASGKE-----------------GYISQLLLLTDGENEHG-DNSRAIKLADVAIGYNLT 167 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + T+ N L + + + +A+ I F + Sbjct: 168 VNTLGFGDHWNQDVLEQIADAGGGSLSYIEHAEEAIATFGRL 209 >gi|8569518|pdb|1QC5|A Chain A, I Domain From Integrin Alpha1-Beta1 Length = 192 Score = 40.2 bits (92), Expect = 0.57, Method: Composition-based stats. Identities = 30/188 (15%), Positives = 71/188 (37%), Gaps = 26/188 (13%) Query: 233 NALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 +++ FL D ++ + + +G++ Y V + + + Sbjct: 20 DSVTAFLNDLLERMDIGPKQTQVGIVQYGENVTHEFNL-NKYSSTEEVLVAAKKIVQRGG 78 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 + A+ T+ K+ R+G +K ++ +TDGE N Sbjct: 79 RQTMTALGTD----TARKEAFTEARGARRG-------VKKVMVIVTDGE---SHDNHRLK 124 Query: 352 KICDKAKENFIKIVTISINASPNGQRL--------LKTCVSSP--EYHYNVVNADSLIHV 401 K+ ++ I+ +I+I S N L +K+ S P ++ +NV + +L+ + Sbjct: 125 KVIQDCEDENIQRFSIAILGSYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDEIALVTI 184 Query: 402 FQNISQLM 409 + + + + Sbjct: 185 VKTLGERI 192 >gi|332970076|gb|EGK09074.1| D-amino acid dehydrogenase large subunit [Desmospora sp. 8437] Length = 454 Score = 40.2 bits (92), Expect = 0.57, Method: Composition-based stats. Identities = 42/238 (17%), Positives = 76/238 (31%), Gaps = 49/238 (20%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 +E+++D SGSM + KM K A+ F+ + + + V Sbjct: 144 VEILLDASGSMAGRIRDGV---------------KMDLAKEAIENFVSDMPENAKISLRV 188 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKV---RQYVT----RDMDSL-ILKPTDSTPAMKQAYQI 304 Y G G + ++ + TE V YV + ++S T AM++A Q Sbjct: 189 Y-GHKGSNRKQDQKESCA-STEVVYPHGSYVKGKFGKALNSFEPTGWTPLAAAMEEARQD 246 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG-ENNNFKSNVNTIKICDKAKENFIK 363 L + I ++DG E + + + + Sbjct: 247 L----------------KPYAGEDAENIIYVVSDGIETCGGDPVKAAKSLYNSDIQAVVN 290 Query: 364 IVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQ----NISQLMVHRKYSV 416 I I + GQ+ LK + + + L F I + + Y+V Sbjct: 291 I--IGFDVDDAGQQALKKVAEAGGGEYKTANTREELNQSFGIDWDEIEKEVSKVWYNV 346 >gi|159041653|ref|YP_001540905.1| hypothetical protein Cmaq_1087 [Caldivirga maquilingensis IC-167] gi|157920488|gb|ABW01915.1| conserved hypothetical protein [Caldivirga maquilingensis IC-167] Length = 448 Score = 40.2 bits (92), Expect = 0.57, Method: Composition-based stats. Identities = 35/295 (11%), Positives = 93/295 (31%), Gaps = 45/295 (15%) Query: 54 ESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVF 113 + + I + S + N+ ++ D + A+ + I + + Sbjct: 145 RNMLDREIRSLISFYMGNMKKVNDTINKARSVLGPAIGHEVAELILDTDIDPYRARLVNM 204 Query: 114 YNTEIQNIVNSSRI---SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYN 170 N+ ++ + SSR+ + + ++ + ++ + + + Y Sbjct: 205 LNSLLKLVTESSRMYDEGILNEMLDKGVMTGIKRMDKVNEVKDLTPTNRALAKFAKPIYA 264 Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 K+ + + +M +P I LV+D SGSM + ++ + +S + Sbjct: 265 YKLATGSLTVKERKMMRKPK--IYLVIDKSGSMFYTVMNNIFEYSSVSKITWAAALAIVM 322 Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + E V + ++ + + + + + L Sbjct: 323 VMK--------------GHEVV---ARFFDQQIYQL------MTNKKDIIKTLLSLVPLG 359 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T+ T A++ AY + + ++ +TDGE++ Sbjct: 360 GTNITSAIRVAYDDAHRNPALRNYK-----------------LVLITDGEDDELD 397 >gi|224065911|ref|XP_002191398.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Taeniopygia guttata] Length = 809 Score = 40.2 bits (92), Expect = 0.58, Method: Composition-based stats. Identities = 36/173 (20%), Positives = 56/173 (32%), Gaps = 44/173 (25%) Query: 184 EMGERPIF--LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 E P+F + V+D SGSM K+ ++ALL L Sbjct: 236 APQEMPVFPKNVIFVIDRSGSMTGR--------------------KIEQTRDALLKILQD 275 Query: 242 IDLLSHVKEDVYMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 + H I + +V K+ E V TD + A+ Sbjct: 276 LRQEDHFS------FITFNNKVVEWKSSLLPATEENVASAAALVQTLTARGGTDISGALL 329 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 A +L +G+ S+ II LTDG+ + + NV I+ Sbjct: 330 AAVGVL-----------EKAEGLPERSISM---IILLTDGQPTSGEKNVEVIQ 368 >gi|113475004|ref|YP_721065.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] gi|110166052|gb|ABG50592.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] Length = 412 Score = 40.2 bits (92), Expect = 0.58, Method: Composition-based stats. Identities = 35/241 (14%), Positives = 80/241 (33%), Gaps = 49/241 (20%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 Q +Q+ V+ +++ ++ + L++D SGSM Sbjct: 18 QSLSQRQVALSISVMANQLERTVPLNLCLILDHSGSMEGR-------------------- 57 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT--RVEKNIEPSWGTEKVRQYVTRDM 284 + +K A + ++ + + ++ + +V + + ++ + + Sbjct: 58 PLETVKQAAVQLVEKLKEGD------RLSVVAFDHQAQVIVPNQMINDSASIKGKINKLR 111 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 S T +K + L +K S Q FI LTDGEN + Sbjct: 112 AS---GGTAIDKGLKLGIEELNKGRKESIS---------------QAFI--LTDGENEHG 151 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 +++ +K+ A + I + ++ N L K + + + I F Sbjct: 152 DNDL-CLKLAKLATDYNITLNSLGFGDDWNQDVLEKIADAGGGNLSYIQQPEQAIEEFSK 210 Query: 405 I 405 + Sbjct: 211 L 211 >gi|315122409|ref|YP_004062898.1| hypothetical protein CKC_03305 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495811|gb|ADR52410.1| hypothetical protein CKC_03305 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 411 Score = 40.2 bits (92), Expect = 0.59, Method: Composition-based stats. Identities = 59/435 (13%), Positives = 136/435 (31%), Gaps = 61/435 (14%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 M F F + +F I+ A+ + SFL ++ + D +N ++S+ + A Sbjct: 1 MFYYRNFLSVFSRLNHCTNGSFLIVSAVLLSSFLTIMDIMRDYTDMIRVRNMLQSSIDYA 60 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 + S+ +R ++ K+ I + + G + +I+ Sbjct: 61 L------------HNNPNELSVGTIKQREML--IKKRIGYFLDSNYKG--TLLTEEQIKL 104 Query: 121 IVNSSRISMT----------HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYN 170 IVN S +S+T + ++ +M + D+ + + L ++ Sbjct: 105 IVNQSTVSITERSFYPQQFHINIELHKNIQLKSLILHMAMNPKKDFNISQRKSSLYKKNV 164 Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 +V +F + V S + + +K+ Sbjct: 165 ALMVVPFTWTGEWIPP--SLFTTQFTV----SQDLLPSDLKTEHFKKTEYFNKRNQFFKM 218 Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ----YVTRDMDS 286 FL I + + I Y + + + + T + YV + Sbjct: 219 -------FLSKIKENNLCIAPYHYSAIVYWSEGIFSYKLPFSTTFLYSFRDIYVKQYSTI 271 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 +KP++ L + + + +KF++ + G + + Sbjct: 272 WDMKPSNYIL-------DLFAGAELHSNRLTPADPCFRRGVIQKKFMLIIAAGNQISDRK 324 Query: 347 NV----NTIKICDKA-KENF------IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA 395 N C K I + ++ I+ P+ +R L C P+ +Y + + Sbjct: 325 NSAEYFKMKHGCTLMGKNMGKNPQEEITVYSLGISPDPDTKRDLIQCTRHPDRYYEIQSY 384 Query: 396 DSLIHVFQNISQLMV 410 + V + + + Sbjct: 385 KDIAPVIDRLERNIS 399 >gi|297567245|ref|YP_003686217.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] gi|296851694|gb|ADH64709.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] Length = 467 Score = 40.2 bits (92), Expect = 0.59, Method: Composition-based stats. Identities = 29/166 (17%), Positives = 51/166 (30%), Gaps = 30/166 (18%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK-------KRTKMAALKNAL 235 RP ++ VVD SGSM + E + K ++K+ + AL Sbjct: 39 EATRSRPQLVVAFVVDTSGSMREVVTEPTERTGQSVRVDGKDYEVVRGAKSKIDLVIEAL 98 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 L S L + ++ + E + EK R + + T Sbjct: 99 QNLLSSPQLQPSD----RLAIVKFDDVAEVVQPFTPANEKARLVAAAERLTQYSGGTQMG 154 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 M++ ++L + +I LTDG+ Sbjct: 155 AGMREGMRLL-------------------EREAGSRRLILLTDGQT 181 >gi|145502106|ref|XP_001437032.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124404179|emb|CAK69635.1| unnamed protein product [Paramecium tetraurelia] Length = 556 Score = 40.2 bits (92), Expect = 0.59, Method: Composition-based stats. Identities = 49/321 (15%), Positives = 110/321 (34%), Gaps = 38/321 (11%) Query: 83 SNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNN 142 ++ +IDD ++ I + K+ + N + ++ + + N Sbjct: 14 GEVKQKLIIDDPQKQINDLKKQIFKNFLENGINIDENQVLLQDEEGFILNESETI--GNL 71 Query: 143 TIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGS 202 ++ + + + + Q+ + + P+ I ++ D+SGS Sbjct: 72 LKTNDIVKVVQQEQKQNIVNQQQQQQIVAQEEQQEQLQQDLNQNNDPVEAIVVLFDISGS 131 Query: 203 MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR 262 M ++++ +++ A+ NA L V L+ + + Sbjct: 132 MGGMY------------FKEEELSRIGAV-NAFFSAFADKTLAFEFNHIVK--LVWFESF 176 Query: 263 VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 + + + + V D+ T A+ A + L KK Sbjct: 177 ITDKCDFTNDFNNFIKLVD---DASPRGGTKCYDAIAYAIEQLKEIKK------------ 221 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 K P++ + II LTDG++N K N ++ ++ EN I I + +N G + L Sbjct: 222 KYPNIILR--IIALTDGDDNQSKENPQSLV--NRIFENQIIIDSFVVNNDCVGLKTLTH- 276 Query: 383 VSSPEYHYNVVNADSLIHVFQ 403 ++ Y + +F+ Sbjct: 277 -ATNGRCYCPQTLAEGMSLFE 296 >gi|56797873|emb|CAG27570.1| matrilin-4 [Danio rerio] Length = 428 Score = 40.2 bits (92), Expect = 0.59, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 63/169 (37%), Gaps = 25/169 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTP 296 +D I L+ +G++ Y+++V+ K Q V + + L T + Sbjct: 45 MIDIIHELNIGLAATRIGVVQYSSQVQNVFSLK-AFSKTEQMVKAINEIIPLAQGTMTGL 103 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A + S ++ P++P I+ TDG + + V Sbjct: 104 AIRYAMNVAFSAEE-----------GARPNVPHVAVIV--TDGRPQDRVAEVAAAAR--- 147 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 E+ I+I + + L+ S P ++ + V + LIH F Sbjct: 148 --ESGIEIYAVGVA--RADMTSLRAMASPPFEDHVFLVESF-DLIHQFG 191 >gi|294508602|ref|YP_003572661.1| aerotolerance-related membrane protein [Salinibacter ruber M8] gi|294344931|emb|CBH25709.1| aerotolerance-related membrane protein [Salinibacter ruber M8] Length = 358 Score = 40.2 bits (92), Expect = 0.60, Method: Composition-based stats. Identities = 29/196 (14%), Positives = 65/196 (33%), Gaps = 34/196 (17%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + ER + + +D+S SM QD +++ KN + +D Sbjct: 80 EVRTVERRGLDLVVALDVSASM---------------RAQDVPPSRLRRAKNEIRTLVDD 124 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT-RDMDSLILKPTDSTPAMKQ 300 + +GL+ + + R ++ D + TD + A+ Sbjct: 125 L-SGD------RVGLVLFAGSGFVQSPLTTDYGAFRLFLDAAAPDQISTPGTDVSAAVDA 177 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 Q + + T P P + ++ ++DGEN+ + +A+E Sbjct: 178 GLQAFGAPRPTDDTT-------AAPEEPRPRALLIVSDGENHAGDLDAAR----QRAEEA 226 Query: 361 FIKIVTISINASPNGQ 376 + ++T + + Sbjct: 227 GVTLLTAGVGTEDGAR 242 >gi|260834334|ref|XP_002612166.1| hypothetical protein BRAFLDRAFT_88905 [Branchiostoma floridae] gi|229297540|gb|EEN68175.1| hypothetical protein BRAFLDRAFT_88905 [Branchiostoma floridae] Length = 815 Score = 40.2 bits (92), Expect = 0.60, Method: Composition-based stats. Identities = 24/174 (13%), Positives = 59/174 (33%), Gaps = 23/174 (13%) Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 +D ++ +G++ Y+ R K + S T + A+ Sbjct: 334 VDMVNSFDVSPAATRVGVLQYSNRNTLVFNLGNKVNKPTTVSAINSISYQGGGTRTGAAL 393 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 + N + +P K +I LTDG++ + S + + D+ Sbjct: 394 QY------------IRGNAAWRRGNVP-----KVLIVLTDGKSEDSVSGPSQNLVSDR-- 434 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 +++ I ++ + + LL+ + + + ++L I+Q + Sbjct: 435 ---VEVYAIGVSNFDH-EELLQIVNNKQSNVIELNDFNALATKIDEIAQDVCSY 484 >gi|83814223|ref|YP_446667.1| putative batB protein [Salinibacter ruber DSM 13855] gi|83755617|gb|ABC43730.1| putative batB protein [Salinibacter ruber DSM 13855] Length = 350 Score = 40.2 bits (92), Expect = 0.60, Method: Composition-based stats. Identities = 29/196 (14%), Positives = 65/196 (33%), Gaps = 34/196 (17%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + ER + + +D+S SM QD +++ KN + +D Sbjct: 72 EVRTVERRGLDLVVALDVSASM---------------RAQDVPPSRLRRAKNEIRTLVDD 116 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT-RDMDSLILKPTDSTPAMKQ 300 + +GL+ + + R ++ D + TD + A+ Sbjct: 117 L-SGD------RVGLVLFAGSGFVQSPLTTDYGAFRLFLDAAAPDQISTPGTDVSAAVDA 169 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 Q + + T P P + ++ ++DGEN+ + +A+E Sbjct: 170 GLQAFGAPRPTDDTT-------AAPEEPRPRALLIVSDGENHAGDLDAAR----QRAEEA 218 Query: 361 FIKIVTISINASPNGQ 376 + ++T + + Sbjct: 219 GVTLLTAGVGTEDGAR 234 >gi|320094784|ref|ZP_08026529.1| hypothetical protein HMPREF9005_1141 [Actinomyces sp. oral taxon 178 str. F0338] gi|319978281|gb|EFW09879.1| hypothetical protein HMPREF9005_1141 [Actinomyces sp. oral taxon 178 str. F0338] Length = 352 Score = 40.2 bits (92), Expect = 0.60, Method: Composition-based stats. Identities = 24/171 (14%), Positives = 53/171 (30%), Gaps = 32/171 (18%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 P++ + +V+D +GSM D K+ ++ ++ + Sbjct: 55 PSVEGEATQVSSSVEVYMVIDRTGSM-------------VAEDWDGKKPRIDGVRQDVAT 101 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 L+ + +I + + V + + + V Y+ + L S+P Sbjct: 102 ILEKMA-------GSRFSIISWDSGVRTELPLTTDSTAVTSYMATFVQELSESSQGSSP- 153 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 D+ S + + + +F TDGE +N Sbjct: 154 ----------DRPASHLATILEKNKQKHPQNLRTLFVF-TDGETSNQDHWS 193 >gi|148680070|gb|EDL12017.1| mCG120740 [Mus musculus] Length = 752 Score = 40.2 bits (92), Expect = 0.61, Method: Composition-based stats. Identities = 38/233 (16%), Positives = 82/233 (35%), Gaps = 54/233 (23%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D+SGSM ++ + A FL I ++ ++ Sbjct: 311 LVLDVSGSMA-------------------SYDRLDLMNRAAKHFLSQI-----IENRSWV 346 Query: 255 GLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + E + R + + + + T +K A+Q+ + + Sbjct: 347 GMVHFNHLANIKSELIQMNSNIERNQLLQTLPTSADGGTSICSGIKAAFQVFKNGGYETD 406 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 T I+ L+DGE++ T K C D+ K++ + I++ S Sbjct: 407 GTE----------------ILLLSDGEDS-------TAKDCIDEVKDSGAIVHFIALGPS 443 Query: 373 PNGQRLLKTCVSSPEYHYNVVNA--DSLIHVFQNISQ---LMVHRKYSVILKG 420 + + ++ ++ Y A + LI F ++ + + + KG Sbjct: 444 ADLAVTNMSILTGGKHMYASDEAQNNGLIDAFVALASENVDVTQKSLQLESKG 496 >gi|118353832|ref|XP_001010181.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89291948|gb|EAR89936.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 542 Score = 40.2 bits (92), Expect = 0.61, Method: Composition-based stats. Identities = 32/229 (13%), Positives = 76/229 (33%), Gaps = 52/229 (22%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 A+ + + RP + V+D SGSM K+ +K L Sbjct: 121 AVKQESLENRPNLDLICVIDNSGSMSGE--------------------KIQNVKKTLEYL 160 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP--SWGTEK--VRQYVTRDMDSLILKPTDS 294 L+ + + LI + + + + + K ++ + + TD Sbjct: 161 LELLGDND------RLCLILFNSYATRLCHLMKTNNSNKPAFKEIINKI---YSTGGTDI 211 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 M+ A+++L K ++ ++ + L+DG+ + +++ + Sbjct: 212 NSGMELAFRVLKDRKYQNPVSS----------------VFLLSDGQ--DGSADLRVRQSL 253 Query: 355 DK-AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 ++ + I + + +G + K C Y V + + F Sbjct: 254 ERHLPQECFTIHSFGFGSDHDGPLMNKICSLKDGNFYYVEKINQVDEFF 302 >gi|290982629|ref|XP_002674032.1| von Willebrand factor, type A domain-containing protein [Naegleria gruberi] gi|284087620|gb|EFC41288.1| von Willebrand factor, type A domain-containing protein [Naegleria gruberi] Length = 286 Score = 40.2 bits (92), Expect = 0.62, Method: Composition-based stats. Identities = 25/144 (17%), Positives = 48/144 (33%), Gaps = 8/144 (5%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + L++D SGSM S + T+ + A +SI + Sbjct: 74 LQLVLIIDHSGSMGSFDEDATGQNRSKGLVDSNHWTRYDNVIQAAKYLSESIFQYDKDGK 133 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 LI + + V + I + + + V + T+ A++ A++ +D + Sbjct: 134 V---PLIFFDSNVREVI-----VDSIPRLVAAFEKNQPNSSTNLLGALELAFKNHVNDHE 185 Query: 311 RSFFTNFFRQGVKIPSLPFQKFII 334 F F P K +I Sbjct: 186 NVLFIVFTDGSPNGGQEPKIKQLI 209 >gi|262199272|ref|YP_003270481.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262082619|gb|ACY18588.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 430 Score = 40.2 bits (92), Expect = 0.62, Method: Composition-based stats. Identities = 20/135 (14%), Positives = 40/135 (29%), Gaps = 25/135 (18%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 R + LV+D SGSM K++ +K A L L+++ Sbjct: 44 APATRAPLDLALVIDRSGSMSGD--------------------KLSDVKTAALELLETLQ 83 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + Y + + + R+ + T P + +A + Sbjct: 84 PEDTITLVSYSSDVSMHLMRTRADD-----AGQREARRALLALQARGGTALGPGLFRALE 138 Query: 304 ILTSDKKRSFFTNFF 318 L R+ ++ Sbjct: 139 ALEGASDRTRMSHLM 153 >gi|296139788|ref|YP_003647031.1| von Willebrand factor type A [Tsukamurella paurometabola DSM 20162] gi|296027922|gb|ADG78692.1| von Willebrand factor type A [Tsukamurella paurometabola DSM 20162] Length = 327 Score = 40.2 bits (92), Expect = 0.62, Method: Composition-based stats. Identities = 35/239 (14%), Positives = 70/239 (29%), Gaps = 51/239 (21%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + L +D+S SM D + ++ A K A F+ + Sbjct: 87 RNRATVVLAIDVSLSMEAR---------------DVEPDRLTAAKEAAKKFVTEL----- 126 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 V +G++ + T + + + T + + + Q + + Sbjct: 127 -PNGVNLGIVSF---AGTASLLVSPTPDRTLALNAVDKLELAQRTATGEGIYTSIQSIKN 182 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS--NVNTIKICD-KAKENFIKI 364 + + II +DG+ + KAKE I I Sbjct: 183 IRDVLGGEDNAPPAR----------IILESDGKQTVPTDLDDPRGGFTAARKAKEEGIPI 232 Query: 365 VTISI-----NASPNGQRL--------LKTCVS-SPEYHYNVVNADSLIHVFQNISQLM 409 TIS + + GQ + LK S + + + L + ++ + Sbjct: 233 STISFGTTSGSVNIGGQNIPVPVDDASLKRIAELSGGQFFAASSLNDLNEAYGSLRDEI 291 >gi|268610218|ref|ZP_06143945.1| von Willebrand factor, type A [Ruminococcus flavefaciens FD-1] Length = 565 Score = 40.2 bits (92), Expect = 0.62, Method: Composition-based stats. Identities = 41/216 (18%), Positives = 77/216 (35%), Gaps = 44/216 (20%) Query: 166 NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 N + N + + L + V+D SGSM A Sbjct: 366 NIKVNGNSWTQMQKLWKTNKNSGKPIAAVFVLDTSGSMSGA------------------- 406 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 + +LK +L + I+ + Y+G++ Y++ V ++E + + Y +D Sbjct: 407 -PLNSLKASLRNSIKYINSSN------YIGVVSYSSNVNVDLELAKFDLNQQAYFMGAVD 459 Query: 286 SLILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 SL T + A+ QA +L K + + + L+DG++N+ Sbjct: 460 SLTASGNTATFSALSQAMIMLRDFTKDNPNVSPM--------------VFLLSDGQSNSG 505 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLK 380 + A+ I I TI NA+ N + + Sbjct: 506 SEFSDIDGAIATAQ---IPIYTIGYNANLNELKAIS 538 >gi|160858157|emb|CAP19998.1| collagen type VI alpha 5 [Homo sapiens] Length = 609 Score = 40.2 bits (92), Expect = 0.62, Method: Composition-based stats. Identities = 40/309 (12%), Positives = 101/309 (32%), Gaps = 56/309 (18%) Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGY----------SAVFYNTEIQNIV 122 + D + +S + L A F+ + S + ++++ Sbjct: 342 QEISDAIDRMSLINEGTLTGKALNFVGQYFTHSKGARLGAKKFLILITDGVAQDDVRDPA 401 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 R + + + ++N + + +S + ++ +HL + +K++ + AL Sbjct: 402 RILRGKDVTIFSVGVYNANRSQLEEISGDSSLVFHVENFDHL--KALERKLIFRVCAL-- 457 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + + + V+D SGS+ QD + L + + Sbjct: 458 HDCKRITLLDVVFVLDHSGSI-------------KKQYQDHM----------INLTIHLV 494 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS--LILKPTDSTPAMKQ 300 ++ V G + Y+ + T R + ++ T + A+K Sbjct: 495 KKADVGRDRVQFGALKYSDQPNILFYL--NTYSNRSAIIENLRKRRDTGGNTYTAKALKH 552 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A L ++ + ++ +I +TDGE+++ +T + + Sbjct: 553 A-NALFTE-----------EHGSRIKQNVKQMLIVITDGESHDHDQLNDTAL---ELRNK 597 Query: 361 FIKIVTISI 369 I I + + Sbjct: 598 GITIFAVGV 606 >gi|197118219|ref|YP_002138646.1| hypothetical protein Gbem_1835 [Geobacter bemidjiensis Bem] gi|197087579|gb|ACH38850.1| conserved hypothetical protein [Geobacter bemidjiensis Bem] Length = 356 Score = 40.2 bits (92), Expect = 0.62, Method: Composition-based stats. Identities = 40/251 (15%), Positives = 77/251 (30%), Gaps = 18/251 (7%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 K I SEK + A+ +M +G + + +++A +AA L GA + + Sbjct: 6 KLIRSEKGMVIVYVAILLMMMFGFLGLAVDGGHLFKVRGELQNAADAAALKGAWHLYTRP 65 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 + + A++ I + ++ + A+ N + + Sbjct: 66 TD----PTQLPTLQWEVARFQAQQMITENSSDNTALKDAMVDVGYWNTNSNILQPTTLPT 121 Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFL 192 +D + + + F + + Y+Q A+L P Sbjct: 122 PVPGVDIPAVRVIASRSDGNNGGPVKNFFMQIFGKDYSQVSSRAAVAMLGFPYTVPPAVP 181 Query: 193 IELV-VDLSGSMHCAMNSD-------PEDVNSAP------ICQDKKRTKMAALKNALLLF 238 EL + LS M S PE S+P C + T A N + Sbjct: 182 AELFPLALSKCMTDQYFSQVPMPDPPPEIRISSPYIPGGDTCYSGQWTSFKADTNDVRTI 241 Query: 239 LDSIDLLSHVK 249 D + + Sbjct: 242 KDLMYKGNPEP 252 >gi|107028246|ref|YP_625341.1| hypothetical protein Bcen_5496 [Burkholderia cenocepacia AU 1054] gi|116687157|ref|YP_840404.1| hypothetical protein Bcen2424_6782 [Burkholderia cenocepacia HI2424] gi|105897410|gb|ABF80368.1| conserved hypothetical protein [Burkholderia cenocepacia AU 1054] gi|116652872|gb|ABK13511.1| conserved hypothetical protein [Burkholderia cenocepacia HI2424] Length = 423 Score = 40.2 bits (92), Expect = 0.62, Method: Composition-based stats. Identities = 15/134 (11%), Positives = 52/134 (38%), Gaps = 6/134 (4%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 M+ +R+ ++ + ++ +II L++ + +G + + + ++ ++++ +A Sbjct: 1 MNRAARYPKVTRRSLHRQRGAVAIIVGLALAVMIGFVGLALDLGKLYVTRSELQNSADAC 60 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 L+ A + S +S + I A F + + + + F ++ Sbjct: 61 ALSAARDLTSAISLSVAEADGI------AAGHLNFVFFQKTSVQMSTNANVTFSDSLTNP 114 Query: 121 IVNSSRISMTHMAN 134 + + ++ Sbjct: 115 FLTKNAVTTPANIK 128 >gi|312886237|ref|ZP_07745851.1| von Willebrand factor type A [Mucilaginibacter paludis DSM 18603] gi|311301262|gb|EFQ78317.1| von Willebrand factor type A [Mucilaginibacter paludis DSM 18603] Length = 348 Score = 40.2 bits (92), Expect = 0.63, Method: Composition-based stats. Identities = 29/213 (13%), Positives = 72/213 (33%), Gaps = 45/213 (21%) Query: 158 LQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSA 217 L+FI ++ + + ++E +R + +++D+S SM Sbjct: 57 LKFILFIIAYGFLIVAAADPQVGSKMEEVKRKGADLMILLDVSNSMLS------------ 104 Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 QD ++ K A+ +D++ D +G+I + + + + + Sbjct: 105 ---QDLSPNRLENAKRAISQLIDNL-------HDDRIGIIVFAGQAYVQLPITTDYSAAK 154 Query: 278 QYVTRD-MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 ++ + + + T A+ Q K +I + Sbjct: 155 LFLNTINTNMVPTQGTAIGAAIDLGMQSFDFKN------------------GMSKAMIVI 196 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 TDGEN+ + + + A++ + + I + Sbjct: 197 TDGENHEDDA----VSAANHARDKDVTVNVIGV 225 >gi|238020799|ref|ZP_04601225.1| hypothetical protein GCWU000324_00689 [Kingella oralis ATCC 51147] gi|237867779|gb|EEP68785.1| hypothetical protein GCWU000324_00689 [Kingella oralis ATCC 51147] Length = 554 Score = 40.2 bits (92), Expect = 0.63, Method: Composition-based stats. Identities = 40/236 (16%), Positives = 84/236 (35%), Gaps = 47/236 (19%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 Q + I I A P L+ LV D+SGSM + Sbjct: 164 PWQADAKLIKIAIKAKEIRSSALPPANLVFLV-DVSGSM-------------------QA 203 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + K+ +K L + + V LI Y + + + P+ G + + + + + Sbjct: 204 QDKLPLVKKTLRILTKRLRAEDKVT------LITYASNEKLVLPPTSGKD--KDTILQAI 255 Query: 285 DSLILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 + L T A++ AY + ++++ N + I+ TDG+ N Sbjct: 256 NQLEAGGSTAGEQALQMAY----AQAQKAYIKNGINR------------ILLATDGDFNV 299 Query: 344 FKSNVNTIK-ICDKAKENFIKIVTISINASPNGQRLLKTCVSS-PEYHYNVVNADS 397 ++ NT+K + ++ I + T+ ++L++ + + + N Sbjct: 300 GITDFNTLKDTVAEKRKAGISLTTLGFGTGNYNEQLMEQLADAGDGNYSYIDNETE 355 >gi|90409149|ref|ZP_01217268.1| hypothetical protein PCNPT3_10636 [Psychromonas sp. CNPT3] gi|90309757|gb|EAS37923.1| hypothetical protein PCNPT3_10636 [Psychromonas sp. CNPT3] Length = 226 Score = 40.2 bits (92), Expect = 0.63, Method: Composition-based stats. Identities = 31/184 (16%), Positives = 62/184 (33%), Gaps = 36/184 (19%) Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 FI L + I I ++ + L +DLSGSM+ Sbjct: 56 FILLLAWTLVIVALAKPIYYGEPIRAQQQSR-NMILSLDLSGSMN----------EVDMR 104 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 + T+++ +K+ L F+ + ++ +GLI + ++ + + Q Sbjct: 105 LDGQSVTRLSLVKSLLKKFVAT-------RQGDRLGLILFADHAYLQTPLTFDLKTIAQR 157 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + L+ T ++ A + +K + Q+ +I LTDG Sbjct: 158 IEETQIGLVGIRTAIGESIAIAIKRFVKNKNK------------------QRILILLTDG 199 Query: 340 ENNN 343 N Sbjct: 200 SNTA 203 >gi|281338025|gb|EFB13609.1| hypothetical protein PANDA_007564 [Ailuropoda melanoleuca] Length = 901 Score = 40.2 bits (92), Expect = 0.63, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 51/165 (30%), Gaps = 45/165 (27%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D SGSM KM + A + +D LS + Sbjct: 244 VIFVIDKSGSMSGR--------------------KMQQTREA---LIKILDDLSPKDQF- 279 Query: 253 YMGLIGYTTRVEKNIEP-----SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 LI ++ + + + R Y TD A+ A Q+L S Sbjct: 280 --NLISFSGDAAQWKPLLVPASAENVNQARSYAAGIQA---HGGTDINEAVLMAVQLLNS 334 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 K++ II LTDG+ ++N I+ Sbjct: 335 AKQKELMPEGTVS-----------LIILLTDGDPTMGETNPARIQ 368 >gi|282877523|ref|ZP_06286341.1| von Willebrand factor type A domain protein [Prevotella buccalis ATCC 35310] gi|281300347|gb|EFA92698.1| von Willebrand factor type A domain protein [Prevotella buccalis ATCC 35310] Length = 332 Score = 40.2 bits (92), Expect = 0.63, Method: Composition-based stats. Identities = 42/246 (17%), Positives = 71/246 (28%), Gaps = 72/246 (29%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I L +D+S SM D + ++ A KN F+ Sbjct: 88 IDIMLAMDVSTSMLAE---------------DLRPNRLEAAKNVAAEFI----SGRPNDN 128 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVR-----QYVTRDMDS--LILKPTDSTPAMKQAYQ 303 +GL + + T+ + V D+ + LI T + A Sbjct: 129 ---IGLTIFAGESFTQCPMT--TDHASLLNLLRNVRTDIAARGLISDGTAVGMGLANAVS 183 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L K +S K +I +TDG NN + T A+ I+ Sbjct: 184 RLKDSKAKS------------------KVVILITDGSNNMGDISPMTSAQ--IAQSLGIR 223 Query: 364 IVTISI----NAS-----------------PNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + TI + A + + L + Y N L ++ Sbjct: 224 VYTIGVGTNKVAPYPMNVGGTTQYVNIPVEIDSKTLSDIAAVTEGNFYRATNNKELKQIY 283 Query: 403 QNISQL 408 +I +L Sbjct: 284 NDIDKL 289 >gi|11498366|ref|NP_069594.1| hypothetical protein AF0760 [Archaeoglobus fulgidus DSM 4304] gi|2649856|gb|AAB90485.1| predicted coding region AF_0760 [Archaeoglobus fulgidus DSM 4304] Length = 959 Score = 40.2 bits (92), Expect = 0.63, Method: Composition-based stats. Identities = 41/219 (18%), Positives = 66/219 (30%), Gaps = 52/219 (23%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNI----EPSWGTEKVRQYVT 281 +M A K A + F + + V GL +TT E+ + T+ Sbjct: 609 KRMDAAKLAAITFNNMLGEGDFV------GLATFTTYAERISVNQTPLKYMTK------D 656 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK-FIIFLTDGE 340 + + ++ + A A L K Q + + +I LTDGE Sbjct: 657 KLRVNNEIEGLYAKLATDHA-DALYWGVKVFPIWPDETQNNCTECINNTRPLMILLTDGE 715 Query: 341 N-----NNFKSNVNTIKIC----------------DKAKEN------FIKIVTISINAS- 372 N N C D K N I I TI Sbjct: 716 TTTCDKNEDYFNNTCKNKCVRDNGHYGAQQALCVADYIKRNIKVNGFNIPICTIGFGTDI 775 Query: 373 -PNGQRLLKTCVSS-PEY----HYNVVNADSLIHVFQNI 405 +GQ L+ S P+ ++ ++ LI ++ I Sbjct: 776 GSDGQAFLRDIASPRPDNGEACYFFATTSEELIEAYKTI 814 >gi|77359874|ref|YP_339449.1| pilin biogenesis protein [Pseudoalteromonas haloplanktis TAC125] gi|76874785|emb|CAI86006.1| conserved protein of unknown function ; putative pilin biogenesis protein [Pseudoalteromonas haloplanktis TAC125] Length = 1056 Score = 40.2 bits (92), Expect = 0.63, Method: Composition-based stats. Identities = 37/213 (17%), Positives = 68/213 (31%), Gaps = 49/213 (23%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSD-------------------------PEDVNSA 217 +E E+P + ++ D SGSM ++ E Sbjct: 32 VETKEKPR--VMMIFDTSGSMAWDVSDGDACYMRSGNRYYEVDCFQSKNSYNRNEQCYKR 89 Query: 218 PICQDKKRTKMAA-LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 + + T + LK A I L + ED+ GL+ + + G +K Sbjct: 90 SSYYNYEATCSDSRLKVAQNA----IKQLVNDNEDIEFGLMRFNGDNGGYVLARVGADK- 144 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK----KRSFFTNFFRQGVKIPSLPFQKF 332 + + + + T T + +AY +T K + + + + PF Sbjct: 145 KSLLEKIEELPANGATPLTETLWEAYLYITGQKVYYGNNTNDRDKTAESSGYYNSPFVPV 204 Query: 333 ------------IIFLTDGENNNFKSNVNTIKI 353 II +TDG+ +N + N I Sbjct: 205 TGEPLRCDNSINIILMTDGDPSNDSNRNNDIYY 237 >gi|315181058|gb|ADT87972.1| hypothetical protein vfu_A02859 [Vibrio furnissii NCTC 11218] Length = 406 Score = 40.2 bits (92), Expect = 0.64, Method: Composition-based stats. Identities = 23/167 (13%), Positives = 56/167 (33%), Gaps = 15/167 (8%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 +GI ++ +I +++++ + + F I + + ++++ +AA LA A + + Sbjct: 8 RGIRKQRGLVVVIVTIAMLALIAVAAFAIDINHAMMNRTKLQNSVDAAALAAAIVLDKDG 67 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 + A D R + + +++ N+ T Sbjct: 68 T--------------EAQADTIARSTLTKMSTAAGNAELTLDVSDVVNVEVQFSNDPTVF 113 Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPA 179 ++ SS + Y V+ D F L + S + Sbjct: 114 PDSGYSSSPDGDRYVRVVINQLDLESFFFARALGVT-KRLTASAVAG 159 >gi|315266493|gb|ADT93346.1| von Willebrand factor type A [Shewanella baltica OS678] Length = 627 Score = 40.2 bits (92), Expect = 0.64, Method: Composition-based stats. Identities = 40/232 (17%), Positives = 76/232 (32%), Gaps = 54/232 (23%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + ++D+SGSM K+ L+ AL L + V Sbjct: 221 GASNLVFLLDVSGSMA-------------------SVDKLPLLQTALKLLTAQLSAQDKV 261 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILTS 307 VY G G ++ Q + ++ L T+ + QAYQ Sbjct: 262 SIVVYAGAAG--VVLDGAS------GNDTQTLNYALEQLSAGGSTNGGQGITQAYQ---- 309 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKENFIKIVT 366 K+ F N + +I TDG+ N ++ + I + +K K++ I + T Sbjct: 310 LAKKHFIPNGINR------------VILATDGDFNVGVTDFDDLIALIEKEKDHGIGLTT 357 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNA---------DSLIHVFQNISQLM 409 + +L++ +Y ++ D L I++ + Sbjct: 358 LGFGLGNYNDQLMEQLADKGNGNYAYIDTLNEARKVLVDELSSTLFTIAKDV 409 >gi|257892784|ref|ZP_05672437.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium 1,231,408] gi|257829163|gb|EEV55770.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium 1,231,408] Length = 677 Score = 40.2 bits (92), Expect = 0.64, Method: Composition-based stats. Identities = 42/319 (13%), Positives = 95/319 (29%), Gaps = 53/319 (16%) Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRY-NQKIVSFIPALLRIEMGERPIFL 192 N + T +YDY + + Y N + +F+ ++ G + +F Sbjct: 39 QNNAEGEFLTNGVKEGDTYNYDYGNAPEANFDEENYINYENSAFVKKSVKKAEGIQGLFD 98 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + L + + + V K++ L F + +D S ++ Sbjct: 99 VTLAIK-----GNQLKKPIDLVMVIDYSSSMTGEKLSNALKGLQEFGEELDD-SLESGNI 152 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK---------------------- 290 +G++ Y V + ++ +Y R+ Sbjct: 153 RIGIVAYNRFVYSTDDFLTDINQL-EYFLRNTAESHTGTFMQKGLLEGQSLLEEKSRPEA 211 Query: 291 -----------PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 S + A S + + Q V +K+ T G Sbjct: 212 EKMLVHIGDDSANRSYLPKENAQVFHNSGEIVDYNGYHTDQYVTEFQTDSEKY---QTSG 268 Query: 340 ENNNFKSNVNTIKICDKA--------KENFIKIVTISINASPNGQRLLKTCVSSPEYHYN 391 + + + + + + A K IK +++ S G+ + + SSP + + Sbjct: 269 SSTDPNAVSVSSSLINDATLGTIISIKNAGIKCYSVATAPSSRGEYIGRNLASSPNNYLS 328 Query: 392 V-VNADSLIHVFQNISQLM 409 + N L + + I+ + Sbjct: 329 IDENLTGLGNALKEIANGI 347 >gi|315647020|ref|ZP_07900133.1| D-amino acid dehydrogenase, large subunit [Paenibacillus vortex V453] gi|315277222|gb|EFU40551.1| D-amino acid dehydrogenase, large subunit [Paenibacillus vortex V453] Length = 471 Score = 40.2 bits (92), Expect = 0.65, Method: Composition-based stats. Identities = 35/210 (16%), Positives = 67/210 (31%), Gaps = 29/210 (13%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I +++D SGSM +N ++KM + K A+ F + Sbjct: 163 LHISILLDASGSMKAQING---------------KSKMDSAKEAIQTF------ADKLPN 201 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 + + L Y + G +K ++ + + D T +K A Q + Sbjct: 202 NAEVSLRVYGHKGT-------GDQKDKRVSCDSTEEIFHGQGDQTNQIKTALQKVEPAGW 254 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 I + ++DG K ++K I + + + Sbjct: 255 TPIANALQSVKKDINPETTDSVVYVVSDGIETCGGKPAQVAKELHQSKVKTI-VNIVGFD 313 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 GQ+LL+ +S + VN D + Sbjct: 314 VDNEGQKLLRQIAASGGGEFMSVNNDEALK 343 >gi|254436533|ref|ZP_05050027.1| von Willebrand factor type A domain protein [Octadecabacter antarcticus 307] gi|198251979|gb|EDY76293.1| von Willebrand factor type A domain protein [Octadecabacter antarcticus 307] Length = 613 Score = 40.2 bits (92), Expect = 0.65, Method: Composition-based stats. Identities = 30/178 (16%), Positives = 59/178 (33%), Gaps = 44/178 (24%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 +RP + ++D SGSM + K+ L+ + L LD++ Sbjct: 250 DRPPLNLVFLIDTSGSM-------------------ESADKLPLLRQSFRLMLDNLRPED 290 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQIL 305 V Y G + E R + +++L T+ ++QAY + Sbjct: 291 EVAIVTYAGSTSIALEPTQASE--------RATIIAALNALNAGGSTNGQGGLEQAYALA 342 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 + K + +I TDG+ N S+ ++ + K + + Sbjct: 343 ETMKTAGDVSR----------------VILATDGDFNVGLSDPRGLQAYIEDKRDDAQ 384 >gi|205374347|ref|ZP_03227145.1| hypothetical protein Bcoam_14574 [Bacillus coahuilensis m4-4] Length = 1083 Score = 40.2 bits (92), Expect = 0.66, Method: Composition-based stats. Identities = 31/173 (17%), Positives = 54/173 (31%), Gaps = 44/173 (25%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 ER + V D SGSM + S + ++ + K+A +++ Sbjct: 70 ERDPIDLVFVFDKSGSMDFKVAS------------NSSVKRIDSAKSA---MTNALMFFD 114 Query: 247 HVKEDVYMGLIGYTTRVEKNI-----EPSWGT-----EKVRQYVTRDMDSLILKPTDSTP 296 G + +++ ++ WG+ K++ + M T+ T Sbjct: 115 GQNTSDRFGFVPFSSNANTDVVSLTDSSGWGSSSYTNSKLQTIHNKTMGLSASGGTNYTE 174 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 A+ A L S K IIFLTDG S+ Sbjct: 175 ALDVA-SKLFD------------------SSSKDKNIIFLTDGTPTFSFSDEK 208 >gi|118617118|ref|YP_905450.1| hypothetical protein MUL_1447 [Mycobacterium ulcerans Agy99] gi|118569228|gb|ABL03979.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99] Length = 733 Score = 40.2 bits (92), Expect = 0.66, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 50/170 (29%), Gaps = 39/170 (22%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 E P + +V+D SGSM KM A + A +D + Sbjct: 262 AEPSSAPR-DVVVVLDRSGSMGGW--------------------KMVAARRAAGRIVDML 300 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 D ++ + R+E G R S L S A Sbjct: 301 DAGD------RFCVLAFDDRIETPPAMPDGLVPASDR-NRFAASSWLGSLRSRGGTVMA- 352 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 Q LT+ + + RQ ++ + DG+ + + ++ Sbjct: 353 QPLTNAVEMLADSGEDRQAS----------VVLVADGQISGEDHLLRSLA 392 >gi|118353826|ref|XP_001010178.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89291945|gb|EAR89933.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 547 Score = 40.2 bits (92), Expect = 0.67, Method: Composition-based stats. Identities = 33/217 (15%), Positives = 72/217 (33%), Gaps = 46/217 (21%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 RP + V+D SGSM K+ +K L L+ + Sbjct: 121 RPNLDLVCVIDRSGSMSGN--------------------KIENVKKTLEYLLELLGEND- 159 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILT 306 + LI + + V + + + + ++ + T+ M+ A+++L Sbjct: 160 -----RLCLIAFDSCVSRRCHLMKTNSSNKPNLIKIINEIHCHGGTNINSGMELAFRVL- 213 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK-AKENFIKIV 365 K+R ++ I L+DG++ +++ + +K + I Sbjct: 214 --KERKYYNPVSS-------------IFLLSDGQDGG--ADLRVRQSLEKHLSQECFTIH 256 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + +G + K C Y V + + F Sbjct: 257 SFGFGSDHDGPLMNKICSLKDGNFYYVEKINQVDEFF 293 >gi|310830437|ref|YP_003965538.1| ABC-type cobalt transport system, permease component CbiQ and transporter [Paenibacillus polymyxa SC2] gi|309249904|gb|ADO59470.1| ABC-type cobalt transport system, permease component CbiQ and transporter [Paenibacillus polymyxa SC2] Length = 338 Score = 40.2 bits (92), Expect = 0.67, Method: Composition-based stats. Identities = 40/269 (14%), Positives = 87/269 (32%), Gaps = 30/269 (11%) Query: 9 FYFKKGIASEKA---NFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 F K I EK F I ++ F + I + + K+ ++ N+AA AGA Sbjct: 2 FKRIKKIRDEKGFVSVFVIFAVSFILPFFIF--HTIEMTYLYGMKDKFQNFNDAAASAGA 59 Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYS--AVFYNTEIQNIVN 123 ++ + GD + D+ K +K + E+ + N+ I ++ N Sbjct: 60 MQLEKKVVSKGDL---------KFHEDEVKNVVKRILSENYGLDAHLNPQENSYITDVPN 110 Query: 124 ------SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFI 177 + + + + V T + F + + + Sbjct: 111 VKVSIVNQESDLPKELLTQEGFKFEIKHPTIIVYTEVKPKGIFFNKFITIKSISALEVSF 170 Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 R E+ P + ++ +M+ +N N+ + + T M + Sbjct: 171 KTGGRTELTNNPQPSKDGLI---FTMNKVVNPLHFPKNTKLVPMNWSWTNMPMAAGGDIE 227 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKN 266 F +++ +V + Y R++ Sbjct: 228 F-----SVTNTNSNVQLKAASYELRLKGM 251 >gi|291295671|ref|YP_003507069.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] gi|290470630|gb|ADD28049.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] Length = 744 Score = 40.2 bits (92), Expect = 0.67, Method: Composition-based stats. Identities = 50/243 (20%), Positives = 85/243 (34%), Gaps = 52/243 (21%) Query: 175 SFIPALLRIEMGERPI-FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 + + L +E E P I LV+D+SGSM L Sbjct: 319 TSLADSLPVEPVEEPGGVGIVLVLDVSGSMLED----------------------DKLGL 356 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-KPT 292 A+ L+ I Y+G++ ++ R P TE+ R+ + S T Sbjct: 357 AVTGSLELIRSARPQD---YIGVVVFSDRPRWLFRPRPMTEQGRKEAESLLLSTQAGGGT 413 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 A +A + L S K +I LTDG + ++ Sbjct: 414 MIRRAYLEALEALEQVPTES------------------KQVIALTDGLAADVTPDLF--- 452 Query: 353 ICDKAKENF--IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 D A+E IK T++I A +G+ L + + +++V + L F +Q + Sbjct: 453 --DAAREASPRIKTNTVAIGADADGRFLRELAQAGDGTYWDVPRPEDLPRFFLEEAQRVF 510 Query: 411 HRK 413 R+ Sbjct: 511 RRE 513 >gi|51476525|emb|CAH18248.1| hypothetical protein [Homo sapiens] Length = 637 Score = 40.2 bits (92), Expect = 0.67, Method: Composition-based stats. Identities = 33/178 (18%), Positives = 55/178 (30%), Gaps = 36/178 (20%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D SGSM K+ + AL+ LD + V Sbjct: 275 VVFVIDKSGSMSGR--------------------KIQQTREALIKILDDLSPRDQFNLIV 314 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 + R + K R + L T+ AM A Q+L S + Sbjct: 315 FSTEATQ-WRPSLVPASAENVNKARSFAAGIQA---LGGTNINDAMLMAVQLLDSSNQ-- 368 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI-KICDKAKENFIKIVTISI 369 + + S+ II LTDG+ ++N +I +A + + Sbjct: 369 ------EERLPEGSVSL---IILLTDGDPTVGETNPRSIQNNVREAVSGGYSLFCLGF 417 >gi|326529585|dbj|BAK04739.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 742 Score = 40.2 bits (92), Expect = 0.68, Method: Composition-based stats. Identities = 29/234 (12%), Positives = 67/234 (28%), Gaps = 41/234 (17%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + V+D+SGSM TK+A LK A+ + ++ Sbjct: 282 RAPVDLITVLDVSGSMAG--------------------TKLALLKRAMGFVIQNLGSSD- 320 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGT-EKVRQYVTRDMDSLILKPTDSTPAM-------- 298 + +I +++ + T +Q + T+ + Sbjct: 321 -----RLSVIAFSSSARRLFPLRRMTESGRKQSLLAVNSLTSNGGTNIAEGLRKGSKVIE 375 Query: 299 -KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 +QA + S S + + + + + T G + + + Sbjct: 376 ERQAKNPVCSIILLSDGQDTYTVSPSTG--AHKPYTVSPTAGAQKASAEYCALLPSTNGS 433 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN-ISQLMV 410 ++ + + A + L +S + ++ F I L+ Sbjct: 434 QQVPVHVF--GFGADHDAVSLHSISQTSGGTFSFIETEATIQDAFAQCIGGLLS 485 >gi|313233190|emb|CBY24305.1| unnamed protein product [Oikopleura dioica] Length = 1902 Score = 40.2 bits (92), Expect = 0.68, Method: Composition-based stats. Identities = 57/302 (18%), Positives = 100/302 (33%), Gaps = 27/302 (8%) Query: 36 LIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAK 95 +I L W + +N + + I AG + + +L R IS R Sbjct: 632 MITIFCEKLGWAHLRNLL-ATFQNQIEAGVHRELLDLVRCPF----ISARRARQFYKLGY 686 Query: 96 RFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYD 155 + +KN +K +S F E I + + + +++S M V+ ++ Sbjct: 687 KRLKNIVKADISEIVEAFKQMESF-ISSKPQTGENELPKEKVESRG------MVVLQGFN 739 Query: 156 YRLQFIEHL-LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 L FIE + Q+ I+ L E E F + D S++ S D Sbjct: 740 KPLLFIEAAQMLQKEIFAIIGTETILDEPESDEHSRFQTQNSFDNMSSINATAKSIACDA 799 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 +P+C K +TK + +A F DS LL + L V P Sbjct: 800 TISPVCSRKSQTKADTIIDA--AFSDSKSLLDASHPTIQSHLPEIDDLVTSQFVPDPNDA 857 Query: 275 KVRQYVTRDMDSLILK----------PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 + + + + L TD + + Q L S K+ + + Sbjct: 858 SMMESLNDSLAPLYSGLPRSEHSSLSSTDESEDNES--QELFSASKKGRDEQSSLKKCQS 915 Query: 325 PS 326 + Sbjct: 916 ET 917 >gi|198430849|ref|XP_002120173.1| PREDICTED: similar to cGMP-dependent protein kinase 1, alpha isozyme (CGK 1 alpha) (cGKI-alpha) [Ciona intestinalis] Length = 1896 Score = 40.2 bits (92), Expect = 0.68, Method: Composition-based stats. Identities = 23/160 (14%), Positives = 54/160 (33%), Gaps = 42/160 (26%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL--SH 247 I I V+D SGS+ +++ K AL ++ + Sbjct: 838 IVDIIFVLDESGSVS-----------------------LSSYKQALRWVVELLTSFREDV 874 Query: 248 VKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 K +V++G++ + + I ++ ++ + + T+ PA+ + + Sbjct: 875 DKGNVHVGVVAFHSWAGTRIALGAFEFSNLQARIIALSNGRNYGGTNIAPAIDETLREFN 934 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 + QK +I +TDG ++ + Sbjct: 935 RN----------------GRTGIQKQMILMTDGYSSYPNA 958 >gi|52425826|ref|YP_088963.1| hypothetical protein MS1771 [Mannheimia succiniciproducens MBEL55E] gi|52307878|gb|AAU38378.1| unknown [Mannheimia succiniciproducens MBEL55E] Length = 541 Score = 40.2 bits (92), Expect = 0.68, Method: Composition-based stats. Identities = 28/272 (10%), Positives = 80/272 (29%), Gaps = 35/272 (12%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 + K+ +E +++I + + L+ + K A LA Sbjct: 6 YFIKLKQFFQNEDGAYAVIMGILSFFLIGLVALTVDGSGMLLDKARFSQGIEQAGLA--- 62 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVF-------YNTEIQ 119 +++ + + ++ ++ + + + + Y Sbjct: 63 -LMAENNDFRTTNQKHADVLRQTVTKEELEGFSDTFSAQKYKRNQELVSGLVRHYYYPST 121 Query: 120 NIVNSSRISMTHMAN-NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRY-----NQKI 173 ++ +IS + N L N ++ S + + L + Sbjct: 122 YFKDNLKISDKYDYQCNNLQGPNGEQLKSIACEISGKFE-RPSWLYLGKNNGLSFAETTT 180 Query: 174 VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 ++ ++ + E + LV DLSGSM+ +++ K+ L+ Sbjct: 181 INANKIYIQKNLDEIIPIDLMLVADLSGSMNSSVSGTKYGTA-----------KIDILRE 229 Query: 234 ALLLFLDSIDLLSH------VKEDVYMGLIGY 259 + + ++ + + +G + Sbjct: 230 VVSAIAKELLEQNNTEEGKVISQYNRIGFTSF 261 Score = 39.5 bits (90), Expect = 0.98, Method: Composition-based stats. Identities = 35/163 (21%), Positives = 58/163 (35%), Gaps = 31/163 (19%) Query: 266 NIEPSW-GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 N W G + V+R L T S+ + +L + Q K+ Sbjct: 367 NSTYIWAGKNESADLVSRFNRVPALGATLSSSGLLIGANLLMNTNPDEN-----AQPSKL 421 Query: 325 PSLPFQKFIIFLTDGE---NNNFKSNVNTIKI-----CDKAKE------------NFIKI 364 Q+ I+ L+DGE NN S T + C+K K +I Sbjct: 422 G-ANTQRIILVLSDGEDQINNASSSLNITSTLINQGMCEKIKSKLNSLQDKTYLEQPTRI 480 Query: 365 VTISINASPNG--QRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 ++ P+G + + CV +Y+Y N + L+ F+ I Sbjct: 481 GFVAFGYGPSGTQKAAWEKCV--GKYYYVANNKEELLESFRKI 521 >gi|326435505|gb|EGD81075.1| hypothetical protein PTSG_11020 [Salpingoeca sp. ATCC 50818] Length = 552 Score = 40.2 bits (92), Expect = 0.68, Method: Composition-based stats. Identities = 42/261 (16%), Positives = 82/261 (31%), Gaps = 43/261 (16%) Query: 147 NMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPA-LLRIEMGERPIFLIELVVDLSGSMHC 205 ++ +++ +F+ + QR+ V + A + E R I V+D+SGSM Sbjct: 20 SVAARLTFESTSEFVAYSRGQRHKINSVFHVQAPEYQGEEAARGAVDIVAVIDVSGSMSG 79 Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK 265 +K+ K L + ++ H MGL+ Y + V Sbjct: 80 --------------------SKLDLAKATLEFLIKNLSQTDH------MGLVVYHSDVSV 113 Query: 266 NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP 325 + R T + ++ Q T+ F QG + Sbjct: 114 AFPLT-----------RMDAEGKRTATAALSTLRA--QRCTNLSGGLFKGIEMMQGRERS 160 Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKICDK--AKENFIKIVTISINASPNGQRLLKTCV 383 + ++ +TDG N N I + + T ++ + L Sbjct: 161 AASVSS-VLLMTDGIANEGVRGPNLITATRQLMGDNPSYSLYTFGYGSNHEEELLKDLSE 219 Query: 384 SSPEYHYNVVNADSLIHVFQN 404 +Y + N D++ F + Sbjct: 220 VGNGMYYYIENNDTIPESFGD 240 >gi|94499792|ref|ZP_01306328.1| hypothetical protein RED65_14762 [Oceanobacter sp. RED65] gi|94427993|gb|EAT12967.1| hypothetical protein RED65_14762 [Oceanobacter sp. RED65] Length = 731 Score = 40.2 bits (92), Expect = 0.68, Method: Composition-based stats. Identities = 43/235 (18%), Positives = 78/235 (33%), Gaps = 62/235 (26%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 L++D+SGSM Q K +T + +K L F + ++ V + Sbjct: 355 LLLDISGSM-----------------QGKFQTLIEGVKKGLKRF--------NPQDRVRV 389 Query: 255 GLIGYTTRVEKNIEPSWGT-----EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + + + G + + + + + L T ++ A L +D+ Sbjct: 390 --VLFNDY---ASNLTGGFLPATQKNIAEIIRKLDLVLPNGGTHLMDGVRFALSGLDADR 444 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + I +TDG N K D K+ I++ T I Sbjct: 445 TSA--------------------IWLVTDGVTN--VGETKQRKFVDLLKQKDIRVFT-FI 481 Query: 370 NASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVH---RKYSVILKG 420 + + LLK + NV N+D +I + + + H R V KG Sbjct: 482 MGNGANRPLLKAITKASNGFAINVSNSDDIIGQLEKAASKVTHEALRDIKVRFKG 536 >gi|118356063|ref|XP_001011290.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89293057|gb|EAR91045.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 520 Score = 40.2 bits (92), Expect = 0.68, Method: Composition-based stats. Identities = 38/239 (15%), Positives = 85/239 (35%), Gaps = 51/239 (21%) Query: 113 FYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQK 172 ++N + + + S + S Y + S +L + +++++ NQ Sbjct: 10 YFNKKDNSFQDKSLAEQISGLHTDTGSDQTKADYFLKDNLSLSIKLLNNKIVMDKKNNQI 69 Query: 173 IVSFIPALLRIEMGERPI--------FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 I + + ++ + V+D SGSM Sbjct: 70 IPGVVSVEAKDFDADQVKKDKVRYQPLDLIFVIDTSGSMQGK------------------ 111 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 K+ +K ++L L I + L+G+ ++ + +E + T+ ++ + + + Sbjct: 112 --KIELVKKSILQVLHIIQGDD------RISLVGFNSQAKVLLELTQLTKNSKKKIQKTV 163 Query: 285 DSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 D L T M++A+ I+ K+R+ N I L+DG++N Sbjct: 164 DELQAGGGTQIGFGMQKAFDII---KERTNSKNLAS-------------IFLLSDGQDN 206 >gi|257886572|ref|ZP_05666225.1| predicted protein [Enterococcus faecium 1,141,733] gi|257822626|gb|EEV49558.1| predicted protein [Enterococcus faecium 1,141,733] Length = 689 Score = 40.2 bits (92), Expect = 0.69, Method: Composition-based stats. Identities = 42/319 (13%), Positives = 98/319 (30%), Gaps = 53/319 (16%) Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRY-NQKIVSFIPALLRIEMGERPIFL 192 N + T +YDY + + Y N + +F+ ++ G + +F Sbjct: 39 QNNAEGEFLTNGVKEGDTYNYDYGNAPEANFDEENYINYENSAFVKKSVKKAEGIQGLFD 98 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + L + + ++ S+ + +K + L+ DS++ + + Sbjct: 99 VTLAI-KGNQLKKPIDLVMVIDYSSSMTGEKLSNALKGLQEFGEELDDSLESGN-----I 152 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK---------------------- 290 +G++ Y V + ++ +Y R+ Sbjct: 153 RIGIVAYNRFVYSTDDFLTDINQL-EYFLRNTAESHTGTFMQKGLLEGQSLLEEKSRPEA 211 Query: 291 -----------PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 S + A S + + Q V +K+ T G Sbjct: 212 EKMLVHIGDDSANRSYLPKENAQVFHNSGEIVDYNGYHTDQYVTEFQTDSEKY---QTSG 268 Query: 340 ENNNFKSNVNTIKICDKA--------KENFIKIVTISINASPNGQRLLKTCVSSPEYHYN 391 + + + + + + A K IK +++ S G+ + + SSP + + Sbjct: 269 SSTDPNAVSVSSSLINDATLGTIISIKNAGIKCYSVATAPSSRGEYIGRNLASSPNNYLS 328 Query: 392 V-VNADSLIHVFQNISQLM 409 + N L + + I+ M Sbjct: 329 IDENLTGLGNALKEIANGM 347 >gi|158260465|dbj|BAF82410.1| unnamed protein product [Homo sapiens] Length = 937 Score = 40.2 bits (92), Expect = 0.69, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 67/185 (36%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 +K + +DS+ +S +GL+ Y+T+V + Sbjct: 672 EVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVHTEFTLRNFNSAKDMKKAVAHMKYM 728 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 K + + A+K ++ + + G + S + I TDG + +V Sbjct: 729 GKGSMTGLALKHMFERSFTQGE----------GARPLSTRVPRAAIVFTDG---RAQDDV 775 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + KAK N I + + + + + L+ S P ++ + + ++ + + + Sbjct: 776 S--EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTNKHLFYAEDFSTMDEISEKLK 831 Query: 407 QLMVH 411 + + Sbjct: 832 KGICE 836 >gi|149200157|ref|ZP_01877181.1| hypothetical protein LNTAR_03324 [Lentisphaera araneosa HTCC2155] gi|149136798|gb|EDM25227.1| hypothetical protein LNTAR_03324 [Lentisphaera araneosa HTCC2155] Length = 348 Score = 40.2 bits (92), Expect = 0.69, Method: Composition-based stats. Identities = 47/257 (18%), Positives = 86/257 (33%), Gaps = 53/257 (20%) Query: 179 ALLRIEMGERPIFL----IELV--VDLSGSMHCAMNSDPEDVNSA----PICQDKKRTKM 228 AL R GE + +++V +D+SGSM + VN I + ++ Sbjct: 69 ALARPRTGEENSYSYKDSVDIVFSLDISGSMSSYDQPEDLAVNRRVIAEAINNKELHPRL 128 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K ++ F+D +GL+ + P+ E ++ + + Sbjct: 129 HYAKKSIADFIDKRKSD-------RLGLVVFGAEAYSVCPPTNDHEYLQNRLKEISTEYL 181 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 Y T+ + K P +K II +TDG + SN+ Sbjct: 182 GD-----------YNRQTNITAAISGGLARLRKSKAP----KKIIILVTDGSHTA-NSNL 225 Query: 349 NTIKICDKAKENFIKIVTISI-----------------NASPN--GQRLLKTCVS-SPEY 388 A ++ I TI + NAS + + LLK + Sbjct: 226 TPRMAAKAAAKSDAVIYTIGVGNEVAWNVENFFGSSRLNASNSDFDEELLKEIAEKTGGL 285 Query: 389 HYNVVNADSLIHVFQNI 405 +++V A+ + V + I Sbjct: 286 YFSVREAEQMKDVLKKI 302 >gi|119612172|gb|EAW91766.1| matrilin 2, isoform CRA_b [Homo sapiens] gi|119612174|gb|EAW91768.1| matrilin 2, isoform CRA_b [Homo sapiens] Length = 922 Score = 40.2 bits (92), Expect = 0.69, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 67/185 (36%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 +K + +DS+ +S +GL+ Y+T+V + Sbjct: 672 EVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVHTEFTLRNFNSAKDMKKAVAHMKYM 728 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 K + + A+K ++ + + G + S + I TDG + +V Sbjct: 729 GKGSMTGLALKHMFERSFTQGE----------GARPLSTRVPRAAIVFTDG---RAQDDV 775 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + KAK N I + + + + + L+ S P ++ + + ++ + + + Sbjct: 776 S--EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTNKHLFYAEDFSTMDEISEKLK 831 Query: 407 QLMVH 411 + + Sbjct: 832 KGICE 836 >gi|119612170|gb|EAW91764.1| matrilin 2, isoform CRA_a [Homo sapiens] gi|119612171|gb|EAW91765.1| matrilin 2, isoform CRA_a [Homo sapiens] Length = 941 Score = 40.2 bits (92), Expect = 0.69, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 67/185 (36%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 +K + +DS+ +S +GL+ Y+T+V + Sbjct: 672 EVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVHTEFTLRNFNSAKDMKKAVAHMKYM 728 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 K + + A+K ++ + + G + S + I TDG + +V Sbjct: 729 GKGSMTGLALKHMFERSFTQGE----------GARPLSTRVPRAAIVFTDG---RAQDDV 775 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + KAK N I + + + + + L+ S P ++ + + ++ + + + Sbjct: 776 S--EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTNKHLFYAEDFSTMDEISEKLK 831 Query: 407 QLMVH 411 + + Sbjct: 832 KGICE 836 >gi|62548862|ref|NP_085072.2| matrilin-2 isoform b precursor [Homo sapiens] Length = 937 Score = 40.2 bits (92), Expect = 0.69, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 67/185 (36%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 +K + +DS+ +S +GL+ Y+T+V + Sbjct: 672 EVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVHTEFTLRNFNSAKDMKKAVAHMKYM 728 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 K + + A+K ++ + + G + S + I TDG + +V Sbjct: 729 GKGSMTGLALKHMFERSFTQGE----------GARPLSTRVPRAAIVFTDG---RAQDDV 775 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + KAK N I + + + + + L+ S P ++ + + ++ + + + Sbjct: 776 S--EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTNKHLFYAEDFSTMDEISEKLK 831 Query: 407 QLMVH 411 + + Sbjct: 832 KGICE 836 >gi|62548860|ref|NP_002371.3| matrilin-2 isoform a precursor [Homo sapiens] Length = 956 Score = 40.2 bits (92), Expect = 0.69, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 67/185 (36%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 +K + +DS+ +S +GL+ Y+T+V + Sbjct: 672 EVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVHTEFTLRNFNSAKDMKKAVAHMKYM 728 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 K + + A+K ++ + + G + S + I TDG + +V Sbjct: 729 GKGSMTGLALKHMFERSFTQGE----------GARPLSTRVPRAAIVFTDG---RAQDDV 775 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + KAK N I + + + + + L+ S P ++ + + ++ + + + Sbjct: 776 S--EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTNKHLFYAEDFSTMDEISEKLK 831 Query: 407 QLMVH 411 + + Sbjct: 832 KGICE 836 >gi|37182908|gb|AAQ89254.1| MATN2 [Homo sapiens] Length = 915 Score = 40.2 bits (92), Expect = 0.69, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 67/185 (36%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 +K + +DS+ +S +GL+ Y+T+V + Sbjct: 631 EVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVHTEFTLRNFNSAKDMKKAVAHMKYM 687 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 K + + A+K ++ + + G + S + I TDG + +V Sbjct: 688 GKGSMTGLALKHMFERSFTQGE----------GARPLSTRVPRAAIVFTDG---RAQDDV 734 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + KAK N I + + + + + L+ S P ++ + + ++ + + + Sbjct: 735 S--EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTNKHLFYAEDFSTMDEISEKLK 790 Query: 407 QLMVH 411 + + Sbjct: 791 KGICE 795 >gi|32425428|gb|AAH16394.1| MATN2 protein [Homo sapiens] Length = 715 Score = 40.2 bits (92), Expect = 0.69, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 67/185 (36%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 +K + +DS+ +S +GL+ Y+T+V + Sbjct: 431 EVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVHTEFTLRNFNSAKDMKKAVAHMKYM 487 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 K + + A+K ++ + + G + S + I TDG + +V Sbjct: 488 GKGSMTGLALKHMFERSFTQGE----------GARPLSTRVPRAAIVFTDG---RAQDDV 534 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + KAK N I + + + + + L+ S P ++ + + ++ + + + Sbjct: 535 S--EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTNKHLFYAEDFSTMDEISEKLK 590 Query: 407 QLMVH 411 + + Sbjct: 591 KGICE 595 >gi|14714613|gb|AAH10444.1| Matrilin 2 [Homo sapiens] gi|261858984|dbj|BAI46014.1| matrilin 2 [synthetic construct] Length = 937 Score = 40.2 bits (92), Expect = 0.69, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 67/185 (36%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 +K + +DS+ +S +GL+ Y+T+V + Sbjct: 672 EVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVHTEFTLRNFNSAKDMKKAVAHMKYM 728 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 K + + A+K ++ + + G + S + I TDG + +V Sbjct: 729 GKGSMTGLALKHMFERSFTQGE----------GARPLSTRVPRAAIVFTDG---RAQDDV 775 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + KAK N I + + + + + L+ S P ++ + + ++ + + + Sbjct: 776 S--EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTNKHLFYAEDFSTMDEISEKLK 831 Query: 407 QLMVH 411 + + Sbjct: 832 KGICE 836 >gi|11125762|gb|AAC51260.2| matrilin-2 precursor [Homo sapiens] Length = 956 Score = 40.2 bits (92), Expect = 0.69, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 67/185 (36%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 +K + +DS+ +S +GL+ Y+T+V + Sbjct: 672 EVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVHTEFTLRNFNSAKDMKKAVAHMKYM 728 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 K + + A+K ++ + + G + S + I TDG + +V Sbjct: 729 GKGSMTGLALKHMFERSFTQGE----------GARPFSTRVPRAAIVFTDG---RAQDDV 775 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + KAK N I + + + + + L+ S P ++ + + ++ + + + Sbjct: 776 S--EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTNKHLFYAEDFSTMDEISEKLK 831 Query: 407 QLMVH 411 + + Sbjct: 832 KGICE 836 >gi|62298084|sp|O00339|MATN2_HUMAN RecName: Full=Matrilin-2; Flags: Precursor Length = 956 Score = 40.2 bits (92), Expect = 0.69, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 67/185 (36%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 +K + +DS+ +S +GL+ Y+T+V + Sbjct: 672 EVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVHTEFTLRNFNSAKDMKKAVAHMKYM 728 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 K + + A+K ++ + + G + S + I TDG + +V Sbjct: 729 GKGSMTGLALKHMFERSFTQGE----------GARPLSTRVPRAAIVFTDG---RAQDDV 775 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + KAK N I + + + + + L+ S P ++ + + ++ + + + Sbjct: 776 S--EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTNKHLFYAEDFSTMDEISEKLK 831 Query: 407 QLMVH 411 + + Sbjct: 832 KGICE 836 >gi|52545626|emb|CAB70853.2| hypothetical protein [Homo sapiens] Length = 672 Score = 40.2 bits (92), Expect = 0.69, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 67/185 (36%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 +K + +DS+ +S +GL+ Y+T+V + Sbjct: 388 EVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVHTEFTLRNFNSAKDMKKAVAHMKYM 444 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 K + + A+K ++ + + G + S + I TDG + +V Sbjct: 445 GKGSMTGLALKHMFERSFTQGE----------GARPLSTRVPRAAIVFTDG---RAQDDV 491 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + KAK N I + + + + + L+ S P ++ + + ++ + + + Sbjct: 492 S--EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTNKHLFYAEDFSTMDEISEKLK 547 Query: 407 QLMVH 411 + + Sbjct: 548 KGICE 552 >gi|21739491|emb|CAD38787.1| hypothetical protein [Homo sapiens] Length = 1016 Score = 40.2 bits (92), Expect = 0.69, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 67/185 (36%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 +K + +DS+ +S +GL+ Y+T+V + Sbjct: 732 EVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVHTEFTLRNFNSAKDMKKAVAHMKYM 788 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 K + + A+K ++ + + G + S + I TDG + +V Sbjct: 789 GKGSMTGLALKHMFERSFTQGE----------GARPLSTRVPRAAIVFTDG---RAQDDV 835 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + KAK N I + + + + + L+ S P ++ + + ++ + + + Sbjct: 836 S--EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTNKHLFYAEDFSTMDEISEKLK 891 Query: 407 QLMVH 411 + + Sbjct: 892 KGICE 896 >gi|14042702|dbj|BAB55358.1| unnamed protein product [Homo sapiens] Length = 537 Score = 40.2 bits (92), Expect = 0.69, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 67/185 (36%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 +K + +DS+ +S +GL+ Y+T+V + Sbjct: 253 EVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVHTEFTLRNFNSAKDMKKAVAHMKYM 309 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 K + + A+K ++ + + G + S + I TDG + +V Sbjct: 310 GKGSMTGLALKHMFERSFTQGE----------GARPLSTRVPRAAIVFTDG---RAQDDV 356 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + KAK N I + + + + + L+ S P ++ + + ++ + + + Sbjct: 357 S--EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTNKHLFYAEDFSTMDEISEKLK 412 Query: 407 QLMVH 411 + + Sbjct: 413 KGICE 417 >gi|11360063|pir||T46488 hypothetical protein DKFZp434J065.1 - human (fragment) Length = 741 Score = 40.2 bits (92), Expect = 0.69, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 67/185 (36%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 +K + +DS+ +S +GL+ Y+T+V + Sbjct: 457 EVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVHTEFTLRNFNSAKDMKKAVAHMKYM 513 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 K + + A+K ++ + + G + S + I TDG + +V Sbjct: 514 GKGSMTGLALKHMFERSFTQGE----------GARPLSTRVPRAAIVFTDG---RAQDDV 560 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + KAK N I + + + + + L+ S P ++ + + ++ + + + Sbjct: 561 S--EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTNKHLFYAEDFSTMDEISEKLK 616 Query: 407 QLMVH 411 + + Sbjct: 617 KGICE 621 >gi|312870351|ref|ZP_07730477.1| putative acetyl-CoA carboxylase, biotin carboxylase subunit [Lactobacillus oris PB013-T2-3] gi|311094138|gb|EFQ52456.1| putative acetyl-CoA carboxylase, biotin carboxylase subunit [Lactobacillus oris PB013-T2-3] Length = 463 Score = 39.9 bits (91), Expect = 0.70, Method: Composition-based stats. Identities = 36/271 (13%), Positives = 89/271 (32%), Gaps = 28/271 (10%) Query: 43 VLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALID---DAKRFIK 99 V D K + A + A L+ ++ + E++ + + D + F + Sbjct: 171 VTDAQEMKAAFHEAQSEARLS----FDNDQMYIEKIMENVKHIEMQIFRDRFGHSVYFPE 226 Query: 100 NHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQ 159 + V + + I + R + +A ++ + ++ + D+ Sbjct: 227 RDCSLQI-NKQKVLEESPCKLITPTERKELGQIALRAVNGIDYLNTGTLEFLMDQDHHFY 285 Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLS------GSMHCAMNSDPED 213 F+E + + + + ++ R L D S ++ C +N++ + Sbjct: 286 FMEMNTRIQVEHTVTEMVTGIDIVKAQVRVAAGAPLPFDQSDIQLQGAAIECRINAEDAE 345 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLI---GYTTRVEKNIEPS 270 P K L + ID +Y G Y + + K I Sbjct: 346 HGFIPSTGQVKY-----LYLPVGNLGMRIDTA------LYQGATVTPFYDSMIAKVIALG 394 Query: 271 WGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 E+ + + R ++ +++ ++ +QA Sbjct: 395 HNREEAIEKIKRLLNEMVIVGINTNQTFQQA 425 >gi|296196207|ref|XP_002745750.1| PREDICTED: anthrax toxin receptor 2-like [Callithrix jacchus] Length = 663 Score = 39.9 bits (91), Expect = 0.70, Method: Composition-based stats. Identities = 33/240 (13%), Positives = 67/240 (27%), Gaps = 52/240 (21%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + + R F + V+D SGS+ V Sbjct: 209 QEQPSCRRAFDLYFVLDKSGSVANNWIEIYNFVQQLAE---------------------- 246 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 +S + + I ++++ + + +K+ + + + T +K A Sbjct: 247 -RFVSPE---MRLSFIVFSSQASIILPLTGDRDKISKGLEDLKHVNPVGETYIHEGLKLA 302 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG--ENNNFKSNVNTIKICDKAKE 359 Q K L II LTDG + KI ++ Sbjct: 303 ----------------NEQIQKAGGLKTSSIIIALTDGKLDGLVPSYAEKEAKI---SRS 343 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + + + Q L+ S E + V + I ++ + + IL+ Sbjct: 344 LGASVYCVGVL--DFEQAQLERIADSKEQVFPVKGGFQ---ALKGIINSILAQSCTEILE 398 >gi|326922323|ref|XP_003207399.1| PREDICTED: collagen alpha-2(VI) chain-like [Meleagris gallopavo] Length = 1022 Score = 39.9 bits (91), Expect = 0.71, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 62/207 (29%), Gaps = 54/207 (26%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL--S 246 I V+D S S+ T KN ++ + + + Sbjct: 612 GALDIMFVIDSSESIG--------------------YTNFTLEKNFVVNVVSRLGSIAKD 651 Query: 247 HVKED-VYMGLIGYTTR-VEKNIEPS----WGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 E +G++ Y+ + I+ ++ V R I T + A++ Sbjct: 652 PKSETGARVGVVQYSHEGTFEAIKLDDERINSLSSFKEAVKRL--EWIAGGTWTPSALQF 709 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 AY L + +R Q F + +TDG + + N +C Sbjct: 710 AYNKLIKESRREKA---------------QVFAVVITDGRYDPRDDDKNLGALC----GR 750 Query: 361 FIKIVTIS----INASPNGQRLLK-TC 382 + + TI + + L+ C Sbjct: 751 DVLVNTIGIGDIFDQPEQSETLVSIAC 777 >gi|269126610|ref|YP_003299980.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] gi|268311568|gb|ACY97942.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] Length = 315 Score = 39.9 bits (91), Expect = 0.71, Method: Composition-based stats. Identities = 22/147 (14%), Positives = 39/147 (26%), Gaps = 17/147 (11%) Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 V S + A + + + + P I+ L Sbjct: 136 SANVVATPSGDRAAAISSIDTLTLAKRTAIGEAVFTSLQAIRSFDAQAGQDPPPAHIVLL 195 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQR--------------LLKTC 382 +DG+N +S I D A+ + + TI+ L Sbjct: 196 SDGDNTTGRSVPEAI---DAARAADVPVSTIAFGTPYGTVEIDGETTPVEVNKVTLAGLA 252 Query: 383 VSSPEYHYNVVNADSLIHVFQNISQLM 409 + Y + D L V+ NI + Sbjct: 253 QGTNGKAYEAADNDQLSQVYANIGTSL 279 >gi|221042220|dbj|BAH12787.1| unnamed protein product [Homo sapiens] Length = 648 Score = 39.9 bits (91), Expect = 0.71, Method: Composition-based stats. Identities = 32/160 (20%), Positives = 52/160 (32%), Gaps = 35/160 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D SGSM K+ + AL+ LD + V Sbjct: 275 VVFVIDKSGSMSGR--------------------KIQQTREALIKILDDLSPRDQFNLIV 314 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 + R + K R + L T+ AM A Q+L S + Sbjct: 315 FSTEATQ-WRPSLVPASAENVNKARSFAAGIQA---LGGTNINDAMLMAVQLLDSSNQ-- 368 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + S+ II LTDG+ ++N +I+ Sbjct: 369 ------EERLPEGSVSL---IILLTDGDPTVGETNPRSIQ 399 >gi|211616|gb|AAA48705.1| type VI collagen, alpha-2 subunit [Gallus gallus] Length = 720 Score = 39.9 bits (91), Expect = 0.71, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 62/207 (29%), Gaps = 54/207 (26%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL--S 246 I V+D S S+ T KN ++ + + + Sbjct: 536 GALDIMFVIDSSESIG--------------------YTNFTLEKNFVVNVVSRLGSIAKD 575 Query: 247 HVKED-VYMGLIGYTTR-VEKNIEPS----WGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 E +G++ Y+ + I+ ++ V R I T + A++ Sbjct: 576 PKSETGARVGVVQYSHEGTFEAIKLDDERINSLSSFKEAVKRL--EWIAGGTWTPSALQF 633 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 AY L + +R Q F + +TDG + + N +C Sbjct: 634 AYNKLIKESRREKA---------------QVFAVVITDGRYDPRDDDKNLGALC----GR 674 Query: 361 FIKIVTIS----INASPNGQRLLK-TC 382 + + TI + + L+ C Sbjct: 675 DVLVNTIGIGDIFDQPEQSETLVSIAC 701 >gi|62881|emb|CAA39981.1| type VI collagen subunit alpha2 [Gallus gallus] Length = 918 Score = 39.9 bits (91), Expect = 0.71, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 62/207 (29%), Gaps = 54/207 (26%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL--S 246 I V+D S S+ T KN ++ + + + Sbjct: 612 GALDIMFVIDSSESIG--------------------YTNFTLEKNFVVNVVSRLGSIAKD 651 Query: 247 HVKED-VYMGLIGYTTR-VEKNIEPS----WGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 E +G++ Y+ + I+ ++ V R I T + A++ Sbjct: 652 PKSETGARVGVVQYSHEGTFEAIKLDDERINSLSSFKEAVKRL--EWIAGGTWTPSALQF 709 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 AY L + +R Q F + +TDG + + N +C Sbjct: 710 AYNKLIKESRREKA---------------QVFAVVITDGRYDPRDDDKNLGALC----GR 750 Query: 361 FIKIVTIS----INASPNGQRLLK-TC 382 + + TI + + L+ C Sbjct: 751 DVLVNTIGIGDIFDQPEQSETLVSIAC 777 >gi|45384382|ref|NP_990679.1| collagen alpha-2(VI) chain precursor [Gallus gallus] gi|115352|sp|P15988|CO6A2_CHICK RecName: Full=Collagen alpha-2(VI) chain; Flags: Precursor gi|62877|emb|CAA33144.1| type VI collagen alpha-2 subunit preprotein [Gallus gallus] gi|62882|emb|CAA39982.1| type VI collagen subunit alpha2 [Gallus gallus] Length = 1022 Score = 39.9 bits (91), Expect = 0.71, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 62/207 (29%), Gaps = 54/207 (26%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL--S 246 I V+D S S+ T KN ++ + + + Sbjct: 612 GALDIMFVIDSSESIG--------------------YTNFTLEKNFVVNVVSRLGSIAKD 651 Query: 247 HVKED-VYMGLIGYTTR-VEKNIEPS----WGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 E +G++ Y+ + I+ ++ V R I T + A++ Sbjct: 652 PKSETGARVGVVQYSHEGTFEAIKLDDERINSLSSFKEAVKRL--EWIAGGTWTPSALQF 709 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 AY L + +R Q F + +TDG + + N +C Sbjct: 710 AYNKLIKESRREKA---------------QVFAVVITDGRYDPRDDDKNLGALC----GR 750 Query: 361 FIKIVTIS----INASPNGQRLLK-TC 382 + + TI + + L+ C Sbjct: 751 DVLVNTIGIGDIFDQPEQSETLVSIAC 777 >gi|327271908|ref|XP_003220729.1| PREDICTED: collagen alpha-1(XX) chain-like [Anolis carolinensis] Length = 1480 Score = 39.9 bits (91), Expect = 0.71, Method: Composition-based stats. Identities = 23/139 (16%), Positives = 45/139 (32%), Gaps = 24/139 (17%) Query: 250 EDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + + +GL Y++ + W T R V + SL K ++ + Sbjct: 282 DKIRVGLTQYSS----DPRTEWDLNTYATRDEVLEALRSLRYKGGNTFTGLAL------- 330 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + + K II LTDG++ + + + K I+I + Sbjct: 331 ----THVLEHNLKADTGARSEAPKLIILLTDGKSQDDANPPAQV-----LKNMGIQIFAV 381 Query: 368 SINASPNGQRLLKTCVSSP 386 + + L+ S P Sbjct: 382 GVK--NADETELRQVASDP 398 >gi|167525755|ref|XP_001747212.1| hypothetical protein [Monosiga brevicollis MX1] gi|163774507|gb|EDQ88136.1| predicted protein [Monosiga brevicollis MX1] Length = 471 Score = 39.9 bits (91), Expect = 0.71, Method: Composition-based stats. Identities = 33/246 (13%), Positives = 75/246 (30%), Gaps = 49/246 (19%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + R + +S E ERP + V+D+SGSM Sbjct: 32 IGARESSAYISCRLTAPDFEPVERPAIDLVAVIDVSGSMAGQ------------------ 73 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 K+ +++ L + ++ L+ + + V+ + T ++ D+ Sbjct: 74 --KLKMVQSTLEFLMRNLKDTD------RFALVTFDSDVKTVFDLRPMTTAHKEACLADV 125 Query: 285 DSLILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 L T+ + + + +++ + I+ +TDG N Sbjct: 126 QKLRAGSCTNLSGGLFRGVELMQQRGATKGAVSS---------------ILLMTDGIANE 170 Query: 344 FKSNVNTIKICDKAK-----ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 + + + C + I T N L + + +Y + + D + Sbjct: 171 GVRDKDDM--CRALRGLMGPAPDYTIYTFGYGKDHNENMLRQLSETGNGMYYFIESNDII 228 Query: 399 IHVFQN 404 F + Sbjct: 229 PESFGD 234 >gi|153840568|ref|ZP_01993235.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ3810] gi|149745769|gb|EDM56899.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ3810] Length = 187 Score = 39.9 bits (91), Expect = 0.71, Method: Composition-based stats. Identities = 18/105 (17%), Positives = 42/105 (40%), Gaps = 6/105 (5%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 ++K ++ ++ ++ L + F I + K +++A + A LAGA +V++ + Sbjct: 7 RTQKGITLVLISMVLLILLGVAAFGIDLNHQVLNKTRLQNAVDTAALAGA--VVADKTED 64 Query: 76 GDRFE----SISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNT 116 D+ E + + + F + + S F N Sbjct: 65 VDQAEAAVIATLSSIASESGNTELSFTDGNTSVTFSHDMQTFVNA 109 >gi|149721558|ref|XP_001490961.1| PREDICTED: matrilin 2 isoform 1 [Equus caballus] Length = 956 Score = 39.9 bits (91), Expect = 0.71, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 63/164 (38%), Gaps = 21/164 (12%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLILKPTDSTP 296 ID L+ + +GL+ Y+T+V G+ K + M + K + + Sbjct: 678 VTGIIDSLAISPKAARVGLLQYSTQVRTEFTLRNFGSAKDMKKAVAHM-KYMGKGSMTGL 736 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+K ++ + + G + S + I TDG + +V+ + K Sbjct: 737 ALKHMFERSFTQVE----------GARPLSARVPRVAIVFTDG---RAQDDVS--EWASK 781 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 AK N I + + + + + L+ S P ++ + + ++ Sbjct: 782 AKANGITMYAVGVGKAIEEE--LQEIASEPIDKHLFYAEDFSTM 823 >gi|149721562|ref|XP_001490991.1| PREDICTED: matrilin 2 isoform 2 [Equus caballus] Length = 915 Score = 39.9 bits (91), Expect = 0.71, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 63/164 (38%), Gaps = 21/164 (12%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLILKPTDSTP 296 ID L+ + +GL+ Y+T+V G+ K + M + K + + Sbjct: 637 VTGIIDSLAISPKAARVGLLQYSTQVRTEFTLRNFGSAKDMKKAVAHM-KYMGKGSMTGL 695 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+K ++ + + G + S + I TDG + +V+ + K Sbjct: 696 ALKHMFERSFTQVE----------GARPLSARVPRVAIVFTDG---RAQDDVS--EWASK 740 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 AK N I + + + + + L+ S P ++ + + ++ Sbjct: 741 AKANGITMYAVGVGKAIEEE--LQEIASEPIDKHLFYAEDFSTM 782 >gi|118388916|ref|XP_001027552.1| Ubiquitin-conjugating enzyme family protein [Tetrahymena thermophila] gi|89309322|gb|EAS07310.1| Ubiquitin-conjugating enzyme family protein [Tetrahymena thermophila SB210] Length = 601 Score = 39.9 bits (91), Expect = 0.71, Method: Composition-based stats. Identities = 35/166 (21%), Positives = 58/166 (34%), Gaps = 32/166 (19%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E P I ++ D+SGSM +DP +++ A+ NA L Sbjct: 132 EDPAEAIVVIYDVSGSMKSGFFNDPL------------ISRIGAV-NAFFSAFADKTLAY 178 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 V L + R+EK + + + V D+ T A+ + L Sbjct: 179 EYNHIVQ--LYWFDDRIEKKCDFTKDMNHFIKLVD---DANPRGSTRLYDALMEGINSLL 233 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 KK K P++ + +I +TDGE+N K + Sbjct: 234 LIKK------------KYPNIILR--LIAMTDGEDNQSKYKPEEVA 265 >gi|83717579|ref|YP_440458.1| hypothetical protein BTH_II2270 [Burkholderia thailandensis E264] gi|167579118|ref|ZP_02371992.1| hypothetical protein BthaT_13315 [Burkholderia thailandensis TXDOH] gi|257141105|ref|ZP_05589367.1| hypothetical protein BthaA_18159 [Burkholderia thailandensis E264] gi|83651404|gb|ABC35468.1| conserved hypothetical protein [Burkholderia thailandensis E264] Length = 418 Score = 39.9 bits (91), Expect = 0.71, Method: Composition-based stats. Identities = 10/71 (14%), Positives = 31/71 (43%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 G ++ SI+ AL + + +G + + + ++ ++++ ++ LA A + ++ Sbjct: 15 GRRRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSADSCALAAARDLTGAIN 74 Query: 74 RLGDRFESISN 84 I+ Sbjct: 75 LSVPEAAGITA 85 >gi|282896313|ref|ZP_06304335.1| von Willebrand factor, type A [Raphidiopsis brookii D9] gi|281198809|gb|EFA73688.1| von Willebrand factor, type A [Raphidiopsis brookii D9] Length = 336 Score = 39.9 bits (91), Expect = 0.72, Method: Composition-based stats. Identities = 35/241 (14%), Positives = 82/241 (34%), Gaps = 49/241 (20%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 + +Q++++ + + + R + L++D SGSM K Sbjct: 18 KSSSQRLIAISVSAIGETIDSRVPLNLCLILDHSGSM--------------------KGQ 57 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT--EKVRQYVTRDM 284 + +K A L +D + + ++ + R E + + ++Q + R Sbjct: 58 PVENVKRAAWLLVDKLRDQD------RLSIVVFNHRAEVLLSNQNVVDRDHIKQQINRLS 111 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + T ++ + L ++ + F LTDGEN + Sbjct: 112 AN---GGTSIDEGLRLGIEELAKGRRDTISQAF-----------------LLTDGENEHG 151 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 +N +K A + + + T+ + N L K + ++ + D + F + Sbjct: 152 DNN-RCLKFAQLAADYNLTVNTLGFGNNWNQHILEKISDAGLGSLSHIEHPDQAVDKFNS 210 Query: 405 I 405 + Sbjct: 211 L 211 >gi|296453244|ref|YP_003660387.1| von Willebrand factor type A (vWA) domain-containing protein [Bifidobacterium longum subsp. longum JDM301] gi|296182675|gb|ADG99556.1| von Willebrand factor type A (vWA) domain protein [Bifidobacterium longum subsp. longum JDM301] Length = 565 Score = 39.9 bits (91), Expect = 0.72, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 70/210 (33%), Gaps = 38/210 (18%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 VVD SGSM + +A + D+ + + DV + Sbjct: 390 VVDYSGSMSGEGKNGVVKGLNAALDPDQAKKSY----------------IEPASGDVNI- 432 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 LI + T + ++ + GT + + TD + A L S+ + S +T Sbjct: 433 LIPFETEAHRPVKAT-GTS-TSDLLHEADATDASGGTDIYEGLLSALDELPSESEASQYT 490 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 I+ +TDG +N+ + K++ + I +I + Sbjct: 491 TA---------------IVLMTDGRSNSDHQDEFESAY--KSRGRDLPIFSIMFGDADPS 533 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 Q LK+ + + L VF+ + Sbjct: 534 Q--LKSLATLSNAKVFDGRSGDLAAVFRQV 561 >gi|171741586|ref|ZP_02917393.1| hypothetical protein BIFDEN_00672 [Bifidobacterium dentium ATCC 27678] gi|171277200|gb|EDT44861.1| hypothetical protein BIFDEN_00672 [Bifidobacterium dentium ATCC 27678] Length = 1256 Score = 39.9 bits (91), Expect = 0.73, Method: Composition-based stats. Identities = 36/272 (13%), Positives = 76/272 (27%), Gaps = 50/272 (18%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 V A + LV+D+ ++ K Sbjct: 568 NTYTVNVDVTGAANSSTITTTQSVDFTLVLDV-----------SSSMSDEMDSDQGSIKK 616 Query: 228 MAALKNALLLFLDSIDLLS--HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY----VT 281 M ALK+A+ FL ++ E + +GL+ + + + TE Y + Sbjct: 617 MTALKSAVNNFLGEAAEINEQSGSELIRVGLVKFAGKESSKVGNETYTEGRFVYNYSQIV 676 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + + + A++ + + ++ +IF TDG Sbjct: 677 SPLTADMSDLKNKVSALRH-----NGATRADLGFKHASTVMSGARTDAKRVVIFFTDGTP 731 Query: 342 NNFKS-----NVNTIKICDKAKENFIKIVTISI-------NASPNGQRLLKTCVSS---- 385 + + K++ + +I + + + + VSS Sbjct: 732 TKVSDFDKDVANSAVTYAKSLKDSGATVYSIGVFDGANPSSIEEDQKNQFMNAVSSNYPH 791 Query: 386 ------------PEYHYNVVNADSLIHVFQNI 405 Y+ V N L +F+ I Sbjct: 792 ATAYDKLGTGSNAGYYKVVSNVSDLKSIFEKI 823 >gi|326673138|ref|XP_001334803.4| PREDICTED: collagen alpha-4(VI) chain-like [Danio rerio] Length = 1356 Score = 39.9 bits (91), Expect = 0.74, Method: Composition-based stats. Identities = 27/173 (15%), Positives = 63/173 (36%), Gaps = 24/173 (13%) Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL--ILKPTDSTPAMKQAYQILT 306 + V +GL+ Y+ T + ++ + + +L T + + Sbjct: 66 PDKVRIGLVQYSDTPRTEFSL--NTYQNKEEILDYIRNLRYKTGGTHTGQGL-------- 115 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + +F + + I +TDG+ + ++ + ++ IKI Sbjct: 116 ---EFILKQHFIEEAGSRAQQNVPQIAIVITDGD----SQDEVDLQA-QELRQRGIKIFA 167 Query: 367 ISINASPNGQRLLKTCVSSP-EYH-YNVVNADSLIHVFQNISQLMVHRKYSVI 417 I I RLL+ + P + + Y+V + +L + Q++ + + VI Sbjct: 168 IGIK--DADVRLLRQIANEPYDQYVYSVSDFAALQGISQSVVRELCTSVKDVI 218 >gi|260841562|ref|XP_002613981.1| hypothetical protein BRAFLDRAFT_118457 [Branchiostoma floridae] gi|229299371|gb|EEN69990.1| hypothetical protein BRAFLDRAFT_118457 [Branchiostoma floridae] Length = 2122 Score = 39.9 bits (91), Expect = 0.74, Method: Composition-based stats. Identities = 31/235 (13%), Positives = 77/235 (32%), Gaps = 40/235 (17%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 +++ + I ++D SGS+ N + ++ + L Sbjct: 36 QVKKYQDSRADIVFLLDNSGSVG-RYNFEEVEIAF------------------VENLLSQ 76 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQ 300 + + + ++ + +I+ + ++ + T++ A + Sbjct: 77 LTISPQAS---RVAVVSFDDVARTHIDYIKYPKNKCSFLRELKTVKYIGEWTNTEDAFRL 133 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A ++L P ++ +I LTDG + +K + K Sbjct: 134 AQELLRPPSAFKNER------------PVKQVVILLTDG---RPTRGGDPVKRANNLKSV 178 Query: 361 -FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 +I +I I + N Q+ L+ C + ++ Y N + + I ++Y Sbjct: 179 YNAEIFSIGIGGNLNKQQ-LEDCATDAQHLYLSPNFVDFKDLAKRIRGDPYAKRY 232 >gi|145594605|ref|YP_001158902.1| von Willebrand factor, type A [Salinispora tropica CNB-440] gi|145303942|gb|ABP54524.1| von Willebrand factor, type A [Salinispora tropica CNB-440] Length = 436 Score = 39.9 bits (91), Expect = 0.74, Method: Composition-based stats. Identities = 45/246 (18%), Positives = 87/246 (35%), Gaps = 38/246 (15%) Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 + +PAL E P +ELV+D+SGSM +D + R++++ Sbjct: 27 SVMTGPVPALADWETPVEPP-KVELVLDVSGSMRA---TDIDG-----------RSRISV 71 Query: 231 LKNALLLFLDSIDLLSHVKEDVY-MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 + A +D++ + + +G E+ G + +Q V + Sbjct: 72 AQQAFNEVVDALP--DETELGIRVLGATYPGDDKEQ------GCQDTQQIV-------PV 116 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 P D A K A L + + + I+ +TDGE+ + Sbjct: 117 GPVDRVQA-KAAVATLRPTGYTPVGLALRSAAEDLGTGSTARRIVLITDGEDTCAPPDPC 175 Query: 350 TIKICDKAKENFIKIVTISINASPNG--QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + A+ + + T+ + A Q+LL ++ + +AD L I Q Sbjct: 176 EVARELAAQGTKLVVDTLGL-APDEKVRQQLLCIAGATGGTYTAAQSADELT---GRIKQ 231 Query: 408 LMVHRK 413 L+ + Sbjct: 232 LVDRAR 237 >gi|307324435|ref|ZP_07603643.1| von Willebrand factor type A [Streptomyces violaceusniger Tu 4113] gi|306890166|gb|EFN21144.1| von Willebrand factor type A [Streptomyces violaceusniger Tu 4113] Length = 543 Score = 39.9 bits (91), Expect = 0.74, Method: Composition-based stats. Identities = 39/219 (17%), Positives = 73/219 (33%), Gaps = 45/219 (20%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS--IDLLSHVKEDVY 253 V+D SGSM ++A LK AL S ++ Sbjct: 357 VLDTSGSM--------------------NGDRLAQLKRALGQLAGSDVSPTGDRFRDREE 396 Query: 254 MGLIGYTTRVE---KNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILTSDK 309 + L+ + +RV+ + P V + D ++L T +++ AY L Sbjct: 397 VTLMPFGSRVKGVRTHTVPGERPAPVLAAIRADAEALTADGDTAIFSSLQAAYDHL---- 452 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK--ENFIKIVTI 367 ++ + F I+ +TDGEN + + + + E + I Sbjct: 453 --------AQRRSALGDDRFTS-IVLMTDGENTTGATASDFDAYYRRLRGPERTAPVFPI 503 Query: 368 SINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNI 405 S Q L++ + + ++ SL F+ I Sbjct: 504 VFGDSDRSQ--LQSIATLTGGRLFDATK-GSLDGAFEEI 539 >gi|259501746|ref|ZP_05744648.1| acetyl-CoA carboxylase [Lactobacillus antri DSM 16041] gi|259170290|gb|EEW54785.1| acetyl-CoA carboxylase [Lactobacillus antri DSM 16041] Length = 463 Score = 39.9 bits (91), Expect = 0.74, Method: Composition-based stats. Identities = 36/271 (13%), Positives = 88/271 (32%), Gaps = 28/271 (10%) Query: 43 VLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALID---DAKRFIK 99 V D K A + A L+ ++ + E++ + + D + F + Sbjct: 171 VTDAQEMKAVFHEAQSEARLS----FDNDQMYIEKIMENVKHIEMQIFRDRFGHSVYFPE 226 Query: 100 NHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQ 159 + V + + I + R + +A ++ + ++ + D+ Sbjct: 227 RDCSLQI-NKQKVLEESPCKLITPTERKELGQIALRAVNGIDYLNTGTLEFLMDQDHHFY 285 Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLS------GSMHCAMNSDPED 213 F+E + + + + ++ R L D S ++ C +N++ + Sbjct: 286 FMEMNTRIQVEHTVTEMVTGIDIVKAQVRVAAGAPLPFDQSDIQLQGAAIECRINAEDAE 345 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLI---GYTTRVEKNIEPS 270 P K L + ID +Y G Y + + K I Sbjct: 346 HGFIPSTGQVKY-----LYLPVGNLGMRIDTA------LYQGATVTPFYDSMIAKVIALG 394 Query: 271 WGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 E+ + + R ++ +++ ++ +QA Sbjct: 395 HNREEAIEKIKRLLNEMVIVGINTNQTFQQA 425 >gi|149911407|ref|ZP_01900025.1| hypothetical protein PE36_11192 [Moritella sp. PE36] gi|149805515|gb|EDM65520.1| hypothetical protein PE36_11192 [Moritella sp. PE36] Length = 450 Score = 39.9 bits (91), Expect = 0.74, Method: Composition-based stats. Identities = 44/350 (12%), Positives = 105/350 (30%), Gaps = 29/350 (8%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGD 77 + N I+F +++ + + + + K +++ ++A L A+++ Sbjct: 13 QNGNVLIVFTIALFALIGMASLALDGGHLLLNKGKLQNLVDSAALHAATEL--------- 63 Query: 78 RFESISNHAKRALIDDA--KRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 E ++ RA + N E S N + + + + + + Sbjct: 64 -DEGATHEQARAAVVALIQLNIAHNDHHELASAIDFSIVNNGLDQMTAQLNVEFSQLP-D 121 Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 NN + V S F+ + + + I + + + + Sbjct: 122 PFIQDNNESAKYVKVSLSQLELDNFLADVFSFNKQVSATALSGPSSDISNCYQDLVPMVV 181 Query: 196 VVD-----LSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 D ++G+ + ++ + L + D+ + + Sbjct: 182 CADTANAGVNGANEYGLKEYSLNLMKIGSNSNSPIGPGNFQLLRLGSNTGAADIRAAMAG 241 Query: 251 DVYMGLIGYT---TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 D +G + +T + EP V Q + M P +ST ++ + Sbjct: 242 DTDLGTVCFTPGISNQSVPTEPGNTVGPVAQGLNTRMGEWH-GPVNSTDHLRDTNICEGA 300 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + + G S F + D + + S+ NT CD Sbjct: 301 TIELDANGDALEAGAAAKSYTFNDY-----DSDGAQYLSDANTS--CDAI 343 >gi|118384937|ref|XP_001025607.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89307374|gb|EAS05362.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 670 Score = 39.9 bits (91), Expect = 0.74, Method: Composition-based stats. Identities = 29/210 (13%), Positives = 71/210 (33%), Gaps = 25/210 (11%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 VVD+SGSM + + + + +K+++ + ++++ Y+ Sbjct: 38 VVDVSGSMSSEAKIINQSSQKSDENY--SLSILDVVKHSIKMIVNTLGSED------YLS 89 Query: 256 LIGYTTRVEKNIE-PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 ++ ++ + + + + T+ + A IL ++K + Sbjct: 90 IVTFSDSANVLFDLLPMNDSNKTMAIEKIENLSTEGGTELWKGLNSALNILLNNKTPNTN 149 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 + I LTDG+ + + N +K + I T ++S N Sbjct: 150 QS----------------IFLLTDGQPTDSGIDTNLVKFKQAYPKLNCTINTFGFSSSSN 193 Query: 375 GQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + + K + + +A + F N Sbjct: 194 SELMNKIAMEYNGMFSFIPDASFIATAFAN 223 >gi|332233322|ref|XP_003265851.1| PREDICTED: anthrax toxin receptor 2 [Nomascus leucogenys] Length = 488 Score = 39.9 bits (91), Expect = 0.75, Method: Composition-based stats. Identities = 33/240 (13%), Positives = 66/240 (27%), Gaps = 52/240 (21%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + + R F + V+D SGS+ V Sbjct: 34 QEQPSCRRAFDLYFVLDKSGSVANNWIEIYNFVQQLAE---------------------- 71 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 +S + + I ++++ + + K+ + + + T +K A Sbjct: 72 -RFVSPE---MRLSFIVFSSQATIILPLTGDRGKISKGLEDLKHVSPVGETYIHEGLKLA 127 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG--ENNNFKSNVNTIKICDKAKE 359 Q K L II LTDG + KI ++ Sbjct: 128 ----------------NEQIQKAGGLKTSSIIIALTDGKLDGLVPSYAEKEAKI---SRS 168 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + + + Q L+ S E + V + I ++ + + IL+ Sbjct: 169 LGASVYCVGVL--DFEQAQLERIADSKEQVFPVKGGFQ---ALKGIINSILAQSCTEILE 223 >gi|170734866|ref|YP_001773980.1| hypothetical protein Bcenmc03_6370 [Burkholderia cenocepacia MC0-3] gi|169820904|gb|ACA95485.1| conserved hypothetical protein [Burkholderia cenocepacia MC0-3] Length = 423 Score = 39.9 bits (91), Expect = 0.75, Method: Composition-based stats. Identities = 16/134 (11%), Positives = 55/134 (41%), Gaps = 6/134 (4%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 M+ +R+ ++ + ++ +II L++ + +G + + + ++ ++++ +A Sbjct: 1 MNRAARYPKVTRRSLHRQRGAVAIIVGLALAVMIGFVGLALDLGKLYVTRSELQNSADAC 60 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 L+ A + S +S + I+ + F K ++ S + + F ++ Sbjct: 61 ALSAARDLTSAISLSVAEADGIAAGHLNFVF-----FQKKSVQMS-TNANVTFSDSLTNP 114 Query: 121 IVNSSRISMTHMAN 134 + + ++ Sbjct: 115 FLTKNAVTTPANIK 128 >gi|194211147|ref|XP_001917810.1| PREDICTED: chloride channel, calcium activated, family member 4 [Equus caballus] Length = 909 Score = 39.9 bits (91), Expect = 0.76, Method: Composition-based stats. Identities = 35/216 (16%), Positives = 74/216 (34%), Gaps = 53/216 (24%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM + ++M A FL + + + ++ Sbjct: 309 LVLDKSGSMA----------------GSNRLSRMN---QAAKHFL--LQTIENGS---WV 344 Query: 255 GLIGYTTRVEKNIEPSWGTE-KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + + R + + + + T +K A++++T + Sbjct: 345 GMVHFDSIALVISNLTQIISSNERNKLFESLPTEAVGGTSICAGIKSAFEVITDMYSQID 404 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 + I+ LTDGE+N T C D+ K++ I I++ Sbjct: 405 GSE----------------IVLLTDGEDN-------TAGSCVDEVKQSGAIIHFIAL-GP 440 Query: 373 PNGQRLLKTCVSSPEYHYNVVNADS---LIHVFQNI 405 Q +++ + H + + LI F + Sbjct: 441 SADQAVIEMSTITGGKHKYASDEAANNGLIDAFAAL 476 >gi|170076505|ref|YP_001733144.1| hypothetical protein SYNPCC7002_G0035 [Synechococcus sp. PCC 7002] gi|169887367|gb|ACB01075.1| conserved hypothetical protein (von Willebrand factor type A domain) [Synechococcus sp. PCC 7002] Length = 420 Score = 39.9 bits (91), Expect = 0.76, Method: Composition-based stats. Identities = 38/249 (15%), Positives = 84/249 (33%), Gaps = 54/249 (21%) Query: 172 KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 ++ P L+ ++ RP + V+D SGSMH K+ Sbjct: 25 VLIKIEPPLVELDNNARPPLNLGFVLDKSGSMHGN--------------------KLDYA 64 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K A+ ++ + + L + T+VE I + R ++++ L Sbjct: 65 KQAIAYAIEQLLPSD------RLSLTLFDTQVETKIP-----STLATDKQRLLETIKLIR 113 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 + S+ A+ + + G + + + I L+DG N ++N + I Sbjct: 114 SGSSTALH-----------DGWVQGGIQVGQYLNNDHLNRVI-LLSDGLANVGETNPDVI 161 Query: 352 KI-CDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQ------ 403 + I + + + LL+ S +++ + + L +F+ Sbjct: 162 ASDVHGLMKTGISTSALGV-GRDYDEDLLEAIARSGDGNFFHIASPEDLPQIFETELQGL 220 Query: 404 --NISQLMV 410 I + + Sbjct: 221 ATTIGRSVT 229 >gi|226947182|ref|YP_002802255.1| ppkA-like protein [Azotobacter vinelandii DJ] gi|226722109|gb|ACO81280.1| ppkA-related protein [Azotobacter vinelandii DJ] Length = 656 Score = 39.9 bits (91), Expect = 0.76, Method: Composition-based stats. Identities = 22/183 (12%), Positives = 59/183 (32%), Gaps = 23/183 (12%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM--D 285 + + A+ I+ + E V GL+ Y + +++ +YV++ Sbjct: 242 IDRTREAIRKIYQRIEQ-EKLLEQVRFGLVAYRSNIKEVPAL--------EYVSKLYVDP 292 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ---KFIIFLTD---- 338 + + D M S + + ++I+ ++D Sbjct: 293 AKVQGGEDFLAKMAALKPATVSSSRFDEDAYAGVMRALDGIDWTRFGARYIVLVSDAGAL 352 Query: 339 -GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 G + + ++ ++ +AK + I ++ + K S+ + ++ N Sbjct: 353 DGTDPLSATGLDAAQVRLEAKHRGVAIYSLHLKTPSGA----KNHASAEAQYRDLSNNPI 408 Query: 398 LIH 400 L Sbjct: 409 LNK 411 >gi|255587116|ref|XP_002534143.1| protein binding protein, putative [Ricinus communis] gi|223525789|gb|EEF28236.1| protein binding protein, putative [Ricinus communis] Length = 728 Score = 39.9 bits (91), Expect = 0.77, Method: Composition-based stats. Identities = 26/171 (15%), Positives = 65/171 (38%), Gaps = 43/171 (25%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 +L ++ R + V+D+SGSM TK+A LK A+ Sbjct: 258 SLPQLSQSPRAPVDLVTVLDISGSMAG--------------------TKLALLKRAMGFV 297 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTPA 297 + ++ + +I +++ + ++ RQ + ++SL+ T+ Sbjct: 298 IQNLGSND------RLSVIAFSSTARRLFPLRRMSDTGRQQALQAVNSLVAHGGTNIAEG 351 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 +++ +++ ++++ + II L+DG++ S+ Sbjct: 352 LRKGAKVMEDRREKNPVAS----------------IILLSDGQDTYTVSSS 386 >gi|320450000|ref|YP_004202096.1| hypothetical protein TSC_c09220 [Thermus scotoductus SA-01] gi|320150169|gb|ADW21547.1| conserved hypothetical protein [Thermus scotoductus SA-01] Length = 691 Score = 39.9 bits (91), Expect = 0.77, Method: Composition-based stats. Identities = 38/235 (16%), Positives = 81/235 (34%), Gaps = 52/235 (22%) Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 +P L ++ R + LV+D+SGSM L A+ Sbjct: 278 LPEDLPLKPLGRKGAALVLVMDVSGSMEG-----------------------EKLSLAVA 314 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDST 295 L+ + + Y+G++ +++ P TE+ ++ + S+ T Sbjct: 315 GALELVRSAAEED---YLGVVLFSSTHRVLFPPRPMTEQGKKEAESLLLSVRAGGGTVLG 371 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A ++A ++L +K I+ L+DG ++ + I Sbjct: 372 SAFREAVRLLQGVPVE------------------RKGILVLSDGLISDPQD-----PILA 408 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLM 409 A+ + +++ +++ A + LK +Y A L +F Q + Sbjct: 409 LAEASGLEVSAMALGA-DADRAFLKVLAERGGGRYYQAATAQELPRLFLKEGQEV 462 >gi|301767170|ref|XP_002919035.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like [Ailuropoda melanoleuca] Length = 891 Score = 39.9 bits (91), Expect = 0.78, Method: Composition-based stats. Identities = 30/162 (18%), Positives = 56/162 (34%), Gaps = 37/162 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + V+D+SGSMH KM K+ALL LD + ++ Sbjct: 284 SVVFVIDVSGSMHGR--------------------KMEQTKDALLKILDDMKEEDYLNII 323 Query: 252 VYMGLIG-YTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 ++ G + + + + ++ R +V D T+ + +A +L ++ Sbjct: 324 LFSGDVTIWRDSLVQATPE--NIQEARTFVKNIHDQ---GMTNINDGLMRAINMLNKARE 378 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 +I LTDG+ N +S I+ Sbjct: 379 EHRVPERSTS-----------IVIMLTDGDANVGESRPEKIQ 409 >gi|150397140|ref|YP_001327607.1| putative signal peptide protein [Sinorhizobium medicae WSM419] gi|150028655|gb|ABR60772.1| conserved hypothetical signal peptide protein [Sinorhizobium medicae WSM419] Length = 126 Score = 39.9 bits (91), Expect = 0.78, Method: Composition-based stats. Identities = 10/57 (17%), Positives = 28/57 (49%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 F + ++ + + NF+++ A++ + + I +++ ++SA +AA L Sbjct: 5 TFCLHMRRLMRDREGNFAVLGAIAFIPIIGAAALAIDFAGAYFEAEKIQSALDAAAL 61 >gi|116695554|ref|YP_841130.1| hypothetical protein H16_B1615 [Ralstonia eutropha H16] gi|113530053|emb|CAJ96400.1| conserved hypothetical protein [Ralstonia eutropha H16] Length = 352 Score = 39.9 bits (91), Expect = 0.78, Method: Composition-based stats. Identities = 35/242 (14%), Positives = 79/242 (32%), Gaps = 55/242 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +V+DLSGSM QD + +++ A + A + LD++ Sbjct: 97 TVVVVIDLSGSM---------------RAQDIQPSRIRAAQQAAKVLLDTL------PAG 135 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT--SDK 309 V G++ PS + + R T + A L + Sbjct: 136 VSAGVVAMAGTAAVAQAPSRSKDAAATAIDRLK---PQGGTALGNGLLIALTTLLPQTAG 192 Query: 310 KRSFFTNFFRQGVKIPSLPFQ-------KF----IIFLTDGENNNFKSNVNTIKICDKAK 358 N ++ P + I+ +DGE+N + + A Sbjct: 193 DAERLMNGDTTPLQKPDASHSGEAVTPGSYPSGAIVLFSDGESNTGPAATQAAQ---LAA 249 Query: 359 ENFIKIVTISINASPN--------------GQRLLKTCV-SSPEYHYNVVNADSLIHVFQ 403 + +++ T+ + + +++LK ++ ++ + +A L V++ Sbjct: 250 AHGVRVYTVGVGTTDGVVLSVDGWSARVRLDEKVLKEVANATGAEYFPLADAAQLKRVYR 309 Query: 404 NI 405 + Sbjct: 310 AL 311 >gi|301113718|ref|XP_002998629.1| alpha kinase [Phytophthora infestans T30-4] gi|262111930|gb|EEY69982.1| alpha kinase [Phytophthora infestans T30-4] Length = 748 Score = 39.9 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 17/131 (12%), Positives = 44/131 (33%), Gaps = 28/131 (21%) Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 + ++++ + + +V+D +GSM + +K+A++ Sbjct: 113 VVKEIQLKYAKANALDLVIVMDCTGSMGPW---------------------IDEVKSAIV 151 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE-----PSWGTEKVRQYVTRDMD-SLILK 290 +D++ V +G + Y + + VR+++ R Sbjct: 152 SIIDNVKKDHPSAN-VRVGFVAYRDFCDGVKRLQVYHLTSDVAAVRKFIFRLAAFGGGDG 210 Query: 291 PTDSTPAMKQA 301 P D ++ A Sbjct: 211 PEDIPGGLEAA 221 >gi|149919601|ref|ZP_01908080.1| hypothetical protein PPSIR1_07008 [Plesiocystis pacifica SIR-1] gi|149819544|gb|EDM78972.1| hypothetical protein PPSIR1_07008 [Plesiocystis pacifica SIR-1] Length = 349 Score = 39.9 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 36/233 (15%), Positives = 72/233 (30%), Gaps = 57/233 (24%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS-IDLLSHVKED 251 I L+VD SGSM ++ + LD + +++ ++ D Sbjct: 79 ITLLVDRSGSMDEDFGG------------------ISRWQAVGDTLLDPDVGVVAPLQGD 120 Query: 252 VYMGLIGYTTRVE-----KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + GL Y + ++ + + + T + A+ Sbjct: 121 IRFGLSLYDNPGDMCPRVESTPLA--LNALADMTALYQSAAPEGDTPTGSAL-------- 170 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN------NNFKSNVNTIKICDKAKEN 360 + + Q P +K I+ TDGE N + + A Sbjct: 171 -----TSVADVVAQD----PDPGEKVIVLATDGEPDTCAQPNPDEGQPEAVAAAQAAYAQ 221 Query: 361 FIK--IVTISINASPNGQRLLKTCVS------SPEYHYNVVNADSLIHVFQNI 405 ++ IV++ S + + + + S +Y ++ SLI F I Sbjct: 222 GVRTVIVSVGSGISADHLQDMANAGAGVQPGGSDAVYYQALDQASLIDAFSEI 274 >gi|73695956|gb|AAZ80784.1| matrilin [Biomphalaria glabrata] Length = 418 Score = 39.9 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 30/186 (16%), Positives = 63/186 (33%), Gaps = 27/186 (14%) Query: 232 KNALLLFLDSID--LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 K A+ + + +S V + +I Y + I + T + V + + Sbjct: 65 KTAIKFLQEFLSQYEISSDPNGVRVSIISYGKGIYPEIGFNLTTYDTKDEVIEAIGKIPH 124 Query: 290 KP---TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 K TD+ A++ ++ ++ K I +TDG N + Sbjct: 125 KAGLRTDTGRAIQYMHEAQLANGVVRPG--------------VTKVSIVITDG---NSQE 167 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 T + ++A+++ I + I + LL V N + L +I Sbjct: 168 WKLTKEAAEEARKDNIVMFAIGVGTDIRNSELL-NIAGDQSRVTKVDNYNQL----SSIK 222 Query: 407 QLMVHR 412 + + H+ Sbjct: 223 ESLAHQ 228 >gi|332830871|ref|XP_003311907.1| PREDICTED: matrilin-2 isoform 2 [Pan troglodytes] Length = 915 Score = 39.9 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 67/185 (36%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 +K + +DS+ +S +GL+ Y+T+V + Sbjct: 631 EVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVRTEFTLRNFNSAKDMKKAVAHMKYM 687 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 K + + A+K ++ + + G + S + I TDG + +V Sbjct: 688 GKGSMTGLALKHMFERSFTQGE----------GARPLSTRVPRAAIVFTDG---RAQDDV 734 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + KAK N I + + + + + L+ S P ++ + + ++ + + + Sbjct: 735 S--EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTNKHLFYAEDFSTMDEISEKLK 790 Query: 407 QLMVH 411 + + Sbjct: 791 KGICE 795 >gi|332830869|ref|XP_528309.3| PREDICTED: matrilin-2 isoform 3 [Pan troglodytes] Length = 937 Score = 39.9 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 67/185 (36%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 +K + +DS+ +S +GL+ Y+T+V + Sbjct: 672 EVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVRTEFTLRNFNSAKDMKKAVAHMKYM 728 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 K + + A+K ++ + + G + S + I TDG + +V Sbjct: 729 GKGSMTGLALKHMFERSFTQGE----------GARPLSTRVPRAAIVFTDG---RAQDDV 775 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + KAK N I + + + + + L+ S P ++ + + ++ + + + Sbjct: 776 S--EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTNKHLFYAEDFSTMDEISEKLK 831 Query: 407 QLMVH 411 + + Sbjct: 832 KGICE 836 >gi|332830867|ref|XP_003311906.1| PREDICTED: matrilin-2 isoform 1 [Pan troglodytes] Length = 956 Score = 39.9 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 67/185 (36%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 +K + +DS+ +S +GL+ Y+T+V + Sbjct: 672 EVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVRTEFTLRNFNSAKDMKKAVAHMKYM 728 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 K + + A+K ++ + + G + S + I TDG + +V Sbjct: 729 GKGSMTGLALKHMFERSFTQGE----------GARPLSTRVPRAAIVFTDG---RAQDDV 775 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + KAK N I + + + + + L+ S P ++ + + ++ + + + Sbjct: 776 S--EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTNKHLFYAEDFSTMDEISEKLK 831 Query: 407 QLMVH 411 + + Sbjct: 832 KGICE 836 >gi|332238409|ref|XP_003268390.1| PREDICTED: matrilin-2 isoform 3 [Nomascus leucogenys] Length = 915 Score = 39.9 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 67/185 (36%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 +K + +DS+ +S +GL+ Y+T+V + Sbjct: 631 EVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVRTEFTLRNFNSAKDMKKAVAHMKYM 687 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 K + + A+K ++ + + G + S + I TDG + +V Sbjct: 688 GKGSMTGLALKHMFERSFTQGE----------GARPLSTRVPRAAIVFTDG---RAQDDV 734 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + KAK N I + + + + + L+ S P ++ + + ++ + + + Sbjct: 735 S--EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTNKHLFYAEDFSTMDEISEKLK 790 Query: 407 QLMVH 411 + + Sbjct: 791 KGICE 795 >gi|332238407|ref|XP_003268389.1| PREDICTED: matrilin-2 isoform 2 [Nomascus leucogenys] Length = 937 Score = 39.9 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 67/185 (36%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 +K + +DS+ +S +GL+ Y+T+V + Sbjct: 672 EVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVRTEFTLRNFNSAKDMKKAVAHMKYM 728 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 K + + A+K ++ + + G + S + I TDG + +V Sbjct: 729 GKGSMTGLALKHMFERSFTQGE----------GARPLSTRVPRAAIVFTDG---RAQDDV 775 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + KAK N I + + + + + L+ S P ++ + + ++ + + + Sbjct: 776 S--EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTNKHLFYAEDFSTMDEISEKLK 831 Query: 407 QLMVH 411 + + Sbjct: 832 KGICE 836 >gi|332238405|ref|XP_003268388.1| PREDICTED: matrilin-2 isoform 1 [Nomascus leucogenys] Length = 956 Score = 39.9 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 67/185 (36%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 +K + +DS+ +S +GL+ Y+T+V + Sbjct: 672 EVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVRTEFTLRNFNSAKDMKKAVAHMKYM 728 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 K + + A+K ++ + + G + S + I TDG + +V Sbjct: 729 GKGSMTGLALKHMFERSFTQGE----------GARPLSTRVPRAAIVFTDG---RAQDDV 775 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + KAK N I + + + + + L+ S P ++ + + ++ + + + Sbjct: 776 S--EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTNKHLFYAEDFSTMDEISEKLK 831 Query: 407 QLMVH 411 + + Sbjct: 832 KGICE 836 >gi|297683362|ref|XP_002819353.1| PREDICTED: LOW QUALITY PROTEIN: matrilin-2-like [Pongo abelii] Length = 935 Score = 39.9 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 67/185 (36%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 +K + +DS+ +S +GL+ Y+T+V + Sbjct: 670 EVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVRTEFTLRNFNSAKDMKKAVAHMKYM 726 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 K + + A+K ++ + + G + S + I TDG + +V Sbjct: 727 GKGSMTGLALKHMFERSFTQGE----------GARPLSTRVPRAAIVFTDG---RAQDDV 773 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + KAK N I + + + + + L+ S P ++ + + ++ + + + Sbjct: 774 S--EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTTKHLFYAEDFSTMDEISEKLK 829 Query: 407 QLMVH 411 + + Sbjct: 830 KGICE 834 >gi|291166457|gb|EFE28503.1| hypothetical protein HMPREF0389_00418 [Filifactor alocis ATCC 35896] Length = 637 Score = 39.9 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 37/158 (23%), Positives = 60/158 (37%), Gaps = 15/158 (9%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT----KMAALKNALLLFLDSIDLLSHV 248 I LV+D SGSMH D + I K T K+ + A + L D V Sbjct: 251 ISLVLDNSGSMHTRDLKDSHGNKESRINILKVETGKLLKLLSTNKAADVELVPFDNNVLV 310 Query: 249 KEDVYMGLI--GYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQIL 305 + D G I + + ++ E G+ + MDSL T++ ++ A+ + Sbjct: 311 RSDRKGGYIKPTFYSASKEYREKIIGSNVYEGKLKESMDSLGAYSGTNTGEGLRYAFYSI 370 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 + P F+ ++I L DGE+N Sbjct: 371 DEKNNDLL--------RERPEEHFRDYLIILVDGESNA 400 >gi|332266371|ref|XP_003282183.1| PREDICTED: collagen alpha-4(VI) chain-like, partial [Nomascus leucogenys] Length = 1020 Score = 39.9 bits (91), Expect = 0.80, Method: Composition-based stats. Identities = 17/101 (16%), Positives = 37/101 (36%), Gaps = 15/101 (14%) Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL--KPTDSTPAMKQAY 302 L + V +GL Y + + ++ V + +L T++ A++ Sbjct: 265 LDISSDRVRVGLAQYNDNIYPAFQL--NQHPLKSMVLEQIQNLPYRPGGTNTGSALEF-- 320 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 + ++ T K + +I +TDGE+N+ Sbjct: 321 ------IRTNYLTEESGSRAKD---RVPQIVILVTDGESND 352 >gi|311273688|ref|XP_003133992.1| PREDICTED: integrin alpha-2 [Sus scrofa] Length = 1186 Score = 39.9 bits (91), Expect = 0.80, Method: Composition-based stats. Identities = 31/190 (16%), Positives = 60/190 (31%), Gaps = 30/190 (15%) Query: 233 NALLLFLDS-IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 +A+ FL+ + L +GLI Y + + V + Sbjct: 194 DAVKNFLEKFVQGLDIGPTKTQVGLIQYANNPRVVFNL-NTFKTKAEMVEATSHTTQYGG 252 Query: 292 --TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T++ A++ A S K ++ +TDGE+++ Sbjct: 253 DLTNTFKAIQYARDSAYSAAA-------------GGRPGATKVMVVVTDGESHDGSMLKA 299 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRL--------LKTCVSSPE--YHYNVVNADSLI 399 I C+ + I I++ N L +K S P Y +NV + L+ Sbjct: 300 VIDQCN---NDNILRFGIAVLGYLNRNALDTKNLIKEIKAIASIPTERYFFNVSDEADLL 356 Query: 400 HVFQNISQLM 409 + + + Sbjct: 357 EKAGTLGEQI 366 >gi|301767168|ref|XP_002919014.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4-like [Ailuropoda melanoleuca] Length = 849 Score = 39.9 bits (91), Expect = 0.80, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 51/165 (30%), Gaps = 45/165 (27%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D SGSM KM + A + +D LS + Sbjct: 277 VIFVIDKSGSMSGR--------------------KMQQTREA---LIKILDDLSPKDQF- 312 Query: 253 YMGLIGYTTRVEKNIEP-----SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 LI ++ + + + R Y TD A+ A Q+L S Sbjct: 313 --NLISFSGDAAQWKPLLVPASAENVNQARSYAAGIQA---HGGTDINEAVLMAVQLLNS 367 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 K++ II LTDG+ ++N I+ Sbjct: 368 AKQKELMPEGTVS-----------LIILLTDGDPTMGETNPARIQ 401 >gi|297673847|ref|XP_002814961.1| PREDICTED: anthrax toxin receptor 2 isoform 1 [Pongo abelii] gi|261858584|dbj|BAI45814.1| anthrax toxin receptor 2 [synthetic construct] Length = 488 Score = 39.9 bits (91), Expect = 0.80, Method: Composition-based stats. Identities = 33/240 (13%), Positives = 66/240 (27%), Gaps = 52/240 (21%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + + R F + V+D SGS+ V Sbjct: 34 QEQPSCRRAFDLYFVLDKSGSVANNWIEIYNFVQQLAE---------------------- 71 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 +S + + I ++++ + + K+ + + + T +K A Sbjct: 72 -RFVSPE---MRLSFIVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLA 127 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG--ENNNFKSNVNTIKICDKAKE 359 Q K L II LTDG + KI ++ Sbjct: 128 ----------------NEQIQKAGGLKTSSIIIALTDGKLDGLVPSYAEKEAKI---SRS 168 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + + + Q L+ S E + V + I ++ + + IL+ Sbjct: 169 LGASVYCVGVL--DFEQAQLERIADSKEQVFPVKGGFQ---ALKGIINSILAQSCTEILE 223 >gi|224809466|ref|NP_001139266.1| anthrax toxin receptor 2 isoform 2 [Homo sapiens] gi|306526289|sp|P58335|ANTR2_HUMAN RecName: Full=Anthrax toxin receptor 2; AltName: Full=Capillary morphogenesis gene 2 protein; Short=CMG-2; Flags: Precursor Length = 489 Score = 39.9 bits (91), Expect = 0.80, Method: Composition-based stats. Identities = 33/240 (13%), Positives = 66/240 (27%), Gaps = 52/240 (21%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + + R F + V+D SGS+ V Sbjct: 34 QEQPSCRRAFDLYFVLDKSGSVANNWIEIYNFVQQLAE---------------------- 71 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 +S + + I ++++ + + K+ + + + T +K A Sbjct: 72 -RFVSPE---MRLSFIVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLA 127 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG--ENNNFKSNVNTIKICDKAKE 359 Q K L II LTDG + KI ++ Sbjct: 128 ----------------NEQIQKAGGLKTSSIIIALTDGKLDGLVPSYAEKEAKI---SRS 168 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + + + Q L+ S E + V + I ++ + + IL+ Sbjct: 169 LGASVYCVGVL--DFEQAQLERIADSKEQVFPVKGGFQ---ALKGIINSILAQSCTEILE 223 >gi|119626254|gb|EAX05849.1| anthrax toxin receptor 2, isoform CRA_b [Homo sapiens] Length = 322 Score = 39.9 bits (91), Expect = 0.80, Method: Composition-based stats. Identities = 33/240 (13%), Positives = 66/240 (27%), Gaps = 52/240 (21%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + + R F + V+D SGS+ V Sbjct: 34 QEQPSCRRAFDLYFVLDKSGSVANNWIEIYNFVQQLAE---------------------- 71 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 +S + + I ++++ + + K+ + + + T +K A Sbjct: 72 -RFVSPE---MRLSFIVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLA 127 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG--ENNNFKSNVNTIKICDKAKE 359 Q K L II LTDG + KI ++ Sbjct: 128 ----------------NEQIQKAGGLKTSSIIIALTDGKLDGLVPSYAEKEAKI---SRS 168 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + + + Q L+ S E + V + I ++ + + IL+ Sbjct: 169 LGASVYCVGVL--DFEQAQLERIADSKEQVFPVKGGFQ---ALKGIINSILAQSCTEILE 223 >gi|62089412|dbj|BAD93150.1| anthrax toxin receptor 2 variant [Homo sapiens] Length = 502 Score = 39.9 bits (91), Expect = 0.80, Method: Composition-based stats. Identities = 33/240 (13%), Positives = 66/240 (27%), Gaps = 52/240 (21%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + + R F + V+D SGS+ V Sbjct: 48 QEQPSCRRAFDLYFVLDKSGSVANNWIEIYNFVQQLAE---------------------- 85 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 +S + + I ++++ + + K+ + + + T +K A Sbjct: 86 -RFVSPE---MRLSFIVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLA 141 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG--ENNNFKSNVNTIKICDKAKE 359 Q K L II LTDG + KI ++ Sbjct: 142 ----------------NEQIQKAGGLKTSSIIIALTDGKLDGLVPSYAEKEAKI---SRS 182 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + + + Q L+ S E + V + I ++ + + IL+ Sbjct: 183 LGASVYCVGVL--DFEQAQLERIADSKEQVFPVKGGFQ---ALKGIINSILAQSCTEILE 237 >gi|30013741|gb|AAP04016.1| capillary morphogenesis protein 2 [Homo sapiens] Length = 489 Score = 39.9 bits (91), Expect = 0.80, Method: Composition-based stats. Identities = 33/240 (13%), Positives = 66/240 (27%), Gaps = 52/240 (21%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + + R F + V+D SGS+ V Sbjct: 34 QEQPSCRRAFDLYFVLDKSGSVANNWIEIYNFVQQLAE---------------------- 71 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 +S + + I ++++ + + K+ + + + T +K A Sbjct: 72 -RFVSPE---MRLSFIVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLA 127 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG--ENNNFKSNVNTIKICDKAKE 359 Q K L II LTDG + KI ++ Sbjct: 128 ----------------NEQIQKAGGLKTSSIIIALTDGKLDGLVPSYAEKEAKI---SRS 168 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + + + Q L+ S E + V + I ++ + + IL+ Sbjct: 169 LGASVYCVGVL--DFEQAQLERIADSKEQVFPVKGGFQ---ALKGIINSILAQSCTEILE 223 >gi|50513243|ref|NP_477520.2| anthrax toxin receptor 2 isoform 1 [Homo sapiens] gi|21750159|dbj|BAC03731.1| unnamed protein product [Homo sapiens] Length = 488 Score = 39.9 bits (91), Expect = 0.80, Method: Composition-based stats. Identities = 33/240 (13%), Positives = 66/240 (27%), Gaps = 52/240 (21%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + + R F + V+D SGS+ V Sbjct: 34 QEQPSCRRAFDLYFVLDKSGSVANNWIEIYNFVQQLAE---------------------- 71 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 +S + + I ++++ + + K+ + + + T +K A Sbjct: 72 -RFVSPE---MRLSFIVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLA 127 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG--ENNNFKSNVNTIKICDKAKE 359 Q K L II LTDG + KI ++ Sbjct: 128 ----------------NEQIQKAGGLKTSSIIIALTDGKLDGLVPSYAEKEAKI---SRS 168 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + + + Q L+ S E + V + I ++ + + IL+ Sbjct: 169 LGASVYCVGVL--DFEQAQLERIADSKEQVFPVKGGFQ---ALKGIINSILAQSCTEILE 223 >gi|88857994|ref|ZP_01132636.1| hypothetical protein PTD2_11429 [Pseudoalteromonas tunicata D2] gi|88819611|gb|EAR29424.1| hypothetical protein PTD2_11429 [Pseudoalteromonas tunicata D2] Length = 974 Score = 39.9 bits (91), Expect = 0.81, Method: Composition-based stats. Identities = 47/311 (15%), Positives = 94/311 (30%), Gaps = 66/311 (21%) Query: 151 MTSYDYRLQFIEHL-LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNS 209 M ++ L+ + L L+ + S I + + + +V D SGSM ++ + Sbjct: 14 MKYFNLSLKALTLLCLSVQVPAVSASDIELYVNHNVQTTEKPRVMIVFDTSGSMESSVAT 73 Query: 210 DPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP 269 N +++ + A+ ++ D+ GL+ + I Sbjct: 74 GEYCYNRKGNQVYCADSRIKVAQAAMKALVEQ-------NTDIEFGLMRFNGGSGGYILA 126 Query: 270 SWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILT----SDKKRSFFTNFFRQGVKI 324 G ++ + ++SL T + ++Y LT + K + + Sbjct: 127 --GLGSSKEAILTKIESLRAGGSTPLAETLYESYLYLTGGGVNYAKNIADRDKDIEDNSS 184 Query: 325 PSLPFQKF----------------IIFLTDGE-------NNNFKSNVNTIKICDKAKENF 361 PF+ +I +TDG+ N N KS N+ ++ Sbjct: 185 YDSPFKPKKDDSGLDILRCDNSINMIIMTDGDPTEDGGQNGNIKSLYNSKYGAYPKSKSG 244 Query: 362 -------------------------IKIVTISINA--SPNGQRLLKTCVSS-PEYHYNVV 393 + TI + S +G LLK S + Sbjct: 245 SYLNSLAKYMLNVDLFPTTSGVTDIARTFTIGFGSGMSDDGLDLLKQTASDGGGEYLLAS 304 Query: 394 NADSLIHVFQN 404 A+ L + Sbjct: 305 TAEQLTEALKK 315 >gi|120553803|ref|YP_958154.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like [Marinobacter aquaeolei VT8] gi|120323652|gb|ABM17967.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Marinobacter aquaeolei VT8] Length = 1056 Score = 39.9 bits (91), Expect = 0.81, Method: Composition-based stats. Identities = 36/262 (13%), Positives = 74/262 (28%), Gaps = 46/262 (17%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL-KNALLLFLDSIDLLSHVKEDVYMGL 256 D+SG ++ + ++ + + K + + D L+ V +GL Sbjct: 147 DVSG---VNWSNITAHTYVSGNYRNWLSSSTETVRKTRMEIMQDVAKRLADTVTGVNIGL 203 Query: 257 IGYTTRVEKNIE-PSWGTEKVRQYVTRDMDSL----ILKPTDSTPAMKQAYQILTSDKKR 311 + + V+ + + T + + A + K Sbjct: 204 MAFNQSQNGEGGRVLNNVSNVKDNASAFKAKVDGLYPSGQTPLSETLFGAMRYFQGGK-- 261 Query: 312 SFFTNFFRQGVKIPSLPFQKFI---------IFLTDGENNNFKSNVNTI-----KICDK- 356 F G S ++ I I LTDG + ++ + I K C Sbjct: 262 PFLDRNPVSGTVDGSNNYKSPIELECQANNVILLTDGAPTSDTNHNSFIGSAIGKTCSGN 321 Query: 357 ------------------AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 + + IK T+ + L + +Y NA L Sbjct: 322 CLDEIAGYMATNDMSAAFSGDQTIKTYTVGFSIDDP--LLGAAATAGGGEYYVANNAQQL 379 Query: 399 IHVFQNISQLMVHRKYSVILKG 420 F +I + ++ + + G Sbjct: 380 ADAFDDILRSVMDTSSTFVAPG 401 >gi|167534461|ref|XP_001748906.1| hypothetical protein [Monosiga brevicollis MX1] gi|163772586|gb|EDQ86236.1| predicted protein [Monosiga brevicollis MX1] Length = 2847 Score = 39.9 bits (91), Expect = 0.83, Method: Composition-based stats. Identities = 22/161 (13%), Positives = 58/161 (36%), Gaps = 24/161 (14%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK-NIEPSWGTEKVRQYVTRDMDSLILK 290 ++ + + ++ DV +G+ Y++ + S + ++ + + Sbjct: 840 RSFVRDLVSNLMSGD---NDVRVGVAEYSSTYTQIVFPFSSSQSAIDSSLSSMIQT--AG 894 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + ++ +A + S + S + +I +TDGE ++ Sbjct: 895 ATATGTSLGEAADDIGSTARSSAA----------------RVLILMTDGETSDGDEQNID 938 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYN 391 + D + + I I + S + LL+ S ++ +N Sbjct: 939 PSV-DALRALGVSITAIGVGNSASESELLQI-AGSSDHVFN 977 >gi|197295155|ref|YP_002153696.1| hypothetical protein BCAS0306 [Burkholderia cenocepacia J2315] gi|195944634|emb|CAR57238.1| putative membrane protein [Burkholderia cenocepacia J2315] Length = 423 Score = 39.9 bits (91), Expect = 0.83, Method: Composition-based stats. Identities = 14/84 (16%), Positives = 40/84 (47%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 M+ +R ++G+ ++ +II LS+ + +G + + + ++ ++++ +A Sbjct: 1 MNSAARHPKVTRRGLHRQRGAVAIIVGLSLAVMIGFVGLALDLGKLYVTRSELQNSADAC 60 Query: 61 ILAGASKMVSNLSRLGDRFESISN 84 L+ A + S +S + I+ Sbjct: 61 ALSAARDLTSAISLSVAEADGIAA 84 >gi|134291855|ref|YP_001115624.1| hypothetical protein Bcep1808_6472 [Burkholderia vietnamiensis G4] gi|134135044|gb|ABO59369.1| conserved hypothetical protein [Burkholderia vietnamiensis G4] Length = 423 Score = 39.9 bits (91), Expect = 0.83, Method: Composition-based stats. Identities = 14/123 (11%), Positives = 47/123 (38%), Gaps = 6/123 (4%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 ++ + ++ +I+ L++ + +G + + + ++ ++++ +A L+ A + S Sbjct: 12 RRSLHRQRGAVAIVVGLALAVMIGFVGLALDLGKLYVTRSELQNSADACALSAARDLTSA 71 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 +S + I A F + + L+ + F + + + +S Sbjct: 72 ISLSVAEADGI------AAGHVNFAFFQKSAVQMLTDSNVTFSDALTNPFLTKTAVSTPA 125 Query: 132 MAN 134 Sbjct: 126 NVK 128 >gi|126733209|ref|ZP_01748956.1| hypothetical protein RCCS2_03619 [Roseobacter sp. CCS2] gi|126716075|gb|EBA12939.1| hypothetical protein RCCS2_03619 [Roseobacter sp. CCS2] Length = 632 Score = 39.9 bits (91), Expect = 0.84, Method: Composition-based stats. Identities = 56/359 (15%), Positives = 114/359 (31%), Gaps = 68/359 (18%) Query: 10 YFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMV 69 + ++ E ++ L ++S L++ G + + + ++ ++S + A+LA A+ Sbjct: 57 FIRQFRNDEDGGLIVLTLLLLISMLVVGGMAVDFMRFESERTKLQSVADRAVLAAANL-- 114 Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + +R D F G+ T Sbjct: 115 ---------------NQEREAADVITDFFTA------EGFGGSIVGTPS----------- 142 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 + S + +DV T Y RL I+ L + I +E Sbjct: 143 ---IQKNRNGSTIRLESIVDVDTFY-LRLVGIDTLSAPAN----ATAIEGTGNVE----- 189 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNS----------APICQDKKRTKMAALKNALLLFL 239 + LV+D+SGSM M D +S P+ + + +M L F+ Sbjct: 190 ---VSLVLDISGSMGSRMTGDAYLYDSDGEIRLDPDGNPLTERRTEDRMFFLIQEANKFI 246 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 + L ++ V + L+ Y+ V + + D + S ++ Sbjct: 247 GDL-LKDEYRDRVSINLVAYSQHVRLGDDLYTALNTTPDSIDEDDNLGS-----SYGSIT 300 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSL--PFQKFIIFLTDGENNNFKSNVNTIKICDK 356 Y + + + +G +L PF I G+N ++ + D+ Sbjct: 301 DGYTAPFTYTWVNAEGDEVEEGTPDATLVGPFDSPIDLTWAGDNKVIRNPSRCVTFLDE 359 >gi|15418999|gb|AAK77222.1| capillary morphogenesis protein-2 [Homo sapiens] gi|119626253|gb|EAX05848.1| anthrax toxin receptor 2, isoform CRA_a [Homo sapiens] Length = 386 Score = 39.9 bits (91), Expect = 0.84, Method: Composition-based stats. Identities = 30/214 (14%), Positives = 57/214 (26%), Gaps = 49/214 (22%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + + R F + V+D SGS+ V Sbjct: 34 QEQPSCRRAFDLYFVLDKSGSVANNWIEIYNFVQQLAE---------------------- 71 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 +S + + I ++++ + + K+ + + + T +K A Sbjct: 72 -RFVSPE---MRLSFIVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLA 127 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG--ENNNFKSNVNTIKICDKAKE 359 Q K L II LTDG + KI ++ Sbjct: 128 ----------------NEQIQKAGGLKTSSIIIALTDGKLDGLVPSYAEKEAKI---SRS 168 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV 393 + + + Q L+ S E + V Sbjct: 169 LGASVYCVGVL--DFEQAQLERIADSKEQVFPVK 200 >gi|158319036|ref|YP_001511544.1| von Willebrand factor type A [Frankia sp. EAN1pec] gi|158114441|gb|ABW16638.1| von Willebrand factor type A [Frankia sp. EAN1pec] Length = 608 Score = 39.9 bits (91), Expect = 0.84, Method: Composition-based stats. Identities = 42/239 (17%), Positives = 79/239 (33%), Gaps = 40/239 (16%) Query: 174 VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 + + AL G V+D SGSM N + + A K+ Sbjct: 389 ATVLTALQGSFAGVHQRGNTLAVLDTSGSM----NEEVPGSAGRSRLSVA----LDAAKS 440 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PT 292 A+ LF + DL GL ++TR+ + W + V L T Sbjct: 441 AIPLFAEDSDL----------GLWQFSTRLRGDQ--DW-----EELVPLGPMGERLGAGT 483 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFR------QGVKIPSLPFQKFIIFLTDGENNNFKS 346 S M A + + Q ++ LTDG+N++ S Sbjct: 484 RSQAVMD-AVNRIEPRGDTGLYDTALAAFRYMNQHYVPGRPNQ---VVLLTDGKNSDPGS 539 Query: 347 ----NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + I + + + ++++TI A + L + ++ Y ++ +++ V Sbjct: 540 IALDELVRILRREYSPQRPVQVITIGYGADTDLAALSRISAATGAETYPALDPNTIFEV 598 >gi|77919348|ref|YP_357163.1| hypothetical protein Pcar_1750 [Pelobacter carbinolicus DSM 2380] gi|77545431|gb|ABA88993.1| conserved hypothetical protein [Pelobacter carbinolicus DSM 2380] Length = 421 Score = 39.9 bits (91), Expect = 0.85, Method: Composition-based stats. Identities = 35/236 (14%), Positives = 75/236 (31%), Gaps = 29/236 (12%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA---------ILAG 64 + ++ A+ ++ FL + I V + +N +++A +A + +G Sbjct: 8 LKKDQNGAVIVLVAILLILFLGIAALAIDVYHVYVVRNELQNAADAGALAGARELYLESG 67 Query: 65 ASKMVS-NLSRLGDRFESIS-------NHAKRALIDDAKRFIKNHIKESLSGYSAVFYNT 116 AS + N+ E+IS N+ A D +R H + ++ T Sbjct: 68 ASVNPNANVIANNTAIENISEDVPVEVNYNAAANTGDVQR---GHWSFAARQFTPNGSLT 124 Query: 117 EIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRY---NQKI 173 I V + + + + V F + + Sbjct: 125 AID--VGNYTTEDLDNP-DPNINGGLINAVKVVVRRQDRPASSFFAQIFGFENFGITAEA 181 Query: 174 VSFIPALLRIEMGE--RP-IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 +++I I E +P E ++D +G+ +C M ++ T Sbjct: 182 IAYIGFSGTINPAELDQPIAICEESILDDNGNYNCNMGRMLNSGSNLNTSNTGGWT 237 >gi|198422516|ref|XP_002123195.1| PREDICTED: similar to EGF-like domain-containing protein [Ciona intestinalis] Length = 2053 Score = 39.9 bits (91), Expect = 0.86, Method: Composition-based stats. Identities = 27/231 (11%), Positives = 69/231 (29%), Gaps = 48/231 (20%) Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 S+ + E + + ++D S S+ + +KN Sbjct: 584 SYSCSCPDSNPCENQLLDLFFILDSSSSV--------------------RAANFEKMKNF 623 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK--PT 292 + +D +++ ++ V +G++ Y + + + + ++ T Sbjct: 624 VDRMVDPLNV---GQDRVRVGVMTYNRK--TFKRIDFNEAANNTDFSEKLAAIQYSGRGT 678 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + A+ A + P++P +I +TDG + Sbjct: 679 KTAQAINFA-----------ASNCLHESRGRRPNVPLS--VILMTDG---RSQDWRQLPS 722 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-----YNVVNADSL 398 + +I I N + LL P + ++ +++ L Sbjct: 723 AAATMHQKANMFFSIGITNRVNERELLTIANDDPTKYTILPSFDSLDSADL 773 >gi|222478564|ref|YP_002564801.1| von Willebrand factor type A [Halorubrum lacusprofundi ATCC 49239] gi|222451466|gb|ACM55731.1| von Willebrand factor type A [Halorubrum lacusprofundi ATCC 49239] Length = 571 Score = 39.9 bits (91), Expect = 0.86, Method: Composition-based stats. Identities = 41/260 (15%), Positives = 79/260 (30%), Gaps = 35/260 (13%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK--KRTKMAALKNALLLFLDSIDL 244 + I L +D SGSM + + + TK+ ++ F+D + Sbjct: 220 DAGPIDIMLTLDFSGSMLYDQYGGVVSTDPIQVDGQSYGETTKIDLVELGTRQFIDYLQA 279 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGT-------------------EKVRQYVTRDMD 285 DV +G++ + P G +K+ V+ Sbjct: 280 ---QNADVQVGVVYFDGEGSGENTPRTGILEPLTTNLSAVDTALSNLRQKLANVVSDAAP 336 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 S + A I T + + +K I L+DGE+ N Sbjct: 337 STPFDNDGNPDPYSNADGIATGTYISEGLDDAQTELASNGRASAEKRNIVLSDGESFNGD 396 Query: 346 SNVN-------TIKICDKAKENFIKIVTISINASPNGQRLLKTCV----SSPEYHYNVVN 394 N N + + TI++N S + + + P + ++ + Sbjct: 397 GNTNYAPPASAAANARAASPAPATDVYTINVNGSASTLQAMAGPAGGSGGDPVFFNDIND 456 Query: 395 ADSLIHVFQNISQLMVHRKY 414 ++ VF N++ V K Sbjct: 457 PLNIPTVFGNLAAQTVAEKV 476 >gi|227518343|ref|ZP_03948392.1| pilus subunit protein [Enterococcus faecalis TX0104] gi|227074216|gb|EEI12179.1| pilus subunit protein [Enterococcus faecalis TX0104] Length = 1103 Score = 39.9 bits (91), Expect = 0.87, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 55/150 (36%), Gaps = 39/150 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM + ++ ++ + F+D++ S + Sbjct: 268 LDLVLVVDWSGSM-------------------NENNRIGEVQKGVNRFVDTLAD-SGITN 307 Query: 251 DVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ MG +GY++ N G + V+ + S T + A++ A +L + Sbjct: 308 NINMGYVGYSSDSYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPN 367 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 368 GH------------------KKVIVLLTDG 379 >gi|311252833|ref|XP_003125290.1| PREDICTED: vitrin-like isoform 2 [Sus scrofa] Length = 634 Score = 39.9 bits (91), Expect = 0.88, Method: Composition-based stats. Identities = 28/182 (15%), Positives = 50/182 (27%), Gaps = 25/182 (13%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + L + D +G + YT T K Sbjct: 468 RTVLQFVANLSKEFDISDTDTRVGAVQYTYEQRLEFGFDQYTTKPDVLNAIKRVGYWSGG 527 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ A + L K + +K +I +TDG +V Sbjct: 528 TSTGAAINYALEQLFKKSKPNK----------------RKLMILITDG---RSYDDVRIP 568 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 + A + I + Q L+ + P ++ + V D+L I Q + Sbjct: 569 AM--VAHHKGVITYAIGVA--WAAQEELEIIATHPARDHAFFVDEFDNLYKSVPKIIQNI 624 Query: 410 VH 411 Sbjct: 625 CT 626 >gi|281338026|gb|EFB13610.1| hypothetical protein PANDA_007565 [Ailuropoda melanoleuca] Length = 854 Score = 39.9 bits (91), Expect = 0.88, Method: Composition-based stats. Identities = 30/162 (18%), Positives = 56/162 (34%), Gaps = 37/162 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + V+D+SGSMH KM K+ALL LD + ++ Sbjct: 249 SVVFVIDVSGSMHGR--------------------KMEQTKDALLKILDDMKEEDYLNII 288 Query: 252 VYMGLIG-YTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 ++ G + + + + ++ R +V D T+ + +A +L ++ Sbjct: 289 LFSGDVTIWRDSLVQATPE--NIQEARTFVKNIHDQ---GMTNINDGLMRAINMLNKARE 343 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 +I LTDG+ N +S I+ Sbjct: 344 EHRVPERSTS-----------IVIMLTDGDANVGESRPEKIQ 374 >gi|73695954|gb|AAZ80783.1| matrilin [Biomphalaria glabrata] Length = 418 Score = 39.9 bits (91), Expect = 0.88, Method: Composition-based stats. Identities = 30/186 (16%), Positives = 63/186 (33%), Gaps = 27/186 (14%) Query: 232 KNALLLFLDSID--LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 K A+ + + +S V + +I Y + I + T + V + + Sbjct: 65 KTAIKFLQEFLSQYEISSDPNGVRVSIISYGKGIYPEIGFNLTTYDTKDEVIEAIGRIPH 124 Query: 290 KP---TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 K TD+ A++ ++ ++ K I +TDG N + Sbjct: 125 KAGLRTDTGRAIQYMHEAQLANGVVRPG--------------VTKVSIVITDG---NSQE 167 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 T + ++A+++ I + I + LL V N + L +I Sbjct: 168 WKLTKEAAEEARKDNIVMFAIGVGTDIRNSELL-NIAGDQSRVTKVDNYNQL----SSIK 222 Query: 407 QLMVHR 412 + + H+ Sbjct: 223 ESLAHQ 228 >gi|218961689|ref|YP_001741464.1| hypothetical protein; putative membrane protein [Candidatus Cloacamonas acidaminovorans] gi|167730346|emb|CAO81258.1| hypothetical protein; putative membrane protein [Candidatus Cloacamonas acidaminovorans] Length = 331 Score = 39.9 bits (91), Expect = 0.88, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 55/185 (29%), Gaps = 46/185 (24%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I +D+S SM D +++ + FL+ + Sbjct: 89 MDIIFALDVSKSMDA---------------TDMMPSRLLRAILQIGSFLEQVKTD----- 128 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDSTPAMKQAYQILTSDK 309 +G+I + + E VR + + + + TD A++ A Sbjct: 129 --RIGIIAFAGTATLQCPLTDDYEAVRIVLNGLNSNTVEIPGTDIGSALRLAENAF---- 182 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 P K ++ ++DGE+ + + K I++ T+ + Sbjct: 183 ---------------PEGSKSKTLVLISDGEDLQHSALREARIL----KTKGIRVYTMGV 223 Query: 370 NASPN 374 + Sbjct: 224 GSPEG 228 >gi|55729600|emb|CAH91529.1| hypothetical protein [Pongo abelii] Length = 955 Score = 39.9 bits (91), Expect = 0.88, Method: Composition-based stats. Identities = 26/185 (14%), Positives = 67/185 (36%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 +K + +DS+ +S +GL+ Y+T+V + Sbjct: 672 EVVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVRTEFTLRNFNSAKDMKKAVAHMKYM 728 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 K + + A++ ++ + + G + S + I TDG + +V Sbjct: 729 GKGSMTGLALRHMFERSFTQGE----------GARPLSTRVPRAAIVFTDG---RAQDDV 775 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + KAK N I + + + + + L+ S P ++ + + ++ + + + Sbjct: 776 S--EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTTKHLFYAEDFSTMDEISEKLK 831 Query: 407 QLMVH 411 + + Sbjct: 832 KGICE 836 >gi|257884610|ref|ZP_05664263.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,501] gi|257820448|gb|EEV47596.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,501] Length = 1107 Score = 39.9 bits (91), Expect = 0.89, Method: Composition-based stats. Identities = 29/152 (19%), Positives = 53/152 (34%), Gaps = 41/152 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM ++ +K + F+D++ S + + Sbjct: 269 LDLVLVVDWSGSM-------------------NDNNRIGEVKIGVDRFVDTLAD-SGITD 308 Query: 251 DVYMGLIGYTTRVEKNIEPS---WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + MG +GY++ + + V+ V S T + A++ A +L+ Sbjct: 309 KINMGYVGYSSEGYSYSNGAVQMGSFDSVKNQVKSITPSRTNGGTFTQKALRDAGNMLSV 368 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 369 PNGH------------------KKVIVLLTDG 382 >gi|239622779|ref|ZP_04665810.1| von Willebrand factor type A [Bifidobacterium longum subsp. infantis CCUG 52486] gi|239514776|gb|EEQ54643.1| von Willebrand factor type A [Bifidobacterium longum subsp. infantis CCUG 52486] Length = 401 Score = 39.9 bits (91), Expect = 0.89, Method: Composition-based stats. Identities = 37/208 (17%), Positives = 69/208 (33%), Gaps = 38/208 (18%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 VVD SGSM + +A + D+ + + DV + Sbjct: 226 VVDYSGSMSGEGKNGVVKGLNAALDPDQAKKSY----------------IEPASGDVNI- 268 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 LI + T + ++ + GT + + TD + A L S+ + S +T Sbjct: 269 LIPFETEAHRPVKAT-GTS-TSDLLHEADATDASGGTDIYEGLLSALDELPSESEASQYT 326 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 I+ +TDG +N+ + K++ + I +I + Sbjct: 327 TA---------------IVLMTDGRSNSDHQDEFESAY--KSRGRDLPIFSIMFGDADPS 369 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 Q LK+ + + L VF+ Sbjct: 370 Q--LKSLATLSNAKVFDGRSGDLAAVFR 395 >gi|156347845|ref|XP_001621774.1| hypothetical protein NEMVEDRAFT_v1g221583 [Nematostella vectensis] gi|156208029|gb|EDO29674.1| predicted protein [Nematostella vectensis] Length = 357 Score = 39.9 bits (91), Expect = 0.89, Method: Composition-based stats. Identities = 17/168 (10%), Positives = 48/168 (28%), Gaps = 28/168 (16%) Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ + + T + ++ + + I T A+ + Sbjct: 180 ATYTHVAAVYFNTLANFGFNLKYDINVIKTAIDNLPN--IGGGTHIGKALTYTLDNVFKV 237 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN---FIKIV 365 P + ++ LTDG+ + T+ A N +++ Sbjct: 238 A---------------PRQNVKNVLVVLTDGK----SHDSVTLP--AAAVRNYGPGVEVF 276 Query: 366 TISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVH 411 + + A + L S P ++ ++V + + + + Sbjct: 277 AVGVGAGDSFVAQLNVIASDPDEDHVFHVEHFSQIESTTGAVEDEICK 324 >gi|160874259|ref|YP_001553575.1| von Willebrand factor type A [Shewanella baltica OS195] gi|160859781|gb|ABX48315.1| von Willebrand factor type A [Shewanella baltica OS195] Length = 642 Score = 39.9 bits (91), Expect = 0.89, Method: Composition-based stats. Identities = 40/232 (17%), Positives = 76/232 (32%), Gaps = 54/232 (23%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + ++D+SGSM K+ L+ AL L + V Sbjct: 236 GASNLVFLLDVSGSMA-------------------SVDKLPLLQTALKLLTAQLSAQDKV 276 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILTS 307 VY G G ++ Q + ++ L T+ + QAYQ Sbjct: 277 SIVVYAGAAG--VVLDGAS------GNDTQTLNYALEQLSAGGSTNGGQGITQAYQ---- 324 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKENFIKIVT 366 K+ F N + +I TDG+ N ++ + I + +K K++ I + T Sbjct: 325 LAKKHFIPNGINR------------VILATDGDFNVGVTDFDDLIALIEKEKDHGIGLTT 372 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNA---------DSLIHVFQNISQLM 409 + +L++ +Y ++ D L I++ + Sbjct: 373 LGFGLGNYNDQLMEQLADKGNGNYAYIDTLNEARKVLVDELSSTLFTIAKDV 424 >gi|111020122|ref|YP_703094.1| hypothetical protein RHA1_ro03133 [Rhodococcus jostii RHA1] gi|110819652|gb|ABG94936.1| conserved hypothetical protein [Rhodococcus jostii RHA1] Length = 326 Score = 39.9 bits (91), Expect = 0.89, Method: Composition-based stats. Identities = 36/239 (15%), Positives = 69/239 (28%), Gaps = 51/239 (21%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + L +D+S SM D +++AA + F + Sbjct: 86 RNRATVILAIDVSLSM---------------RATDVPPSRLAAAQAGAKTF------ADN 124 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + + +GL + + + T + A+ A Q + Sbjct: 125 LTPGINLGLEAF---AGTASMLVSPITDHTATDNALDHLQLAERTATGEAIFTALQAI-- 179 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS--NVNTIKICDK-AKENFIKI 364 + + S P I+ +DG+ + + AKE + I Sbjct: 180 --------DTLAGVLGGGSTPPPARIVLESDGKQTVPADLNDPRGAFTAARLAKEQGVPI 231 Query: 365 VTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 TIS + + L + S + +AD L +QN+ Q + Sbjct: 232 STISFGTTHGAIDLNGSHIPVPVDDESLRRIAELSGGSFFTATSADELQASYQNLQQQI 290 >gi|227822378|ref|YP_002826350.1| hypothetical protein NGR_c18330 [Sinorhizobium fredii NGR234] gi|227341379|gb|ACP25597.1| conserved hypothetical protein [Sinorhizobium fredii NGR234] Length = 602 Score = 39.9 bits (91), Expect = 0.90, Method: Composition-based stats. Identities = 35/199 (17%), Positives = 60/199 (30%), Gaps = 45/199 (22%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E P + ++D+SGSM + K+ LKNA L +D + Sbjct: 231 EAPRANLVFLIDVSGSMD-------------------EPDKLPLLKNAFRLLVDRLRPDD 271 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQIL 305 V Y G K + + +D+L T + AYQ L Sbjct: 272 TVSIVTYAG--------NAGTVLEPTAVKDKTKILSAIDTLQPGGSTAGAAGIDAAYQ-L 322 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE-NNNFKSNVNTIKICDKAKENFIKI 364 N I+ TDG+ N S+ ++ + + + I + Sbjct: 323 AEKAFVRDGVNR---------------ILLATDGDFNVGPSSDEELKRMVETKRRSGIFL 367 Query: 365 VTISINASPNGQRLLKTCV 383 + L++T Sbjct: 368 SVLGFGRGNYNDALMQTIA 386 >gi|108760371|ref|YP_628476.1| von Willebrand factor type A domain-containing protein [Myxococcus xanthus DK 1622] gi|108464251|gb|ABF89436.1| von Willebrand factor type A domain protein [Myxococcus xanthus DK 1622] Length = 422 Score = 39.5 bits (90), Expect = 0.91, Method: Composition-based stats. Identities = 25/124 (20%), Positives = 41/124 (33%), Gaps = 28/124 (22%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 E G+R + LV+D SGSM K+A + A + + Sbjct: 39 AETGQRVPVSLALVLDRSGSM--------------------NGQKLADARRAATELVQRL 78 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDSTPAMKQA 301 + I Y T V + TE+ R+ + + L T+ + A+ A Sbjct: 79 KPED------RLAFIDYGTDV-RVQPSRRMTEEAREELLTLISGLQDDGSTNISGALDAA 131 Query: 302 YQIL 305 L Sbjct: 132 ANAL 135 >gi|172065275|ref|YP_001815987.1| hypothetical protein BamMC406_5998 [Burkholderia ambifaria MC40-6] gi|171997517|gb|ACB68434.1| conserved hypothetical protein [Burkholderia ambifaria MC40-6] Length = 423 Score = 39.5 bits (90), Expect = 0.92, Method: Composition-based stats. Identities = 14/123 (11%), Positives = 49/123 (39%), Gaps = 6/123 (4%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 ++G+ ++ +II L++ + +G + + + ++ ++++ +A L+ A + S Sbjct: 12 RRGLHRQQGAVAIIVGLALAVMIGFVGLALDLGKLYVTRSELQNSADACALSAARDLTSA 71 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 +S + I A F + + + + + F ++ + + ++ Sbjct: 72 ISLQVAEADGI------AAGHANFAFFQQNAVQMQTDSNVTFSDSLTNPFLTKTAVATPA 125 Query: 132 MAN 134 Sbjct: 126 NVK 128 >gi|159897645|ref|YP_001543892.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159890684|gb|ABX03764.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 562 Score = 39.5 bits (90), Expect = 0.92, Method: Composition-based stats. Identities = 35/239 (14%), Positives = 83/239 (34%), Gaps = 50/239 (20%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 + A+ ++ + + L++D SGSM ++ ++ K Sbjct: 366 SAEVLQAITQLWQQHKKQVDVALIIDTSGSM-------------------RQENRLREAK 406 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP- 291 AL F+D +V+ + ++T + + S R + +D L+ Sbjct: 407 TALGDFIDIFADQDNVQVTI------FSTNATELSDLS-PIGPKRADLHTRIDGLVADGE 459 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + + Y + + + + ++ LTDGE+ ++ + Sbjct: 460 TRLYSTIGEVYTDIQQQTE----------------VQRIRALVVLTDGEDTASSLSLEQL 503 Query: 352 KICDKAKEN----FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 ++ +++ IKI TI+ S Q +L+ + ++ V+ I+ Sbjct: 504 N--EQIRQDESGTSIKIFTIAY-GSDANQEVLQRIAEITGAKSYTGDPATIRQVYHEIA 559 >gi|149437045|ref|XP_001515975.1| PREDICTED: similar to inter-alpha trypsin inhibitor heavy chain precursor 5 [Ornithorhynchus anatinus] Length = 949 Score = 39.5 bits (90), Expect = 0.93, Method: Composition-based stats. Identities = 38/226 (16%), Positives = 70/226 (30%), Gaps = 48/226 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D S SM A K+ K AL L + + Sbjct: 306 VVFVLDSSASMVGA--------------------KLKQTKEALFTILHDLRPEDNFN--- 342 Query: 253 YMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 ++G+++R++ K+ +R T+ A++ ++L Sbjct: 343 ---IVGFSSRIKVWKDQLVPVTPNSIRDGKVYIHHMSPSGGTNINGALQTGIRLLNDFVA 399 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF---IKIVTI 367 + I+FLTDG + T KI + KE + + TI Sbjct: 400 HNDI-----------DARSVSLIVFLTDGR--PTVGEIQTPKILNNTKEAARDRVCLFTI 446 Query: 368 SINASPNGQRL----LKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 I + + L L+ C + + A L + I + Sbjct: 447 GIGDDVDFKLLEKLSLENCGMTRRFQVEADAAAQLKGFYDEIGTPL 492 >gi|290970585|ref|XP_002668184.1| predicted protein [Naegleria gruberi] gi|284081421|gb|EFC35440.1| predicted protein [Naegleria gruberi] Length = 217 Score = 39.5 bits (90), Expect = 0.94, Method: Composition-based stats. Identities = 27/156 (17%), Positives = 50/156 (32%), Gaps = 27/156 (17%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + L++D SGSM S + T+ + A +SI + Sbjct: 74 LQLVLIIDHSGSMGSFDEDATGQNRSKGLVDSNHWTRYDNVIQAAKYLSESILQYDKDGK 133 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 LI + + V + I + + + V + T+ A++ A++ +D + Sbjct: 134 V---PLIFFDSNVREVI-----VDSIPRLVAAFEKNQPNSSTNLLGALELAFKNHVNDHE 185 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 FI+F TDG N Sbjct: 186 NVL------------------FIVF-TDGSPNGESY 202 >gi|311253580|ref|XP_003125597.1| PREDICTED: matrilin-2-like [Sus scrofa] Length = 423 Score = 39.5 bits (90), Expect = 0.95, Method: Composition-based stats. Identities = 30/173 (17%), Positives = 67/173 (38%), Gaps = 24/173 (13%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSL 287 +K + +DS+ +S +GL+ Y+T+V G+ K + M Sbjct: 132 EIVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVRTEFTLRNFGSAKDMKKAVASM-KY 187 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K + + A+K ++ + + G + S + I TDG + + Sbjct: 188 MGKGSMTGLALKHMFERSFTQIE----------GARPLSARVPRVAIVFTDG---RAQDD 234 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 V+ + KAK N I + + + + + L+ S P ++ + + ++ Sbjct: 235 VS--EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTDKHLFYAEDFSTM 283 >gi|311253578|ref|XP_001926459.2| PREDICTED: matrilin-2 [Sus scrofa] Length = 707 Score = 39.5 bits (90), Expect = 0.95, Method: Composition-based stats. Identities = 30/173 (17%), Positives = 67/173 (38%), Gaps = 24/173 (13%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSL 287 +K + +DS+ +S +GL+ Y+T+V G+ K + M Sbjct: 416 EIVKQFVTGIIDSL-TISPKA--ARVGLLQYSTQVRTEFTLRNFGSAKDMKKAVASM-KY 471 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K + + A+K ++ + + G + S + I TDG + + Sbjct: 472 MGKGSMTGLALKHMFERSFTQIE----------GARPLSARVPRVAIVFTDG---RAQDD 518 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 V+ + KAK N I + + + + + L+ S P ++ + + ++ Sbjct: 519 VS--EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTDKHLFYAEDFSTM 567 >gi|296208409|ref|XP_002751081.1| PREDICTED: calcium-activated chloride channel regulator 4 [Callithrix jacchus] Length = 931 Score = 39.5 bits (90), Expect = 0.95, Method: Composition-based stats. Identities = 34/218 (15%), Positives = 74/218 (33%), Gaps = 55/218 (25%) Query: 195 LVVDLSGSM--HCAMNSDPEDVNSAPICQDKKRTKMAALK-NALLLFLDSIDLLSHVKED 251 LV+D SGSM H +N + + + + + ++ N+ L+ + + E Sbjct: 309 LVLDTSGSMLSHNRLNRMNQAAKQFLMQTVENGSWVGMVRFNSTATILNKLIQIISSNE- 367 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 R + + + T +K A+Q++ + Sbjct: 368 -------------------------RNTLLEKLPTRAQGGTSICSGIKSAFQVIGELYSQ 402 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISIN 370 + I+ LTDGE+N S C D+ K++ + I++ Sbjct: 403 LDGSE----------------IVLLTDGEDNTASS-------CIDEVKQSGAIVHFIALG 439 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADS--LIHVFQNIS 406 + + + ++ + Y A++ LI F ++ Sbjct: 440 KDADKAVIEMSNITGGSHLYASDEAENNGLIDAFGALT 477 >gi|163754424|ref|ZP_02161546.1| aerotolerance-related membrane protein [Kordia algicida OT-1] gi|161325365|gb|EDP96692.1| aerotolerance-related membrane protein [Kordia algicida OT-1] Length = 344 Score = 39.5 bits (90), Expect = 0.95, Method: Composition-based stats. Identities = 28/195 (14%), Positives = 69/195 (35%), Gaps = 45/195 (23%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 ++E +R I VD+S SM D +++ K + +++ Sbjct: 79 KLETVKREGVDIVFAVDVSKSMLAE---------------DISPSRLDKSKRIVSEIINN 123 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-MDSLILKPTDSTPAMKQ 300 + +G+I Y R + + + ++ + L + T A++ Sbjct: 124 LGSD-------RIGIIAYAGRAVPQLPITTDFSAAKMFLNNLNTNMLSSQGTAIDDAIRL 176 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A D +++ + ++ ++DGE++ + ++ ++A + Sbjct: 177 AKTY-YDDVEQTN-----------------RVLVIISDGEDHTGGAG----QLAEEATKE 214 Query: 361 FIKIVTISINASPNG 375 IK TI + + G Sbjct: 215 GIKTYTIGVGTTKGG 229 >gi|156383825|ref|XP_001633033.1| predicted protein [Nematostella vectensis] gi|156220097|gb|EDO40970.1| predicted protein [Nematostella vectensis] Length = 204 Score = 39.5 bits (90), Expect = 0.95, Method: Composition-based stats. Identities = 16/168 (9%), Positives = 50/168 (29%), Gaps = 22/168 (13%) Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ + + T + ++ + + I T A+ + Sbjct: 57 ATYTHVAAVYFNTLANFGFNLKYDINVIKTAIDNLPN--IGGGTHIGKALTYTLDNVFKV 114 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 P + ++ LTDG+ ++ + + + +++ + Sbjct: 115 A---------------PRQNVKNVLVVLTDGK-SHDSVTLPAAAVRNYG--PGVEVFAVG 156 Query: 369 INASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVHRKY 414 + A + L S P ++ ++V + + + + KY Sbjct: 157 VGAGDSFVAQLNVIASDPDEDHVFHVEHFSQIESTTGAVEDEICKGKY 204 >gi|126314401|ref|XP_001377042.1| PREDICTED: similar to Procollagen, type VI, alpha 2, partial [Monodelphis domestica] Length = 762 Score = 39.5 bits (90), Expect = 0.98, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 61/190 (32%), Gaps = 45/190 (23%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL--S 246 I V+D S S+ T + KN ++ ++ + + Sbjct: 353 GALDIVFVIDSSESIG--------------------YTNFSLEKNFVINVVNRLGSITKD 392 Query: 247 HVKED-VYMGLIGYTTR-VEKNIEPSWG----TEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 + +G++ Y+ + I+ G + ++ V + I T + A+ Sbjct: 393 PNSDTGTRIGVVQYSHDGTFEAIKLDDGRIGSLAQFKEEVKKL--EWIAGGTWTPSALNY 450 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 Y L +R + F + +TDG ++ + + +CDK + Sbjct: 451 TYNELIKGSRRKKT---------------RVFAVVITDGRHDPRDNEQSLKALCDKVENV 495 Query: 361 FIKIVTISIN 370 + + I Sbjct: 496 VVTAIGIGDM 505 >gi|150389538|ref|YP_001319587.1| von Willebrand factor, type A [Alkaliphilus metalliredigens QYMF] gi|149949400|gb|ABR47928.1| von Willebrand factor, type A [Alkaliphilus metalliredigens QYMF] Length = 551 Score = 39.5 bits (90), Expect = 0.98, Method: Composition-based stats. Identities = 41/276 (14%), Positives = 97/276 (35%), Gaps = 53/276 (19%) Query: 135 NRLDSSN-NTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEM----GERP 189 N+LD+S I + + + + + Y ++I++ ++ + ++ Sbjct: 113 NQLDTSQFPKISLYFSALNAQGIPILNLTRDSFEIYEERIINSDTKPVKSDTFLANLQQV 172 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + L++D SGSM M K+A FL+ +D + Sbjct: 173 PLSVSLILDNSGSMSGN--------------------PMTQAKSAAKQFLNYVDFSNGD- 211 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + +I + + V I +D ++ A + S+ Sbjct: 212 ---QVEIIEFNSDVYIRIPY---------------------GSDIK-SLNTAIDTMESNS 246 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + + + + V+ S K I+ TDGE N +V+ + + ++ I I I + Sbjct: 247 QTALYDALYTGLVRAYSQSGPKCILAFTDGEENASIRSVSEVT--ELSRATSIPIFIIGV 304 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + + + L + + ++ A L +++ + Sbjct: 305 GSLIDEESLKEIAEQTGGEYFYSPTAVELEQIYKTV 340 >gi|326932831|ref|XP_003212516.1| PREDICTED: cartilage matrix protein-like [Meleagris gallopavo] Length = 493 Score = 39.5 bits (90), Expect = 0.99, Method: Composition-based stats. Identities = 32/172 (18%), Positives = 58/172 (33%), Gaps = 22/172 (12%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 I+ L +G+I Y + V+ K + T + A Sbjct: 62 LSRVIEGLDVGPNSTRVGVINYASAVKNEFSLKTHQTKAGLLQAVRRIEPLSTGTMTGLA 121 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 ++ A SD + G ++ S K I +TDG + +V+ +A Sbjct: 122 IQFAISRAFSDAE----------GARLRSSNINKVAIVVTDGRPQDGVQDVS-----ARA 166 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN---ADSLIHVFQN 404 ++ I+I I + L+ S P ++ V + + L H FQ Sbjct: 167 RQAGIEIFAIGVG--RVDMHTLRQIASEPLDDHVDYVESYSVIEKLTHKFQE 216 >gi|289442929|ref|ZP_06432673.1| LOW QUALITY PROTEIN: membrane protein [Mycobacterium tuberculosis T46] gi|289415848|gb|EFD13088.1| LOW QUALITY PROTEIN: membrane protein [Mycobacterium tuberculosis T46] Length = 246 Score = 39.5 bits (90), Expect = 0.99, Method: Composition-based stats. Identities = 40/233 (17%), Positives = 72/233 (30%), Gaps = 51/233 (21%) Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 ++V+D+S SM D + ++M A + A F D + + Sbjct: 11 DVVIDVSQSM---------------RATDVEPSRMVAAQEAAKQFADELTPG------IN 49 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 +GLI Y + P+ E + + + + T + A+ A Q + + Sbjct: 50 LGLIAYAGTATVLVSPTTNREATKNALDKLQFADR---TATGEAIFTALQAIATVGAV-- 104 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK--SNVNTIKICD-KAKENFIKIVTISIN 370 + P I+ +DG+ N AK+ + I TIS Sbjct: 105 --------IGGGDTPPPARIVLFSDGKETMPTNPDNPKGAYTAARTAKDQGVPISTISFG 156 Query: 371 AS-------------PNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLM 409 P +K S YN L V+ ++ Q + Sbjct: 157 TPYGFVEINDQRQPVPVDDETMKKVAQLSGGNSYNAATLAELRAVYSSLQQQI 209 >gi|229825750|ref|ZP_04451819.1| hypothetical protein GCWU000182_01113 [Abiotrophia defectiva ATCC 49176] gi|229790313|gb|EEP26427.1| hypothetical protein GCWU000182_01113 [Abiotrophia defectiva ATCC 49176] Length = 1659 Score = 39.5 bits (90), Expect = 0.99, Method: Composition-based stats. Identities = 24/161 (14%), Positives = 46/161 (28%), Gaps = 43/161 (26%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 I I LV+D SGSM + K A F +D L Sbjct: 82 QNNKILDIVLVMDTSGSMEGK--------------------SLENAKKAANNF---VDKL 118 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 +V +G++ + + E + ++ + T + +++A +L Sbjct: 119 LPQNNNVNIGIVSFAEKGEIKSGLTRNVTTLKNAIKGLKAD---GGTYTQQGLEKAATVL 175 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 +K ++ + DGE Sbjct: 176 NGAP-----------------AEHKKVMVVIGDGEPTYANG 199 >gi|154687789|ref|YP_001422950.1| YwmC [Bacillus amyloliquefaciens FZB42] gi|154353640|gb|ABS75719.1| YwmC [Bacillus amyloliquefaciens FZB42] Length = 228 Score = 39.5 bits (90), Expect = 0.99, Method: Composition-based stats. Identities = 39/249 (15%), Positives = 77/249 (30%), Gaps = 45/249 (18%) Query: 173 IVSFIPALLRIEMGERPIFL---IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 + + PA E+ + +++D SGSM ++ +K Sbjct: 15 LTAASPAFAAENANEKTLSFKNNTAILLDASGSMAKRIDGV---------------SKYN 59 Query: 230 ALKNALLLFLDSIDLLSHVKEDVY--MGLIGYTTRVEKNIEPS--WGTEKV--RQYVTRD 283 K ++ F D I S V+ V+ G + +V+ +G ++ + ++ Sbjct: 60 MAKEEIVRFADQIKSKSQVRMTVFGSEGNNKNSGKVQSCESIRGVYGFQRFDRQSFLNSL 119 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 T A++ A T K K + LTDGE Sbjct: 120 NGIGPTGWTPIAKALEDAKASFTGLHKLG-----------------SKSVFLLTDGEETC 162 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVF 402 V T K + ++ IK+ I + L + +Y + + +F Sbjct: 163 GGDPVKTAK---ELRKQHIKVNVIGFDFKEGFNGQLHEIAKAGGGKYYEAHSQKDMNRIF 219 Query: 403 QNISQLMVH 411 + + Sbjct: 220 TMAASSLAE 228 >gi|256852741|ref|ZP_05558111.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T8] gi|256711200|gb|EEU26238.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T8] gi|315030743|gb|EFT42675.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX4000] Length = 1105 Score = 39.5 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 29/151 (19%), Positives = 54/151 (35%), Gaps = 39/151 (25%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI-DLLSHVK 249 + LVVD SGSM + ++ ++ + F+D++ D Sbjct: 268 LDLVLVVDWSGSM-------------------NENNRIGEVQKGVNRFVDTLADSGITNN 308 Query: 250 EDVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ MG +GY++ N G + V+ + S T + A++ A +L + Sbjct: 309 ININMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATP 368 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 369 NGH------------------KKVIVLLTDG 381 >gi|86147465|ref|ZP_01065777.1| hypothetical protein MED222_21464 [Vibrio sp. MED222] gi|85834758|gb|EAQ52904.1| hypothetical protein MED222_21464 [Vibrio sp. MED222] Length = 421 Score = 39.5 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 17/104 (16%), Positives = 36/104 (34%), Gaps = 3/104 (2%) Query: 18 EKANFSIIFALSVMSFLLLIG-FLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLG 76 ++ +IF + L + + + K +++A ++A LA A+ + ++ + Sbjct: 11 KQGGLVVIFVTIALLVFLAVSALAVDINHMLVNKTRLQNAVDSAALAAATILDNSKDQAA 70 Query: 77 DRFESIS--NHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEI 118 E S N A + F + S F T Sbjct: 71 VSAEVTSTLNAMAGASGNHEFNFSTAVVSVEYSNDPQSFAGTTT 114 >gi|169338033|ref|ZP_02621346.2| von Willebrand factor type A domain protein [Clostridium botulinum C str. Eklund] gi|169295279|gb|EDS77412.1| von Willebrand factor type A domain protein [Clostridium botulinum C str. Eklund] Length = 1242 Score = 39.5 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 27/219 (12%), Positives = 64/219 (29%), Gaps = 50/219 (22%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 + + L++A + + + K + +GL+ + T + + +V Sbjct: 164 KCYRQSSYNEKNRLQHAKESAIKFVQKFENDKN-ISIGLVSFDTTANSQKDITSKLNEVE 222 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + + T+ +K A Q+L K + K++I ++ Sbjct: 223 DSINSLKVA-DNGATNIEAGLKSAQQLL-----------------KKGNKDADKYVILMS 264 Query: 338 DGENNNFKSNVN-------------------------------TIKICDKAKENFIKIVT 366 DG F +I + K++ I Sbjct: 265 DGFPTAFDYAGEKVEKNFNYHEIQDNTFINFGYYDYSGYAMKHSINQANSLKKDGINSFI 324 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 I + N ++L ++ + N D+L + + Sbjct: 325 IGFSEGANSEKLNNIAKAAGGEYEEAKNTDTLNGAYDKL 363 Score = 36.4 bits (82), Expect = 8.0, Method: Composition-based stats. Identities = 30/165 (18%), Positives = 62/165 (37%), Gaps = 29/165 (17%) Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 SG+ N +K R+++ ++K F+D E+ + ++ Y Sbjct: 697 SGNFIINGKKYYVKDNKVYEFNEKDRSRLDSVKKVANDFVDKFK----NDENTEIAIVRY 752 Query: 260 TTRVE-----KNIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSF 313 +++ N G + + + + ++SL T+ ++++Y IL K S Sbjct: 753 SSKANIVLDGSNKIFLNGKDN--EIIKKRINSLKADGGTNIGDGIRKSYSILDKCDKDS- 809 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 +K++I +TDG + NTIK + K Sbjct: 810 ----------------EKYMILMTDGVPTAYTCYANTIKASNNCK 838 >gi|8569519|pdb|1QC5|B Chain B, I Domain From Integrin Alpha1-Beta1 Length = 192 Score = 39.5 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 69/190 (36%), Gaps = 30/190 (15%) Query: 233 NALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 +++ FL D ++ + + +G++ Y V + + + Sbjct: 20 DSVTAFLNDLLERMDIGPKQTQVGIVQYGENVTHEFNL-NKYSSTEEVLVAAKKIVQRGG 78 Query: 292 --TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T + + A + ++ + +K ++ +TDGE N Sbjct: 79 RQTMTALGIDTARKEAFTEA-------------RGARRGVKKVMVIVTDGE---SHDNHR 122 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRL--------LKTCVSSP--EYHYNVVNADSLI 399 K+ ++ I+ +I+I S N L +K+ S P ++ +NV + +L+ Sbjct: 123 LKKVIQDCEDENIQRFSIAILGSYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDEIALV 182 Query: 400 HVFQNISQLM 409 + + + + + Sbjct: 183 TIVKTLGERI 192 >gi|320007358|gb|ADW02208.1| von Willebrand factor type A [Streptomyces flavogriseus ATCC 33331] Length = 428 Score = 39.5 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 43/213 (20%), Positives = 74/213 (34%), Gaps = 42/213 (19%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 +ELV+D+SGSM ++++A K A D+L V E+V Sbjct: 41 VELVLDVSGSMRTRDMGGE--------------SRISAAKQAFN------DVLDAVPEEV 80 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +G+ + G + +Q + P D T A K A L Sbjct: 81 QLGIRTLGADYPGDDRKV-GCKDTKQ-------LYPVGPLDRTEA-KTAVATLAPTGWTP 131 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE---NFI--KIVTI 367 + + I+ +TDGE+ + C+ A++ I I T+ Sbjct: 132 IGPALLGAADDLDGGDSTRRIVLITDGEDTCGPLDP-----CEVARDIAARGIHLVIDTL 186 Query: 368 SINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 + ++ L TC++ + V N D L Sbjct: 187 GLVPDAKIRQQL-TCIAEATGGTYTAVQNTDEL 218 >gi|226310168|ref|YP_002770062.1| hypothetical protein BBR47_05810 [Brevibacillus brevis NBRC 100599] gi|226093116|dbj|BAH41558.1| hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 437 Score = 39.5 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 39/235 (16%), Positives = 85/235 (36%), Gaps = 58/235 (24%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 F +E+++D SGSM + +TKM K A+ F +++ E Sbjct: 136 FNVEIILDASGSMAGKIG---------------DKTKMQLAKEAIQEFAEAL------PE 174 Query: 251 DVYMGLIGYTTR---VEKNIEPSWGTEK----VRQYVTRDMDSL-----ILKPTDSTPAM 298 D + L Y + +++ + S G+ + ++ Y + ++ T ++ Sbjct: 175 DARISLRVYGHKGSNADEHKQLSCGSSEMVYPLQAYDAKRLEQALDMFEPTGWTSIAHSL 234 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 + A + L + I ++DG + N + + + Sbjct: 235 RLAQEDLAGFEADKNTN----------------VIYLVSDGIETC---DGNPVAVAKELS 275 Query: 359 ENFIK--IVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSL---IHVFQNISQ 407 ++ I + I + + GQ+ LK +S + NV N + + + I+Q Sbjct: 276 QSKIMPLLNVIGFDVNAEGQKQLKEIAHASEGLYANVTNREQFKQELERAKEIAQ 330 >gi|170015985|ref|NP_001116166.1| complement component 2 [Xenopus laevis] gi|169642443|gb|AAI60743.1| LOC734198 protein [Xenopus laevis] Length = 662 Score = 39.5 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 46/276 (16%), Positives = 91/276 (32%), Gaps = 50/276 (18%) Query: 146 YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHC 205 Y D + QF L + + ++I + I + ++D S S+ Sbjct: 119 YAYSFDLPEDVQEQFKASLSGILNIMERPASFGRTIKIT--KDGILNVYFLLDASQSVG- 175 Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK 265 + K +D + L + G+I Y T + Sbjct: 176 -------------------QANFDIYKACSEYLVDELALFDM---TIQFGIISYATVPKV 213 Query: 266 NIEP----SWGTEKVRQYVT---RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFF 318 I S + V + + D T++ A+++ Y +++S K+ ++ Sbjct: 214 IIPIYDEESDNNDHVLTLIRNGLKYSDHKDKTGTNTKAALEEIYSMMSSQKE-TYKNESV 272 Query: 319 RQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK------AKENFIKIVTISINAS 372 II LTDG+ N +TIK + +E+++ + T I Sbjct: 273 WN-------SIHHIIILLTDGKANLGGRPAHTIKRIEDFLDIKHKREDYLDVYTFGI-GP 324 Query: 373 PNGQRLLKTCVSSPE---YHYNVVNADSLIHVFQNI 405 L S + + + + +A+ + VFQ I Sbjct: 325 EVDMADLSEMASKKDGETHVFRMESANEMKTVFQKI 360 >gi|82619290|gb|ABB85337.1| complement C2 [Xenopus laevis] Length = 753 Score = 39.5 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 46/276 (16%), Positives = 91/276 (32%), Gaps = 50/276 (18%) Query: 146 YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHC 205 Y D + QF L + + ++I + I + ++D S S+ Sbjct: 210 YAYSFDLPEDVQEQFKASLSGILNIMERPASFGRTIKIT--KDGILNVYFLLDASQSVG- 266 Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK 265 + K +D + L + G+I Y T + Sbjct: 267 -------------------QANFDIYKACSEYLVDELALFDM---TIQFGIISYATVPKV 304 Query: 266 NIEP----SWGTEKVRQYVT---RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFF 318 I S + V + + D T++ A+++ Y +++S K+ ++ Sbjct: 305 IIPIYDEESDNNDHVLTLIRNGLKYSDHKDKTGTNTKAALEEIYSMMSSQKE-TYKNESV 363 Query: 319 RQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK------AKENFIKIVTISINAS 372 II LTDG+ N +TIK + +E+++ + T I Sbjct: 364 WN-------SIHHIIILLTDGKANLGGRPAHTIKRIEDFLDIKHKREDYLDVYTFGI-GP 415 Query: 373 PNGQRLLKTCVSSPE---YHYNVVNADSLIHVFQNI 405 L S + + + + +A+ + VFQ I Sbjct: 416 EVDMADLSEMASKKDGETHVFRMESANEMKTVFQKI 451 >gi|99078203|ref|YP_611461.1| von Willebrand factor, type A [Ruegeria sp. TM1040] gi|99035341|gb|ABF62199.1| von Willebrand factor type A [Ruegeria sp. TM1040] Length = 477 Score = 39.5 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 34/219 (15%), Positives = 71/219 (32%), Gaps = 48/219 (21%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM ++ K+ ++ + L ++ +++ Sbjct: 28 LVLDASGSMWGQIDGV---------------AKITIAQDVMQHLLKTL----PENQEL-- 66 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 GL+ Y R + + D++ LI S A+ QA ++ K Sbjct: 67 GLMAYGHRRKGD--------------CNDIEQLIAPAAGSRQAISQAVTQISPKGKTPLS 112 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD-----KAKENFIKIVTISI 369 + + S + +I ++DGE + C +A+ + I Sbjct: 113 AAVMQAADALRSSEEKATVILISDGE------ETCGLDPCAVGAELEARGVDFTLHAIGF 166 Query: 370 -NASPNGQRLLKTCVSSPEYHY-NVVNADSLIHVFQNIS 406 A + L+ + Y + +A L ++ Sbjct: 167 GIADDAARAQLQCLAENTGGFYRDASSASELTAALAQVA 205 >gi|289606823|emb|CBI60997.1| unnamed protein product [Sordaria macrospora] Length = 599 Score = 39.5 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 19/135 (14%), Positives = 35/135 (25%), Gaps = 38/135 (28%) Query: 271 WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT------SDKKRSFFTNFFRQGVKI 324 Y+ + T M + L+ S + KI Sbjct: 107 NNRSGFVSYLNGLVA---RGGTYHDIGMIWGARFLSTTGLFKSATPETNDVTDPDNPAKI 163 Query: 325 PSLPFQKFIIFLTDGENNNFKSN-----------------------------VNTIKICD 355 +K++IF+TDG+ + ++ C+ Sbjct: 164 RGFSVKKYMIFMTDGDMSPTWNDYSAYGIEYLDGRVNGSPTTDNAALLARHLQRFRMACN 223 Query: 356 KAKENFIKIVTISIN 370 AK I I I+ + Sbjct: 224 AAKAKGIDIWVIAFS 238 >gi|227552253|ref|ZP_03982302.1| possible pilus subunit protein [Enterococcus faecium TX1330] gi|257895164|ref|ZP_05674817.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com12] gi|293378024|ref|ZP_06624201.1| von Willebrand factor type A domain protein [Enterococcus faecium PC4.1] gi|227178583|gb|EEI59555.1| possible pilus subunit protein [Enterococcus faecium TX1330] gi|257831729|gb|EEV58150.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com12] gi|292643342|gb|EFF61475.1| von Willebrand factor type A domain protein [Enterococcus faecium PC4.1] Length = 498 Score = 39.5 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 29/152 (19%), Positives = 52/152 (34%), Gaps = 41/152 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM ++ +K + F+D++ S + + Sbjct: 312 LDLVLVVDWSGSM-------------------NDNNRIGEVKIGVDRFVDTLAD-SGITD 351 Query: 251 DVYMGLIGYTTRVEKNIEPS---WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + MG +GY+ + + V+ V S T + A++ A +L+ Sbjct: 352 KINMGYVGYSIEGYSYSNGAVQMGSFDSVKNQVKSITPSWTNGGTFTQKALRDAGNMLSV 411 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 412 PNGH------------------KKVIVLLTDG 425 >gi|198435216|ref|XP_002126368.1| PREDICTED: similar to integrin alpha Hr1 precursor-like [Ciona intestinalis] Length = 1274 Score = 39.5 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 61/411 (14%), Positives = 135/411 (32%), Gaps = 68/411 (16%) Query: 22 FSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA-GASKM-----VSNLSRL 75 I ++ V+ +I + + ++S ++A++ GAS + + +++ + Sbjct: 8 MYICASMIVILCASMISAQNIETREFHSLSPIKS--DSAVIHFGASVVIKTSPMKSIAYV 65 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 G + N + K + + +S + I +SS ++ N Sbjct: 66 GAPADGNKNGSVYKCSFSGKSYGNTSCSKDISFDNNAAIGMSIG--ADSSFSNLYICGNQ 123 Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 S T V Y + + F + I I Sbjct: 124 HTTSCPTTPLQKRMVGACYKKPMT----------SSTATMFKTPCVPGCPKIILIADIMF 173 Query: 196 VVDLSGSMH-----CAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 V+D S S+ A+N + V+ D L+ + F + + Sbjct: 174 VLDDSSSVDDTAFRSALNWIIQVVSYFSSYIDS-----GDLRVGVYGFSN-----DDHRS 223 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQILTSDK 309 + +GL W + +++ + ++ T + A+K+ ++ ++ Sbjct: 224 GIRIGLR------------KWTSATLKKQIGELLNVKSTGAGTYISHAIKETVKVFEAN- 270 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 K II LTDG +++ D A++ I +V++ + Sbjct: 271 ---------------GRKGISKEIILLTDGGASDWW---LLKGEADTARDKGIVLVSVGV 312 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILKG 420 S N +LL + + + ++L V N+ + K +V L+G Sbjct: 313 GTSVNNDQLL-AIAGNKSRVFQATDYNTLDEVVNNVVSTIDAIKITVNLEG 362 >gi|74193948|dbj|BAE36901.1| unnamed protein product [Mus musculus] Length = 1111 Score = 39.5 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 29/176 (16%), Positives = 60/176 (34%), Gaps = 12/176 (6%) Query: 203 MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR 262 + S + ++ + R+K +AL + + +S V + + Sbjct: 297 LGILYESCNQPQDAIKCYLNAARSKSCNNTSALTSRIKFLQSVSDTWNSVQTA--SHHSV 354 Query: 263 VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQG- 321 +K + +K++ + T A Q + + Q Sbjct: 355 QQKVYTQCFTAQKLQSFGKDQQPPFQTGSTRYLQAASTNDQNQNGNHTLPQNSKGDAQNH 414 Query: 322 -VKIPSLPFQKFIIFLTDGENNNFKS--------NVNTIKICDKAKENFIKIVTIS 368 ++IP+ QK I F + +++ KS + +T IC AK++ I IS Sbjct: 415 FLRIPTSEEQKIINFTKESKDSRSKSLTSKTSRKDRDTSNICVNAKKHSNHIYQIS 470 >gi|297201072|ref|ZP_06918469.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083] gi|197712143|gb|EDY56177.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083] Length = 516 Score = 39.5 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 31/210 (14%), Positives = 57/210 (27%), Gaps = 45/210 (21%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D S SM P +D + AL LD + Sbjct: 337 LVLDRSASM-------------RPYYKDGSAQALGEQTLALAAHLDPESKVH-------- 375 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 ++ ++T ++ E G + L S A + + Sbjct: 376 -VVFFSTELDGTGEL--GLTDHENKIDELHAGLGRMGRTSY---HAAVEEVL-------- 421 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT-ISINASP 373 G P P ++F TDG + T + + AK + + ++ Sbjct: 422 ---AHHGKNAPDTPA--LVVFQTDGAPDA--KTPATQALTEAAKTHPNVFFSFVAFGDPE 474 Query: 374 N-GQRLLKTCV-SSPEYHYNVVNADSLIHV 401 N L+ ++ + L Sbjct: 475 NKAFDYLRKLKLANTSHFLAGETPKELTDA 504 >gi|77457690|ref|YP_347195.1| VCBS [Pseudomonas fluorescens Pf0-1] gi|77381693|gb|ABA73206.1| putative secreted protein, hemolysin [Pseudomonas fluorescens Pf0-1] Length = 2887 Score = 39.5 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 35/224 (15%), Positives = 77/224 (34%), Gaps = 32/224 (14%) Query: 99 KNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRL 158 I +L G + Y N + ++T ++++ S + Sbjct: 1989 SGAITYTLVGSATGTYGQIQLNADGTYTYTLTSAPKTTPNANDGPN------TLSESFTY 2042 Query: 159 QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL------VVDLSGSMHCAMNSDPE 212 + + L N + +V+ + + + ER + +E+ V+D+SGSM + P Sbjct: 2043 KATDALGNSTTSTIVVNIVDDVPKAVASERSVAAVEIDSNILIVLDISGSMA-DASGVPG 2101 Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 +++ K A+ LD D L DV + L+ +++ + W Sbjct: 2102 L------------SRLELAKQAISALLDKYDDL----GDVKVQLVTFSSN-ATDRTSVWV 2144 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTN 316 + + + + T+ A+ Y + K + N Sbjct: 2145 DVATAKTLLAGLSA--GGGTNYDAAVATMYNAFNTSGKLTGAQN 2186 >gi|311265878|ref|XP_003130868.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like [Sus scrofa] Length = 944 Score = 39.5 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 45/223 (20%), Positives = 77/223 (34%), Gaps = 50/223 (22%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D S SM A K+ K+AL L + H Sbjct: 296 VVFVLDSSASMVGA--------------------KLRQTKDALFTILHDLRPQDHFN--- 332 Query: 253 YMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +IG++ R++ K+ S VR TD A++ +L Sbjct: 333 ---IIGFSNRIKVWKDHLVSVTPNNVRDGKVYIHHMSPSGGTDINGALQTGIALLHDS-- 387 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF---IKIVTI 367 R ++ S+ I+FLTDG+ +T KI + +E + I T+ Sbjct: 388 ------VARHDLEDRSVSL---IVFLTDGK--PTVGETHTPKILNNTREAARGRVCIFTV 436 Query: 368 SINASPNGQRL----LKTCVSSPEYHYNVVNA-DSLIHVFQNI 405 I + + L L+ C + + + +A LI + I Sbjct: 437 GIGDDVDFRLLEKLSLENCGFT-RHVHEDEDAGAQLIGFYDEI 478 >gi|253996156|ref|YP_003048220.1| von Willebrand factor type A [Methylotenera mobilis JLW8] gi|253982835|gb|ACT47693.1| von Willebrand factor type A [Methylotenera mobilis JLW8] Length = 2114 Score = 39.5 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 38/256 (14%), Positives = 79/256 (30%), Gaps = 25/256 (9%) Query: 55 SANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFY 114 + N +A+ A+ S LS + + L++DA I Sbjct: 1218 NGNPSAVADSATVYESGLSTGTQAGVLATTASGNLLVNDAGVSTTTTISSI-----NGVT 1272 Query: 115 NTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIV 174 T V S+ ++ AN + V + + + Q N + Sbjct: 1273 PTSGTITVTSATGTLVVNANTGAYTYTLNAATTEGVNDKPTFNYVLTDSVTGQSTNANLT 1332 Query: 175 ------SFIPALLRIEMGERPI---FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 + + + + + + +V+D SGSM D + S D Sbjct: 1333 VNIVDDAPVGGNITQTLQAASAALTYNVVIVLDRSGSMA----QDANGLWSNQSGYDAST 1388 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 +M K A+ + D L +V + + +++ ++ W + V V + Sbjct: 1389 NRMEIAKEAIAQLIARYDGLGNVN----VKFVTFSSDAVES---EWYIDNVTGAVRYVDN 1441 Query: 286 SLILKPTDSTPAMKQA 301 T + A+ + Sbjct: 1442 VQAGGGTQYSTALNET 1457 >gi|146306837|ref|YP_001187302.1| hemolysin-type calcium-binding region [Pseudomonas mendocina ymp] gi|145575038|gb|ABP84570.1| Hemolysin-type calcium-binding region [Pseudomonas mendocina ymp] Length = 3184 Score = 39.5 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 39/262 (14%), Positives = 83/262 (31%), Gaps = 40/262 (15%) Query: 98 IKNHIKESLS---GYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSY 154 + +S + + +I S+ + +N D + + +++ + + Sbjct: 2651 PSGQLTQSFTYVVSDGDGDRASANLDICIRGDRSVLVVGSNADDEQGSNVLHHVP--SQF 2708 Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 D IE N +V + + + I L++D SGSM DP+ Sbjct: 2709 DDGKGAIEGTFG---NDVLVGDLGGAADPVVKPAENYNIALILDRSGSMA----DDPDGS 2761 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 +++A LK+A+ F+ + + + + + T Sbjct: 2762 GGYG-------SRLALLKDAVNAFIGKLGTHTGQINIALISFSSSASLLLSGTLAQIQTA 2814 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 M T+ AM+QA + + + N Sbjct: 2815 LAAPN-NVLMALTASGATNYEAAMQQA-NAWFGGVEVNGYNNLAY--------------- 2857 Query: 335 FLTDGE----NNNFKSNVNTIK 352 FLTDG+ N + ++ +T+ Sbjct: 2858 FLTDGDPTTYNGDNSNSGSTVN 2879 >gi|90019771|ref|YP_525598.1| inter-alpha-trypsin inhibitor domain-containing protein [Saccharophagus degradans 2-40] gi|89949371|gb|ABD79386.1| von Willebrand factor, type A [Saccharophagus degradans 2-40] Length = 763 Score = 39.5 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 34/216 (15%), Positives = 77/216 (35%), Gaps = 41/216 (18%) Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 + ++ +P + + + I VVD SGSM + T + Sbjct: 365 DYLLLMLLPPQGQQQHTQSLSRDIVFVVDTSGSM--------------------QGTSIQ 404 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSL 287 K +L L ++ +I + T ++ S V+ V+ + Sbjct: 405 QAKRSLQFALRGLNPSDTFN------IIEFDTSFSRFRSRPVSATASNVQAAVSWVNNLN 458 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T+ A+++A+ L S + ++ ++F+TDG N ++ Sbjct: 459 ADNGTEMYAALEEAFDQLASINPNGTENSKSSNNLQQ--------VVFITDGAVGNEQAL 510 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV 383 ++ I + N ++ T++I ++PN ++ Sbjct: 511 LSLIHR----RLNNARLFTVAIGSAPNSY-FMRKAA 541 >gi|313203639|ref|YP_004042296.1| von willebrand factor type a [Paludibacter propionicigenes WB4] gi|312442955|gb|ADQ79311.1| von Willebrand factor type A [Paludibacter propionicigenes WB4] Length = 346 Score = 39.5 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 30/190 (15%), Positives = 64/190 (33%), Gaps = 41/190 (21%) Query: 158 LQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSA 217 L+F L+ +++ L+ E +R + + +D+S SM Sbjct: 57 LKFYFQLVVVFLLTIVLAQPQFGLKQEKEKRKGIEVMIALDVSNSMLA------------ 104 Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 QD +++ K L +D ++ D +GL+ + + + + Sbjct: 105 ---QDVLPSRLENAKQILSKLVDDMN-------DDKVGLVVFAGDAYTQLPITVDYVSAK 154 Query: 278 QYVTRDMDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 +++ L + T A+ A + + + K II + Sbjct: 155 MFLSNISPELVPRQGTAIGSALDLAIKSFGAKSEAG------------------KAIILI 196 Query: 337 TDGENNNFKS 346 TDGEN+ + Sbjct: 197 TDGENHEDDA 206 >gi|257127006|ref|YP_003165120.1| von Willebrand factor type A [Leptotrichia buccalis C-1013-b] gi|257050945|gb|ACV40129.1| von Willebrand factor type A [Leptotrichia buccalis C-1013-b] Length = 509 Score = 39.5 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 48/338 (14%), Positives = 109/338 (32%), Gaps = 43/338 (12%) Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 +L + +E I + A +F + K + +N+ N + Sbjct: 51 DLPDIVYTYEGIVDAAPGIY-----QFPDTYEKNIVKKSDIW-----RENVQNELKKIKP 100 Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDY-RLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 + + D +F + YDY ++ ++ + + + + ++ E Sbjct: 101 ALGEDASDEEIERLFKKFLYIAGYDYEPIETLDRFSYVIFKDDMANPFT---KQKIEENM 157 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 +E+++D SGSM + +T M K ++ + + Sbjct: 158 NVNLEIILDASGSMKQKI---------------SDKTMMEIAKESIEKVVSEM----PAN 198 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK-QAYQILTSD 308 V GL + + + + + ++ K S ++ + + Sbjct: 199 TKV--GLRVFGHKGDNTASKKQESCSANELISPIETLDKDKLKSSLAPIQPTGWTSIAKS 256 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + +G K ++ +II TDG + V T K + I + I Sbjct: 257 IENGTNDLKALKGEKTLNIL---YII--TDGIETCDGNPVETAKKFKN-ENTDIVLGIIG 310 Query: 369 INASPNGQRLLKTCVSSPEYHY-NVVNADSLIHVFQNI 405 N + ++LK ++ +Y + +A L Q I Sbjct: 311 FNVDAHQNKVLKEIANAANGYYSSANDAAKLTEELQRI 348 >gi|225575062|ref|ZP_03783672.1| hypothetical protein RUMHYD_03151 [Blautia hydrogenotrophica DSM 10507] gi|225037732|gb|EEG47978.1| hypothetical protein RUMHYD_03151 [Blautia hydrogenotrophica DSM 10507] Length = 393 Score = 39.5 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 37/234 (15%), Positives = 72/234 (30%), Gaps = 60/234 (25%) Query: 149 DVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE----------RPIFLIELVVD 198 + +R F+ R I + + G+ + L++D Sbjct: 57 SFLAEGIHRRNFVIFRRGNRAITLIAAALTGFAVGAGGQFLYMLPPMENHSAVDMVLLLD 116 Query: 199 LSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG 258 SGSM + K A L+ +D E + Sbjct: 117 GSGSM---------------------QGKKEPCVQATEALLEQMD------EQSRAQAVA 149 Query: 259 YTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNF 317 + + V N E E+ R+ + + ++ I+ T+ + A L K+ Sbjct: 150 FASCVLGNTELLPLDEEGRETLIKFVEGTDIIGGTEFGQPLTFALNSLEEKKETGRIQA- 208 Query: 318 FRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 +I L+DGE T++ ++ KE + + TI ++A Sbjct: 209 ---------------VILLSDGEGPFP----ETLE--EEYKEKDVVLYTIRMDA 241 >gi|322688246|ref|YP_004207980.1| hypothetical protein BLIF_0055 [Bifidobacterium longum subsp. infantis 157F] gi|320459582|dbj|BAJ70202.1| conserved hypothetical protein [Bifidobacterium longum subsp. infantis 157F] Length = 362 Score = 39.5 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 70/210 (33%), Gaps = 38/210 (18%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 VVD SGSM + +A + D+ + + DV + Sbjct: 187 VVDYSGSMSGEGKNGVVKGLNAALDPDQAKKSY----------------IEPASGDVNI- 229 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 LI + T + ++ + GT + + TD + A L S+ + S +T Sbjct: 230 LIPFETEAHRPVKAT-GTS-TSDLLHEADATDASGGTDIYEGLLSALDELPSESEASQYT 287 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 I+ +TDG +N+ + K++ + I +I + Sbjct: 288 TA---------------IVLMTDGRSNSDHQDEFESAY--KSRGRDLPIFSIMFGDADPS 330 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 Q LK+ + + L VF+ + Sbjct: 331 Q--LKSLATLSNAKVFDGRSGDLAAVFRQV 358 >gi|327262910|ref|XP_003216266.1| PREDICTED: integrin alpha-2-like [Anolis carolinensis] Length = 1302 Score = 39.5 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 66/187 (35%), Gaps = 24/187 (12%) Query: 233 NALLLFLDS-IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 +A+ FL + L +GLI Y + + T K V ++ Sbjct: 310 DAVKAFLKKFVQGLDIGPTKTQVGLIQYGNNPREVFNLNTYTNK-EDAVQAMSETYQNGG 368 Query: 292 --TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T++ A++ A + S + K ++ +TDGE+++ Sbjct: 369 EYTNTFRAIEYARRYAFSKE-------------SGGRPSASKVMVVVTDGESHDGSKLQE 415 Query: 350 TIKIC--DKAKENFIKIVTISINASPNGQRLLK-----TCVSSPEYHYNVVNADSLIHVF 402 I C D I ++ I + + L+K + + +Y +NV + +L+ Sbjct: 416 VIAKCNEDNITRFGIAVLGYLIRNELDTKNLIKEIKGIASLPTSKYFFNVSSEAALLEEA 475 Query: 403 QNISQLM 409 + + + Sbjct: 476 GTLGERL 482 >gi|224090449|ref|XP_002195035.1| PREDICTED: integrin, alpha 1 [Taeniopygia guttata] Length = 1184 Score = 39.5 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 33/188 (17%), Positives = 73/188 (38%), Gaps = 28/188 (14%) Query: 234 ALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKP 291 ++ FL+S+ + + +G++ Y V T + V + Sbjct: 191 SVTDFLNSLLRNMDIGPQQTQVGIVQYGQTVVHEFYL--NTYSTTEDVMAAASRIRQRGG 248 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A I T+ ++ + R+G QK ++ +TDGE N Sbjct: 249 TQTM----TALGIDTAREEAFTEAHGARRG-------VQKVMVIVTDGE---SHDNYRLQ 294 Query: 352 KICDKAKENFIKIVTISINASPNGQRL--------LKTCVSSP--EYHYNVVNADSLIHV 401 ++ D ++ I+ I+I S + L +K+ S P ++ +NV + +L+ + Sbjct: 295 EVIDDCEDENIQRFAIAILGSYSRGNLSTEKFVEEIKSIASKPTEKHFFNVSDELALLTI 354 Query: 402 FQNISQLM 409 + + + + Sbjct: 355 VEALGERI 362 >gi|331019621|gb|EGH99677.1| serine/threonine-protein kinase PpkA [Pseudomonas syringae pv. lachrymans str. M302278PT] Length = 663 Score = 39.5 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 27/192 (14%), Positives = 63/192 (32%), Gaps = 21/192 (10%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + + A+ I + ++ V GL+ + + + + + Y + Sbjct: 253 IERTREAIAKVYAQIAKENLGRQ-VKFGLVAFRSSTQAVPGLEY---VTKMYADPNTVKD 308 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD-----GENN 342 +KQA S + S+ S ++++ +TD G++ Sbjct: 309 GADFLAKAAGLKQAKVSSKSFNEDSYAGVMQAIDKVDWSPFGARYVVLITDAGALDGDDK 368 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISI----------NASPNGQRLLKTCVSSPEYHYNV 392 + +N ++ +A + I T+ + A Q L ++ +Y V Sbjct: 369 LSGTGLNAEQVRIEASNPGVAIYTLHLKTAAGAKDHAKAEAQYQALSTYTGTNTSLYYPV 428 Query: 393 VNADSLIHVFQN 404 +A L F + Sbjct: 429 -DAGDLN-AFGS 438 >gi|260890922|ref|ZP_05902185.1| D-amino acid dehydrogenase large subunit [Leptotrichia hofstadii F0254] gi|260859475|gb|EEX73975.1| D-amino acid dehydrogenase large subunit [Leptotrichia hofstadii F0254] Length = 516 Score = 39.5 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 43/290 (14%), Positives = 98/290 (33%), Gaps = 33/290 (11%) Query: 119 QNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDY-RLQFIEHLLNQRYNQKIVSFI 177 +N+ N + + + D +F + YDY ++ ++ + +V+ Sbjct: 95 ENVQNELKKIKPALGEDASDEEIERLFKKFLYIAGYDYEPIETLDRFSYVIFKDDMVNPF 154 Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + ++ E +E+++D SGSM + +T M K ++ Sbjct: 155 T---KQKIEENMNVNLEIILDASGSMKQKIG---------------DKTMMEIAKESIEK 196 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 + + + GL + + + + + ++ K S Sbjct: 197 VVSEM------PANTKAGLRVFGHKGDNTASKKQESCSANELISPIETLNKDKLKSSLAP 250 Query: 298 MK-QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 ++ + + + +G K ++ +II TDG + V T K Sbjct: 251 IQPTGWTSIAKSIENGTNDLKALKGEKTLNIL---YII--TDGIETCDGNPVETAKKFKN 305 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHY-NVVNADSLIHVFQNI 405 + I + I N + ++LK ++ +Y +V +A L Q I Sbjct: 306 -ENTDIVLGIIGFNVDAHQNKVLKEIANAANGYYSSVNDAAKLTEELQRI 354 >gi|152991131|ref|YP_001356853.1| hypothetical protein NIS_1388 [Nitratiruptor sp. SB155-2] gi|151422992|dbj|BAF70496.1| hypothetical protein [Nitratiruptor sp. SB155-2] Length = 928 Score = 39.5 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 33/156 (21%), Positives = 52/156 (33%), Gaps = 32/156 (20%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 P F I L++D+SGSM T+++ A L L V Sbjct: 560 PQFNIMLILDVSGSMGWDSGDGT--------------TRLSKEVEAAQKLLQEYSKLGDV 605 Query: 249 KEDVYMGLIGYT-TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 V + L + + I +W V + + + TD A+ A Q L Sbjct: 606 A--VKLVLFSSDVSNQAQYIPQNW--MSVDKAIGMLDNLYADGTTDYVNAIDGAMQ-LFD 660 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 K +FF N + F++DGE + Sbjct: 661 QKDGTFFDNGANRVY------------FMSDGEPSY 684 >gi|28870056|ref|NP_792675.1| ppkA-like protein [Pseudomonas syringae pv. tomato str. DC3000] gi|28853302|gb|AAO56370.1| ppkA-related protein [Pseudomonas syringae pv. tomato str. DC3000] Length = 653 Score = 39.5 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 27/192 (14%), Positives = 63/192 (32%), Gaps = 21/192 (10%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + + A+ I + ++ V GL+ + + + + + Y + Sbjct: 243 IERTREAIAKVYAQIAKENLGRQ-VKFGLVAFRSSTQAVPGLEY---VTKMYADPNTVKD 298 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD-----GENN 342 +KQA S + S+ S ++++ +TD G++ Sbjct: 299 GADFLAKAAGLKQAKVSSKSFNEDSYAGVMQAIDKVDWSPFGARYVVLITDAGALDGDDK 358 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISI----------NASPNGQRLLKTCVSSPEYHYNV 392 + +N ++ +A + I T+ + A Q L ++ +Y V Sbjct: 359 LSGTGLNAEQVRIEASNPGVAIYTLHLKTAAGAKDHAKAEAQYQALSTYTGTNTSLYYPV 418 Query: 393 VNADSLIHVFQN 404 +A L F + Sbjct: 419 -DAGDLN-AFGS 428 >gi|332809376|ref|XP_003308229.1| PREDICTED: calcium-activated chloride channel regulator 4 isoform 1 [Pan troglodytes] Length = 682 Score = 39.5 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 35/238 (14%), Positives = 74/238 (31%), Gaps = 64/238 (26%) Query: 195 LVVDLSGSMHCA--MNSDPEDVNSAPICQDKKRTKMAAL----KNALLLFLDSIDLLSHV 248 LV+D SGSM +N + + + + + + ++ L I Sbjct: 72 LVLDKSGSMGGKDRLNRMNQAAKHFLLQTVENGSWVGMVHFDSTATVVNKLIQIKSSD-E 130 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + + GL Y T +K A+Q++ Sbjct: 131 RNTLMAGLPTY----------------------------PRGGTSICSGIKYAFQVIGEL 162 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTI 367 + + ++ LTDGE+N S C D+ K++ + I Sbjct: 163 HSQLDGSE----------------VLLLTDGEDNTASS-------CIDEVKQSGAIVHFI 199 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNA--DSLIHVFQNIS---QLMVHRKYSVILKG 420 ++ + + + ++ + Y A + LI F ++ + + + KG Sbjct: 200 ALGRDADEAVIEMSKITGGSHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKG 257 >gi|315150339|gb|EFT94355.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0012] Length = 1103 Score = 39.5 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 53/150 (35%), Gaps = 39/150 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM + ++ ++ + F+D++ S Sbjct: 268 LDLVLVVDWSGSM-------------------NENNRIGEVQKGVNRFVDTLADSSITNN 308 Query: 251 DVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + MG +GY++ N G + V+ + S T + A++ A +L + Sbjct: 309 -INMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPN 367 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 368 GH------------------KKVIVLLTDG 379 >gi|227547429|ref|ZP_03977478.1| von Willebrand factor type A (vWA) domain protein [Bifidobacterium longum subsp. infantis ATCC 55813] gi|227212076|gb|EEI79972.1| von Willebrand factor type A (vWA) domain protein [Bifidobacterium longum subsp. infantis ATCC 55813] Length = 362 Score = 39.5 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 70/210 (33%), Gaps = 38/210 (18%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 VVD SGSM + +A + D+ + + DV + Sbjct: 187 VVDYSGSMSGEGKNGVVKGLNAALDPDQAKKSY----------------IEPASGDVNI- 229 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 LI + T + ++ + GT + + TD + A L S+ + S +T Sbjct: 230 LIPFETEAHRPVKAT-GTS-TSDLLHEADATDASGGTDIYEGLLSALDELPSESEASQYT 287 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 I+ +TDG +N+ + K++ + I +I + Sbjct: 288 TA---------------IVLMTDGRSNSDHQDEFESAY--KSRGRDLPIFSIMFGDADPS 330 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 Q LK+ + + L VF+ + Sbjct: 331 Q--LKSLATLSNAKVFDGRSGDLAAVFRQV 358 >gi|327467286|gb|EGF12786.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK330] Length = 470 Score = 39.5 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 31/173 (17%), Positives = 63/173 (36%), Gaps = 31/173 (17%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I V D SGSM M +V K +++ L+ + + + + +V ++ Sbjct: 200 ISFVFDSSGSMERDMKGRNTNVV--------KERRISILREKAIEMVKELKEIGNVSVNL 251 Query: 253 YMGLIGYTTRVEKNIE-PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 G Y V+K+ GTE++ + + T+ ++ L Sbjct: 252 -SGFSHYGFYVQKDFSQLDKGTEQIEKSINSLP---TRGVTNPGDGLRYGMVSLQ----- 302 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 +Q V++ K+++ LTDG N + + + +K +I Sbjct: 303 -------QQHVQL------KYVVLLTDGIPNAYIVDPSANYPGNKVWIRGGEI 342 >gi|256965511|ref|ZP_05569682.1| von Willebrand factor [Enterococcus faecalis HIP11704] gi|256956007|gb|EEU72639.1| von Willebrand factor [Enterococcus faecalis HIP11704] Length = 1154 Score = 39.5 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 55/150 (36%), Gaps = 39/150 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM + ++ ++ + F+D++ S + Sbjct: 319 LDLVLVVDWSGSM-------------------NENNRIGEVQKGVNRFVDTLAD-SGITN 358 Query: 251 DVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ MG +GY++ N G + V+ + S T + A++ A +L + Sbjct: 359 NINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPN 418 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 419 GH------------------KKVIVLLTDG 430 >gi|198435896|ref|XP_002123489.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 1595 Score = 39.5 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 41/223 (18%), Positives = 73/223 (32%), Gaps = 50/223 (22%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R I LVVD SGS VN + K+ L + S +L Sbjct: 413 RGKIDIVLVVDQSGS-----------VNQCNFQKVKRW---------LRDIVRSFNLGVT 452 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTE-------KVRQYVTRDMD--SLILKPTDSTPAM 298 ++ G++ Y+ + + G +Q +T+ + + T + A Sbjct: 453 EQDV---GVVVYSKKATTSTVVDLGFSDYDSDGHTKKQEMTKILKKLAYEGGTTYTGYAF 509 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 K A ++LT +K +K II LTDG + + + D ++ Sbjct: 510 KLANEMLTGNK---------------SRPDAKKMIILLTDGATTAA-NTLQLKEELDVSR 553 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + I+ + + Q L + + V L V Sbjct: 554 AANVMILAVGVG--KFNQTELIQIAGDRKNFFAVTKFSELEKV 594 >gi|308070278|ref|YP_003871883.1| hypothetical protein PPE_03528 [Paenibacillus polymyxa E681] gi|305859557|gb|ADM71345.1| Conserved hypothetical protein [Paenibacillus polymyxa E681] Length = 695 Score = 39.5 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 41/266 (15%), Positives = 84/266 (31%), Gaps = 60/266 (22%) Query: 158 LQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIE-----LVVDLSGSMHCAMNSDPE 212 + F+ +L + + I + + IE V+D S SM ++DPE Sbjct: 10 IVFMTFVLFLQMG--LNGVICTVNQANAASLGAVPIEGYDAVFVLDTSYSMR---DTDPE 64 Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SW 271 +++ ++ + +G + Y V + S Sbjct: 65 GISAE-----------------VINMFMDLSDADR----TRVGFVAYNHNVVASKPLTSI 103 Query: 272 GTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 + + +D+ +L TD +++ + G P Sbjct: 104 AVAAQKSQIQQDIRTLNRSGYTDLGLGLRR-------------GSELLAAGASQGRQP-- 148 Query: 331 KFIIFLTDGEN---------NNFKSNVNTIKICDKAKENFIKIVTISIN--ASPNGQRLL 379 F+I L+DGE + SN + + A+ + TI +N + N Q L Sbjct: 149 -FMILLSDGETDFGASSGSRSKGDSNNDVSSVIKSAQTKGYPVYTIGLNHDGTVNRQELE 207 Query: 380 KTCVSSPEYHYNVVNADSLIHVFQNI 405 + + + +A+ L + I Sbjct: 208 RIASQTGGASFITSSAEDLPEILNRI 233 >gi|257415703|ref|ZP_05592697.1| von Willebrand factor [Enterococcus faecalis AR01/DG] gi|257157531|gb|EEU87491.1| von Willebrand factor [Enterococcus faecalis ARO1/DG] Length = 1154 Score = 39.5 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 55/150 (36%), Gaps = 39/150 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM + ++ ++ + F+D++ S + Sbjct: 319 LDLVLVVDWSGSM-------------------NENNRIGEVQKGVNRFVDTLAD-SGITN 358 Query: 251 DVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ MG +GY++ N G + V+ + S T + A++ A +L + Sbjct: 359 NINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPN 418 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 419 GH------------------KKVIVLLTDG 430 >gi|237689767|gb|ACQ59162.1| factor B/C2 [Larimichthys crocea] Length = 755 Score = 39.5 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 47/329 (14%), Positives = 97/329 (29%), Gaps = 55/329 (16%) Query: 99 KNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRL 158 + V Y+ + S + N + +Y T + Sbjct: 151 ASRAGNMFGIDDKVTYSCNSNLFLVGSSERVCQE-NGQWTGMEPECYYKHTYDTPVEISQ 209 Query: 159 QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP 218 F + N ++ + P ++ + + I + VD+S S+ Sbjct: 210 AFGSAIKNTLTKEEPLDGSPEGRKLRISKEGTLNIYIAVDVSASIE-------------D 256 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKV 276 + + L + F +S E V +++ +E+ + W + V Sbjct: 257 KYISNTTSVIKKLIQKISSF-----TVSPNYEIV-----FFSSEIEEVVNILDFWEKKDV 306 Query: 277 RQYVTRDMDSL-----ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 R+ + +D+ TD ++ + S K N F+ + Sbjct: 307 RKDILEKLDTFGINDRSSTGTDLNLVFFTFWERM-SFIKTRVGANAFKDHRHV------- 358 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKA-------------KENFIKIVTISINASPNGQRL 378 II TDG N S T+ +E ++ + +I A L Sbjct: 359 -IILFTDGAYNMGGSPAPTLARIKDMVYMNQTGEKEVNSREEYLDVYVFAIGAETFDDDL 417 Query: 379 LKTCV--SSPEYHYNVVNADSLIHVFQNI 405 V + ++ + V D + F +I Sbjct: 418 KSFTVGKAEEKHFFKVKGIDKMYDAFDDI 446 >gi|271966806|ref|YP_003341002.1| von Willebrand factor, type A [Streptosporangium roseum DSM 43021] gi|270509981|gb|ACZ88259.1| von Willebrand factor, type A [Streptosporangium roseum DSM 43021] Length = 315 Score = 39.5 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 47/280 (16%), Positives = 89/280 (31%), Gaps = 60/280 (21%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 + + LL + +R+ R I + +D+S SM A Sbjct: 52 WRRHVAPALFLLMMSLLIIGAARPADSVRVP---RDRATIIIALDISLSMEAA------- 101 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 D + ++ A K A F++ + E +G++ + + P+ Sbjct: 102 --------DVQPNRITAAKEAAQKFVEDL------PERFNVGVVAFARSASVVVSPTTDH 147 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + V + T A+ + L +F +Q V P I Sbjct: 148 QAVSA---SLGNLTTRAGTAIGEAV---FNSL------DAVRSFDQQAVTDPPPAA---I 192 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS--------------INASPNGQRLL 379 + L+DG+N + +S + D A + I TI+ +N N L Sbjct: 193 VLLSDGDNTSGRSVAEAV---DAAMSARVPISTIAYGTQEGTVSIDGRDVNVPVNKATLQ 249 Query: 380 KTCVSSPEYHYNVVNADSLIHVFQNISQLM----VHRKYS 415 + Y + L V++ I + VH++ S Sbjct: 250 TLSEGTSGRAYEAESGSQLREVYEQIGTSLGYRTVHQEIS 289 >gi|315172113|gb|EFU16130.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX1342] Length = 1103 Score = 39.5 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 55/150 (36%), Gaps = 39/150 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM + ++ ++ + F+D++ S + Sbjct: 268 LDLVLVVDWSGSM-------------------NENNRIGEVQKGVNRFVDTLAD-SGITN 307 Query: 251 DVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ MG +GY++ N G + V+ + S T + A++ A +L + Sbjct: 308 NINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPN 367 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 368 GH------------------KKVIVLLTDG 379 >gi|257897779|ref|ZP_05677432.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com15] gi|257835691|gb|EEV60765.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com15] Length = 819 Score = 39.5 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 29/152 (19%), Positives = 52/152 (34%), Gaps = 41/152 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM ++ +K + F+D++ S + + Sbjct: 312 LDLVLVVDWSGSM-------------------NDNNRIGEVKIGVDRFVDTLAD-SGITD 351 Query: 251 DVYMGLIGYTTRVEKNIEPS---WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + MG +GY+ + + V+ V S T + A++ A +L+ Sbjct: 352 KINMGYVGYSIEGYSYSNGAVQMGSFDSVKNQVKSITPSWTNGGTFTQKALRDAGNMLSV 411 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 412 PNGH------------------KKVIVLLTDG 425 >gi|257086444|ref|ZP_05580805.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis D6] gi|256994474|gb|EEU81776.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis D6] Length = 1154 Score = 39.5 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 55/150 (36%), Gaps = 39/150 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM + ++ ++ + F+D++ S + Sbjct: 319 LDLVLVVDWSGSM-------------------NENNRIGEVQKGVNRFVDTLAD-SGITN 358 Query: 251 DVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ MG +GY++ N G + V+ + S T + A++ A +L + Sbjct: 359 NINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPN 418 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 419 GH------------------KKVIVLLTDG 430 >gi|257085650|ref|ZP_05580011.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis Fly1] gi|256993680|gb|EEU80982.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis Fly1] Length = 1154 Score = 39.5 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 55/150 (36%), Gaps = 39/150 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM + ++ ++ + F+D++ S + Sbjct: 319 LDLVLVVDWSGSM-------------------NENNRIGEVQKGVNRFVDTLAD-SGITN 358 Query: 251 DVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ MG +GY++ N G + V+ + S T + A++ A +L + Sbjct: 359 NINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPN 418 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 419 GH------------------KKVIVLLTDG 430 >gi|121605804|ref|YP_983133.1| type 4 fimbrial biogenesis protein PilY1 [Polaromonas naphthalenivorans CJ2] gi|120594773|gb|ABM38212.1| type 4 fimbrial biogenesis protein PilY1 [Polaromonas naphthalenivorans CJ2] Length = 1148 Score = 39.5 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 33/180 (18%), Positives = 57/180 (31%), Gaps = 26/180 (14%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + G P + + VD SGSM+ +N + + + ++ LKN+L + Sbjct: 70 VSGGNEPAPNVIISVDDSGSMNWDVNGNSTNTTNNK--------RITLLKNSLKSVFGNP 121 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV---------------RQYVTRDMDSL 287 V I +V N + G + R ++SL Sbjct: 122 TANPPTNGVVPDNRIRLAWQVMHNNGDASGASSLTPGNTNSMKAFSGTHRTNFNTFINSL 181 Query: 288 ILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS--LPFQKFIIFLTDGENNNF 344 T S KQA + S + + + + IF+TDG N+ Sbjct: 182 SASGDTPSHKMAKQALDYMKSPAGTNSPWADMPGTAQTTQYLSCRRSYHIFMTDGGWNSD 241 >gi|302796876|ref|XP_002980199.1| hypothetical protein SELMODRAFT_444452 [Selaginella moellendorffii] gi|300151815|gb|EFJ18459.1| hypothetical protein SELMODRAFT_444452 [Selaginella moellendorffii] Length = 550 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 28/231 (12%), Positives = 75/231 (32%), Gaps = 43/231 (18%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 R + V+D+SGSM + K+ +K A+ + ++ Sbjct: 75 AAADARAPVDLVTVLDVSGSM--------------------RGQKLELVKTAMEFVIRNL 114 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE-KVRQYVTRDMDSLILKPTDSTPAMKQA 301 + ++ ++ + ++ T ++ L T+ P +K Sbjct: 115 RQQD------RLAIVSFSDEPKVHLGLKRMTHDGRAAALSAVEKLRSLGGTEIRPGLKAG 168 Query: 302 YQILTSDKKRSFFTNFF-RQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 + +L+ K R+ ++ ++ F++ + D + ++ C Sbjct: 169 FDLLSRRKNRNPVSSIMLLSDGMDNAITFKRCKVLPVD----------SYLEDC----SE 214 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN-ISQLMV 410 + + T + + + +L ++ V ++ H F I L+ Sbjct: 215 RVPVHTFGFGSDHDPEAMLSIAEATGGSFCYVQEESTVQHAFAQCIGGLLS 265 >gi|229546230|ref|ZP_04434955.1| pilus subunit protein [Enterococcus faecalis TX1322] gi|229308754|gb|EEN74741.1| pilus subunit protein [Enterococcus faecalis TX1322] Length = 1103 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 55/150 (36%), Gaps = 39/150 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM + ++ ++ + F+D++ S + Sbjct: 268 LDLVLVVDWSGSM-------------------NENNRIGEVQKGVNRFVDTLAD-SGITN 307 Query: 251 DVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ MG +GY++ N G + V+ + S T + A++ A +L + Sbjct: 308 NINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPN 367 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 368 GH------------------KKVIVLLTDG 379 >gi|218186188|gb|EEC68615.1| hypothetical protein OsI_36984 [Oryza sativa Indica Group] Length = 585 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 31/170 (18%), Positives = 59/170 (34%), Gaps = 37/170 (21%) Query: 183 IEMGERPIFLIELV--VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + + F I+LV +D+SGSM A + T++ +K A+ + + Sbjct: 36 VAPAKTERFPIDLVAVLDVSGSMTKATSMHG-------------WTRLDLVKGAMKMVTN 82 Query: 241 SIDLLSHVKEDVYMGLIGYTTRV--EKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPA 297 + + ++ + +V T K R ++ L T PA Sbjct: 83 KLGAGD------RLAIVPFNGKVVAAGATRLMEMTTKGRADANAKVNQLKAGGDTKFLPA 136 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 +K A +L S + + FI L+DG++N + Sbjct: 137 LKHASGLLDSRPAGD-------KQYRPG------FIFLLSDGQDNGVLDD 173 >gi|108759903|ref|YP_633800.1| BatB protein [Myxococcus xanthus DK 1622] gi|108463783|gb|ABF88968.1| batB protein [Myxococcus xanthus DK 1622] Length = 343 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 27/189 (14%), Positives = 69/189 (36%), Gaps = 43/189 (22%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + E+ +R + + +D S SM +D + +++ K L LD Sbjct: 85 KSELTKRRGIDVVVALDASKSMLA---------------RDIQPSRLERAKLELTTLLDE 129 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR-DMDSLILKPTDSTPAMKQ 300 + +GL+ + + V+ ++ D + + T+ A++ Sbjct: 130 LK-------GDRVGLVVFAGDAFIQSPLTSDYSAVKLFLRAVDPEVMPQGGTNVGAALRL 182 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 + Q+L + + S ++ ++ LTDGE+ T + K++ Sbjct: 183 SRQVLENADRGSK----------------ERVVVLLTDGEDLVGDVAEAT----EALKDS 222 Query: 361 FIKIVTISI 369 ++++ + + Sbjct: 223 GVQVLAVGV 231 >gi|116750907|ref|YP_847594.1| von Willebrand factor, type A [Syntrophobacter fumaroxidans MPOB] gi|116699971|gb|ABK19159.1| von Willebrand factor, type A [Syntrophobacter fumaroxidans MPOB] Length = 268 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 32/176 (18%), Positives = 59/176 (33%), Gaps = 36/176 (20%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 +++D SGSM+ S RTK A K AL F +D M Sbjct: 83 VIMDSSGSMNEVRCSG-------------NRTKSEAAKTALAQFARI------TPKDANM 123 Query: 255 GLIGYTTR-VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 GL + + + + G E +++ + T A+ Y+ L +R Sbjct: 124 GLAVFDAYGIAERVPL--GLENRDKFIAAVNATAPGNGTPLHDALLLGYRRLEETARRQA 181 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + ++ +TDG ++ T + +++ + I TI Sbjct: 182 GYGEYH-------------LVVITDG-QAYPQNQDPTPVVAYILRQSPVVIHTIGF 223 >gi|312899566|ref|ZP_07758892.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0470] gi|311293245|gb|EFQ71801.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0470] Length = 1103 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 55/150 (36%), Gaps = 39/150 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM + ++ ++ + F+D++ S + Sbjct: 268 LDLVLVVDWSGSM-------------------NENNRIGEVQKGVNRFVDTLAD-SGITN 307 Query: 251 DVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ MG +GY++ N G + V+ + S T + A++ A +L + Sbjct: 308 NINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPN 367 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 368 GH------------------KKVIVLLTDG 379 >gi|307271265|ref|ZP_07552544.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0855] gi|306512014|gb|EFM81005.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0855] Length = 1103 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 55/150 (36%), Gaps = 39/150 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM + ++ ++ + F+D++ S + Sbjct: 268 LDLVLVVDWSGSM-------------------NENNRIGEVQKGVNRFVDTLAD-SGITN 307 Query: 251 DVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ MG +GY++ N G + V+ + S T + A++ A +L + Sbjct: 308 NINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPN 367 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 368 GH------------------KKVIVLLTDG 379 >gi|307291074|ref|ZP_07570959.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0411] gi|306497728|gb|EFM67260.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0411] Length = 1103 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 55/150 (36%), Gaps = 39/150 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM + ++ ++ + F+D++ S + Sbjct: 268 LDLVLVVDWSGSM-------------------NENNRIGEVQKGVNRFVDTLAD-SGITN 307 Query: 251 DVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ MG +GY++ N G + V+ + S T + A++ A +L + Sbjct: 308 NINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPN 367 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 368 GH------------------KKVIVLLTDG 379 >gi|296225414|ref|XP_002758468.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 [Callithrix jacchus] Length = 904 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 32/176 (18%), Positives = 55/176 (31%), Gaps = 48/176 (27%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D SGSM K+ + A + +D LS + Sbjct: 275 VVFVIDKSGSMSGR--------------------KIQQTREA---LIKILDDLSPRDQF- 310 Query: 253 YMGLIGYTTRVEKNIE-----PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 LI +++ + + K R + L T+ A+ A Q+L Sbjct: 311 --NLITFSSEATQWSPSLVPASAENVNKARSFAAAIHA---LGGTNINDAVLMAVQLL-- 363 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN---NFKSNVNTIKICDKAKEN 360 + + + S+ II LTDG+ SN T + +E Sbjct: 364 ------DRSNREERLPTRSVSL---IILLTDGDPTVGEGPASNSKTRCTGENVREA 410 >gi|257082947|ref|ZP_05577308.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis E1Sol] gi|256990977|gb|EEU78279.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis E1Sol] Length = 1148 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 55/150 (36%), Gaps = 39/150 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM + ++ ++ + F+D++ S + Sbjct: 313 LDLVLVVDWSGSM-------------------NENNRIGEVQKGVNRFVDTLAD-SGITN 352 Query: 251 DVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ MG +GY++ N G + V+ + S T + A++ A +L + Sbjct: 353 NINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPN 412 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 413 GH------------------KKVIVLLTDG 424 >gi|257078107|ref|ZP_05572468.1| von Willebrand factor [Enterococcus faecalis JH1] gi|256986137|gb|EEU73439.1| von Willebrand factor [Enterococcus faecalis JH1] Length = 1154 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 55/150 (36%), Gaps = 39/150 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM + ++ ++ + F+D++ S + Sbjct: 319 LDLVLVVDWSGSM-------------------NENNRIGEVQKGVNRFVDTLAD-SGITN 358 Query: 251 DVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ MG +GY++ N G + V+ + S T + A++ A +L + Sbjct: 359 NINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPN 418 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 419 GH------------------KKVIVLLTDG 430 >gi|256762099|ref|ZP_05502679.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T3] gi|256683350|gb|EEU23045.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T3] Length = 1154 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 55/150 (36%), Gaps = 39/150 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM + ++ ++ + F+D++ S + Sbjct: 319 LDLVLVVDWSGSM-------------------NENNRIGEVQKGVNRFVDTLAD-SGITN 358 Query: 251 DVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ MG +GY++ N G + V+ + S T + A++ A +L + Sbjct: 359 NINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPN 418 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 419 GH------------------KKVIVLLTDG 430 >gi|255976231|ref|ZP_05426817.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T2] gi|255969103|gb|EET99725.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T2] Length = 1154 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 55/150 (36%), Gaps = 39/150 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM + ++ ++ + F+D++ S + Sbjct: 319 LDLVLVVDWSGSM-------------------NENNRIGEVQKGVNRFVDTLAD-SGITN 358 Query: 251 DVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ MG +GY++ N G + V+ + S T + A++ A +L + Sbjct: 359 NINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPN 418 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 419 GH------------------KKVIVLLTDG 430 >gi|162454087|ref|YP_001616454.1| hypothetical protein sce5811 [Sorangium cellulosum 'So ce 56'] gi|161164669|emb|CAN95974.1| hypothetical protein predicted by Glimmer/Critica [Sorangium cellulosum 'So ce 56'] Length = 907 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 26/215 (12%), Positives = 66/215 (30%), Gaps = 44/215 (20%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 P + LV+D SGSM A + + + A +D + + Sbjct: 500 PHLSVHLVLDTSGSMAGA--------------------PIDSARRAAQALVDRL---APA 536 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + V ++ +R+ + + T+ + Y Sbjct: 537 DDFSLTTFSSDAEVVIEDGPVGPRRAAIRRAIEGLREG---GGTNIGAGLSLGY------ 587 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD-KAKENFIKIVTI 367 R G+ ++ + ++ ++DG + ++ + A + I+ + Sbjct: 588 ------AQASRPGIPEDAV---RVVLLVSDGRATSGLTHSERLAWLALDAFQRGIQTSAL 638 Query: 368 SINASPNGQRLLKTCVSS-PEYHYNVVNADSLIHV 401 + +GQ L+ S +Y + + + + Sbjct: 639 GLGDDFDGQ-LMSAIASDGAGGYYYLRHPEQIAPA 672 >gi|15827966|ref|NP_302229.1| hypothetical protein ML1808 [Mycobacterium leprae TN] gi|221230443|ref|YP_002503859.1| hypothetical protein MLBr_01808 [Mycobacterium leprae Br4923] gi|81536900|sp|Q9CBL9|Y1808_MYCLE RecName: Full=UPF0353 protein ML1808 gi|254800638|sp|B8ZS82|Y1808_MYCLB RecName: Full=UPF0353 protein MLBr01808 gi|13093519|emb|CAC30761.1| possible membrane protein [Mycobacterium leprae] gi|219933550|emb|CAR71903.1| possible membrane protein [Mycobacterium leprae Br4923] Length = 335 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 39/234 (16%), Positives = 72/234 (30%), Gaps = 51/234 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + LV+D+S SM D + +MAA + A F + + Sbjct: 99 VMLVIDVSQSM---------------RATDVEPNRMAAAQEAAKQFAGELTPG------I 137 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GLI Y + P+ + + + + T + A+ A Q + + Sbjct: 138 NLGLIAYAGTATVLVSPTTNRYATKNALDKLQFADR---TATGEAIFTALQAIATVGAV- 193 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK--SNVNTIKICD-KAKENFIKIVTISI 369 + +P I+ +DG+ N AK+ + I TIS Sbjct: 194 ---------IGGGEMPPPARIVLFSDGKETMPTNPDNPKGAYTAARTAKDQGVPISTISF 244 Query: 370 N--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + + + K S YN L V+ ++ Q + Sbjct: 245 GTVYGFVEINGQRQPVPVDDETMKKVAQLSGGNSYNAATLAELKAVYASLQQQI 298 >gi|315028044|gb|EFT39976.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX2137] Length = 1103 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 55/150 (36%), Gaps = 39/150 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM + ++ ++ + F+D++ S + Sbjct: 268 LDLVLVVDWSGSM-------------------NENNRIGEVQKGVNRFVDTLAD-SGITN 307 Query: 251 DVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ MG +GY++ N G + V+ + S T + A++ A +L + Sbjct: 308 NINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPN 367 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 368 GH------------------KKVIVLLTDG 379 >gi|149176866|ref|ZP_01855476.1| BatB [Planctomyces maris DSM 8797] gi|148844303|gb|EDL58656.1| BatB [Planctomyces maris DSM 8797] Length = 798 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 27/205 (13%), Positives = 60/205 (29%), Gaps = 54/205 (26%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + + ++D+S SM D +++ K + +D + Sbjct: 88 QKGIEVMFLLDVSRSMLAE---------------DVSPSRLDRAKQQIKDMVDEM-SGD- 130 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD-SLILKPTDSTPAMKQAYQILT 306 +GL+ + +++ + E +Q + S+ + A++ A Sbjct: 131 -----RVGLVVFAGETRQSVPLTSHYEDFKQSLDAVGPHSVRRGGSLLGDAIRSATAGFI 185 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 K I+ TDGE+ K + +N I+I T Sbjct: 186 DKTND------------------HKAIVVFTDGEDQESKPVEAAK---EAFTKNGIRIFT 224 Query: 367 ISI----------NASPNGQRLLKT 381 + + GQ ++ Sbjct: 225 VGLGDMDQGARIPETEQGGQAFVEY 249 >gi|152999639|ref|YP_001365320.1| von Willebrand factor type A [Shewanella baltica OS185] gi|151364257|gb|ABS07257.1| von Willebrand factor type A [Shewanella baltica OS185] Length = 642 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 37/231 (16%), Positives = 71/231 (30%), Gaps = 52/231 (22%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + ++D+SGSM K+ L+ AL L + V Sbjct: 236 GASNLVFLLDVSGSMAST-------------------DKLPLLQTALKLLTAQLSAQDKV 276 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 VY G G ++ + T+ + QAYQ Sbjct: 277 SIVVYAGAAG--VVLDGASG-----NDTQTLTYALEQLSAGGSTNGGQGITQAYQ----L 325 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI-KICDKAKENFIKIVTI 367 K+ F N + +I TDG+ N ++ + + + +K K + I + T+ Sbjct: 326 AKKHFIPNGINR------------VILATDGDFNVGVTDFDDLTALIEKEKAHGIGLTTL 373 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNA---------DSLIHVFQNISQLM 409 +L++ +Y ++ D L I++ + Sbjct: 374 GFGLGNYNDQLMEQLADKGNGNYAYIDTLNEARKVLVDELSSTLFTIAKDV 424 >gi|327534722|gb|AEA93556.1| von Willebrand factor type A domain protein [Enterococcus faecalis OG1RF] Length = 1103 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 55/150 (36%), Gaps = 39/150 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM + ++ ++ + F+D++ S + Sbjct: 268 LDLVLVVDWSGSM-------------------NENNRIGEVQKGVNRFVDTLAD-SGITN 307 Query: 251 DVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ MG +GY++ N G + V+ + S T + A++ A +L + Sbjct: 308 NINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPN 367 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 368 GH------------------KKVIVLLTDG 379 >gi|302536534|ref|ZP_07288876.1| VWA domain-containing protein [Streptomyces sp. C] gi|302445429|gb|EFL17245.1| VWA domain-containing protein [Streptomyces sp. C] Length = 532 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 30/225 (13%), Positives = 64/225 (28%), Gaps = 60/225 (26%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V+D SGSM ++ ++ L++AL + G Sbjct: 349 VLDTSGSM-------------------EEGDRIGRLRSALTDLTGTGSSG--------TG 381 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL--------------KPTDSTPAMKQA 301 + +KV +V + T ++K A Sbjct: 382 QRFRDREEVTLLPFGDKVKKVLTHVVEPGNPGPALDAIRGDVKSLRPEGGTAVYASLKAA 441 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 YQ L +F + I+ +TDG++ + + ++ + Sbjct: 442 YQHLGEGNADAFTS-----------------IVLMTDGQSGDKVKDFDSFYAGLPEAQKR 484 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD-SLIHVFQNI 405 + + S + L + ++ + + SL F+ I Sbjct: 485 TPVFAVLFGDSDR-KELTHITELTGGRLFDATDGNSSLAGAFEEI 528 >gi|257889535|ref|ZP_05669188.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,410] gi|257825895|gb|EEV52521.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,410] Length = 857 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 29/152 (19%), Positives = 53/152 (34%), Gaps = 41/152 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM ++ +K + F+D++ S + + Sbjct: 19 LDLVLVVDWSGSM-------------------NDNNRIGEVKIGVDRFVDTLAD-SGITD 58 Query: 251 DVYMGLIGYTTRVEKNIEPS---WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + MG +GY++ + + V+ V S T + A++ A +L+ Sbjct: 59 KINMGYVGYSSEGYSYSNGAVQMGSFDSVKNQVKSITPSRTNGGTFTQKALRDAGSMLSV 118 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 119 PNGH------------------KKVIVLLTDG 132 >gi|242042273|ref|XP_002468531.1| hypothetical protein SORBIDRAFT_01g047480 [Sorghum bicolor] gi|241922385|gb|EER95529.1| hypothetical protein SORBIDRAFT_01g047480 [Sorghum bicolor] Length = 650 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 28/185 (15%), Positives = 59/185 (31%), Gaps = 51/185 (27%) Query: 166 NQRYNQKIVSFIPALLRIEMGERPI--FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 N I + G+ P + V+D+SGSM A Sbjct: 157 NFAVLVHIKAPGMTDAEAAAGDAPRAPLDLVTVLDVSGSMIGA----------------- 199 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP----SWGTEKVRQY 279 K+A LK A+ +D++ + ++ ++ R + G + Sbjct: 200 ---KLALLKQAMGFVIDNLGPHD------RLSVVSFSDRARRVTRLLRMSGDGKAAAKSA 250 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 V + T+ ++ A ++L + R+ ++ +I L+DG Sbjct: 251 VESLVA---RGGTNIAEGLRTAARVLEERRHRNTVSS----------------VILLSDG 291 Query: 340 ENNNF 344 ++ Sbjct: 292 QDTYT 296 >gi|329573764|gb|EGG55354.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX1467] Length = 1103 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 55/150 (36%), Gaps = 39/150 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM + ++ ++ + F+D++ S + Sbjct: 268 LDLVLVVDWSGSM-------------------NENNRIGEVQKGVNRFVDTLAD-SGITN 307 Query: 251 DVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ MG +GY++ N G + V+ + S T + A++ A +L + Sbjct: 308 NINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPN 367 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 368 GH------------------KKVIVLLTDG 379 >gi|315168359|gb|EFU12376.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX1341] Length = 1103 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 55/150 (36%), Gaps = 39/150 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM + ++ ++ + F+D++ S + Sbjct: 268 LDLVLVVDWSGSM-------------------NENNRIGEVQKGVNRFVDTLAD-SGITN 307 Query: 251 DVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ MG +GY++ N G + V+ + S T + A++ A +L + Sbjct: 308 NINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPN 367 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 368 GH------------------KKVIVLLTDG 379 >gi|307278884|ref|ZP_07559945.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0860] gi|306504433|gb|EFM73642.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0860] Length = 1103 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 55/150 (36%), Gaps = 39/150 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM + ++ ++ + F+D++ S + Sbjct: 268 LDLVLVVDWSGSM-------------------NENNRIGEVQKGVNRFVDTLAD-SGITN 307 Query: 251 DVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ MG +GY++ N G + V+ + S T + A++ A +L + Sbjct: 308 NINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPN 367 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 368 GH------------------KKVIVLLTDG 379 >gi|294780961|ref|ZP_06746313.1| von Willebrand factor type A domain protein [Enterococcus faecalis PC1.1] gi|307268799|ref|ZP_07550167.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX4248] gi|307286933|ref|ZP_07567011.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0109] gi|294451907|gb|EFG20357.1| von Willebrand factor type A domain protein [Enterococcus faecalis PC1.1] gi|306501991|gb|EFM71279.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0109] gi|306514927|gb|EFM83474.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX4248] gi|315031654|gb|EFT43586.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0017] gi|315034905|gb|EFT46837.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0027] gi|315165584|gb|EFU09601.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX1302] Length = 1103 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 55/150 (36%), Gaps = 39/150 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM + ++ ++ + F+D++ S + Sbjct: 268 LDLVLVVDWSGSM-------------------NENNRIGEVQKGVNRFVDTLAD-SGITN 307 Query: 251 DVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ MG +GY++ N G + V+ + S T + A++ A +L + Sbjct: 308 NINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPN 367 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 368 GH------------------KKVIVLLTDG 379 >gi|223939936|ref|ZP_03631804.1| von Willebrand factor type A [bacterium Ellin514] gi|223891427|gb|EEF57920.1| von Willebrand factor type A [bacterium Ellin514] Length = 346 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 31/179 (17%), Positives = 58/179 (32%), Gaps = 37/179 (20%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I LV DLS SM ++ + A + D ++ + Sbjct: 91 DIMLVFDLSWSMMVLDMGGHDETGTRFG-------------IASAVLEDFVNKRPND--- 134 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQILTSDKK 310 +GLI ++ + + + + + R +I + T A A Sbjct: 135 -RIGLIVFSGVPYLASPLTLNHDWLVENLHRLHIGIIRELGTAIGDATAAA--------- 184 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + +++ + II LTDG+NN + + + A KI TI + Sbjct: 185 --------TKRLQMSKDSKSRIIILLTDGDNN--QGEIEPVPAAQLAAAIGAKIYTIGL 233 >gi|66045865|ref|YP_235706.1| von Willebrand factor, type A [Pseudomonas syringae pv. syringae B728a] gi|63256572|gb|AAY37668.1| von Willebrand factor, type A [Pseudomonas syringae pv. syringae B728a] Length = 663 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 31/193 (16%), Positives = 68/193 (35%), Gaps = 23/193 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ-YVTRDMDS 286 + + A+ I + ++ V GL+ + + + + T+ +D Sbjct: 253 IERTREAISKVYAQIAKENLGRQ-VKFGLVAFRSSTQAVPGLEYVTKMYADPNTVKDGAD 311 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD-----GEN 341 + K D +KQA S + S+ S ++++ +TD G++ Sbjct: 312 FLAKAAD----LKQAKVSSKSFNEDSYAGVMQAIDKVDWSPFGARYVVLITDAGALDGDD 367 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISI----------NASPNGQRLLKTCVSSPEYHYN 391 + +N ++ +A + I T+ + NA Q L ++ +Y Sbjct: 368 KLSGTGLNAEQVRIEASNPGVAIYTLHLKTAAGAKDHANAEAQYQALSTYTGTNTSLYYP 427 Query: 392 VVNADSLIHVFQN 404 V +A L F + Sbjct: 428 V-DAGDL-DAFGS 438 >gi|315148427|gb|EFT92443.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX4244] Length = 1103 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 55/150 (36%), Gaps = 39/150 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM + ++ ++ + F+D++ S + Sbjct: 268 LDLVLVVDWSGSM-------------------NENNRIGEVQKGVNRFVDTLAD-SGITN 307 Query: 251 DVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ MG +GY++ N G + V+ + S T + A++ A +L + Sbjct: 308 NINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPN 367 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 368 GH------------------KKVIVLLTDG 379 >gi|283778201|ref|YP_003368956.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] gi|283436654|gb|ADB15096.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] Length = 786 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 45/283 (15%), Positives = 85/283 (30%), Gaps = 74/283 (26%) Query: 106 LSGYSAVFYNTEIQNIV------NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQ 159 + +A + NT I+ + +S I + + +D + V L Sbjct: 199 IPMATARYTNTPIEKVSLEATIESSIAIKSVYSPTHAVDVK-RPDEKHATVKFEASNYLP 257 Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIE------------------LVVDLSG 201 + L ++ R + + FL+ VVD SG Sbjct: 258 TTDFRLLYDVGDAPLAASVLSYRPDNSDEGFFLMLASPNHSQGEVDLTKKTVIFVVDRSG 317 Query: 202 SMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT 261 SM K+ + A+ L+++ ++ Y + Sbjct: 318 SMQGK--------------------KIEQAREAMRYVLNNLHEGDTFN------IVAYDS 351 Query: 262 RVEKNIE-PSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFR 319 VE + R+ +D L T+ + A+ A+ +LT + + Sbjct: 352 TVESFKPELQKFDDATRKSALAYVDGLYAGGSTNISGALDSAFAMLTGSDRPN------- 404 Query: 320 QGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 +I+FLTDG N KI + AK+ + Sbjct: 405 ------------YILFLTDG--LPTAGETNEGKIVELAKQKNV 433 >gi|147834997|emb|CAN61381.1| hypothetical protein VITISV_037547 [Vitis vinifera] Length = 1324 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 28/158 (17%), Positives = 59/158 (37%), Gaps = 43/158 (27%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + V+D+SGSM TK+A LK A+ + S+ Sbjct: 274 RAPVDLVTVLDVSGSMAG--------------------TKLALLKRAMGFVIQSLGPCD- 312 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILT 306 + +I +++ + T+ RQ + ++SLI T+ +++ +++ Sbjct: 313 -----RLSVISFSSTARRLFPLRRMTDTGRQQALQAVNSLISNGGTNIAEGLRKGAKVML 367 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 K ++ ++ II L+DG++ Sbjct: 368 DRKWKNPVSS----------------IILLSDGQDTYT 389 >gi|257089500|ref|ZP_05583861.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis CH188] gi|256998312|gb|EEU84832.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis CH188] Length = 1154 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 55/150 (36%), Gaps = 39/150 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM + ++ ++ + F+D++ S + Sbjct: 319 LDLVLVVDWSGSM-------------------NENNRIGEVQKGVNRFVDTLAD-SGITN 358 Query: 251 DVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ MG +GY++ N G + V+ + S T + A++ A +L + Sbjct: 359 NINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPN 418 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 419 GH------------------KKVIVLLTDG 430 >gi|154089852|emb|CAO81741.1| collagen type VI alpha 6 [Homo sapiens] Length = 840 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 55/353 (15%), Positives = 108/353 (30%), Gaps = 66/353 (18%) Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 SN LG E+I A F + ++++ + + N Sbjct: 493 SNKQDLGKAIENIRQMGGNTNTGAALNFTLSLLQKAKKQRGNK-VPCHLVVLTNGMSKDS 551 Query: 130 THMANNRL------------DSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFI 177 NRL +N T + Y + + L + R + Sbjct: 552 ILEPANRLREEHIRVYAIGIKEANQTQLREIAGEEKRVYYVHDFDALKDIRN-----QVV 606 Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + E + I +VD SGS+ + +KM K + Sbjct: 607 QEICTEEACKEMKADIMFLVDSSGSIG-----------------PENFSKM---KTFMKN 646 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 + + + V +G++ ++ ++ + R M + ++ Sbjct: 647 LVSKSQIGP---DRVQIGVVQFSDINKEEFQL-----------NRFMSQSDIS--NAIDQ 690 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 M Q + SF + +F K +KF+I +TDGE + + Sbjct: 691 MAHIGQTTLTGSALSFVSQYF-SPTKGARPNIRKFLILITDGEAQDIVKEPAVV-----L 744 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 ++ + I ++ + S Q L+ PE + V N + Q I +V Sbjct: 745 RQEGVIIYSVGVFGSNVTQ--LEEISGRPEMVFYVEN----FDILQRIEDDLV 791 >gi|46395320|dbj|BAD16597.1| DEC-1 [Lymnaea stagnalis] Length = 919 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 54/169 (31%), Gaps = 20/169 (11%) Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 K+DV G + + + VEK + V + + TD+ A++ Sbjct: 467 GKDDVRYGALIFGSNVEKLFDLKKYDSPVDVEQHIMEATYLASSTDTAAALQYILDQRMF 526 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 ++ K II LTDGE+ + T K + +++I Sbjct: 527 ADEQG------------GRPDAVKIIIVLTDGESTYPE---KTRAEATKLQSLGYHMMSI 571 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I N L S+ + + V + + +V R + Sbjct: 572 GIGNEINELE-LNALASNTSNIFKAAS----YQVLDQLHKEVVTRACQI 615 >gi|229819442|ref|YP_002880968.1| von Willebrand factor A [Beutenbergia cavernae DSM 12333] gi|229565355|gb|ACQ79206.1| von Willebrand factor type A [Beutenbergia cavernae DSM 12333] Length = 647 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 37/241 (15%), Positives = 76/241 (31%), Gaps = 48/241 (19%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 + A+ R + +VDLS SM +T++ ++A+ Sbjct: 440 VVAAVAEAFPEVRKDAQVLFLVDLSASMGYD--------------NTAGQTRLEGAQHAI 485 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 LD +GL G+TT +T + + + D+ Sbjct: 486 TAALDHFTAGD------RVGLAGFTT--------------TDGTITPGLVAPVADIADNR 525 Query: 296 PAMKQAYQILTSDKKR---SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 A+ + L +F +Q + I+ L+DG N ++ TI Sbjct: 526 GALVEGVNSLEPVAHTPLYQAVADFAQQQAAMWDPDRINAIVLLSDGVN--ATGDIETIG 583 Query: 353 ICDKAK-------ENFIKIVTISINASPNGQRLLKTCVSSPE-YHYNVVNADSLIHVFQN 404 D E + + T+ + L+ S+ ++Y+ + + V + Sbjct: 584 QDDMIHVLHGLHAETPVLVFTLGYS-PDADVETLQAISSATGAHYYDATDPTEVEAVLGD 642 Query: 405 I 405 + Sbjct: 643 L 643 >gi|170700850|ref|ZP_02891839.1| conserved hypothetical protein [Burkholderia ambifaria IOP40-10] gi|170134258|gb|EDT02597.1| conserved hypothetical protein [Burkholderia ambifaria IOP40-10] Length = 423 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 20/127 (15%), Positives = 55/127 (43%), Gaps = 3/127 (2%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 ++G+ ++ +II L++ + +G + + + ++ ++++ ++ L+ A + S Sbjct: 12 RRGLHRQQGAVAIIVGLALAMMIGFVGLALDLGKLYVTRSELQNSADSCALSAARDLTSA 71 Query: 72 LSRLGDRFESISN-HAKRALI--DDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRIS 128 +S + I+ HA A + + +++ S S + T + N N + Sbjct: 72 ISLQVAEADGIAAGHANFAFFQQNAVQMQTDSNVTFSDSLTNPFLTKTAVANPANVKYVK 131 Query: 129 MTHMANN 135 T +N Sbjct: 132 CTAQLSN 138 >gi|292618048|ref|XP_699485.3| PREDICTED: integrin alpha-1-like [Danio rerio] Length = 1201 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 24/170 (14%), Positives = 62/170 (36%), Gaps = 28/170 (16%) Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +G++ Y V S ++ V D + A+ + + +K + Sbjct: 202 RVGIVSYGDDVGHVFNLS-QFSNTKELVKNAADIRQRTGHKTMTALG-----IDTARKEA 255 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 F + +K ++ +TDGE+++ + + I C +E+ I+ +++ Sbjct: 256 FTVERGARPG------VKKVMVIVTDGESHDHHNLKSVIDQC---QEDGIERFAVAVLGD 306 Query: 373 PNGQRL-----------LKTCVSS--PEYHYNVVNADSLIHVFQNISQLM 409 N Q ++ S ++ +NV + +L+ + + + Sbjct: 307 YNRQNKSIDEIKKFIEEIEYIASETKSDHFFNVSDERALVTIVDTLGSKI 356 >gi|256962322|ref|ZP_05566493.1| von Willebrand factor [Enterococcus faecalis Merz96] gi|256952818|gb|EEU69450.1| von Willebrand factor [Enterococcus faecalis Merz96] Length = 1154 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 55/150 (36%), Gaps = 39/150 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM + ++ ++ + F+D++ S + Sbjct: 319 LDLVLVVDWSGSM-------------------NENNRIGEVQKGVNRFVDTLAD-SGITN 358 Query: 251 DVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ MG +GY++ N G + V+ + S T + A++ A +L + Sbjct: 359 NINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPN 418 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 419 GH------------------KKVIVLLTDG 430 >gi|256618671|ref|ZP_05475517.1| von Willebrand factor [Enterococcus faecalis ATCC 4200] gi|256598198|gb|EEU17374.1| von Willebrand factor [Enterococcus faecalis ATCC 4200] Length = 1154 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 55/150 (36%), Gaps = 39/150 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM + ++ ++ + F+D++ S + Sbjct: 319 LDLVLVVDWSGSM-------------------NENNRIGEVQKGVNRFVDTLAD-SGITN 358 Query: 251 DVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ MG +GY++ N G + V+ + S T + A++ A +L + Sbjct: 359 NINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPN 418 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 419 GH------------------KKVIVLLTDG 430 >gi|255973237|ref|ZP_05423823.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T1] gi|257418922|ref|ZP_05595916.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T11] gi|255964255|gb|EET96731.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T1] gi|257160750|gb|EEU90710.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T11] Length = 1154 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 55/150 (36%), Gaps = 39/150 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM + ++ ++ + F+D++ S + Sbjct: 319 LDLVLVVDWSGSM-------------------NENNRIGEVQKGVNRFVDTLAD-SGITN 358 Query: 251 DVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ MG +GY++ N G + V+ + S T + A++ A +L + Sbjct: 359 NINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPN 418 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 419 GH------------------KKVIVLLTDG 430 >gi|262384208|ref|ZP_06077344.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B] gi|262295106|gb|EEY83038.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B] Length = 1137 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 25/157 (15%), Positives = 53/157 (33%), Gaps = 12/157 (7%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYT----TRVEKNIEPSWGTEKVRQYVTRDMDS 286 K L + V + G + + + +++ +W V+ + R + S Sbjct: 14 AKKGEATLLGASSNYLSVNTETNYGALSFDPTTLSDLKELNPATWTVSNVKTSLGRTLYS 73 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF--------QKFIIFLTD 338 + K T T ++ A I D +S S + ++ D Sbjct: 74 FVNKATGLTLSVDPATAIAVDDNNKSQKIANAAPLALGGSASEWVVETVSGKDILVSYID 133 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 GEN + + +KAK++ I I++++ Sbjct: 134 GENVVYIAQSGNKLYLEKAKKSAISTAAIALSSDQAS 170 >gi|300860089|ref|ZP_07106177.1| von Willebrand factor type A domain protein [Enterococcus faecalis TUSoD Ef11] gi|295112663|emb|CBL31300.1| Cna protein B-type domain./von Willebrand factor type A domain. [Enterococcus sp. 7L76] gi|300850907|gb|EFK78656.1| von Willebrand factor type A domain protein [Enterococcus faecalis TUSoD Ef11] gi|315144603|gb|EFT88619.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX2141] Length = 1103 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 55/150 (36%), Gaps = 39/150 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM + ++ ++ + F+D++ S + Sbjct: 268 LDLVLVVDWSGSM-------------------NENNRIGEVQKGVNRFVDTLAD-SGITN 307 Query: 251 DVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ MG +GY++ N G + V+ + S T + A++ A +L + Sbjct: 308 NINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPN 367 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 368 GH------------------KKVIVLLTDG 379 >gi|229116005|ref|ZP_04245400.1| hypothetical protein bcere0017_22960 [Bacillus cereus Rock1-3] gi|228667499|gb|EEL22946.1| hypothetical protein bcere0017_22960 [Bacillus cereus Rock1-3] Length = 452 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 39/223 (17%), Positives = 81/223 (36%), Gaps = 29/223 (13%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + + + +E+++D SGSM +N + KM A K A+ +LD Sbjct: 142 VHSKKPKEKSLNVEILLDASGSMAGKVNG---------------QVKMEAAKKAIYNYLD 186 Query: 241 SIDLLSHVKEDVYMGLIGYT-TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 I +V + + G+ + E + S G+ +V + + Sbjct: 187 KI----PNNANVMLRVYGHKGSNNENDKSLSCGSSEVMYPLQPYKKEQFNAALSNFGP-- 240 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + + L S + G + ++ +I+ +DGE VN K +++ Sbjct: 241 KGWTPLASAIESVNNDFKEYTGEENLNVV---YIV--SDGEETCGGDPVNAAKNLNQSST 295 Query: 360 NFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHV 401 + + + I + + Q+ LK + + V NA+ L Sbjct: 296 HAV-VNIIGFDVKNSEQQQLKNTAEAGKGNYATVSNAEELYQT 337 >gi|268572467|ref|XP_002648969.1| Hypothetical protein CBG21291 [Caenorhabditis briggsae] Length = 427 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 31/220 (14%), Positives = 65/220 (29%), Gaps = 47/220 (21%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LV+D S + P+ + K+ + + + H Sbjct: 240 LDLVLVLDFS-------------TTTDPVYN--------SYKDLSKRLVQQLKIGPH--- 275 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSD 308 + + + T + + ++ V R +D L K T +++A Sbjct: 276 YTQVAAVTFATVGRTRVRFNLKKYSTQEEVLRGIDKLQSKGGTTAIGAGIEKA------- 328 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 T P + + I+F TDG +N + A ++ T++ Sbjct: 329 -----LTQIDESEGARPGIATKVMIVF-TDGWSNKGPDPEKRAR---DAVNAGFEMYTVA 379 Query: 369 INASPNG-----QRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 A G L S + + V +L+ + Sbjct: 380 YTARAPGSVTLNNETLSAISGSSGHAFTDVTFQTLVDKIK 419 >gi|328882566|emb|CCA55805.1| putative exported protein [Streptomyces venezuelae ATCC 10712] Length = 543 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 30/206 (14%), Positives = 69/206 (33%), Gaps = 46/206 (22%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 GERP + VVD+SGSM + ++ +K +L L Sbjct: 166 TGERPPAALTFVVDISGSMA-------------------EPGRLDLVKESLGLL------ 200 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDSTPAMKQAYQ 303 +++D + L+ ++ E + + + R V ++SL T+ ++ Y Sbjct: 201 ADELRDDDSIALVTFSDEAETRLPMT-RVGEARGRVREVVNSLATTSSTNVEAGVRTGYD 259 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD--GENNNFKSNVNTIKICDKAKENF 361 + ++ ++ L+D ++ +I ++ K Sbjct: 260 VAVDGHRKDATNR----------------VVLLSDALANTGATEAGAILERIEEERKAYG 303 Query: 362 IKIVTISINASPNGQRLLKTCVSSPE 387 I + + + S G ++ + Sbjct: 304 ITLFGVGV-GSDYGDAFMERLADRGD 328 >gi|317483399|ref|ZP_07942390.1| von Willebrand factor type A domain-containing protein [Bifidobacterium sp. 12_1_47BFAA] gi|316915154|gb|EFV36585.1| von Willebrand factor type A domain-containing protein [Bifidobacterium sp. 12_1_47BFAA] Length = 401 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 37/208 (17%), Positives = 69/208 (33%), Gaps = 38/208 (18%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 VVD SGSM + +A + D+ + + DV + Sbjct: 226 VVDYSGSMSGEGKNGVVKGLNAALDPDQAKKSY----------------IEPASGDVNI- 268 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 LI + T + ++ + GT + + TD + A L S+ + S +T Sbjct: 269 LIPFETEAHRPVKAT-GTS-TSDLLHEADATDASGGTDIYEGLLSALDELPSESEASQYT 326 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 I+ +TDG +N+ + K++ + I +I + Sbjct: 327 TA---------------IVLMTDGRSNSDHQDEFESSY--KSRGRDLPIFSIMFGDADPS 369 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 Q LK+ + + L VF+ Sbjct: 370 Q--LKSLATLSNAKVFDGRSGDLAAVFR 395 >gi|225435355|ref|XP_002285271.1| PREDICTED: hypothetical protein isoform 2 [Vitis vinifera] Length = 670 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 27/158 (17%), Positives = 59/158 (37%), Gaps = 43/158 (27%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + V+D+SGSM TK+A LK A+ + S+ Sbjct: 274 RAPVDLVTVLDVSGSMAG--------------------TKLALLKRAMGFVIQSLGPCD- 312 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILT 306 + +I +++ + T+ RQ + ++SL+ T+ +++ +++ Sbjct: 313 -----RLSVISFSSTARRLFPLRRMTDTGRQQALQAVNSLVSNGGTNIAEGLRKGAKVML 367 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 K ++ ++ II L+DG++ Sbjct: 368 DRKWKNPVSS----------------IILLSDGQDTYT 389 >gi|225435353|ref|XP_002285265.1| PREDICTED: hypothetical protein isoform 1 [Vitis vinifera] Length = 729 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 27/158 (17%), Positives = 59/158 (37%), Gaps = 43/158 (27%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + V+D+SGSM TK+A LK A+ + S+ Sbjct: 274 RAPVDLVTVLDVSGSMAG--------------------TKLALLKRAMGFVIQSLGPCD- 312 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILT 306 + +I +++ + T+ RQ + ++SL+ T+ +++ +++ Sbjct: 313 -----RLSVISFSSTARRLFPLRRMTDTGRQQALQAVNSLVSNGGTNIAEGLRKGAKVML 367 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 K ++ ++ II L+DG++ Sbjct: 368 DRKWKNPVSS----------------IILLSDGQDTYT 389 >gi|315160966|gb|EFU04983.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0645] Length = 1103 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 55/150 (36%), Gaps = 39/150 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM + ++ ++ + F+D++ S + Sbjct: 268 LDLVLVVDWSGSM-------------------NENNRIGEVQKGVNRFVDTLAD-SGITN 307 Query: 251 DVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ MG +GY++ N G + V+ + S T + A++ A +L + Sbjct: 308 NINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPN 367 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 368 GH------------------KKVIVLLTDG 379 >gi|312903619|ref|ZP_07762795.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0635] gi|310632972|gb|EFQ16255.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0635] gi|315577195|gb|EFU89386.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0630] Length = 1103 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 55/150 (36%), Gaps = 39/150 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM + ++ ++ + F+D++ S + Sbjct: 268 LDLVLVVDWSGSM-------------------NENNRIGEVQKGVNRFVDTLAD-SGITN 307 Query: 251 DVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ MG +GY++ N G + V+ + S T + A++ A +L + Sbjct: 308 NINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPN 367 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 368 GH------------------KKVIVLLTDG 379 >gi|293383944|ref|ZP_06629845.1| von Willebrand factor type A domain protein [Enterococcus faecalis R712] gi|293387429|ref|ZP_06631983.1| von Willebrand factor type A domain protein [Enterococcus faecalis S613] gi|312908057|ref|ZP_07767039.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 512] gi|312910783|ref|ZP_07769620.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 516] gi|291078704|gb|EFE16068.1| von Willebrand factor type A domain protein [Enterococcus faecalis R712] gi|291083151|gb|EFE20114.1| von Willebrand factor type A domain protein [Enterococcus faecalis S613] gi|310625945|gb|EFQ09228.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 512] gi|311288927|gb|EFQ67483.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 516] Length = 1103 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 55/150 (36%), Gaps = 39/150 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM + ++ ++ + F+D++ S + Sbjct: 268 LDLVLVVDWSGSM-------------------NENNRIGEVQKGVNRFVDTLAD-SGITN 307 Query: 251 DVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ MG +GY++ N G + V+ + S T + A++ A +L + Sbjct: 308 NINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPN 367 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 368 GH------------------KKVIVLLTDG 379 >gi|227552873|ref|ZP_03982922.1| pilus subunit protein [Enterococcus faecalis HH22] gi|229550416|ref|ZP_04439141.1| pilus subunit protein [Enterococcus faecalis ATCC 29200] gi|257422992|ref|ZP_05599982.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis X98] gi|312952910|ref|ZP_07771770.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0102] gi|227178004|gb|EEI58976.1| pilus subunit protein [Enterococcus faecalis HH22] gi|229304444|gb|EEN70440.1| pilus subunit protein [Enterococcus faecalis ATCC 29200] gi|257164816|gb|EEU94776.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis X98] gi|310629158|gb|EFQ12441.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0102] gi|315154335|gb|EFT98351.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0031] gi|315156569|gb|EFU00586.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0043] gi|315573683|gb|EFU85874.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0309B] gi|315582702|gb|EFU94893.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0309A] gi|323480330|gb|ADX79769.1| Endocarditis and Biofilm-Associated Pilus subunitA [Enterococcus faecalis 62] Length = 1103 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 55/150 (36%), Gaps = 39/150 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM + ++ ++ + F+D++ S + Sbjct: 268 LDLVLVVDWSGSM-------------------NENNRIGEVQKGVNRFVDTLAD-SGITN 307 Query: 251 DVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ MG +GY++ N G + V+ + S T + A++ A +L + Sbjct: 308 NINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPN 367 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 368 GH------------------KKVIVLLTDG 379 >gi|110832907|ref|YP_691766.1| von Willebrand factor type A domain-containing protein [Alcanivorax borkumensis SK2] gi|110646018|emb|CAL15494.1| protein containing a von Willebrand factor type A domain [Alcanivorax borkumensis SK2] Length = 698 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 51/327 (15%), Positives = 98/327 (29%), Gaps = 61/327 (18%) Query: 94 AKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTS 153 ++ + +L +A + A N S T+ ++ V Sbjct: 219 IQQLGSQQWQVTLDNRNASTTIKGKGR-SEKGDNNFAPPATNGHPPSAFTLDQDIVVYWR 277 Query: 154 YDYRLQFIEHL-----LNQRYNQKIVSFIPA--LLRIEMGERPIFLIELVVDLSGSMHCA 206 + L L + ++S P L I G +F V+D+SGSM+ Sbjct: 278 HQQDLPGSVDLVAYKAPGKDRGTFMLSITPGDDLPPITTGSDWVF----VLDISGSMNAK 333 Query: 207 MNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN 266 + + + V AL I L E++ G + T Sbjct: 334 LATLGDGVRQ-------------ALGKLRGNDRFRIVLFDDRAEELTSGFVDAT------ 374 Query: 267 IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 +RQY + M T+ + A L +D+ Sbjct: 375 ------PNNIRQYTQKIMQLQSRGGTNLFGGLSLALTPLDADRPTG-------------- 414 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 I+ +TDG N K D + + +++ T + + + +L + Sbjct: 415 ------IVLVTDGVANVGK--TRQKDFIDLLENHDVRLFT-FVMGNSANRPMLTAMTDAS 465 Query: 387 EYH-YNVVNADSLIHVFQNISQLMVHR 412 +V N+D + N + + H+ Sbjct: 466 NGFAISVSNSDDIAGQILNATSKVTHQ 492 >gi|46190338|ref|ZP_00121620.2| COG2304: Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Bifidobacterium longum DJO10A] gi|189440236|ref|YP_001955317.1| hypothetical protein BLD_1374 [Bifidobacterium longum DJO10A] gi|189428671|gb|ACD98819.1| Hypothetical protein BLD_1374 [Bifidobacterium longum DJO10A] Length = 380 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 70/210 (33%), Gaps = 38/210 (18%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 VVD SGSM + +A + D+ + + DV + Sbjct: 205 VVDYSGSMSGEGKNGVVKGLNAALDPDQAKKSY----------------IEPASGDVNI- 247 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 LI + T + ++ + GT + + TD + A L S+ + S +T Sbjct: 248 LIPFETEAHRPVKAT-GTS-TSDLLHEADATDASGGTDIYEGLLSALDELPSESEASQYT 305 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 I+ +TDG +N+ + K++ + I +I + Sbjct: 306 TA---------------IVLMTDGRSNSDHQDEFESAY--KSRGRDLPIFSIMFGDADPS 348 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 Q LK+ + + L VF+ + Sbjct: 349 Q--LKSLATLSNAKVFDGRSGDLAAVFRQV 376 >gi|322690259|ref|YP_004219829.1| hypothetical protein BLLJ_0067 [Bifidobacterium longum subsp. longum JCM 1217] gi|320455115|dbj|BAJ65737.1| conserved hypothetical protein [Bifidobacterium longum subsp. longum JCM 1217] Length = 380 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 70/210 (33%), Gaps = 38/210 (18%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 VVD SGSM + +A + D+ + + DV + Sbjct: 205 VVDYSGSMSGEGKNGVVKGLNAALDPDQAKKSY----------------IEPASGDVNI- 247 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 LI + T + ++ + GT + + TD + A L S+ + S +T Sbjct: 248 LIPFETEAHRPVKAT-GTS-TSDLLHEADATDASGGTDIYEGLLSALDELPSESEASQYT 305 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 I+ +TDG +N+ + K++ + I +I + Sbjct: 306 TA---------------IVLMTDGRSNSDHQDEFESAY--KSRGRDLPIFSIMFGDADPS 348 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 Q LK+ + + L VF+ + Sbjct: 349 Q--LKSLATLSNAKVFDGRSGDLAAVFRQV 376 >gi|314948817|ref|ZP_07852188.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0082] gi|313644760|gb|EFS09340.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0082] Length = 1129 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 29/152 (19%), Positives = 52/152 (34%), Gaps = 41/152 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM ++ +K + F+D++ S + + Sbjct: 291 LDLVLVVDWSGSM-------------------NDNNRIGEVKIGVDRFVDTLAD-SGITD 330 Query: 251 DVYMGLIGYTTRVEKNIEPS---WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + MG +GY++ + + V+ V S T + A++ A L+ Sbjct: 331 KINMGYVGYSSEGYSYSNGAVQMGSFDSVKNQVKSITPSRTNGGTFTQKALRDAGSRLSV 390 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 391 PNGH------------------KKVIVLLTDG 404 >gi|307274607|ref|ZP_07555787.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX2134] gi|306508759|gb|EFM77849.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX2134] Length = 1103 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 55/150 (36%), Gaps = 39/150 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM + ++ ++ + F+D++ S + Sbjct: 268 LDLVLVVDWSGSM-------------------NENNRIGEVQKGVNRFVDTLAD-SGITN 307 Query: 251 DVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ MG +GY++ N G + V+ + S T + A++ A +L + Sbjct: 308 NINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPN 367 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 368 GH------------------KKVIVLLTDG 379 >gi|119593589|gb|EAW73183.1| hCG25234 [Homo sapiens] Length = 195 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 39/240 (16%), Positives = 81/240 (33%), Gaps = 62/240 (25%) Query: 141 NNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLS 200 N+ +++ + + L + + R +LL+ + ++ LV+D S Sbjct: 3 NHRSTWDVIMSSEDFQHLSPMTEINLPR-------PTFSLLKSKQR-----VVCLVLDKS 50 Query: 201 GSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT 260 GSM + + +M A L + I+ S V GL+ + Sbjct: 51 GSM----------------NAEDRLFRMNQA--AELYLIQIIEKGSLV------GLVTFD 86 Query: 261 TRVEKNIEPSWGT-EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFR 319 + + + + Q +T ++ T +K +Q + + +F + Sbjct: 87 SFAKIQSKLIKIIDDNTYQKITANLPQEADGGTSICRGLKAGFQAIPQSNQSTFGSE--- 143 Query: 320 QGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRL 378 II LTDGE+ +C + K++ I TI++ + L Sbjct: 144 -------------IILLTDGEDYQIS-------LCFGEVKQSGTVIHTIAL-GPSADEEL 182 >gi|78060312|ref|YP_366887.1| hypothetical protein Bcep18194_C7199 [Burkholderia sp. 383] gi|77964862|gb|ABB06243.1| hypothetical protein Bcep18194_C7199 [Burkholderia sp. 383] Length = 423 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 18/129 (13%), Positives = 52/129 (40%), Gaps = 6/129 (4%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 M+ +R ++ + ++ +II LS+ + +G + + + ++ ++++ +A Sbjct: 1 MNSAARHPKVMRRSLHRQRGAVAIIVGLSLAVMIGFVGLALDLGKLYVTRSELQNSADAC 60 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 L+ A + S +S + I A F +N + + + F ++ Sbjct: 61 ALSAARDLTSAISLSVAEADGI------AAGHLNFVFFQNKSVQMSTNANVTFSDSLTDP 114 Query: 121 IVNSSRISM 129 + S ++ Sbjct: 115 FLTRSAVTT 123 >gi|19552620|ref|NP_600622.1| hypothetical protein NCgl1349 [Corynebacterium glutamicum ATCC 13032] gi|62390288|ref|YP_225690.1| Mg-chelatase subunit [Corynebacterium glutamicum ATCC 13032] gi|21324171|dbj|BAB98796.1| Hypothetical protein [Corynebacterium glutamicum ATCC 13032] gi|41325625|emb|CAF21414.1| secreted Mg-chelatase subunit [Corynebacterium glutamicum ATCC 13032] Length = 525 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 30/215 (13%), Positives = 63/215 (29%), Gaps = 42/215 (19%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V+D+SGSM ++ LK+ + + + Sbjct: 344 VLDVSGSMLGQ--------------------RITLLKDTMSDLISGGATTDLANVSLR-- 381 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 I S+G +V ++L + S ++Q + L +D + Sbjct: 382 ----DREKVSIIPFSFGPHEVIS------ETLGAVGSPSRTDLQQRVEALQADGGTGIYD 431 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGE----NNNFKSNVNTIKICDKAKENFIKIVTISI-N 370 + + I+ +TDGE + + + I + I Sbjct: 432 AVLAAYAESAGGDYIPSIVLMTDGELTAGRTYDQFLTEWNALPSNIRS--IPVFVILYGE 489 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 A+ L ++ ++ +N L F+ I Sbjct: 490 ANVADMEQLA--ATTGGETFDAIN-GDLDEAFKEI 521 >gi|327538509|gb|EGF25172.1| von Willebrand factor type A domain-containing protein [Rhodopirellula baltica WH47] Length = 764 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 45/326 (13%), Positives = 106/326 (32%), Gaps = 71/326 (21%) Query: 91 IDDAKRFIKNHIKESLSGYSAVFYNTEIQN---------IVNSSRISMTHMANNRLDSSN 141 + DA R + + E + + VF + +I + + I T + + N Sbjct: 229 VPDASRITPDSLPEGMRNGNDVFVSVKIDAAMPIQQIVPVTHELDIQQTSDTHAAITLKN 288 Query: 142 NTIFYNMDVMTSY-----DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELV 196 + + D + Y D L + H + + ++ + IE E + LV Sbjct: 289 QSTIADKDFIIEYRLAGDDSTLASLTHRESDAEDGYVMLALQPKWSIEPTEITPREVILV 348 Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL 256 +D SGSM ++ L+ LD ++ + + Sbjct: 349 LDTSGSM--------------------NGPAISQLRLFADHVLDHLNPNDEFR------V 382 Query: 257 IGYTTRVEKNIEPS-----WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 I ++ R + + +Q+V S T+ PA+K A Sbjct: 383 IAFSNRTTAFQPDAIAATDANIQSAKQFVRGLRAS---GGTNLLPALKLAL--------- 430 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 G + ++++ +TD ++ + ++ + + ++ I+ A Sbjct: 431 ---------GGEADESARPRYMVLMTDA---LVGNDHSILRYLRQPEFQDARVFPIAFGA 478 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADS 397 +PN + + + +++ + Sbjct: 479 APNDYLISR--AAEMGRGFSMQVTNQ 502 >gi|326797335|ref|YP_004315155.1| hypothetical protein Marme_4119 [Marinomonas mediterranea MMB-1] gi|326548099|gb|ADZ93319.1| hypothetical protein Marme_4119 [Marinomonas mediterranea MMB-1] Length = 187 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 55/164 (33%), Gaps = 5/164 (3%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 ++ +A+E I FALS+ G L+ D + N M A AS + + Sbjct: 3 RRFLANESGATVIEFALSLPIMF---GILLVSTDLYNI-NRMRGDMEQASHNLASILANQ 58 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI-VNSSRISMT 130 D F+ + H + + I + + S + EI ++ S Sbjct: 59 QEWNADSFDYLIEHTIDNSVGEEYELIVSKVNIDRSMDWSPIRRGEISDVCAEKSSGKYY 118 Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIV 174 D +NT + + + Y+ L LL + + Sbjct: 119 SDQMPEEDPDSNTASFLVIQLCRYNDDLIINSGLLGSKKMESTS 162 >gi|229577024|ref|NP_001153314.1| collagen type XXVIII alpha 1 a [Danio rerio] gi|228007387|emb|CAQ51228.1| collagen type XXVIII alpha 1 a [Danio rerio] Length = 1208 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 36/160 (22%), Positives = 57/160 (35%), Gaps = 16/160 (10%) Query: 260 TTRVEKNIEPSWGTEKVR--QYVTRDMDSLILKPTD----STPAMKQAYQILTSDKKRSF 313 + R++ + SW ++ +V + K T+ S M AY I Sbjct: 92 SIRLDTGRKLSWRAALLQYSSHVIIEQTLKQWKGTENFKSSIAPM--AY-IGHGTYTTYA 148 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI---- 369 TN + V+ S K + LTDG N + AK +K+ TI I Sbjct: 149 ITNMTKIFVEESSPERIKIALLLTDG--FFHPRNPDIFSAMADAKNQGVKVFTIGITRTA 206 Query: 370 NASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQL 408 N N L + YN+ + + + V I+QL Sbjct: 207 NDPVNAANLRLLSSTPASRFLYNLQDTNVMEKVITQIAQL 246 >gi|255598079|ref|XP_002536925.1| conserved hypothetical protein [Ricinus communis] gi|223518102|gb|EEF25458.1| conserved hypothetical protein [Ricinus communis] Length = 451 Score = 39.1 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 45/340 (13%), Positives = 104/340 (30%), Gaps = 75/340 (22%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 + +++ S++ +S+ + L ++G + ++ + +A A A++ +SN Sbjct: 3 RSFPHAQRGGISLMVIVSLTTLLAVVGLAFSAGLSYLVRSKL-NAATDAAGLAAARAISN 61 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 + D+ + +R F N++ N + +I + S Sbjct: 62 GTTQADQIANAKAAGQRFFHA---NFPSNYL----------MSNATLNDISVTFSGSEV- 107 Query: 132 MANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF 191 + V S + PA++ +R Sbjct: 108 ---------------TIGVSASASLPAALFGGF-------GTSALAPAVVTET--KRKDL 143 Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +V+D SGS+ + + + + R ++ + A +D + Sbjct: 144 DMIVVMDTSGSLSPSAANVRSSAITFLNQFNATRDRVGLVHFAFGAIVD---------DA 194 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + G+ T ++ Y T S M A Q + S Sbjct: 195 IRQTARGFDRASM--------TNHIKAY-------AFSGSTASAEGMYTARQQINSVPTA 239 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 + + + I+F +DG N+F + +N Sbjct: 240 NLNRSNM------------RVIVFFSDGAPNSFGAYLNWK 267 >gi|330808169|ref|YP_004352631.1| hypothetical protein PSEBR_a1432 [Pseudomonas brassicacearum subsp. brassicacearum NFM421] gi|327376277|gb|AEA67627.1| Conserved hypothetical protein [Pseudomonas brassicacearum subsp. brassicacearum NFM421] Length = 2855 Score = 39.1 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 36/225 (16%), Positives = 69/225 (30%), Gaps = 43/225 (19%) Query: 153 SYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL------VVDLSGSMHCA 206 S + Q + L N +VS + + + ER + +E+ V+D+SGSM Sbjct: 2008 SESFTYQATDSLGNSTTGSLVVSIVDDVPKAVASERSVTAVEIDSNLLIVLDVSGSMADD 2067 Query: 207 MNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN 266 +++ K A+ LD D L DV + L+ +++ Sbjct: 2068 SGVPGL-------------SRLDLAKQAISALLDKYDDL----GDVKVQLVTFSSSATDQ 2110 Query: 267 IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 + ++ T+ A+ A + + + N Sbjct: 2111 TSVWVDVATAKSLLSSLSAD---GGTNYDAAVATAKTAFVTSGQLTGAQNIGY------- 2160 Query: 327 LPFQKFIIFLTDGENNNF--KSNVNTIKICDKAKENFIKIVTISI 369 F +DG+ N+ + N IK I + Sbjct: 2161 --------FFSDGKPNSGLETGTADEAAWKAFLDANGIKNYAIGL 2197 >gi|315158311|gb|EFU02328.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0312] Length = 1103 Score = 39.1 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 55/150 (36%), Gaps = 39/150 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM + ++ ++ + F+D++ S + Sbjct: 268 LDLVLVVDWSGSM-------------------NENNRIGEVQKGVNRFVDTLAD-SGITN 307 Query: 251 DVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ MG +GY++ N G + V+ + S T + A++ A +L + Sbjct: 308 NINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPN 367 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 368 GH------------------KKVIVLLTDG 379 >gi|170724217|ref|YP_001751905.1| Na-Ca exchanger/integrin-beta4 [Pseudomonas putida W619] gi|169762220|gb|ACA75536.1| Na-Ca exchanger/integrin-beta4 [Pseudomonas putida W619] Length = 5962 Score = 39.1 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 29/190 (15%), Positives = 65/190 (34%), Gaps = 39/190 (20%) Query: 162 EHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 + + N +V+ + L + + + I +VD SGSM Sbjct: 5421 DSITGTDGNDIVVADVSGLHVVPGQD---YNIAFIVDTSGSMGS---------------- 5461 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKE--DVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 + + A K +L ++ + V + L+ + T+V+ ++ + ++ Sbjct: 5462 ----SGVDAAKKSLESVFKTLAASVKGAQSGTVNILLVDFATQVKSSVSVTLNDAGLKTL 5517 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 ++ T+ A K ++F N G + F F+TDG Sbjct: 5518 LSALGTLNSNGGTNYEDAFKTT---------ANWFANLKAAGSTGSNQTF-----FITDG 5563 Query: 340 ENNNFKSNVN 349 E ++++ Sbjct: 5564 EPTYYQTSEQ 5573 >gi|29375667|ref|NP_814821.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis V583] gi|29343128|gb|AAO80891.1| von Willebrand factor type A domain protein [Enterococcus faecalis V583] Length = 1103 Score = 39.1 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 55/150 (36%), Gaps = 39/150 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM + ++ ++ + F+D++ S + Sbjct: 268 LDLVLVVDWSGSM-------------------NENNRIGEVQKGVNRFVDTLAD-SGITN 307 Query: 251 DVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ MG +GY++ N G + V+ + S T + A++ A +L + Sbjct: 308 NINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPN 367 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 368 GH------------------KKVIVLLTDG 379 >gi|260841558|ref|XP_002613979.1| hypothetical protein BRAFLDRAFT_67440 [Branchiostoma floridae] gi|229299369|gb|EEN69988.1| hypothetical protein BRAFLDRAFT_67440 [Branchiostoma floridae] Length = 1796 Score = 39.1 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 36/292 (12%), Positives = 91/292 (31%), Gaps = 46/292 (15%) Query: 85 HAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI-VNSSRISMTHMANNRLDSSNNT 143 + R L A F K S VF ++ N+ + +T + N + + Sbjct: 21 VSARNLAQVASTFQNQVNKYQDSRADIVFMLDSSGSVGENNFQTEITFVENFLSQLTISP 80 Query: 144 IFYNMDVMTSYDYRLQFIEHLLNQRY----NQKIVSFIPALLRIEMGERPIFLIELVVDL 199 + +++ D I+++ N + +++ + + EL+ Sbjct: 81 QASRVAIVSFDDAARTHIDYIKNPKNKCSFLRELKTVKYTGGSTNAEDAFRLAQELLRPK 140 Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 S Q K+T++ + FL + + + ++ + Sbjct: 141 SA---------------FANNQPVKQTEI----TFVENFLSQLTISPQAS---RVAVVSF 178 Query: 260 TTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILTSDKKRSFFTNFF 318 +I+ + ++ T++ + A ++L Sbjct: 179 DNHARTHIDYINSPKNKCSFLRELKAVKYTGGSTNAEDGFRLAQELLRP----------- 227 Query: 319 RQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN-FIKIVTISI 369 Q P ++ +++LTDG K + + + + K +I +I + Sbjct: 228 -QSAFTNYQPVKQVVVYLTDG-----KPDKDPVGRANNLKSVYNAEIYSIGV 273 >gi|297618081|ref|YP_003703240.1| hypothetical protein Slip_1922 [Syntrophothermus lipocalidus DSM 12680] gi|297145918|gb|ADI02675.1| protein of unknown function DUF2134, membrane [Syntrophothermus lipocalidus DSM 12680] Length = 310 Score = 39.1 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 20/110 (18%), Positives = 46/110 (41%), Gaps = 14/110 (12%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 +K I E +++FALS+ + L + ++ + +A ++A+LAGA ++ N Sbjct: 5 RKLIGGEGGTAAVLFALSLTALLGFTALVTDSGLLFLNQSRVANAVDSAVLAGAQELPGN 64 Query: 72 ------LSRLGDRFESISNHAK--------RALIDDAKRFIKNHIKESLS 107 ++R +++ R ++ +A R + + L Sbjct: 65 PDAALAVARTYAEANGVTDAEAVFSVSEDGRQIVGEATRRVGLYFARVLG 114 >gi|218528924|ref|YP_002419740.1| hypothetical protein Mchl_0894 [Methylobacterium chloromethanicum CM4] gi|218521227|gb|ACK81812.1| conserved hypothetical protein [Methylobacterium chloromethanicum CM4] Length = 477 Score = 39.1 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 51/359 (14%), Positives = 115/359 (32%), Gaps = 26/359 (7%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 R + ++ + + +++FAL+V+ + L+G + K +++A +AA LAG Sbjct: 10 RLKGRATALASNAEGSINVLFALAVLPTIGLVGLGVDYGMAISSKTRLDNAADAAALAGV 69 Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 ++ + + ++ K K F N K + EI + Sbjct: 70 VTAKEFIAANAQQSDVMTAGIKAGEYQALKAFNVNASKVPF--ATVSLSQLEIVRSGQTL 127 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEM 185 ++++ A + + ++ +T+ R+ + +V ++ Sbjct: 128 DATVSYTATVQ-STFGRLFGLSVTTLTN---RVNASADIAGYLDFYLMVDVSGSMGLPTT 183 Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 L V+ G+ C + + K + + A+ NA+ LD Sbjct: 184 DSDAALLASKSVEDQGN--CQFACHFPNRKGWNLAAGKIQLRSDAVNNAVCALLDRASK- 240 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 V +G+ + R+ S T + + D P T + L Sbjct: 241 PIVPNQYRIGIYPFINRLATLAPLSDTTTSLAS-LKTTADCGKAWPLAFTNLLDTGSTQL 299 Query: 306 TSDKKRSFFTNFFRQGVKI----------------PSLPFQKFIIFLTDGENNNFKSNV 348 ++ + T + S + F+ +TDG N+ + Sbjct: 300 FTNNDPTTGTGSGGTHFETALPQMKSTIRTFGNGSSSANPKPFVFLITDGMQNSQTYSS 358 >gi|260808371|ref|XP_002598981.1| hypothetical protein BRAFLDRAFT_221835 [Branchiostoma floridae] gi|229284256|gb|EEN54993.1| hypothetical protein BRAFLDRAFT_221835 [Branchiostoma floridae] Length = 193 Score = 39.1 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 38/233 (16%), Positives = 73/233 (31%), Gaps = 42/233 (18%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + + V+D SGS+ + A +K + D +++ Sbjct: 1 KHQQLDMVFVLDGSGSI--------------------QAVNFAKVKKFAVDLSDGLNISP 40 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 +GLI YT + + T K + S T + A+ A Sbjct: 41 TA---TRVGLIEYTDSPTVEFKLADHTNKASLATAINNVSYQSGGTQTGRALDAA----- 92 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQ--KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 T + +P++ F + I +TDG S N + ++N I Sbjct: 93 -------RTQMDWRQPPVPNVCFSLLQAAIVVTDG-----MSGDNVQQPAKALRDNDISA 140 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 + I + N L + + + + N D L + IS + ++ Sbjct: 141 YGVGIGPAINANELNEIAGGDAGHVFYIPNYDKLEKEMEKISNSVCSGVLQIL 193 >gi|7258382|emb|CAB77598.1| putative protein [Arabidopsis thaliana] Length = 676 Score = 39.1 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 26/166 (15%), Positives = 61/166 (36%), Gaps = 43/166 (25%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 +I R + V+D+SGSM TK+A LK A+ + + Sbjct: 234 QISQYRRAPIDLVTVLDISGSMGG--------------------TKLALLKRAMGFVIQN 273 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQ 300 + + +I +++ + + ++ RQ + ++SL+ T+ +++ Sbjct: 274 LGSSD------RLSVIAFSSTARRLFPLTRMSDAGRQLALQAVNSLVANGGTNIVDGLRK 327 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 +++ +R+ + II L+DG + + Sbjct: 328 GAKVMEDRLERNSVAS----------------IILLSDGRDTYTTN 357 >gi|28393354|gb|AAO42101.1| unknown protein [Arabidopsis thaliana] Length = 650 Score = 39.1 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 26/166 (15%), Positives = 61/166 (36%), Gaps = 43/166 (25%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 +I R + V+D+SGSM TK+A LK A+ + + Sbjct: 208 QISQYRRAPIDLVTVLDISGSMGG--------------------TKLALLKRAMGFVIQN 247 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQ 300 + + +I +++ + + ++ RQ + ++SL+ T+ +++ Sbjct: 248 LGSSD------RLSVIAFSSTARRLFPLTRMSDAGRQLALQAVNSLVANGGTNIVDGLRK 301 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 +++ +R+ + II L+DG + + Sbjct: 302 GAKVMEDRLERNSVAS----------------IILLSDGRDTYTTN 331 >gi|79607904|ref|NP_974433.2| zinc finger (C3HC4-type RING finger) family protein [Arabidopsis thaliana] gi|332645764|gb|AEE79285.1| C3HC4-type RING finger protein [Arabidopsis thaliana] Length = 632 Score = 39.1 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 26/166 (15%), Positives = 61/166 (36%), Gaps = 43/166 (25%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 +I R + V+D+SGSM TK+A LK A+ + + Sbjct: 233 QISQYRRAPIDLVTVLDISGSMGG--------------------TKLALLKRAMGFVIQN 272 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQ 300 + + +I +++ + + ++ RQ + ++SL+ T+ +++ Sbjct: 273 LGSSD------RLSVIAFSSTARRLFPLTRMSDAGRQLALQAVNSLVANGGTNIVDGLRK 326 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 +++ +R+ + II L+DG + + Sbjct: 327 GAKVMEDRLERNSVAS----------------IILLSDGRDTYTTN 356 >gi|79315048|ref|NP_001030861.1| zinc finger (C3HC4-type RING finger) family protein [Arabidopsis thaliana] gi|332645765|gb|AEE79286.1| C3HC4-type RING finger protein [Arabidopsis thaliana] Length = 633 Score = 39.1 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 26/166 (15%), Positives = 61/166 (36%), Gaps = 43/166 (25%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 +I R + V+D+SGSM TK+A LK A+ + + Sbjct: 234 QISQYRRAPIDLVTVLDISGSMGG--------------------TKLALLKRAMGFVIQN 273 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQ 300 + + +I +++ + + ++ RQ + ++SL+ T+ +++ Sbjct: 274 LGSSD------RLSVIAFSSTARRLFPLTRMSDAGRQLALQAVNSLVANGGTNIVDGLRK 327 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 +++ +R+ + II L+DG + + Sbjct: 328 GAKVMEDRLERNSVAS----------------IILLSDGRDTYTTN 357 >gi|30694117|ref|NP_191038.2| zinc finger (C3HC4-type RING finger) family protein [Arabidopsis thaliana] gi|332645763|gb|AEE79284.1| C3HC4-type RING finger protein [Arabidopsis thaliana] Length = 675 Score = 39.1 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 26/166 (15%), Positives = 61/166 (36%), Gaps = 43/166 (25%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 +I R + V+D+SGSM TK+A LK A+ + + Sbjct: 233 QISQYRRAPIDLVTVLDISGSMGG--------------------TKLALLKRAMGFVIQN 272 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQ 300 + + +I +++ + + ++ RQ + ++SL+ T+ +++ Sbjct: 273 LGSSD------RLSVIAFSSTARRLFPLTRMSDAGRQLALQAVNSLVANGGTNIVDGLRK 326 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 +++ +R+ + II L+DG + + Sbjct: 327 GAKVMEDRLERNSVAS----------------IILLSDGRDTYTTN 356 >gi|293348732|ref|XP_001072793.2| PREDICTED: collagen, type XXII, alpha 1 [Rattus norvegicus] gi|293360639|ref|XP_243609.5| PREDICTED: collagen, type XXII, alpha 1 [Rattus norvegicus] Length = 1613 Score = 39.1 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 22/159 (13%), Positives = 53/159 (33%), Gaps = 19/159 (11%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 +D +G++ Y+ R E + + + T++ A++ Sbjct: 74 VDTFEVGPGHTRVGVVRYSDRPTTAFELGHFSSREEVKAAARRITYHGGNTNTGDALRYI 133 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 S + G + + F++ I LTDG + + + Sbjct: 134 TSRSFSAQA----------GGRPGNRAFKQVAILLTDGRSQDLVLDAAAAAH-----AAG 178 Query: 362 IKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 I+I + + A + L S P + ++V + +++ Sbjct: 179 IRIFAVGVGA--ALKEELDEIASEPKSAHVFHVSDFNAI 215 >gi|156404157|ref|XP_001640274.1| predicted protein [Nematostella vectensis] gi|156227407|gb|EDO48211.1| predicted protein [Nematostella vectensis] Length = 476 Score = 39.1 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 28/171 (16%), Positives = 56/171 (32%), Gaps = 27/171 (15%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVD SGSM + ++ + R ++A L + + E Sbjct: 212 DVILVVDYSGSMGGSRLPIAKEAAKTVLDTLNPRDRVAFLAFESGVRRVKVTSGDAKDEK 271 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + + + K + +++++ + S T A A+ IL Sbjct: 272 C------FESSLAKASPV--NIDILKKFLDGEYAS---GGTMYAVAFNAAFDIL------ 314 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + K + + I+F+TDG + TI K + + Sbjct: 315 -------DKYYKEKNTTRRPVILFMTDG---APNDDPGTILNTVKMRNQGL 355 >gi|157962337|ref|YP_001502371.1| cell wall anchor domain-containing protein [Shewanella pealeana ATCC 700345] gi|157847337|gb|ABV87836.1| LPXTG-motif cell wall anchor domain [Shewanella pealeana ATCC 700345] Length = 789 Score = 39.1 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 62/187 (33%), Gaps = 45/187 (24%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM + K AL L + Sbjct: 403 LVIDTSGSMSGDA--------------------IIQAKTALKYALAGLRPTDKFN----- 437 Query: 255 GLIGYTTRVEKNIEPS-----WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ-ILTSD 308 ++ + + V+K + + + + Y+ R + T+ + A+ A +D Sbjct: 438 -IVQFNSDVDKWSGMAMSATPYNLAQAQNYINRLEAN---GGTEMSIAINAALNIETVTD 493 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 K+ + G + ++F+TDG +N I +A+ ++ TI Sbjct: 494 KETGTELDNNDLGSN-----LLRQVLFITDGAVSNESMLFELI----EAQLGDSRLFTIG 544 Query: 369 I-NASPN 374 I +A Sbjct: 545 IGSAPNA 551 >gi|73954248|ref|XP_546328.2| PREDICTED: similar to integrin, alpha 1 precursor [Canis familiaris] Length = 1182 Score = 39.1 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 32/184 (17%), Positives = 72/184 (39%), Gaps = 25/184 (13%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 D ++ + + +G++ Y V + V + +I + T Sbjct: 194 AFLNDLLERMDIGPKQTQVGIVQYGENVTHEFNL--NKYSSTEEVLVAANQIIQRGGRQT 251 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A I T+ K+ R+G +K ++ +TDGE+++ I+ C+ Sbjct: 252 ---MTALGIDTARKEAFTEARGARRG-------VKKVMVIVTDGESHDNHLLNKVIQDCE 301 Query: 356 KAKENFIKIVTISINASPNGQRL--------LKTCVSSP--EYHYNVVNADSLIHVFQNI 405 + I+ +I+I S N L +K+ S P ++ +NV + +L+ + + + Sbjct: 302 ---DENIQRFSIAILGSYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVEAL 358 Query: 406 SQLM 409 + + Sbjct: 359 GERI 362 >gi|170740935|ref|YP_001769590.1| hypothetical protein M446_2717 [Methylobacterium sp. 4-46] gi|168195209|gb|ACA17156.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 432 Score = 38.7 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 61/445 (13%), Positives = 135/445 (30%), Gaps = 71/445 (15%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 + A + ++F ++++ LLL+G + + + +++ + A+LA S Sbjct: 17 RVFAADRSGSIGMMFVVTLVPVLLLVGAAVDFTSYQKARTELDAVADQAVLAAVS----- 71 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN--SSRISM 129 + A AK F S + V S S Sbjct: 72 -------AAGMKMSQADAEAAMAKLFTDAAAALPNVSASPRAATAPTTDGVRTASLTYSA 124 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 T + +T+ + + + F + L + P++ Sbjct: 125 TIRTGIMRLAGFSTVAFGGTATAASPNPI-FTDFYLLLDNS-------PSMGVAATTADI 176 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 ++ D M++ D + K ++ +++A +D+ + Sbjct: 177 ATMVANTSDQCAFACHDMSAGGNDYYAKAKNLGVKM-RIDVVRDATQQLMDTASAKAIAA 235 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST--PAMKQAYQILTS 307 M + + T V+ +L TD+ M YQ + Sbjct: 236 GQYRMAIYSFGTSCSGIGL---------NQVSALTANLSTSKTDAGALDLMTVPYQNYNN 286 Query: 308 DKKRSFFTNFFRQGVKIPS-------LPFQKFIIFLTDG--ENNNFKSNVN--------- 349 D+ F F R +P+ QK + F++DG + N + Sbjct: 287 DQCTDFDGIFARLNSAVPNPGSGASAASPQKVVFFVSDGVADANYPSTCTKPTTNGRCQE 346 Query: 350 --TIKICDKAKENFIKI---VTISINASPNG-------------QRLLKTCVSSPEYHYN 391 T+ C K+ I++ T + NG + C SP+ ++ Sbjct: 347 PITLANCQALKDRGIRVAVLYTTYLPLPTNGWYNTWIAPFSSQIATNMAACA-SPDLYWP 405 Query: 392 VVNADSLIHVFQNISQLMVHRKYSV 416 V ++ + + + + +V + + Sbjct: 406 VSPSEGIADAMKGLFKKVVDSQRRI 430 >gi|311252831|ref|XP_003125289.1| PREDICTED: vitrin-like isoform 1 [Sus scrofa] Length = 656 Score = 38.7 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 28/182 (15%), Positives = 50/182 (27%), Gaps = 25/182 (13%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + L + D +G + YT T K Sbjct: 490 RTVLQFVANLSKEFDISDTDTRVGAVQYTYEQRLEFGFDQYTTKPDVLNAIKRVGYWSGG 549 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ A + L K + +K +I +TDG +V Sbjct: 550 TSTGAAINYALEQLFKKSKPNK----------------RKLMILITDG---RSYDDVRIP 590 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 + A + I + Q L+ + P ++ + V D+L I Q + Sbjct: 591 AM--VAHHKGVITYAIGVA--WAAQEELEIIATHPARDHAFFVDEFDNLYKSVPKIIQNI 646 Query: 410 VH 411 Sbjct: 647 CT 648 >gi|217970127|ref|YP_002355361.1| von Willebrand factor type A [Thauera sp. MZ1T] gi|217507454|gb|ACK54465.1| von Willebrand factor type A [Thauera sp. MZ1T] Length = 840 Score = 38.7 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 35/198 (17%), Positives = 62/198 (31%), Gaps = 47/198 (23%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 R+ P ++++VD SGSM +AA + AL + Sbjct: 259 RVPAAAHP-LAVKILVDCSGSMQGD--------------------SIAAARRALQAIIAG 297 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD---SLILKPTDSTPAM 298 + +E L + + VE W T + + L T+ A+ Sbjct: 298 L------REGERFSLSRFGSTVEHRSRALWRTSAATRQAGQRWAMQLQADLGGTEMENAL 351 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 L D + S T V ++ +TDG+ + V +A+ Sbjct: 352 ASTL-ALAGDAEPSPGTEEGAAAVD---------LLLITDGQIHAIDRTVK------RAR 395 Query: 359 ENFIKIVTISI-NASPNG 375 +I + I +A G Sbjct: 396 ALGNRIFVVGIGSAPAEG 413 >gi|224285863|gb|ACN40645.1| unknown [Picea sitchensis] Length = 829 Score = 38.7 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 65/169 (38%), Gaps = 29/169 (17%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + G R + V+D+SGSM TK+A LK A+ + ++ Sbjct: 351 DPGCRAPIDLVTVLDVSGSMSG--------------------TKLALLKRAMAFVISNLS 390 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDSTPAMKQAY 302 + ++ +++ ++ T ++ R ++ L+ T+ +++ Sbjct: 391 PED------RLSVVVFSSTAKRVFSLKRMTPDGQRAANRVVERLLCTGGTNIAEGLRKGA 444 Query: 303 QILTSDKKRSFFTNFF--RQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 ++L ++R+ + G SL + ++F +D + + + + Sbjct: 445 KVLEDRRQRNPVASIMLLSDGQDTYSLSSRGVVLFPSDEQRRSARQSTR 493 >gi|87308834|ref|ZP_01090973.1| hypothetical protein DSM3645_11362 [Blastopirellula marina DSM 3645] gi|87288545|gb|EAQ80440.1| hypothetical protein DSM3645_11362 [Blastopirellula marina DSM 3645] Length = 616 Score = 38.7 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 38/196 (19%), Positives = 63/196 (32%), Gaps = 48/196 (24%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 ++V I ERP + ++D+SGSM K Sbjct: 230 NAEHRLVRIGIKGKEIANAERPASNLVFLLDVSGSM-------------------NNARK 270 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + LK + L +D + V VY G G + + + + +D L Sbjct: 271 LPLLKQGMKLLVDQLGENDKVAIVVYAGAAG--------MVLNSTNGDDKSTIMEALDRL 322 Query: 288 ILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN---N 343 T+ ++ AYQ T + + +I TDG+ N Sbjct: 323 QAGGSTNGGQGIELAYQAATENFIKGGVNR----------------VILCTDGDFNVGVT 366 Query: 344 FKSNVNTIKICDKAKE 359 S++ T+ DKAK Sbjct: 367 STSDLVTMAA-DKAKS 381 >gi|309355882|emb|CAP38139.2| hypothetical protein CBG_21291 [Caenorhabditis briggsae AF16] Length = 430 Score = 38.7 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 31/220 (14%), Positives = 65/220 (29%), Gaps = 47/220 (21%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LV+D S + P+ + K+ + + + H Sbjct: 243 LDLVLVLDFS-------------TTTDPVYN--------SYKDLSKRLVQQLKIGPH--- 278 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSD 308 + + + T + + ++ V R +D L K T +++A Sbjct: 279 YTQVAAVTFATVGRTRVRFNLKKYSTQEEVLRGIDKLQSKGGTTAIGAGIEKA------- 331 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 T P + + I+F TDG +N + A ++ T++ Sbjct: 332 -----LTQIDESEGARPGIATKVMIVF-TDGWSNKGPDPEKRAR---DAVNAGFEMYTVA 382 Query: 369 INASPNG-----QRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 A G L S + + V +L+ + Sbjct: 383 YTARAPGSVTLNNETLSAISGSSGHAFTDVTFQTLVDKIK 422 >gi|188527810|ref|YP_001910497.1| phage/colicin/tellurite resistance cluster terY protein [Helicobacter pylori Shi470] gi|188144050|gb|ACD48467.1| phage/colicin/tellurite resistance cluster terY protein [Helicobacter pylori Shi470] Length = 217 Score = 38.7 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 30/234 (12%), Positives = 73/234 (31%), Gaps = 43/234 (18%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E + L++D SGSM+ ++ T++ AL + ++++ + Sbjct: 11 EERFIPVFLLLDTSGSMNESLG---------------NCTRIEALNLCIQKMIETLKQEA 55 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + M +I + P + + T A + A ++ Sbjct: 56 KKELFSKMAIITFGENGAVLHTPFDDVKNI-----NFKPLSASGGTPLDQAFRLAKDLIE 110 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 P+ ++ + I ++DGE N+ K + + Sbjct: 111 DKD-------------TFPTKFYKLYSILVSDGEPNDDKWQKALSNFHHDGRSAKSVCWS 157 Query: 367 ISI----NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I I + K + + + + L+ +F+ ++Q + S+ Sbjct: 158 IFIGDRNTNPQVNKDFGK------DGVFYADDVEKLVGLFEIMTQTISKGSTSI 205 >gi|167763115|ref|ZP_02435242.1| hypothetical protein BACSTE_01484 [Bacteroides stercoris ATCC 43183] gi|167699455|gb|EDS16034.1| hypothetical protein BACSTE_01484 [Bacteroides stercoris ATCC 43183] Length = 342 Score = 38.7 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 33/225 (14%), Positives = 77/225 (34%), Gaps = 45/225 (20%) Query: 151 MTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 ++ Y ++F + +++ ++E +R + + +D+S SM Sbjct: 50 VSKYRPDVKFWMVFVAIGLFSVLLARPQFGSKLETVKRQGVEVMIALDISNSMLA----- 104 Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 QD + +++ K + +D ++ + V G+I + + + Sbjct: 105 ----------QDVQPSRLQKAKRLVAQLVDKME-----NDKV--GMIVFAGDAFTQLPIT 147 Query: 271 WGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 + ++ SLI K T A+ A + T P Sbjct: 148 SDYISAKMFLESIDPSLISKQGTAIGAAINLASRSFT------------------PQEGV 189 Query: 330 QKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 + +I +TDGEN+ + ++ A E I++ + + Sbjct: 190 GRAVIVITDGENHEGDA----VEAAKDAAEKGIQVNVLGVGMPEG 230 >gi|145295537|ref|YP_001138358.1| hypothetical protein cgR_1465 [Corynebacterium glutamicum R] gi|140845457|dbj|BAF54456.1| hypothetical protein [Corynebacterium glutamicum R] Length = 354 Score = 38.7 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 30/215 (13%), Positives = 63/215 (29%), Gaps = 42/215 (19%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V+D+SGSM ++ LK+ + + + Sbjct: 173 VLDVSGSMLGQ--------------------RITLLKDTMSDLISGGATTDLANVSLR-- 210 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 I S+G +V ++L + S ++Q + L +D + Sbjct: 211 ----GREKVSIIPFSFGPHEVIS------ETLGAVGSPSRIDLQQRVEALQADGGTGIYD 260 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGE----NNNFKSNVNTIKICDKAKENFIKIVTISI-N 370 + + I+ +TDGE + + + I + I Sbjct: 261 AVLAAYAESAGGDYIPSIVLMTDGELTAGRTYDQFLTEWNALPSNIRS--IPVFVILYGE 318 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 A+ L ++ ++ +N L F+ I Sbjct: 319 ANVADMEQLA--ATTGGKTFDAIN-GDLDEAFKEI 350 >gi|297154321|gb|ADI04033.1| hypothetical protein SBI_00912 [Streptomyces bingchenggensis BCW-1] Length = 423 Score = 38.7 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 41/233 (17%), Positives = 74/233 (31%), Gaps = 43/233 (18%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 ++S A+ E +ELV+D+SGSM ++MAA K Sbjct: 22 VLSPAAAVADDEPARESP-KVELVLDVSGSMRAR--------------DVDGMSRMAAAK 66 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 A +D++ V+ + Y + K + Sbjct: 67 QAFNEVIDAVP--DEVRLGIRTLGATYPGQDRKTGCLD------------SKQLYPVGRV 112 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 D T A K A L + + I+ +TDGE++ + + Sbjct: 113 DRTEA-KTAVATLRPTGWTPIGLALRGASKDLAGGDATRRIVLITDGEDSCGQPDP---- 167 Query: 353 ICDKAKENFIK-----IVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 CD A+E + + T+ + + L C++ + V + D L Sbjct: 168 -CDVARELAAQGTHLVVDTLGLTLDSKVRDQLS-CIAEATGGTYTAVQHTDQL 218 >gi|148656885|ref|YP_001277090.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148568995|gb|ABQ91140.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 412 Score = 38.7 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 27/177 (15%), Positives = 58/177 (32%), Gaps = 45/177 (25%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM A K+AALK+A+ ++++ V Sbjct: 46 LVLDRSGSMQGA--------------------KLAALKDAVKRVIETLTPQDIV------ 79 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 ++ + V+ + ++ T+K + + T + M L + Sbjct: 80 AIVLFDDTVQTLVPATFATDK-ATLIAQVDAIEEAGGTAMSGGMAAGIVELRKNHDPGRV 138 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 ++ LTDG+ + + + + ++I + + A Sbjct: 139 GA----------------MLLLTDGQT--WGDEDRCRALAQELARDGVRITALGLGA 177 >gi|91201645|emb|CAJ74705.1| hypothetical protein kuste3942 [Candidatus Kuenenia stuttgartiensis] Length = 336 Score = 38.7 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 34/196 (17%), Positives = 61/196 (31%), Gaps = 50/196 (25%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E+ I V+D+S SM D K ++ K + + + Sbjct: 87 EKEGLEIVFVLDVSMSMLAE---------------DVKPNRLECAKMEIANLVRGL---- 127 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTE-------KVRQYVTRDMDSLILKPTDSTPAMK 299 ED +GL+ + R + T+ ++ V + T+ A+ Sbjct: 128 ---EDDRVGLVVFAAR--AFSLLPYPTKDYEMVFLRILNMVNEHYVRFVPYGTNIGNALI 182 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 A + +++ +K II LTDGE + + I ++ Sbjct: 183 AAMETFSNEA-------------------GKKIIILLTDGEEQLLRRSQVVEAIRLLLEK 223 Query: 360 NFIKIVTISINASPNG 375 N I I I N Sbjct: 224 NDISTYIIGIGDPNNS 239 >gi|90406967|ref|ZP_01215158.1| putative RTX toxin [Psychromonas sp. CNPT3] gi|90312009|gb|EAS40103.1| putative RTX toxin [Psychromonas sp. CNPT3] Length = 3350 Score = 38.7 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 63/365 (17%), Positives = 122/365 (33%), Gaps = 64/365 (17%) Query: 24 IIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESIS 83 ++FA V F+ + + ++ N L G+ M+ + + D I Sbjct: 2619 LVFADGVYLVQADGSFIAEGISVVEYDVDINASLND--LDGSESMIIEIHDVPDAAIVIG 2676 Query: 84 NHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNT 143 + K I E S + V ++ N + + A + D+ N Sbjct: 2677 ATKSEEGVWVIK------IDEGQSDFDGVVSIRLPEDQNNMFTLDIKVTATEQNDNENGQ 2730 Query: 144 IFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSM 203 TS + ++ + N I E + LV+D+SGSM Sbjct: 2731 N------TTSTTQSVTGTPIIVEETVNLVISEP----------EIAETNLILVLDISGSM 2774 Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 + ++ + + ++ K AL L+ + L V + L+ + + Sbjct: 2775 NGSI-------------EGSDQNRLDFAKTALSNLLEIQNTL----GVVNVNLVAFENNI 2817 Query: 264 EKNIEPSW-----GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFF 318 + + G + QY+ ++++ T+ A+K + F Sbjct: 2818 FSSHWVTLDGGPEGLASILQYI-ENLNADAYGGTNYQDALKT-------------VMSEF 2863 Query: 319 RQGVKIPSLPFQK--FIIFLTDGE-NNNFKSNVNTIKICDKAKENFI-KIVTISINASPN 374 QGV + K I+FL+DG+ + +N + D I I T+ I S Sbjct: 2864 EQGVASGDIDVSKDTNIVFLSDGKPGQSIINNPVEQEWNDFTSNYNIDSINTVGIQISAG 2923 Query: 375 GQRLL 379 Q L+ Sbjct: 2924 DQVLV 2928 >gi|319787647|ref|YP_004147122.1| von Willebrand factor type A [Pseudoxanthomonas suwonensis 11-1] gi|317466159|gb|ADV27891.1| von Willebrand factor type A [Pseudoxanthomonas suwonensis 11-1] Length = 585 Score = 38.7 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 30/188 (15%), Positives = 58/188 (30%), Gaps = 43/188 (22%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 ++ E P + L+VD SGSM K+ LK A + + Sbjct: 208 VDKRELPPANLVLLVDTSGSMD-------------------DPAKLPLLKRAFAQLVPQL 248 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 + ++ Y + P+ G + + T+ ++ AY Sbjct: 249 RAKD------RVSIVAYAGHAGLVLPPTPGNRH-GEILAALEGLHAAGSTNGGEGLRLAY 301 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN-TIKICDKAKENF 361 RQG + I+ TDG+ N ++ N + + + + Sbjct: 302 -------------AMARQGHVEGGVNR---ILLATDGDFNVGITDRNALLTLVADQRRSG 345 Query: 362 IKIVTISI 369 I + T+ Sbjct: 346 IALSTLGF 353 >gi|296111730|ref|YP_003622112.1| hypothetical protein LKI_08020 [Leuconostoc kimchii IMSNU 11154] gi|295833262|gb|ADG41143.1| hypothetical protein LKI_08020 [Leuconostoc kimchii IMSNU 11154] Length = 894 Score = 38.7 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 38/262 (14%), Positives = 89/262 (33%), Gaps = 50/262 (19%) Query: 95 KRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSY 154 F ++I + + Y T I + + + H + + + ++ D + Sbjct: 37 TAFADSNITPQYTNDNKGVYPTNSWTIPGQNTV-INHQGGDASNGWDKNSSWSGDSSDTS 95 Query: 155 DYRLQFIEHLLNQ-----RYNQKIVSFIPALLRIEMGER-----PIFLIELVVDLSGSMH 204 L+F N +Y ++ + + + I LVVD+SGSM+ Sbjct: 96 KSYLKFGTDTSNPDYQIRKYAKETSTPGLYDVYLNAKGNEVKNIKPIDIVLVVDMSGSMN 155 Query: 205 CAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT--R 262 ++N + ++ A + + FL +I+ + + V +G++G+++ Sbjct: 156 SSVNGGND--------------RVGAARQGVKNFLKTINDA-GIGKYVNVGVVGFSSPGY 200 Query: 263 VEKNIEPSWGTEKV-----RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNF 317 + + S + + + + T + ++ +L D Sbjct: 201 ISSSGTLSENIDASDNQAHITRINNLLANDFKGGTFTQLGIRTGQSMLAGDSND------ 254 Query: 318 FRQGVKIPSLPFQKFIIFLTDG 339 +K +I LTDG Sbjct: 255 -----------HKKMMILLTDG 265 >gi|229125374|ref|ZP_04254461.1| hypothetical protein bcere0016_56090 [Bacillus cereus 95/8201] gi|228658081|gb|EEL13834.1| hypothetical protein bcere0016_56090 [Bacillus cereus 95/8201] Length = 452 Score = 38.7 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 40/217 (18%), Positives = 79/217 (36%), Gaps = 29/217 (13%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + +E+++D SGSM +N + KM A K A+ +LD I Sbjct: 148 KEKSLNVEILLDASGSMAGKVNG---------------QVKMEAAKKAIYNYLDKI---- 188 Query: 247 HVKEDVYMGLIGYT-TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 +V + + G+ + E + S G+ +V + + + + L Sbjct: 189 PDNANVMLRVYGHKGSNNENDKSLSCGSSEVMYPLQPYKKEQFNAALSNFGP--KGWTPL 246 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 S + G + ++ +I+ +DGE VN K +++ + + + Sbjct: 247 ASAIESVNDDFKEYTGEENLNVV---YIV--SDGEETCGGDPVNAAKNLNQSSTHAV-VN 300 Query: 366 TISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHV 401 I + + Q+ LK + + V NAD L Sbjct: 301 IIGFDVKNSEQQQLKNTAEAGKGNYATVSNADELYQT 337 >gi|260818868|ref|XP_002604604.1| hypothetical protein BRAFLDRAFT_126772 [Branchiostoma floridae] gi|229289932|gb|EEN60615.1| hypothetical protein BRAFLDRAFT_126772 [Branchiostoma floridae] Length = 2219 Score = 38.7 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 42/261 (16%), Positives = 87/261 (33%), Gaps = 59/261 (22%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI----- 219 L Q SF+ LL G+ + V+D S S+ + + Sbjct: 49 LEQMALTLNTSFVTPLLNTLNGQ---VDLIFVLDRSASIGWYNGAWDSMLQFVENLLYEF 105 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 + T+++ + + + +D +D LS + +EK Sbjct: 106 SVNSVHTRVSIITYSTTVSVD-VDYLSDGSASSRLTKCKLNDDIEK-------------- 150 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 R + + T +T A+++A Q+L +K I LTDG Sbjct: 151 --RLRNKALHGWTATTTALERAKQVLL-----------------TSRPTAKKAIFLLTDG 191 Query: 340 ENNNFK---------SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV--SSPEY 388 +N + SN+ A+ ++I + I + L++ S ++ Sbjct: 192 RSNIGQPPSIPAREISNLRWSGWDMAAQGRQVEIFALGIE--DAVEAELRSIAYPSQQDH 249 Query: 389 HYNVVNADSLIHVFQNISQLM 409 ++ + N F +++L+ Sbjct: 250 YFLLDNFQD----FSLLARLL 266 >gi|330503959|ref|YP_004380828.1| hemolysin-type calcium-binding repeat-containing protein [Pseudomonas mendocina NK-01] gi|328918245|gb|AEB59076.1| hemolysin-type calcium-binding repeat-containing protein [Pseudomonas mendocina NK-01] Length = 3977 Score = 38.7 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 34/161 (21%), Positives = 59/161 (36%), Gaps = 30/161 (18%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + +V+DLSGSM AP TK+A K+A+ +DS D L V V Sbjct: 3157 LMVVLDLSGSMD-----------DAPSGVSGFSTKLALAKDAVQRLIDSYDNLGDV--MV 3203 Query: 253 YMGLIGYTTRVEKNIEPSWGTEK-VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + + + + W T + ++T ++ T+ A+ +A Sbjct: 3204 RI--VTF-ANTASAVGNVWMTASDAKAWLTALANNAGNGSTNYDDALIKA---------- 3250 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 N + K+ Q FL+DG+ +N + Sbjct: 3251 ---MNAYDSTGKLTGTGVQSVSYFLSDGQPTLSNANPGSNN 3288 >gi|227833260|ref|YP_002834967.1| hypothetical protein cauri_1436 [Corynebacterium aurimucosum ATCC 700975] gi|262184244|ref|ZP_06043665.1| hypothetical protein CaurA7_09649 [Corynebacterium aurimucosum ATCC 700975] gi|227454276|gb|ACP33029.1| putative membrane protein [Corynebacterium aurimucosum ATCC 700975] Length = 688 Score = 38.7 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 36/199 (18%), Positives = 65/199 (32%), Gaps = 40/199 (20%) Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 LV+D SGSM+ +T++ A K+A F+ + + Sbjct: 70 MLVLDSSGSMNVQDAGG--------------QTRLDAAKDATKKFVSEL------GGTIP 109 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 +GL+ Y V++ E + V+ + + T A++ D + Sbjct: 110 LGLVTYGGTVDEAPENQEAGCQDIHVVSGPKEDVGDSFTGPIDALQAKGYTPIGDSLKKA 169 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK---ENFI--KIVTIS 368 Q I+ ++DG +C+ AK E I I TI Sbjct: 170 AEELGGQHGT---------IVLVSDG-----IDTCAPPPVCEVAKELHEQGIDLVINTIG 215 Query: 369 INASPNGQRLLKTCVSSPE 387 N ++ L C++ Sbjct: 216 FNVDEEARKELS-CIAEAA 233 >gi|258405287|ref|YP_003198029.1| hypothetical protein Dret_1163 [Desulfohalobium retbaense DSM 5692] gi|257797514|gb|ACV68451.1| Protein of unknown function DUF2134, membrane [Desulfohalobium retbaense DSM 5692] Length = 323 Score = 38.7 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 25/109 (22%), Positives = 43/109 (39%), Gaps = 2/109 (1%) Query: 14 GIASEKANFSIIFALSV-MSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 + E + +II AL V S L G I + KN +++ +AA LAGA ++ N Sbjct: 10 ILRDECGSVAIITALFVLFSLLATAGIAIDIGRQATAKNELQNTLDAAALAGAIELGQN- 68 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI 121 + E+ ++ ++ N IK F T ++ Sbjct: 69 GPANVKSEAKEAAENNSIDNNGLILGDNDIKVGNWTEPNFFSKTPYNSV 117 >gi|91203253|emb|CAJ72892.1| hypothetical protein kustd2147 [Candidatus Kuenenia stuttgartiensis] Length = 701 Score = 38.7 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 34/228 (14%), Positives = 79/228 (34%), Gaps = 44/228 (19%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + + +V+D S SM ++ T+MA L + DL Sbjct: 318 TQDDFQLQVMIVLDYSASM---------------YEKNGDITRMA---------LSAKDL 353 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQ 303 + + + +G++ + R ++ + Y ++++ + Sbjct: 354 IESLNDTHQVGVVEF-HRPDEPPAI---LQDFTTYKNAAIEAVSQFSSGKIYRDFSSCWD 409 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG-ENNNFKSNVNTIKICDKAKENFI 362 + +Q + P K ++FL+DG +N++F + N I + AKE + Sbjct: 410 AVL---------KGLKQFPEKPDPDIFKTLVFLSDGFDNSSFSTPGNVISL---AKERDV 457 Query: 363 KIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQLM 409 I + I + + +LK + + + N F+ + + Sbjct: 458 HIYILGI-GRGSEEEVLKNIALETGGTYVHAENIAVFRERFKQTIKDI 504 >gi|293569888|ref|ZP_06680975.1| putative pilus subunit protein PilB [Enterococcus faecium E1071] gi|291587636|gb|EFF19513.1| putative pilus subunit protein PilB [Enterococcus faecium E1071] Length = 1277 Score = 38.7 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 29/152 (19%), Positives = 53/152 (34%), Gaps = 41/152 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM ++ +K + F+D++ S + + Sbjct: 291 LDLVLVVDWSGSM-------------------NDNNRIGEVKIGVDRFVDTLAD-SGITD 330 Query: 251 DVYMGLIGYTTRVEKNIEPS---WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + MG +GY++ + + V+ V S T + A++ A +L+ Sbjct: 331 KINMGYVGYSSEGYSYSNGAVQMGSFDSVKNQVKSITPSWTNGGTFTQKALRDAGNMLSV 390 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 391 PNGH------------------KKVIVLLTDG 404 >gi|224097862|ref|XP_002311085.1| predicted protein [Populus trichocarpa] gi|222850905|gb|EEE88452.1| predicted protein [Populus trichocarpa] Length = 713 Score = 38.7 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 25/177 (14%), Positives = 62/177 (35%), Gaps = 49/177 (27%) Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 + +L ++ R + V+D+SGSM TK+A LK A+ Sbjct: 250 LASLPQLSQTPRAPVDLVTVLDISGSMAG--------------------TKLALLKRAMG 289 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP----SWGTEKVRQYVTRDMDSLILKPT 292 + ++ + +I +++ + G + Q V + + T Sbjct: 290 FVIQNLGSND------RLSVIAFSSTARRLFSLRRMSDAGRQHALQAVNSLVAN---GGT 340 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 + +++ +++ ++++ + II L+DG++ S + Sbjct: 341 NIAEGLRKGAKVMEERREKNPVAS----------------IILLSDGQDTYTVSGSS 381 >gi|298207016|ref|YP_003715195.1| hypothetical protein CA2559_02145 [Croceibacter atlanticus HTCC2559] gi|83849650|gb|EAP87518.1| hypothetical protein CA2559_02145 [Croceibacter atlanticus HTCC2559] Length = 346 Score = 38.7 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 30/206 (14%), Positives = 71/206 (34%), Gaps = 46/206 (22%) Query: 173 IVSFIPALL-RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 I P + ++E +R + +D+S SM D ++ Sbjct: 70 IALVNPKIGTKLETVKREGVDVVFAIDVSKSMLAE---------------DVAPNRLEKS 114 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV-TRDMDSLILK 290 + + ++S+ +G+I Y + + + ++ + D L + Sbjct: 115 QQLVTQIINSLASD-------RVGIIAYAGSAFPQLPITTDYASAKMFLQNMNTDMLSSQ 167 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T A++ A D++ + + + ++DGE ++ Sbjct: 168 GTAINEAIQLAKTYYNDDEQTN------------------RVLFIISDGE----DHEGDS 205 Query: 351 IKICDKAKENFIKIVTISINASPNGQ 376 + I ++A E I+I TI + + G+ Sbjct: 206 VNIAEEASEEGIRIFTIGVGTTKGGR 231 >gi|330901495|gb|EGH32914.1| von Willebrand factor, type A [Pseudomonas syringae pv. japonica str. M301072PT] Length = 218 Score = 38.7 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 28/164 (17%), Positives = 56/164 (34%), Gaps = 41/164 (25%) Query: 192 LIELVVDLSGSMHC---AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + + VD+SGSM SD +++ ++ L FL+ Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDEV-------------SRLVLVQQLLGDFLE-------G 130 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ +GLI + ++ ++ VR ++ + K T A+ A + L Sbjct: 131 RKGDRVGLILFGSQAFVQAPLTYDRRTVRVWLDEARIGIAGKNTALGDAIGLALKRLRMR 190 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 S + ++ +TDG NN + + T Sbjct: 191 PATS------------------RALVLVTDGANNAGQIDPITAA 216 >gi|325297740|ref|YP_004257657.1| von Willebrand factor type A [Bacteroides salanitronis DSM 18170] gi|324317293|gb|ADY35184.1| von Willebrand factor type A [Bacteroides salanitronis DSM 18170] Length = 341 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 27/195 (13%), Positives = 65/195 (33%), Gaps = 45/195 (23%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 ++E +R + +D+S SM D +++ K + +++ Sbjct: 81 KMETVKRQGVETVVALDISNSMLAE---------------DVTPSRLEKSKKLISRLVET 125 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDSTPAMKQ 300 + + V M I + + + + ++ SLI + TD A+ Sbjct: 126 FN-----NDKVAM--IVFAGEAFTQLPITSDYISAKMFLETITPSLISTQGTDIRGAIDL 178 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A + T P+ + I+ +TDGEN ++ +A + Sbjct: 179 AMKSFT------------------PNEGVGRAIVLITDGEN----HEGGAVEAAQEAAKK 216 Query: 361 FIKIVTISINASPNG 375 +++ + + + Sbjct: 217 GVRVFVLGVGSPDGS 231 >gi|319778364|ref|YP_004129277.1| PpkA [Taylorella equigenitalis MCE9] gi|317108388|gb|ADU91134.1| PpkA [Taylorella equigenitalis MCE9] Length = 656 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 24/180 (13%), Positives = 58/180 (32%), Gaps = 28/180 (15%) Query: 223 KKRTKMAALKNALLLFLDSIDLLSH--------------------VKEDVYMGLIGYTTR 262 + +K+ K A++ +DS + + + V GL+ + + Sbjct: 212 AQPSKLTEFKAAVVFVIDSTISMDPYINKTREAVRELYKQIEKDDLLDQVKFGLVAFRSS 271 Query: 263 VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 E + + + YV + D ++KQA + S+ Sbjct: 272 TEAVPDLEY---TSKMYVNPNEVKDGKDFMDKVASLKQAKVSSKEFNEDSYAGINQALND 328 Query: 323 KIPSLPFQKFIIFLT-----DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQR 377 + ++I+ +T DG N + ++ +A+ + I T+ + + Sbjct: 329 INWNDFGARYIVLITDAGAIDGNNPLSSTGFGAQQLRQEAQHKGVAIYTLHLKTQAGSKN 388 >gi|256820365|ref|YP_003141644.1| von Willebrand factor type A [Capnocytophaga ochracea DSM 7271] gi|256581948|gb|ACU93083.1| von Willebrand factor type A [Capnocytophaga ochracea DSM 7271] Length = 347 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 30/189 (15%), Positives = 58/189 (30%), Gaps = 45/189 (23%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 +IE +R I +D+S SM D ++ K + Sbjct: 81 KIETVKREGVDIVFAIDVSKSMLAE---------------DVAPNRLEKAKRIAFETISQ 125 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV-TRDMDSLILKPTDSTPAMKQ 300 + +G++ Y + + + ++ + D L + T A++ Sbjct: 126 LK-------GDRVGIVAYAASAYPQLALTTDHSAAKMFLQDMNTDMLSSQGTAIQEAIRM 178 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A + P + + LTDGE + +I +A+E Sbjct: 179 ASNYF------------------DENTPTARLLFILTDGE----DHEMGATEIATEAQEK 216 Query: 361 FIKIVTISI 369 + I TI I Sbjct: 217 GVHIYTIGI 225 >gi|188590759|ref|YP_001922423.1| von Willebrand factor type A domain protein [Clostridium botulinum E3 str. Alaska E43] gi|188501040|gb|ACD54176.1| von Willebrand factor type A domain protein [Clostridium botulinum E3 str. Alaska E43] Length = 984 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 30/157 (19%), Positives = 59/157 (37%), Gaps = 43/157 (27%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 P I LV+D SGSM K +K+ +KNA + F++ I + ++ Sbjct: 88 PKKEIVLVLDTSGSM--------------------KDSKIKKMKNAAMEFVNKIKKIPNL 127 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 D+ + Y+T + E + + + T++ +++A IL + Sbjct: 128 DIDI----VTYSTSGYTYLNNGNTEEDLLKIINSIKAD---GGTNTGEGLRKANYILDLE 180 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 K ++ K I+F++DG + Sbjct: 181 KNKNAD----------------KSIVFMSDGMPTYYS 201 >gi|157265460|ref|YP_001468018.1| Von Willebrand factor type A domain [Thermus phage P74-26] gi|156905355|gb|ABU96998.1| Von Willebrand factor type A domain [Thermus phage P74-26] Length = 563 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 43/243 (17%), Positives = 80/243 (32%), Gaps = 25/243 (10%) Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 + + + I +RIE E + ++VD SGSM ++ + A Sbjct: 341 KALAANISPFMRIEPEEERSLHLVVLVDESGSMGTSIYGLTDISQGRSGEPFLGAASGVA 400 Query: 231 LKNALLLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 A L L SI L +K DV M GY G +YV + Sbjct: 401 SNRAFLAKLTSIILYEGLKNADVRMQFFGYGDTALPPPLDELGNH---EYVRELV----- 452 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ-KFIIFLTDGENNNFKSNV 348 + A +T + + + P + + +I+L DGE S+ Sbjct: 453 --------LPYALATITHKSNNGDLSALSHAFGVLRASPAKHRVVIYLADGE----ISSS 500 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 ++ K + IK+ + ++ + + + V D ++ Q I + Sbjct: 501 RLVEAFRKVTSSGIKVYWLDLSGRKRPY---SSSALAAARRFVVNTFDDVLKAIQAIFEE 557 Query: 409 MVH 411 + Sbjct: 558 VAE 560 >gi|157265344|ref|YP_001467903.1| Von Willebrand factor type A domain [Thermus phage P23-45] gi|156905239|gb|ABU96883.1| Von Willebrand factor type A domain [Thermus phage P23-45] Length = 563 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 43/243 (17%), Positives = 80/243 (32%), Gaps = 25/243 (10%) Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 + + + I +RIE E + ++VD SGSM ++ + A Sbjct: 341 KALAANISPFMRIEPEEERSLHLVVLVDESGSMGTSIYGLTDISQGRSGEPFLGAASGVA 400 Query: 231 LKNALLLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 A L L SI L +K DV M GY G +YV + Sbjct: 401 SNRAFLAKLTSIILYEGLKNADVRMQFFGYGDTALPPPLDELGNH---EYVRELV----- 452 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ-KFIIFLTDGENNNFKSNV 348 + A +T + + + P + + +I+L DGE S+ Sbjct: 453 --------LPYALATITHKSNNGDLSALSHAFGVLRASPAKHRVVIYLADGE----ISSS 500 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 ++ K + IK+ + ++ + + + V D ++ Q I + Sbjct: 501 RLVEAFRKVTSSGIKVYWLDLSGRKRPY---SSSALAAARRFVVNTFDDVLKAIQAIFEE 557 Query: 409 MVH 411 + Sbjct: 558 VAE 560 >gi|83312059|ref|YP_422323.1| hypothetical protein amb2960 [Magnetospirillum magneticum AMB-1] gi|82946900|dbj|BAE51764.1| hypothetical protein [Magnetospirillum magneticum AMB-1] Length = 1171 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 29/223 (13%), Positives = 69/223 (30%), Gaps = 55/223 (24%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + +++D S SM A + + + A FL + Sbjct: 63 LDVVMLLDHSSSMGAA-----------------PGSPLQMMLRAAGNFLRQL------SP 99 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 D + ++G+ + + + R T A+ QA ++L Sbjct: 100 DSRVAVVGFNQVPSVHCTLAATPAQARS---ALQAISPGGATSIAAALNQAVELLAH--- 153 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 K ++ +DG+++ I A I ++ + Sbjct: 154 --------------GRPGMDKVVVLCSDGQDD--------IAEIADALARLKAIPSVRVL 191 Query: 371 ASPNGQRLLKTC----VSSPEYHYNVVNADSLIHVFQNISQLM 409 A G ++ V+ + ++++ A + VFQ +++ + Sbjct: 192 AVGFGDEVIHATFLAMVADRQDYFHLTRARDMDDVFQRLAKEV 234 >gi|257880953|ref|ZP_05660606.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,502] gi|257892525|ref|ZP_05672178.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,408] gi|257816611|gb|EEV43939.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,502] gi|257828904|gb|EEV55511.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,408] Length = 1107 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 29/152 (19%), Positives = 53/152 (34%), Gaps = 41/152 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM ++ +K + F+D++ S + + Sbjct: 269 LDLVLVVDWSGSM-------------------NDNNRIGEVKIGVDRFVDTLAD-SGITD 308 Query: 251 DVYMGLIGYTTRVEKNIEPS---WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + MG +GY++ + + V+ V S T + A++ A +L+ Sbjct: 309 KINMGYVGYSSEGYSYSNGAVQMGSFDSVKNQVKSITPSRTNGGTFTQKALRDAGSMLSV 368 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 369 PNGH------------------KKVIVLLTDG 382 >gi|189461338|ref|ZP_03010123.1| hypothetical protein BACCOP_01988 [Bacteroides coprocola DSM 17136] gi|189431867|gb|EDV00852.1| hypothetical protein BACCOP_01988 [Bacteroides coprocola DSM 17136] Length = 341 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 28/194 (14%), Positives = 66/194 (34%), Gaps = 45/194 (23%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 ++E +R + +D+S SM QD +++ K + +++ Sbjct: 81 KMETVKRQGVETVVALDISNSMLA---------------QDVTPSRLEKSKKLISRLVET 125 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDSTPAMKQ 300 + + V M I + + + + ++ SLI + TD A+ Sbjct: 126 FN-----NDKVAM--IVFAGEAFTQLPITSDYISAKMFLETINPSLISTQGTDIAGAINL 178 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A + T P+ + I+ +TDGEN ++ +A + Sbjct: 179 AMKSFT------------------PNEGVGRAIVLITDGEN----HEGGAVEAAQEAAKK 216 Query: 361 FIKIVTISINASPN 374 +++ + + + Sbjct: 217 GVRVFVLGVGSPDG 230 >gi|149178272|ref|ZP_01856865.1| hypothetical protein PM8797T_16765 [Planctomyces maris DSM 8797] gi|148842921|gb|EDL57291.1| hypothetical protein PM8797T_16765 [Planctomyces maris DSM 8797] Length = 169 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 17/128 (13%), Positives = 43/128 (33%), Gaps = 18/128 (14%) Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + T+ P + + + + + ++ L+DG + Sbjct: 41 LHPVIGGGGTNMAPGL-------------FISREILERPIFPSQIYLRPVVVVLSDGLTS 87 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + T +I + K++ IVT++ + L + +S E+ Y+ L F Sbjct: 88 HP---AKTSEIATQLKKD-ADIVTVAF-GDDADEPYLISLATSSEHFYHCRTGTDLRAFF 142 Query: 403 QNISQLMV 410 ++ + Sbjct: 143 ASVGTTLS 150 >gi|332291973|ref|YP_004430582.1| von Willebrand factor type A [Krokinobacter diaphorus 4H-3-7-5] gi|332170059|gb|AEE19314.1| von Willebrand factor type A [Krokinobacter diaphorus 4H-3-7-5] Length = 344 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 29/201 (14%), Positives = 68/201 (33%), Gaps = 45/201 (22%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 ++E +R + +D+S SM D ++ K + +++ Sbjct: 81 KLETVKREGVDVVFAIDVSKSMLAE---------------DIAPNRIEKSKQLVTQIINN 125 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-MDSLILKPTDSTPAMKQ 300 + +G+I Y + + + ++++ D L + T A++ Sbjct: 126 LGSD-------RIGIIAYAGSAYPQLPITTDYSSAKLFLSQMNTDMLSSQGTAIGEAIEL 178 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A +++ + + + ++DGE + I ++A + Sbjct: 179 AKTYYNDEEQTN------------------RVLFIISDGE----DHVGESSNIAEQANDE 216 Query: 361 FIKIVTISINASPNGQRLLKT 381 I+I TI + S G LK Sbjct: 217 GIRIFTIGVGKSEGGPIPLKR 237 >gi|308062304|gb|ADO04192.1| phage/colicin/tellurite resistance cluster terY protein [Helicobacter pylori Cuz20] Length = 217 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 31/234 (13%), Positives = 72/234 (30%), Gaps = 43/234 (18%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E + L++D SGSM+ ++ T++ AL + ++ + + Sbjct: 11 EERFIPVFLLLDTSGSMNESLG---------------NCTRIEALNLCIQKMIEILKQEA 55 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + M +I + P + + T A + A ++ Sbjct: 56 KKELFSKMAIITFGENGAVLHTPFDDVKNI-----NFKPLSASGGTPLDQAFRLAKDLIE 110 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 P+ ++ + I ++DGE N+ K + + Sbjct: 111 DKD-------------TFPTKFYKLYSILVSDGEPNDDKWQKALSNFHHDGRSAKSVCWS 157 Query: 367 ISI----NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I I + K + + N + L+ +F+ ++Q + S+ Sbjct: 158 IFIGDRNTNPQVNKDFGK------DGVFYADNVEKLVGLFEIMTQTISKGSTSI 205 >gi|261876473|dbj|BAI47562.1| collagen type VI alpha 2 subunit [Mesocricetus auratus] Length = 1026 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 29/189 (15%), Positives = 60/189 (31%), Gaps = 49/189 (25%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL--S 246 + V+D S S+ T KN ++ ++ + + Sbjct: 619 GALDVVFVIDSSESIG--------------------YTNFTLEKNFVINVVNRLGAIAKD 658 Query: 247 HVKED-VYMGLIGYTTR-----VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 E +G++ Y+ + + E ++ V I T + A+K Sbjct: 659 PKSETGTRVGVVQYSHEGTFEAIRLDDERVNSLSSFKEAVKNL--EWIAGGTWTPSALKF 716 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 AY L + +R + F + +TDG ++ ++N +CD Sbjct: 717 AYNQLIKESRRQKT---------------RVFAVVITDGRHDPRDDDLNLRALCD----R 757 Query: 361 FIKIVTISI 369 + + I I Sbjct: 758 DVTVTAIGI 766 >gi|281427229|ref|NP_001094211.1| collagen, type VI, alpha 2 [Rattus norvegicus] gi|149043684|gb|EDL97135.1| procollagen, type VI, alpha 2, isoform CRA_a [Rattus norvegicus] Length = 1027 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 29/189 (15%), Positives = 60/189 (31%), Gaps = 49/189 (25%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL--S 246 + V+D S S+ T KN ++ ++ + + Sbjct: 620 GALDVVFVIDSSESIG--------------------YTNFTLEKNFVINVVNRLGAIAKD 659 Query: 247 HVKED-VYMGLIGYTTR-----VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 E +G++ Y+ + + E ++ V I T + A+K Sbjct: 660 PKSETGTRVGVVQYSHEGTFEAIRLDDERVNSLSSFKEAVKNL--EWIAGGTWTPSALKF 717 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 AY L + +R + F + +TDG ++ ++N +CD Sbjct: 718 AYNQLIKESRRQKT---------------RVFAVVITDGRHDPRDDDLNLRALCD----R 758 Query: 361 FIKIVTISI 369 + + I I Sbjct: 759 DVTVTAIGI 767 >gi|22203747|ref|NP_666119.1| collagen alpha-2(VI) chain precursor [Mus musculus] gi|125987813|sp|Q02788|CO6A2_MOUSE RecName: Full=Collagen alpha-2(VI) chain; Flags: Precursor gi|21706759|gb|AAH34414.1| Collagen, type VI, alpha 2 [Mus musculus] gi|148699895|gb|EDL31842.1| procollagen, type VI, alpha 2 [Mus musculus] Length = 1034 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 29/189 (15%), Positives = 60/189 (31%), Gaps = 49/189 (25%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL--S 246 + V+D S S+ T KN ++ ++ + + Sbjct: 627 GALDVVFVIDSSESIG--------------------YTNFTLEKNFVINVVNRLGAIAKD 666 Query: 247 HVKED-VYMGLIGYTTR-----VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 E +G++ Y+ + + E ++ V I T + A+K Sbjct: 667 PKSETGTRVGVVQYSHEGTFEAIRLDDERVNSLSSFKEAVKNL--EWIAGGTWTPSALKF 724 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 AY L + +R + F + +TDG ++ ++N +CD Sbjct: 725 AYNQLIKESRRQKT---------------RVFAVVITDGRHDPRDDDLNLRALCD----R 765 Query: 361 FIKIVTISI 369 + + I I Sbjct: 766 DVTVTAIGI 774 >gi|49809|emb|CAA46541.1| alpha-2 collagen [Mus musculus] Length = 1029 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 29/189 (15%), Positives = 60/189 (31%), Gaps = 49/189 (25%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL--S 246 + V+D S S+ T KN ++ ++ + + Sbjct: 622 GALDVVFVIDSSESIG--------------------YTNFTLEKNFVINVVNRLGAIAKD 661 Query: 247 HVKED-VYMGLIGYTTR-----VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 E +G++ Y+ + + E ++ V I T + A+K Sbjct: 662 PKSETGTRVGVVQYSHEGTFEAIRLDDERVNSLSSFKEAVKNL--EWIAGGTWTPSALKF 719 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 AY L + +R + F + +TDG ++ ++N +CD Sbjct: 720 AYNQLIKESRRQKT---------------RVFAVVITDGRHDPRDDDLNLRALCD----R 760 Query: 361 FIKIVTISI 369 + + I I Sbjct: 761 DVTVTAIGI 769 >gi|49907|emb|CAA44206.1| alpha-2 collagen type VI, subunit [Mus musculus] Length = 764 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 29/189 (15%), Positives = 60/189 (31%), Gaps = 49/189 (25%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL--S 246 + V+D S S+ T KN ++ ++ + + Sbjct: 357 GALDVVFVIDSSESIG--------------------YTNFTLEKNFVINVVNRLGAIAKD 396 Query: 247 HVKED-VYMGLIGYTTR-----VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 E +G++ Y+ + + E ++ V I T + A+K Sbjct: 397 PKSETGTRVGVVQYSHEGTFEAIRLDDERVNSLSSFKEAVKNL--EWIAGGTWTPSALKF 454 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 AY L + +R + F + +TDG ++ ++N +CD Sbjct: 455 AYNQLIKESRRQKT---------------RVFAVVITDGRHDPRDDDLNLRALCD----R 495 Query: 361 FIKIVTISI 369 + + I I Sbjct: 496 DVTVTAIGI 504 >gi|149624862|ref|XP_001517471.1| PREDICTED: similar to Cartilage matrix protein precursor (Matrilin-1) [Ornithorhynchus anatinus] Length = 238 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 28/178 (15%), Positives = 65/178 (36%), Gaps = 23/178 (12%) Query: 235 LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + F++ I D L +++ +GL+ Y++ V + T K + + K T Sbjct: 36 VKKFINQIVDSLDVSEQNAQVGLVQYSSSVRQEFPLGRFTSKRDIKAAVKKMTYMEKGTM 95 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A+ + QK I TDG + ++ S+ Sbjct: 96 TGTALNYLIDNTFAIS-------------SGARPGAQKVGIVFTDGRSQDYISDAA---- 138 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 KAK+ K+ + + + L+ S P E+++ + ++ + + + + + Sbjct: 139 -KKAKDLGFKMFAVGVG--NAVEDELREIASDPVAEHYFYTADFKTINQIGKKLQKKI 193 >gi|322435250|ref|YP_004217462.1| Protein of unknown function DUF2134, membrane [Acidobacterium sp. MP5ACTX9] gi|321162977|gb|ADW68682.1| Protein of unknown function DUF2134, membrane [Acidobacterium sp. MP5ACTX9] Length = 515 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 25/199 (12%), Positives = 66/199 (33%), Gaps = 21/199 (10%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 ++R F + ++ I L + +IG ++ V + + ++++++AA LA Sbjct: 1 MNRLNKLFCLFLRDQRGQVLPIAGLMMFVITAMIGLVVDVGHIYLCQRELQASSDAAALA 60 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 GA + + + ++ + + + T I + Sbjct: 61 GAEIIPTATTAAAVYAKATA------------------YSSTTGAANVYKNMTNITMVSG 102 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 + + S + YN + + ++ + + +S + Sbjct: 103 YPILKCLSTMQTQGISCVGPLSYNSIQVMQQAVVPLYFARIIGR--SSMTISATSTAAKG 160 Query: 184 EMGERPIFLIELVVDLSGS 202 RP + + LV+D + S Sbjct: 161 GASSRP-YNVALVLDTTYS 178 >gi|114765751|ref|ZP_01444846.1| hypothetical protein 1100011001350_R2601_23570 [Pelagibaca bermudensis HTCC2601] gi|114541858|gb|EAU44894.1| hypothetical protein R2601_23570 [Roseovarius sp. HTCC2601] Length = 493 Score = 38.7 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 15/129 (11%), Positives = 47/129 (36%), Gaps = 9/129 (6%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 + E + + A+ + L+ + L +L + ++ +++ + A L A Sbjct: 1 MLRDESGSVTAATAVLLPGILIGMAMLFDLLWLNNHRSHLQAQADMAALEAA-------R 53 Query: 74 RLGDRFESISNHAKRALIDDAKR--FIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 G+R ++ ++D+ R + + E F + + + + +T Sbjct: 54 YTGERPSAVRQARVSVAVNDSFRAERLASRQIELGRWQDGSFSDMDASDPRRPNAARVTV 113 Query: 132 MANNRLDSS 140 + + + + Sbjct: 114 RSEAKTNLT 122 >gi|91205150|ref|YP_537505.1| hypothetical protein RBE_0335 [Rickettsia bellii RML369-C] gi|91068694|gb|ABE04416.1| unknown [Rickettsia bellii RML369-C] Length = 516 Score = 38.7 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 59/383 (15%), Positives = 111/383 (28%), Gaps = 66/383 (17%) Query: 47 HYKKNSMESANNAA---ILAGASKMVSNLSRLGDRFESISNHAKRAL-IDDAKRFIKNHI 102 KN ++ A +L K +LS++ E I+ K D + I Sbjct: 129 LSFKNPIKKDIKEATLKLLQATIKDEKSLSKILVNEEDITPFQKAIYHPTDFSQLITQIS 188 Query: 103 KESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIE 162 + + + N I V T + +S+ + +D Y L Sbjct: 189 SNEENSLNFIMNNGAIAQSVQVY----TADGKAPIIASDLKDGFIIDKQYLLKYLLPIFN 244 Query: 163 HLLNQRYNQKIVSFIPALLRIEMGERP-IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 + + + F P ++ GE I L++D+SGSM + Sbjct: 245 GFIWNEEGKFPIMFAPKNPKVLDGENNYAHNISLLIDISGSMEKDFSVY----------- 293 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE--KNIEPSWGTEKVRQY 279 KN +L LD + + + + ++ + E ++ Y Sbjct: 294 ----------KNNILKILDKLAEIPNW----QINIVVFNDESTARSFSNQENNIEDIKVY 339 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + + T +K+A + S I+F TDG Sbjct: 340 INNLKAN---GYTKLYGTIKEALESFKGKIDESSTL-----------------IVF-TDG 378 Query: 340 ENNNFKSNVNTIKICD----KAKENFIKIVTISINASPNGQRLLKTCVSSPE----YHYN 391 ++ SNV + D K + T+ N Q + + + Sbjct: 379 KDEGTNSNVTEKDVVDVTSEVIKNPQFNMYTVGFGQYYN-QEFFEQVATRGGFTHVSLND 437 Query: 392 VVNADSLIHVFQNISQLMVHRKY 414 L NI Q ++ + Sbjct: 438 PTGMHQLQQYIDNIEQKVMTFEI 460 >gi|256419952|ref|YP_003120605.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] gi|256034860|gb|ACU58404.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] Length = 345 Score = 38.7 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 35/225 (15%), Positives = 73/225 (32%), Gaps = 45/225 (20%) Query: 152 TSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDP 211 + + +F+ + + ++ + R+E R + + +D+S SM Sbjct: 52 SRRRFTFRFLLIFIAFLFGAIGLANLQKGSRMEKITRKGVDVVIALDVSKSMLAG----- 106 Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW 271 D K ++ K + D +D +GL+ + + + Sbjct: 107 ----------DVKPDRLTRAKQLISKLADKLDND-------RVGLVVFAGNAYLQMPLTI 149 Query: 272 GTEKVRQYVTR-DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 + Y+T D + + T A++ A + K+R Sbjct: 150 DYSAAKMYLTTVSPDMIPTQGTAIGQAIQVA-NDAFNKKERKH----------------- 191 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 K +I ++DGE + I A E+ + I TI I + Sbjct: 192 KSLIIISDGE----DHDEAAISKARAAFEDGVVINTIGIGSPTGS 232 >gi|69244153|ref|ZP_00602689.1| von Willebrand factor, type A:Cna B-type [Enterococcus faecium DO] gi|293560613|ref|ZP_06677101.1| von Willebrand factor type A domain protein [Enterococcus faecium E1162] gi|294621687|ref|ZP_06700851.1| von Willebrand factor type A domain protein [Enterococcus faecium U0317] gi|314940439|ref|ZP_07847593.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133a04] gi|314943384|ref|ZP_07850154.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133C] gi|314953415|ref|ZP_07856334.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133A] gi|314993087|ref|ZP_07858476.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133B] gi|314997388|ref|ZP_07862342.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133a01] gi|68196610|gb|EAN11036.1| von Willebrand factor, type A:Cna B-type [Enterococcus faecium DO] gi|209491032|gb|ACI49667.1| putative pilus tip protein [Enterococcus faecium] gi|291598696|gb|EFF29749.1| von Willebrand factor type A domain protein [Enterococcus faecium U0317] gi|291605436|gb|EFF34882.1| von Willebrand factor type A domain protein [Enterococcus faecium E1162] gi|313588546|gb|EFR67391.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133a01] gi|313592412|gb|EFR71257.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133B] gi|313594552|gb|EFR73397.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133A] gi|313597919|gb|EFR76764.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133C] gi|313640359|gb|EFS04940.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133a04] Length = 1129 Score = 38.7 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 29/152 (19%), Positives = 53/152 (34%), Gaps = 41/152 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM ++ +K + F+D++ S + + Sbjct: 291 LDLVLVVDWSGSM-------------------NDNNRIGEVKIGVDRFVDTLAD-SGITD 330 Query: 251 DVYMGLIGYTTRVEKNIEPS---WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + MG +GY++ + + V+ V S T + A++ A +L+ Sbjct: 331 KINMGYVGYSSEGYSYSNGAVQMGSFDSVKNQVKSITPSRTNGGTFTQKALRDAGSMLSV 390 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 391 PNGH------------------KKVIVLLTDG 404 >gi|313238340|emb|CBY13422.1| unnamed protein product [Oikopleura dioica] Length = 345 Score = 38.7 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 17/104 (16%), Positives = 35/104 (33%), Gaps = 11/104 (10%) Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 + +Q + K I+ +TDG+ +K K++ I I Sbjct: 22 IQGKTNTGGALERAQQMLAEGRPSVPKIILLITDGD---ATDKERLDAQIEKLKKSNILI 78 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 TI + + L + ++ Y + F +IS++ Sbjct: 79 YTIGV-GDLIDRNELNRIATDEDFVYETRD-------FDSISKI 114 >gi|300783401|ref|YP_003763692.1| von Willebrand factor type A [Amycolatopsis mediterranei U32] gi|299792915|gb|ADJ43290.1| von Willebrand factor type A [Amycolatopsis mediterranei U32] Length = 535 Score = 38.7 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 36/221 (16%), Positives = 68/221 (30%), Gaps = 48/221 (21%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V+D SGSM A ++ +L++AL+ + L+ Sbjct: 348 VLDTSGSMAGA--------------------RIDSLRSALVGLTGADTSLTGRFRRFRSR 387 Query: 256 LIG----YTTRVEKNIEPS----WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + T + ++ Q T + T ++ +AYQ+L Sbjct: 388 EEVTMLPFNTGPGAPRTFTVPEENPAAELAQIKTFAEGLVARGGTAIYDSLSRAYQVLEP 447 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN---TIKICDKAKENFIKI 364 I+ +TDGEN N S + ++ A + + + Sbjct: 448 LMAADPDRFTS--------------IVLMTDGENANGSSLPDFLTSLASLPPAMKQ-VPV 492 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 T+ + L + + ++ N L VFQ I Sbjct: 493 FTVLF-GEGSSDELTQVATRTGGKVFDARNV-QLSRVFQEI 531 >gi|319784283|ref|YP_004143759.1| hypothetical protein Mesci_4600 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317170171|gb|ADV13709.1| hypothetical protein Mesci_4600 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 407 Score = 38.7 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 15/98 (15%), Positives = 44/98 (44%), Gaps = 7/98 (7%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 F K +SE+ NF+I A +++ ++ + + ++ + + ++++ +AA LA +K Sbjct: 2 FALKAFWSSERGNFAITTAFAMLPIMIGLAGAVDLIGTSHDASQLQNSLDAAGLAIGTKF 61 Query: 69 VSNLSRLGDRF-------ESISNHAKRALIDDAKRFIK 99 +++ + ++S ++ + F Sbjct: 62 SPDMAAGDVQQLGLQFFAANMSAADQQEYLGSVSAFAA 99 >gi|315223476|ref|ZP_07865333.1| aerotolerance-related exported protein BatB [Capnocytophaga ochracea F0287] gi|314946649|gb|EFS98640.1| aerotolerance-related exported protein BatB [Capnocytophaga ochracea F0287] Length = 347 Score = 38.7 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 30/189 (15%), Positives = 57/189 (30%), Gaps = 45/189 (23%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 +IE +R I +D+S SM D ++ K + Sbjct: 81 KIETVKREGVDIVFAIDVSKSMLAE---------------DVAPNRLEKAKRIAFETISQ 125 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-MDSLILKPTDSTPAMKQ 300 + +G++ Y + + + ++ D L + T A++ Sbjct: 126 LK-------GDRVGIVAYAASAYPQLALTTDHSAAKMFLQGMNTDMLSSQGTAIQEAIRM 178 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A + P + + LTDGE + +I +A+E Sbjct: 179 ASNYF------------------DENTPTARLLFILTDGE----DHEMGATEIATEAQEK 216 Query: 361 FIKIVTISI 369 + I TI I Sbjct: 217 GVHIYTIGI 225 >gi|325963511|ref|YP_004241417.1| von Willebrand factor type A-like protein [Arthrobacter phenanthrenivorans Sphe3] gi|323469598|gb|ADX73283.1| von Willebrand factor type A-like protein [Arthrobacter phenanthrenivorans Sphe3] Length = 622 Score = 38.7 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 40/241 (16%), Positives = 80/241 (33%), Gaps = 47/241 (19%) Query: 174 VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 + + L R ++D+S SM Q+ TK+ K+ Sbjct: 416 AAALTGLQDSFPEVRKRARALFLLDVSESM----------------VQEPGLTKLQRAKD 459 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT-EKVRQYVTRDMDS----LI 288 A+L LD +GL ++ + + G V + T D Sbjct: 460 AVLKALDHFTAEDE------IGLAAFS--QVGDGPLTPGVVSPVAPFKTNKEDLIAKLNE 511 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF---K 345 LK D+TP + A D+ + + NF I+ L+DG+N+ Sbjct: 512 LKAVDATPLFE-AVSRFAGDQAKEYKDNFINA------------IVLLSDGKNDTTHPGD 558 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQN 404 + ++ + + + T++ L+ +S ++Y+ + + L V Sbjct: 559 LGGLSEQLGHQNHSTPVLVFTLAY-GPDADVPTLREIARASGAHYYDATDPNRLEEVLGE 617 Query: 405 I 405 + Sbjct: 618 L 618 >gi|221109528|ref|XP_002169888.1| PREDICTED: similar to collagen type VI alpha 6 [Hydra magnipapillata] Length = 366 Score = 38.7 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 30/193 (15%), Positives = 61/193 (31%), Gaps = 37/193 (19%) Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 E I + ++D SGS+ + E + + D K Sbjct: 9 TTGKPSKESCSDAIVDVGFILDSSGSLRRDYKNLKEFLKTIASFFDIKI----------- 57 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 G+I ++ R E +I+ + ++ D L+ T Sbjct: 58 -------------NGSQAGVITFSHRSEHSIKLNDFSDGDSFEKAVDKIPLMGSTTRIDK 104 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ + ++ +++ L K +I LTDG + I D+ Sbjct: 105 ALRHSKNVMFTNQN-------------GGRLEATKLLILLTDGSQTFSAKQEDPSIIADE 151 Query: 357 AKENFIKIVTISI 369 + + + I+ I I Sbjct: 152 IRNDGVLIIAIGI 164 >gi|218709385|ref|YP_002417006.1| putative hemolysin-type calcium-binding region [Vibrio splendidus LGP32] gi|218322404|emb|CAV18557.1| putative hemolysin-type calcium-binding region [Vibrio splendidus LGP32] Length = 1883 Score = 38.7 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 66/195 (33%), Gaps = 35/195 (17%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + + + ++L++D ++ + K+A +K A+ LD Sbjct: 1243 ISVSPETKSNTNVQLILD-----------------TSGSMSNSSNGKLAIMKAAVSKMLD 1285 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 H DV + LI + +R + +W T +Y+ + TD A+ Sbjct: 1286 QY----HDMGDVRVQLIDFNSRSTRLEFNGRAWMTVSEAKYLVNRL--TAGGGTDYDDAV 1339 Query: 299 KQAYQIL-TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 K+A Q + + N + DG ++ S+ K + Sbjct: 1340 KKARQSWDHDEHLQLDNANNVSYFISDGKPQ---------DGHDDATISDNEETKWANHL 1390 Query: 358 KENFIKIVTISINAS 372 N I +I IN+S Sbjct: 1391 ISNGITSQSIGINSS 1405 >gi|118353828|ref|XP_001010179.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89291946|gb|EAR89934.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 511 Score = 38.7 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 32/220 (14%), Positives = 73/220 (33%), Gaps = 52/220 (23%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 RP + V+D SGSM K+ +K L L+ + Sbjct: 85 RPNLDLICVIDNSGSMEGE--------------------KIQNVKKTLEYLLELLGDND- 123 Query: 248 VKEDVYMGLIGYTTRVEKNIEP--SWGTEK--VRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + LI + ++ K + + K ++ + + + TD M+ A++ Sbjct: 124 -----RLCLILFNSKATKLCHLMRTNNSNKPAFKEIINKIEAN---GGTDINSGMELAFR 175 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK-AKENFI 362 +L K + ++ + L+DG+ + +++ + ++ + Sbjct: 176 VLKDRKYHNPVSS----------------VFLLSDGQ--DGSADLKVRQSLERHLPQECF 217 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 I + + +G + K C Y V + + F Sbjct: 218 TIHSFGFGSDHDGPLMNKICSLKDGNFYYVEKINQVDEFF 257 >gi|325282943|ref|YP_004255484.1| von Willebrand factor type A [Deinococcus proteolyticus MRP] gi|324314752|gb|ADY25867.1| von Willebrand factor type A [Deinococcus proteolyticus MRP] Length = 535 Score = 38.7 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 33/220 (15%), Positives = 65/220 (29%), Gaps = 50/220 (22%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL----SHVKED 251 V+D+SGSM ++ ALK AL + L + + Sbjct: 352 VLDVSGSMEGK--------------------RLEALKAALGNLSGADTSLGWRFAAFADR 391 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEK----VRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + LI ++ VE ++ T+ A+ +AY+ + Sbjct: 392 ERVTLIPFSGDVEAVRSFQVNKASRAADLQAIAAAGGALQAGGGTNIYGALSEAYRQAAA 451 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS--NVNTIKICDKAKENFIKIV 365 S+ + ++ +TDGE S A +K Sbjct: 452 APAGSYTS-----------------VVLMTDGEGTAGPSLNEFRDFYAALPAGARSVKTF 494 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 T+ + Q + + + ++ +L F+ I Sbjct: 495 TVLF-GDSDVQEMNEVAALTGGRTFDGQQ--NLAAAFKEI 531 >gi|311252837|ref|XP_003125292.1| PREDICTED: vitrin-like [Sus scrofa] Length = 595 Score = 38.7 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 28/182 (15%), Positives = 50/182 (27%), Gaps = 25/182 (13%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + L + D +G + YT T K Sbjct: 429 RTVLQFVANLSKEFDISDTDTRVGAVQYTYEQRLEFGFDQYTTKPDVLNAIKRVGYWSGG 488 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ A + L K + +K +I +TDG +V Sbjct: 489 TSTGAAINYALEQLFKKSKPNK----------------RKLMILITDG---RSYDDVRIP 529 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 + A + I + Q L+ + P ++ + V D+L I Q + Sbjct: 530 AM--VAHHKGVITYAIGVA--WAAQEELEIIATHPARDHAFFVDEFDNLYKSVPKIIQNI 585 Query: 410 VH 411 Sbjct: 586 CT 587 >gi|198426873|ref|XP_002129255.1| PREDICTED: similar to Collagen alpha-1(XIV) chain [Ciona intestinalis] Length = 725 Score = 38.7 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 18/138 (13%), Positives = 43/138 (31%), Gaps = 15/138 (10%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 +++L + +DV +G Y++ + + S Sbjct: 211 QDSLRFLASLTKRFTIGPDDVRVGFSVYSSTSTIHSHFNQHMNNSALEAEILGTSYTGGG 270 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ +L + +GV + ++ +TDG++ + + Sbjct: 271 TSTGRAIN---DVLNNGFVERNGARPASEGVP-------RILVVMTDGQSGDSVKTPS-- 318 Query: 352 KICDKAKENFIKIVTISI 369 D K I + + I Sbjct: 319 ---DNVKAAGITVFGVGI 333 >gi|198274643|ref|ZP_03207175.1| hypothetical protein BACPLE_00795 [Bacteroides plebeius DSM 17135] gi|198272090|gb|EDY96359.1| hypothetical protein BACPLE_00795 [Bacteroides plebeius DSM 17135] Length = 339 Score = 38.7 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 29/195 (14%), Positives = 66/195 (33%), Gaps = 45/195 (23%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 ++E +R + +D+S SM QD +++ K + +++ Sbjct: 81 KMETVKRQGVETVVALDISNSMLA---------------QDVTPSRLEKSKKLISRLVET 125 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQ 300 + + V M I + + + + ++ SLI + TD A+ Sbjct: 126 FN-----NDKVAM--IVFAGEAFTQLPITSDYVSAKMFLETISPSLITTQGTDIRGAIDL 178 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A + T P+ + I+ +TDGEN I+ +A + Sbjct: 179 AMKSFT------------------PNEGVGRAIVLITDGEN----HEGGAIEAAQEAAKK 216 Query: 361 FIKIVTISINASPNG 375 +++ + + + Sbjct: 217 GMRVFVLGVGSPDGS 231 >gi|94969532|ref|YP_591580.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] gi|94551582|gb|ABF41506.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] Length = 430 Score = 38.7 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 26/129 (20%), Positives = 49/129 (37%), Gaps = 9/129 (6%) Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQ--GVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 TDS +K+A + + + + + +K + +TDGE ++ N Sbjct: 241 TDSVAKLKEALEKYETRGGTALYDAVLASNAHLMKAPKLEKKVLFIVTDGE-DDASLNTL 299 Query: 350 TIKICDKAKENFIKIVTISINASPNG-----QRLLKTCV-SSPEYHYNVVNADSLIHVFQ 403 I +EN I TI I G QR L+ S+ + + D + + Q Sbjct: 300 EQTIRKVQQENGPTIYTIGILDETGGHKRRAQRALREMAESTGGVAFFPQSLDEVSRITQ 359 Query: 404 NISQLMVHR 412 I+ + ++ Sbjct: 360 QIAHDIRNQ 368 >gi|269105138|ref|ZP_06157832.1| protein TadG associated with Flp pilus assembly [Photobacterium damselae subsp. damselae CIP 102761] gi|268160588|gb|EEZ39087.1| protein TadG associated with Flp pilus assembly [Photobacterium damselae subsp. damselae CIP 102761] Length = 436 Score = 38.7 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 32/257 (12%), Positives = 75/257 (29%), Gaps = 31/257 (12%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 K +++ + SI+FA+ + + K +E A AA LA A+ Sbjct: 2 KLKKAQQGHASILFAIMIPVLFGIFTLASDGARAIQTKARIEDATEAASLAIAAH----- 56 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 + + + + +K A ++K +I + S S Y ++I S Sbjct: 57 NDPNVNSDGLGSGSK-VNRRIATDYLKAYITDIDSISSLKIYRRNCEDIPECSS------ 109 Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFL 192 + + F+ +V + Sbjct: 110 ----GLNKGKSRFFEYEVEALTTQN----SWFPGNNVISGFGDTFSTRGHSLARKYQSEA 161 Query: 193 IELVV--DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 +++V D S SM + + +++ ++I++ + Sbjct: 162 VDVVFAADFSKSMEEPWTGGRQKYKDLVRVINDVTSELEK--------FNNINIADKKNQ 213 Query: 251 DVYMGLIGYTTRVEKNI 267 + +G+ Y + Sbjct: 214 NT-IGISPYNSNTYSKF 229 >gi|260810653|ref|XP_002600071.1| hypothetical protein BRAFLDRAFT_79673 [Branchiostoma floridae] gi|229285356|gb|EEN56083.1| hypothetical protein BRAFLDRAFT_79673 [Branchiostoma floridae] Length = 1096 Score = 38.3 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 41/212 (19%), Positives = 86/212 (40%), Gaps = 45/212 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + LV+D+SGSM + +M L+ ++ ++ LS +++ Sbjct: 167 VVLVLDVSGSM-------------------TGQGRMERLRRSVSTYI-----LSTIEDGA 202 Query: 253 YMGLIGY--TTRVEKNIEPSWGTEKVRQYVTRDMDSLILK----PTDSTPAMKQAYQILT 306 ++G++ + T+ ++E G + + +D L + T+ T A+ A QIL Sbjct: 203 WLGIVTFRGTSHKICDLEQLNGDSVREEILNMTLDGLTNRTGKVGTNITRAVTLAVQILG 262 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 Q K+ + +I +TDG + ++V + D AK + I T Sbjct: 263 PAV----------QDRKLGDSTGPRQMILITDGRDRRLNNSVIFMLQNDTAK--GVVIDT 310 Query: 367 ISI-NASPNGQRLLKTCVSSPEYHYNVVNADS 397 I++ + + G LL + + ++D+ Sbjct: 311 IALGDGAEEGLPLLS--EVTGGQFFFSPDSDA 340 >gi|74011918|ref|XP_848765.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Canis familiaris] Length = 826 Score = 38.3 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 28/153 (18%), Positives = 50/153 (32%), Gaps = 45/153 (29%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D SGSM K+ + AL+ LD + Sbjct: 270 VIFVIDKSGSMSGR--------------------KIQQTREALIKILDDLKPNDQFN--- 306 Query: 253 YMGLIGYTTRVEKNIEP-----SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 LI ++ V ++ ++Y + T+ AM A ++L S Sbjct: 307 ---LISFSGDVTHWKPLLVPASPENVDQAKRY---AANIEAHGGTNINDAMLTAVRLLQS 360 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 ++ ++ S+ II LTDG+ Sbjct: 361 ANQKELLSD--------GSVSL---IILLTDGD 382 >gi|118081959|ref|XP_417299.2| PREDICTED: similar to inter-alpha trypsin inhibitor heavy chain precursor 5 [Gallus gallus] Length = 955 Score = 38.3 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 41/233 (17%), Positives = 74/233 (31%), Gaps = 61/233 (26%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D S SM TK+ K AL L + H Sbjct: 312 VVFVLDSSASMVG--------------------TKLRQTKEALFTILQDLRPEDHFN--- 348 Query: 253 YMGLIGYTTRV-----EKNIEPS-WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 +IG++ R+ ++ + + ++Y+ + T+ A++ ++L Sbjct: 349 ---IIGFSNRIKVWQQDRLVPVTPNNIRDAKKYI---HNMSPTGGTNINSALQTGAKLLN 402 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD-----GENNNFKSNVNTIKICDKAKENF 361 ++ IIFLTD GE + K NT A + Sbjct: 403 DYIAQNNI-----------DARSVSLIIFLTDGRPTVGETQSSKILSNTK----DAIRDK 447 Query: 362 IKIVTISINASPNGQRL----LKTCVSSPEYHYNVVNADSLIHVF-QNISQLM 409 + TI I + + L L+ C + +A S + F I + Sbjct: 448 FCLFTIGIGNDVDYKLLERMALENCGM-VRHFQEDEDAASHLKGFYDEIGTPL 499 >gi|27545301|ref|NP_775382.1| laminin subunit beta-1 [Danio rerio] gi|21538979|gb|AAM61767.1|AF468049_1 laminin beta 1 [Danio rerio] Length = 1785 Score = 38.3 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 23/169 (13%), Positives = 52/169 (30%), Gaps = 16/169 (9%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLS----GSMHCAMNSDPEDVNSAPICQD 222 + + I +L +P+ I+ +++ + M+ +N E ++ + Sbjct: 1217 INNVENSANSIRNILAQNPATQPLTEIQGLLEQATALMAEMNSNLNLTEETLSEISSDNN 1276 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 TK+ +LK ++ L E V + + V +Y + Sbjct: 1277 STDTKLKSLKEEAQKLEQTVKDLREQVEFVK------NSDIRGAR------ASVTRYYEQ 1324 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 ++ I +T Q T + N ++ F K Sbjct: 1325 SQNAEIRANASTTDPYNLVNQSATLRTETEELMNQTKEEFNQRQDEFSK 1373 >gi|332827795|gb|EGK00530.1| hypothetical protein HMPREF9455_03173 [Dysgonomonas gadei ATCC BAA-286] Length = 603 Score = 38.3 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 31/193 (16%), Positives = 60/193 (31%), Gaps = 43/193 (22%) Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 ++V I+ P ++D+SGSM K+ Sbjct: 222 HRLVKIGLKAREIDTDNLPATNFVFLIDVSGSMDWD-------------------GKLDL 262 Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 +K+++ L ++++ + V VY G G P K+ + + Sbjct: 263 VKSSMKLLVNNLRPIDRVAIVVYAGAAG----QVLPSTPGSEKSKILESLNGLTAG---G 315 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE-NNNFKSNVN 349 T + AY + + N II TDG+ N SN Sbjct: 316 STAGGEGIVLAY-KIAKENLIEGGNNR---------------IILCTDGDFNVGVSSNDG 359 Query: 350 TIKICDKAKENFI 362 K+ + +++ + Sbjct: 360 LEKLIENERKSGV 372 >gi|301609302|ref|XP_002934202.1| PREDICTED: epithelial chloride channel protein-like [Xenopus (Silurana) tropicalis] Length = 904 Score = 38.3 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 40/235 (17%), Positives = 79/235 (33%), Gaps = 53/235 (22%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 +P+ ++ +R ++ LV+D+SGSM ++ L A Sbjct: 288 PVPSFSLLQSSDR---VVTLVLDVSGSMASG-------------------GRIERLYQAA 325 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV-RQYVTRDMDSLILKPTDS 294 +FL I V+E Y+G++ ++T + E R+ + + + TD Sbjct: 326 EVFLMQI-----VEEGSYVGILTFSTSISLLSNLVQVIENTQRKQLKSLLPTNAFGGTDI 380 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 ++ N G + ++ LTDGE+N T K Sbjct: 381 CLGIR-----------EGIKINRQYDGSSYGTE-----LVLLTDGEDNYD-----TSKCF 419 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV---VNADSLIHVFQNIS 406 + I I++ + L + + V+ LI F +++ Sbjct: 420 PDITNSGAIIHVIAL-GPNAAKALETIVDMTEGLRFLATDKVDTQGLIEAFISLT 473 >gi|225418703|ref|ZP_03761892.1| hypothetical protein CLOSTASPAR_05927 [Clostridium asparagiforme DSM 15981] gi|225041758|gb|EEG52004.1| hypothetical protein CLOSTASPAR_05927 [Clostridium asparagiforme DSM 15981] Length = 1360 Score = 38.3 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 35/262 (13%), Positives = 87/262 (33%), Gaps = 50/262 (19%) Query: 149 DVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFL---IELVVDLSGSMHC 205 +T+Y +++ +++ + V A + E+ + + L D+SGSM Sbjct: 504 SEITAYVQIETPVDYSIDELKSHITVEDCGAQISEYNLEKVEYSSANMLLCCDVSGSMQG 563 Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK 265 + + A++ +S+ + +G+I + + V+ Sbjct: 564 RP--------------------IEDSRAAVISMAESM------SGNARLGVILFNSSVQG 597 Query: 266 NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP 325 + + + +R + T+ + + K Sbjct: 598 LTDFTVQPDVIRSTAESMTAN---GGTNIFDTV----------------VHGLESFPKNG 638 Query: 326 SLPFQKFIIFLTDG-ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS 384 ++ ++DG ENN + I AK+ I + + + + + L S Sbjct: 639 PEVLNTLVV-MSDGQENNAHSAEEIQTAIGQAAKDKSILVHCLGLGSEVDANYLQTIAQS 697 Query: 385 SPEYHYNVVNADSLIHVFQNIS 406 + + V ++ SL +QN++ Sbjct: 698 AGGTYQYVTDSSSLAVFYQNLA 719 >gi|307102442|gb|EFN50717.1| hypothetical protein CHLNCDRAFT_142575 [Chlorella variabilis] Length = 575 Score = 38.3 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 16/87 (18%), Positives = 34/87 (39%), Gaps = 4/87 (4%) Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED----VN 215 + Q ++ + RP + +++D+SGSM + +S D Sbjct: 94 PLLGAGGDNATQLYMAVGLDSGKASDFRRPRLNLLVLLDVSGSMGESFSSYYYDQLGQQV 153 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSI 242 + P +++R+KM K L + + Sbjct: 154 AQPSGSEEQRSKMDVAKEVLAGVVGKL 180 >gi|167646242|ref|YP_001683905.1| hypothetical protein Caul_2280 [Caulobacter sp. K31] gi|167348672|gb|ABZ71407.1| conserved hypothetical protein [Caulobacter sp. K31] Length = 249 Score = 38.3 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 22/101 (21%), Positives = 49/101 (48%), Gaps = 9/101 (8%) Query: 279 YVTRDMDSLILK---PTDSTPAMK---QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK- 331 Y+ R + ++ TD + ++ Q + L + + F + F K P + +K Sbjct: 96 YIVRKVVGRVIGWNGGTDYSYVLERNLQHFGWLPASPEAGGFLSRFLGRPKTPEVRAKKR 155 Query: 332 -FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 +IF+TDGEN+ +V T +I +++++ ++ + + A Sbjct: 156 SIVIFVTDGENS-TSDHVRTTRILEESEQRGDQVYFLFVGA 195 >gi|23465165|ref|NP_695768.1| hypothetical protein BL0580 [Bifidobacterium longum NCC2705] gi|23325787|gb|AAN24404.1| hypothetical protein BL0580 [Bifidobacterium longum NCC2705] Length = 383 Score = 38.3 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 37/208 (17%), Positives = 69/208 (33%), Gaps = 38/208 (18%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 VVD SGSM + +A + D+ + + DV + Sbjct: 208 VVDYSGSMSGEGKNGVVKGLNAALDPDQAKKSY----------------IEPASGDVNI- 250 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 LI + T + ++ + GT + + TD + A L S+ + S +T Sbjct: 251 LIPFETEAHRPVKAT-GTS-TSDLLHEADATDASGGTDIYEGLLSALDELPSESEASQYT 308 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 I+ +TDG +N+ + K++ + I +I + Sbjct: 309 TA---------------IVLMTDGRSNSDHQDEFESAY--KSRGRDLPIFSIMFGDADPS 351 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 Q LK+ + + L VF+ Sbjct: 352 Q--LKSLATLSNAKVFDGRSGDLAAVFR 377 >gi|94969085|ref|YP_591133.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] gi|94551135|gb|ABF41059.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] Length = 349 Score = 38.3 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 35/228 (15%), Positives = 72/228 (31%), Gaps = 54/228 (23%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I + +D SGS + + + K L D Sbjct: 123 LSIVIAIDASGSTKKDLKLET-----------------DSAKRFARDILRPQD------- 158 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 + + ++ VE+ + T +R+ + R + +I S AM Y + K Sbjct: 159 --RLSVYAFSETVEEIVPF---TSDLRR-IDRGISEIIAG---SATAM---YDTIFLASK 206 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN---FIKIVTI 367 + +K ++ +TDG + ++ E I +V + Sbjct: 207 ALMKHDG------------RKVMVLITDGGDTFSSTSYEQAARAATQSETLLYSIIVVPV 254 Query: 368 SINA---SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + +A + L++ + HY + SL F+ IS + + Sbjct: 255 ANSAGRDTGGEHALIQISQDTGGKHYYATDMGSLDVAFKQISDELRTQ 302 >gi|2159|emb|CAA78125.1| VLA-2 [Sus scrofa] Length = 191 Score = 38.3 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 31/190 (16%), Positives = 60/190 (31%), Gaps = 30/190 (15%) Query: 233 NALLLFLDS-IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 +A+ FL+ + L +GLI Y + + V + Sbjct: 2 DAVKNFLEKFVQGLDIGPTKTQVGLIQYANNPRVVFNL-NTFKTKAEMVEATSHTTQYGG 60 Query: 292 --TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T++ A++ A S K ++ +TDGE+++ Sbjct: 61 DLTNTFKAIQYARDSAYSAAA-------------GGRPGATKVMVVVTDGESHDGSMLKA 107 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRL--------LKTCVSSPE--YHYNVVNADSLI 399 I C+ + I I++ N L +K S P Y +NV + L+ Sbjct: 108 VIDQCN---NDNILRFGIAVLGYLNRNALDTKNLIKEIKAIASIPTERYFFNVSDEADLL 164 Query: 400 HVFQNISQLM 409 + + + Sbjct: 165 EKAGTLGEQI 174 >gi|255011031|ref|ZP_05283157.1| putative outer membrane protein [Bacteroides fragilis 3_1_12] gi|313148836|ref|ZP_07811029.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12] gi|313137603|gb|EFR54963.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12] Length = 608 Score = 38.3 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 32/197 (16%), Positives = 64/197 (32%), Gaps = 45/197 (22%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 ++V I P + ++D+SGSM + Sbjct: 224 NEQHRLVRIGLKAKEIPTENLPASNLIFLIDVSGSM-------------------YGPER 264 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +K++L L ++++ V VY G G + + +Q + +D L Sbjct: 265 LDLVKSSLKLLVNNLRDKDKVAIVVYSGAAG--------EKLASTPGSDKQKIREAIDEL 316 Query: 288 ILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE-NNNFK 345 T +K AY + + N II TDG+ N Sbjct: 317 EAGGSTAGGEGIKLAY-KIARKNFITGGNNR---------------IILCTDGDFNMGVS 360 Query: 346 SNVNTIKICDKAKENFI 362 S+ K+ ++ +++ + Sbjct: 361 SDQELKKLIEQKRKSGV 377 >gi|119575268|gb|EAW54873.1| integrin, alpha 2 (CD49B, alpha 2 subunit of VLA-2 receptor), isoform CRA_b [Homo sapiens] Length = 1179 Score = 38.3 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 32/191 (16%), Positives = 64/191 (33%), Gaps = 32/191 (16%) Query: 233 NALLLFLDS-IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY-VTRDMDSLILK 290 +A+ FL+ + L +GLI Y T K ++ + + Sbjct: 189 DAVKNFLEKFVQGLDIGPTKTQVGLIQYANNPRVVFNL--NTYKTKEEMIVATSQTSQYG 246 Query: 291 P--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T++ A++ A + S K ++ +TDGE+++ Sbjct: 247 GDLTNTFGAIQYARKYAYSAA-------------SGGRRSATKVMVVVTDGESHDGSMLK 293 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRL--------LKTCVSSPE--YHYNVVNADSL 398 I C+ + I I++ N L +K S P Y +NV + +L Sbjct: 294 AVIDQCN---HDNILRFGIAVLGYLNRNALDTKNLIKEIKAIASIPTERYFFNVSDEAAL 350 Query: 399 IHVFQNISQLM 409 + + + + Sbjct: 351 LEKAGTLGEQI 361 >gi|124942|sp|P17301|ITA2_HUMAN RecName: Full=Integrin alpha-2; AltName: Full=CD49 antigen-like family member B; AltName: Full=Collagen receptor; AltName: Full=Platelet membrane glycoprotein Ia; Short=GPIa; AltName: Full=VLA-2 subunit alpha; AltName: CD_antigen=CD49b; Flags: Precursor gi|33907|emb|CAA34894.1| unnamed protein product [Homo sapiens] Length = 1181 Score = 38.3 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 32/191 (16%), Positives = 64/191 (33%), Gaps = 32/191 (16%) Query: 233 NALLLFLDS-IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY-VTRDMDSLILK 290 +A+ FL+ + L +GLI Y T K ++ + + Sbjct: 189 DAVKNFLEKFVQGLDIGPTKTQVGLIQYANNPRVVFNL--NTYKTKEEMIVATSQTSQYG 246 Query: 291 P--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T++ A++ A + S K ++ +TDGE+++ Sbjct: 247 GDLTNTFGAIQYARKYAYSAA-------------SGGRRSATKVMVVVTDGESHDGSMLK 293 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRL--------LKTCVSSPE--YHYNVVNADSL 398 I C+ + I I++ N L +K S P Y +NV + +L Sbjct: 294 AVIDQCN---HDNILRFGIAVLGYLNRNALDTKNLIKEIKAIASIPTERYFFNVSDEAAL 350 Query: 399 IHVFQNISQLM 409 + + + + Sbjct: 351 LEKAGTLGEQI 361 >gi|210135186|ref|YP_002301625.1| phage/colicin/tellurite resistance cluster protein TerY [Helicobacter pylori P12] gi|210133154|gb|ACJ08145.1| phage/colicin/tellurite resistance cluster protein TerY [Helicobacter pylori P12] Length = 214 Score = 38.3 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 31/234 (13%), Positives = 71/234 (30%), Gaps = 43/234 (18%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E + L++D SGSM ++ T++ L + ++++ + Sbjct: 11 EERFIPVFLLLDTSGSMSHSLG---------------NGTRIGVLNLCIQKMIETLKQEA 55 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + M ++ + P + V T A K A ++ Sbjct: 56 KKELFSKMAIVTFGENGVNLHTPFDDIKNV-----NFEPLSASGGTPLDQAFKLAKDLIE 110 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 P+ ++ + I ++DGE NN K + + Sbjct: 111 DKD-------------TFPTKFYKPYSILVSDGEPNNDKWQEPLFNFHHDGRSAKSVCWS 157 Query: 367 ISI----NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I I + K + + + + L+ +F+ ++Q + S+ Sbjct: 158 IFIGDREVNPQVNKDFGK------DGVFYADDVEKLVGLFEIMTQTISKGSASI 205 >gi|145493674|ref|XP_001432832.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124399947|emb|CAK65435.1| unnamed protein product [Paramecium tetraurelia] Length = 618 Score = 38.3 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 27/187 (14%), Positives = 58/187 (31%), Gaps = 42/187 (22%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 F+ + I+ ++P F +++D SGSM A Sbjct: 422 FVENQVEIQAAQQPSFHYIILLDDSGSMSG-----------------------DRFNQAQ 458 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + S+ + + + +I + ++ ++ + ++ T Sbjct: 459 NGLISSLSSAKDNQN-IRVTIIIFNDNARCVVD----SQTINMQTIKNAVVCNGGGTSFQ 513 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A + AYQ + + K F + I F TDG ++ +N Sbjct: 514 SAFQLAYQKIAAVKNFEQFN--------------KHVIFFYTDGGDSYPTQALNQFANLP 559 Query: 356 KAKENFI 362 +A+ I Sbjct: 560 QAQRMKI 566 >gi|114600323|ref|XP_526928.2| PREDICTED: integrin alpha-2 [Pan troglodytes] Length = 1181 Score = 38.3 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 32/191 (16%), Positives = 64/191 (33%), Gaps = 32/191 (16%) Query: 233 NALLLFLDS-IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY-VTRDMDSLILK 290 +A+ FL+ + L +GLI Y T K ++ + + Sbjct: 189 DAVKNFLEKFVQGLDIGPTKTQVGLIQYANNPRVVFNL--NTYKTKEEMIVATSQTSQYG 246 Query: 291 P--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T++ A++ A + S K ++ +TDGE+++ Sbjct: 247 GDLTNTFGAIQYARKYAYSAA-------------SGGRRSATKVMVVVTDGESHDGSMLK 293 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRL--------LKTCVSSPE--YHYNVVNADSL 398 I C+ + I I++ N L +K S P Y +NV + +L Sbjct: 294 AVIDQCN---HDNILRFGIAVLGYLNRNALDTKNLIKEIKAIASIPTERYFFNVSDEAAL 350 Query: 399 IHVFQNISQLM 409 + + + + Sbjct: 351 LEKAGTLGEQI 361 >gi|109077204|ref|XP_001095246.1| PREDICTED: integrin alpha-2 [Macaca mulatta] Length = 1180 Score = 38.3 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 32/191 (16%), Positives = 64/191 (33%), Gaps = 32/191 (16%) Query: 233 NALLLFLDS-IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY-VTRDMDSLILK 290 +A+ FL+ + L +GLI Y T K ++ + + Sbjct: 189 DAVKNFLEKFVQGLDIGPTKTQVGLIQYANNPRVVFNL--NTYKTKEEMIVATSQTSQHG 246 Query: 291 P--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T++ A++ A + S K ++ +TDGE+++ Sbjct: 247 GDLTNTFGAIQYARKYAYSAA-------------SGGRRSATKVMVVVTDGESHDGSMLK 293 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRL--------LKTCVSSPE--YHYNVVNADSL 398 I C+ + I I++ N L +K S P Y +NV + +L Sbjct: 294 AVIDQCN---HDNILRFGIAVLGYLNRNALDTKNLIKEIKAIASIPTERYFFNVSDEAAL 350 Query: 399 IHVFQNISQLM 409 + + + + Sbjct: 351 LEKAGTLGEQI 361 >gi|116295258|ref|NP_002194.2| integrin alpha-2 precursor [Homo sapiens] gi|21105795|gb|AAM34795.1|AF512556_1 integrin, alpha 2 (CD49B, alpha 2 subunit of VLA-2 receptor) [Homo sapiens] gi|119575267|gb|EAW54872.1| integrin, alpha 2 (CD49B, alpha 2 subunit of VLA-2 receptor), isoform CRA_a [Homo sapiens] gi|151556518|gb|AAI48597.1| Integrin, alpha 2 (CD49B, alpha 2 subunit of VLA-2 receptor) [synthetic construct] gi|162319056|gb|AAI56716.1| Integrin, alpha 2 (CD49B, alpha 2 subunit of VLA-2 receptor) [synthetic construct] gi|168278403|dbj|BAG11081.1| integrin alpha-2 precursor [synthetic construct] Length = 1181 Score = 38.3 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 32/191 (16%), Positives = 64/191 (33%), Gaps = 32/191 (16%) Query: 233 NALLLFLDS-IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY-VTRDMDSLILK 290 +A+ FL+ + L +GLI Y T K ++ + + Sbjct: 189 DAVKNFLEKFVQGLDIGPTKTQVGLIQYANNPRVVFNL--NTYKTKEEMIVATSQTSQYG 246 Query: 291 P--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T++ A++ A + S K ++ +TDGE+++ Sbjct: 247 GDLTNTFGAIQYARKYAYSAA-------------SGGRRSATKVMVVVTDGESHDGSMLK 293 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRL--------LKTCVSSPE--YHYNVVNADSL 398 I C+ + I I++ N L +K S P Y +NV + +L Sbjct: 294 AVIDQCN---HDNILRFGIAVLGYLNRNALDTKNLIKEIKAIASIPTERYFFNVSDEAAL 350 Query: 399 IHVFQNISQLM 409 + + + + Sbjct: 351 LEKAGTLGEQI 361 >gi|256958585|ref|ZP_05562756.1| von Willebrand factor [Enterococcus faecalis DS5] gi|256949081|gb|EEU65713.1| von Willebrand factor [Enterococcus faecalis DS5] Length = 666 Score = 38.3 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 55/150 (36%), Gaps = 39/150 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM + ++ ++ + F+D++ S + Sbjct: 319 LDLVLVVDWSGSM-------------------NENNRIGEVQKGVNRFVDTLAD-SGITN 358 Query: 251 DVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ MG +GY++ N G + V+ + S T + A++ A +L + Sbjct: 359 NINMGYVGYSSDGYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPN 418 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 419 GH------------------KKVIVLLTDG 430 >gi|254692857|ref|NP_081450.1| collagen, type XXII, alpha 1 [Mus musculus] Length = 1613 Score = 38.3 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 22/159 (13%), Positives = 52/159 (32%), Gaps = 19/159 (11%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 +D +G++ Y+ R E + + T++ A++ Sbjct: 74 VDTFEVGPGHTRVGVVRYSDRPTTAFELGHFNSREEVKAAARRITYHGGNTNTGDALRYI 133 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 S + G + + F++ I LTDG + + + Sbjct: 134 TSRSFSAQA----------GGRPGNRAFKQVAILLTDGRSQDLVLDAAAAAH-----AAG 178 Query: 362 IKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 I+I + + A + L S P + ++V + +++ Sbjct: 179 IRIFAVGVGA--ALKEELDEIASEPKSAHVFHVSDFNAI 215 >gi|332254890|ref|XP_003276566.1| PREDICTED: integrin alpha-2 [Nomascus leucogenys] Length = 1181 Score = 38.3 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 32/191 (16%), Positives = 64/191 (33%), Gaps = 32/191 (16%) Query: 233 NALLLFLDS-IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY-VTRDMDSLILK 290 +A+ FL+ + L +GLI Y T K ++ + + Sbjct: 189 DAVKNFLEKFVQGLDVGPTKTQVGLIQYANNPRVVFNL--NTYKTKEEMIVATSQTSQYG 246 Query: 291 P--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T++ A++ A + S K ++ +TDGE+++ Sbjct: 247 GDLTNTFGAIQYARKYAYSAA-------------SGGRRSATKVMVVVTDGESHDGSMLK 293 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRL--------LKTCVSSPE--YHYNVVNADSL 398 I C+ + I I++ N L +K S P Y +NV + +L Sbjct: 294 AVIDQCN---HDNILRFGIAVLGYLNRNALDTKNLIKEIKAIASIPTERYFFNVSDEAAL 350 Query: 399 IHVFQNISQLM 409 + + + + Sbjct: 351 LEKAGTLGEQI 361 >gi|282864727|ref|ZP_06273782.1| von Willebrand factor type A [Streptomyces sp. ACTE] gi|282560666|gb|EFB66213.1| von Willebrand factor type A [Streptomyces sp. ACTE] Length = 424 Score = 38.3 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 44/213 (20%), Positives = 77/213 (36%), Gaps = 42/213 (19%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 +ELV+D+SGSM +++M+A K A D+L V E+V Sbjct: 41 VELVLDVSGSM--------------RTRDIDGQSRMSAAKQAFN------DVLDAVPEEV 80 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +G+ + G + +Q + P D T A K A L Sbjct: 81 QLGIRTLGANYPGDDRKV-GCKDTKQ-------LYPVGPLDRTEA-KTAVATLAPTGWTP 131 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE---NFI--KIVTI 367 + + I+ +TDGE+ + C+ A++ I I T+ Sbjct: 132 IGPALLGAADDLDGGDATRRIVLITDGEDTCGPLDP-----CEVARDIAARGIHLVIDTL 186 Query: 368 SINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 + + ++ L TC++ + V +AD L Sbjct: 187 GLVPNAKIRQQL-TCIAEATGGTYTAVQHADEL 218 >gi|323342275|ref|ZP_08082507.1| hypothetical protein HMPREF0357_10688 [Erysipelothrix rhusiopathiae ATCC 19414] gi|322463387|gb|EFY08581.1| hypothetical protein HMPREF0357_10688 [Erysipelothrix rhusiopathiae ATCC 19414] Length = 1466 Score = 38.3 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 15/109 (13%), Positives = 39/109 (35%), Gaps = 14/109 (12%) Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 Y + ++ + + + Q + ++++ + +SN + Sbjct: 269 YNAMVYGTGNEYYLDELGELRSQGN---QNYFVYMSSADAAIIESNQ--------IHQEQ 317 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS-LIHVFQNISQLM 409 I + I + G +LK S +Y+ ++ L +F+ IS + Sbjct: 318 IHLYAIGFDTDARGTDILKR--ISNNNYYDASSSRDNLDDIFKKISNNI 364 Score = 37.9 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 30/203 (14%), Positives = 63/203 (31%), Gaps = 24/203 (11%) Query: 129 MTHMANNRLDSSNNTIFYNMDVMTSYDYRL-QFIEHLLNQRYNQKIVSFIPALLRIEMGE 187 ++ N+ T +D M + R+ + + + N +I + L+ E + Sbjct: 35 VSAEGNSSSSEKTITNSIQIDNMNEGEVRVFKTAKPIPNSINRWEISIDVFGRLKREPSD 94 Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 I LV+D SGSM K + + A + ++ + Sbjct: 95 -----IVLVLDTSGSMDPQ----------------KNPQGIDRISKAKREAIHFVNEIFE 133 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + L+ Y T+V N + ++ + T + A+ +A +L Sbjct: 134 RDASARVALVSYGTKVSSNSFHT--KQESNLLINEIKSLKAEGGTFTQGALYEAKMLLNQ 191 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQ 330 + G P + Sbjct: 192 SSAPNKTIVLLSDGQPTYRYPLK 214 >gi|258616219|ref|ZP_05713989.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium DO] Length = 1095 Score = 38.3 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 29/152 (19%), Positives = 53/152 (34%), Gaps = 41/152 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM ++ +K + F+D++ S + + Sbjct: 291 LDLVLVVDWSGSM-------------------NDNNRIGEVKIGVDRFVDTLAD-SGITD 330 Query: 251 DVYMGLIGYTTRVEKNIEPS---WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + MG +GY++ + + V+ V S T + A++ A +L+ Sbjct: 331 KINMGYVGYSSEGYSYSNGAVQMGSFDSVKNQVKSITPSRTNGGTFTQKALRDAGSMLSV 390 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 391 PNGH------------------KKVIVLLTDG 404 >gi|297159760|gb|ADI09472.1| hypothetical protein SBI_06352 [Streptomyces bingchenggensis BCW-1] Length = 545 Score = 38.3 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 29/173 (16%), Positives = 51/173 (29%), Gaps = 42/173 (24%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 +V+D SGSM TK+ A + A ++ +S +++ Sbjct: 84 IVIDCSGSMTW------------------PPTKIMAARKA------TVAAVSALRKGTRF 119 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 ++ T R E P+ G + K S AMK A Sbjct: 120 AVVQGTERAEVVYPPTGGMA---------VAGPDTKAAASHAAMKLA----ALGGTAIGT 166 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-----VNTIKICDKAKENFI 362 + + + + LTDG+N + ++ C I Sbjct: 167 WLDLARRLHAEQPTALRHTLLLTDGKNEHEDPGDLARVLDACAPCFTCDARGI 219 >gi|294055639|ref|YP_003549297.1| ATP synthase F1, alpha subunit [Coraliomargarita akajimensis DSM 45221] gi|293614972|gb|ADE55127.1| ATP synthase F1, alpha subunit [Coraliomargarita akajimensis DSM 45221] Length = 514 Score = 38.3 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 20/134 (14%), Positives = 43/134 (32%), Gaps = 13/134 (9%) Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 +T A + + K + + P + + + N N Sbjct: 165 GDRATGKTTIAIDTIINQAKINNQHRKEDGSFEEGFRPVYSIYVAV-------GQKNSNI 217 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCV----SSPEYHYNVVNADSLIHVFQNIS 406 + + ++ TI + A + +S + D+LI V+ ++S Sbjct: 218 ARTINVLEKAGAMEYTIIVMAPAADNPANQYIAPFSGASMGEFFMNNGMDALI-VYDDLS 276 Query: 407 QLMVHRK-YSVILK 419 + V + S+ILK Sbjct: 277 KQAVAYRQISLILK 290 >gi|125973851|ref|YP_001037761.1| hypothetical protein Cthe_1336 [Clostridium thermocellum ATCC 27405] gi|256004304|ref|ZP_05429286.1| hypothetical protein ClothDRAFT_1147 [Clostridium thermocellum DSM 2360] gi|281418012|ref|ZP_06249032.1| hypothetical protein Cther_2696 [Clostridium thermocellum JW20] gi|125714076|gb|ABN52568.1| hypothetical protein Cthe_1336 [Clostridium thermocellum ATCC 27405] gi|255991738|gb|EEU01838.1| hypothetical protein ClothDRAFT_1147 [Clostridium thermocellum DSM 2360] gi|281409414|gb|EFB39672.1| hypothetical protein Cther_2696 [Clostridium thermocellum JW20] gi|316939952|gb|ADU73986.1| hypothetical protein Clo1313_0918 [Clostridium thermocellum DSM 1313] Length = 233 Score = 38.3 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 14/72 (19%), Positives = 33/72 (45%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 L + K + K +F++ A+ + + +L + LI V ++ Y+K+ +++ + Sbjct: 3 LKKSWRNCKGLVRDRKGSFTVEAAIIIPAVILTMFALILVSEFLYQKSCIQAIADRTAQR 62 Query: 64 GASKMVSNLSRL 75 GA S + Sbjct: 63 GAEIWNSPSKDM 74 >gi|226327519|ref|ZP_03803037.1| hypothetical protein PROPEN_01390 [Proteus penneri ATCC 35198] gi|225204045|gb|EEG86399.1| hypothetical protein PROPEN_01390 [Proteus penneri ATCC 35198] Length = 505 Score = 38.3 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 34/231 (14%), Positives = 83/231 (35%), Gaps = 42/231 (18%) Query: 179 ALLRIEMGERP---IFLIELVVDLSGSMHCAMNSDPEDVNSAPIC------QDKKRTKMA 229 + ++ ERP + L+ D SGSM +MN D++ D + ++ Sbjct: 289 KIKQMCPAERPPELAPQVILIFDASGSMALSMNLTESDLDYIASTGQLFPGYDAEPRRIT 348 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 +NA I +++++ D+ + + + P++G + + + Sbjct: 349 TARNAA------IKIINNIPSDMKITTVVASDCGVVKSSPAYGGNERSKLLNYIKRIEPD 402 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T ++K+A ++ + + + I+ L+DG + + + Sbjct: 403 SGTPLAESIKRASNLIKGNNRDT-------------------IIVLLSDGLESCDQDPCS 443 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 + +A + I + I + G CV++ + NA+ + Sbjct: 444 AARTLKRAHPRAV-INVVDILGTGAG-----NCVANATGGKVFTARNANEV 488 >gi|312621090|ref|YP_003993818.1| protein tadg, associated with flp pilus assembly [Photobacterium damselae subsp. damselae] gi|311872811|emb|CBX86902.1| Protein TadG, associated with Flp pilus assembly [Photobacterium damselae subsp. damselae] Length = 436 Score = 38.3 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 32/257 (12%), Positives = 75/257 (29%), Gaps = 31/257 (12%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 K +++ + SI+FA+ + + K +E A AA LA A+ Sbjct: 2 KLKKAQQGHASILFAIMIPVLFGIFTLASDGARAIQTKARIEDATEAASLAIAAH----- 56 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 + + + + +K A ++K +I + S S Y ++I S Sbjct: 57 NDPNVNSDGLGSGSK-VNRRIATDYLKAYITDIDSISSLKIYRRNCEDIPECSS------ 109 Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFL 192 + + F+ +V + Sbjct: 110 ----GLNKGKSRFFEYEVEALTTQN----SWFPGNNVISGFGDTFSTRGHSLARKYQSEA 161 Query: 193 IELVV--DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 +++V D S SM + + +++ ++I++ + Sbjct: 162 VDVVFAADFSKSMEEPWTGGRQKYKDLVRVINDVTSELEK--------FNNINIADKKNQ 213 Query: 251 DVYMGLIGYTTRVEKNI 267 + +G+ Y + Sbjct: 214 NT-IGISPYNSNTYSKF 229 >gi|298372685|ref|ZP_06982675.1| BatB protein [Bacteroidetes oral taxon 274 str. F0058] gi|298275589|gb|EFI17140.1| BatB protein [Bacteroidetes oral taxon 274 str. F0058] Length = 345 Score = 38.3 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 36/226 (15%), Positives = 72/226 (31%), Gaps = 44/226 (19%) Query: 151 MTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 ++ Y ++F+ +L ++ L + E ++ LV+D+S SM Sbjct: 50 VSGYRPHVKFVLLMLAVALMIIALARPQMLKKQENVQKRGIEAMLVLDISNSMMA----- 104 Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 QD +++ K L +D + D MGLI + + + Sbjct: 105 ----------QDIAPSRLDYAKMLLSQLIDRL-------TDDKMGLIVFAGDAFIQMPIT 147 Query: 271 WGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 + ++ LI + T A+ A + K+ Sbjct: 148 SDKVSAKMFLKTIQPDLIQRQGTAIGSAIDLAVKSFNDTKQSGGRA-------------- 193 Query: 330 QKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 I LTD EN+ + ++ A++ I + + I Sbjct: 194 ---IFLLTDAENHEDNA----VEAAKMARDKNITVNVVGIGTPEGS 232 >gi|283769330|ref|ZP_06342229.1| hypothetical protein HMPREF9013_0305 [Bulleidia extructa W1219] gi|283103987|gb|EFC05371.1| hypothetical protein HMPREF9013_0305 [Bulleidia extructa W1219] Length = 209 Score = 38.3 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 15/44 (34%), Positives = 23/44 (52%) Query: 10 YFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSM 53 +K I EK ++ +IFAL + L +I + V + KKN M Sbjct: 4 NIRKWIKEEKGSYIVIFALFLTVLLGMISLAVDVGMMYLKKNRM 47 >gi|254304217|ref|ZP_04971575.1| von Willebrand factor domain protein [Fusobacterium nucleatum subsp. polymorphum ATCC 10953] gi|148324409|gb|EDK89659.1| von Willebrand factor domain protein [Fusobacterium nucleatum subsp. polymorphum ATCC 10953] Length = 529 Score = 38.3 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 48/304 (15%), Positives = 101/304 (33%), Gaps = 34/304 (11%) Query: 118 IQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDY-RLQFIEHLLNQRYNQKIVSF 176 +N+ + ++ + + +F + + YDY + I+ + + + Sbjct: 106 KENVQRELKKIEPALSEDASEEEIQHLFKQLLYIAGYDYTPFETIDRFSYVIFKNDMENP 165 Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 ++ E +E+V+D SGSM + +T M K ++ Sbjct: 166 FT---HEKIEENMNVNVEIVLDASGSMVKKIG---------------DKTMMEIAKESIK 207 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 L + V G+ + + + + + + D + + Sbjct: 208 QVLSEM----PANAKV--GIRVFGHKGDNTASKKDESCGANELIYPIGDLNVEGIEKALE 261 Query: 297 AMK-QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 ++ + + + G K ++ +II TDG + V K Sbjct: 262 PIQPTGWTSIAKSIEYGVEDLKALDGEKTLNIL---YII--TDGIETCGGNPVEIAKQL- 315 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY-NVVNADSLIHVFQNISQL-MVHRK 413 K + I + I N N RLLK + +Y +V +AD L I++L K Sbjct: 316 KGENTNIVLGIIGFNVDANQNRLLKQIADAAGGYYSSVNDADKLTGELYRINELAFSDYK 375 Query: 414 YSVI 417 + V+ Sbjct: 376 WEVL 379 >gi|116254826|ref|YP_770662.1| hypothetical protein pRL100386 [Rhizobium leguminosarum bv. viciae 3841] gi|115259474|emb|CAK10612.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae 3841] Length = 644 Score = 38.3 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 68/204 (33%), Gaps = 45/204 (22%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 ++ +++ I P + ++D+SGSM + K Sbjct: 253 NHDTELMHVAIKGYDIAPATTPHANLVFLIDVSGSMD-------------------EPDK 293 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + LK+A L ++ + V Y G G V + + + +D L Sbjct: 294 LPLLKSAFRLLVNRLKPDDTVSIVTYAGNAG---TVLTPTRVA-----EKSKILSAIDRL 345 Query: 288 ILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE-NNNFK 345 T ++ AY + +QG + ++ TDG+ N Sbjct: 346 EAGGSTGGAEGIEAAY-------------DLAKQGFVKDGVNR---VMLATDGDFNVGPS 389 Query: 346 SNVNTIKICDKAKENFIKIVTISI 369 S+ + +I ++ +++ I + + Sbjct: 390 SDEDLKRIIEERRKDGIFLTVLGF 413 >gi|311105412|ref|YP_003978265.1| hemolysin-type calcium-binding repeat family protein 2 [Achromobacter xylosoxidans A8] gi|310760101|gb|ADP15550.1| hemolysin-type calcium-binding repeat family protein 2 [Achromobacter xylosoxidans A8] Length = 1396 Score = 38.3 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 31/154 (20%), Positives = 57/154 (37%), Gaps = 30/154 (19%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + I L++DLS SM A +++P+ T + A K AL L+S L+ + Sbjct: 764 YNIALLLDLSYSMGWANSANPDGD-----------TALDAAKKALKHLLES--QLATHEG 810 Query: 251 DVYMGLIGYTTR----VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + + LI + + + + + V + V+ + T A+ + + Sbjct: 811 TINVSLITFNDEDIRVQKSISDLT--PDNVDEIVSSLLHLETGPNTPYGAALHETKKWFD 868 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 N P++ FLTDGE Sbjct: 869 GQPTVDDHGN-----------PYKNLTYFLTDGE 891 >gi|146284562|ref|YP_001165515.1| von Willebrand factor, type A [Enterobacter sp. 638] gi|145320695|gb|ABP62841.1| von Willebrand factor, type A [Enterobacter sp. 638] Length = 651 Score = 38.3 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 23/202 (11%), Positives = 67/202 (33%), Gaps = 27/202 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + K A+ I+ +++ V GL+ Y + V+ + + YV + Sbjct: 238 IDRTKEAIDKIYKQIEK-EQLQDKVKFGLVAYRSSVKAVPGLEY---DAKMYVDPNTVKD 293 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD-----GENN 342 +KQA + + ++ + ++++ +TD G ++ Sbjct: 294 GKDFLAKVHDLKQATVSSSKVDEDAYAGVMTALDKVDWTQFGARYVVLITDAGALDGTDS 353 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH------------- 389 +++ ++ +A + + T+ + K S+ + Sbjct: 354 LSSTHLGAEQVRQEAAYRGVALYTLHLKTPDGK----KNHASAAAQYQELTLNPFLHKPL 409 Query: 390 YNVVNADSLIHVFQNISQLMVH 411 Y +++ ++ F I + + Sbjct: 410 YYPIDSGD-VNSFGTIVDSLSN 430 >gi|288576300|ref|ZP_05978557.2| pilus-associated protein [Neisseria mucosa ATCC 25996] gi|288565840|gb|EFC87400.1| pilus-associated protein [Neisseria mucosa ATCC 25996] Length = 1081 Score = 38.3 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 10/55 (18%), Positives = 22/55 (40%), Gaps = 2/55 (3%) Query: 359 ENFIKIVTISIN--ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + ++ T+ S G+ L S ++++N + L + F I L+ Sbjct: 298 KQLVQTFTVGFGQGISDAGKAFLTRGASQDDWYFNADKPEDLENAFNKIISLIST 352 >gi|302381356|ref|YP_003817179.1| hypothetical protein Bresu_0241 [Brevundimonas subvibrioides ATCC 15264] gi|302191984|gb|ADK99555.1| hypothetical protein Bresu_0241 [Brevundimonas subvibrioides ATCC 15264] Length = 416 Score = 38.3 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 26/104 (25%), Positives = 43/104 (41%), Gaps = 13/104 (12%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 M+ L R R + E+ N ++IFALS + +L+ + + + ++ + A Sbjct: 1 MNWLDRCRRWLAAFGRDERGNIALIFALSTPAVVLISVGAVELGSVQSNRAKLQDIADTA 60 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKE 104 LAGA+ E A I+ AK FI H+ E Sbjct: 61 ALAGAN-------------ELALAIDDAAAIERAKVFIDGHVSE 91 >gi|125975601|ref|YP_001039511.1| von Willebrand factor, type A [Clostridium thermocellum ATCC 27405] gi|125715826|gb|ABN54318.1| von Willebrand factor, type A [Clostridium thermocellum ATCC 27405] Length = 1300 Score = 38.3 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 43/314 (13%), Positives = 96/314 (30%), Gaps = 66/314 (21%) Query: 112 VFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQ 171 +FY E + + + N + ++ Y + + + I+ +L++ N Sbjct: 2 IFYYDEENDTIKFLETEVDEETNTIKTTVDHFSIYGVIDIVRFAQSW-GIKDILDKLLNP 60 Query: 172 --KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 + + + + + I V+D +GSM +N+ ++ + + Sbjct: 61 GGETIPPVAEIGQA--------DIVFVIDTTGSMGSVINNVKNNITNFANTLMEN----- 107 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR----VEKNIEPSWGTEKVRQYVTRDMD 285 DV +GLI Y ++ W + V ++ + Sbjct: 108 -------------------NVDVRLGLIDYKDLEEDGMDSTKNLGW-FDNVSDFIASVNN 147 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD---GENN 342 D+ + A + KFI+ TD E+ Sbjct: 148 MRATGGGDAPESTVDALEE---------------ARRMDFRPGVNKFIMLFTDVSYKEST 192 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 F+ + + +K KE+ I + I + L + + Y ++ F Sbjct: 193 RFEDVQSMKTVIEKLKEDKIVVSAI---VPSGYESLYRNLYTETGGVYA-----NITQAF 244 Query: 403 QNISQLMVHRKYSV 416 + Q ++ SV Sbjct: 245 SSALQSLISNIASV 258 >gi|309789848|ref|ZP_07684427.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308228152|gb|EFO81801.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 420 Score = 38.3 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 41/222 (18%), Positives = 77/222 (34%), Gaps = 51/222 (22%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 R + V+D SGSM K K+ ++ A+ L +D +D Sbjct: 38 AQVRTPVNVSFVLDRSGSM--------------------KGDKIDRVRQAISLAVDRLDA 77 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--VRQYVTRDMDSLILKPTDSTPAMKQAY 302 L+ + R E I + T++ ++ V+R D+ T PA+++ Sbjct: 78 QDIAS------LVIFDHRNEVLIPAAPVTDRRMIKDRVSRIRDA---GGTKIAPAVEKGL 128 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + + D+ + + ++ LTDG+ N + D A + Sbjct: 129 REIEKDRSGAI-----------------RRLVLLTDGQTENEDECLRRA---DDAGRIGV 168 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 I + + N L++ S + AD + FQN Sbjct: 169 PITALGVGQDWNEDLLIEMANRSGGTADYIARADEITEYFQN 210 >gi|325845103|ref|ZP_08168414.1| von Willebrand factor type A domain protein [Turicibacter sp. HGF1] gi|325488845|gb|EGC91243.1| von Willebrand factor type A domain protein [Turicibacter sp. HGF1] Length = 315 Score = 38.3 bits (87), Expect = 2.6, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 67/201 (33%), Gaps = 68/201 (33%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I LV+D SGSM TK+ LKNA F+D + + ++K Sbjct: 142 IVLVLDGSGSMSG--------------------TKLTNLKNAAKDFIDRLKGVDNLK--- 178 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQY---------------------------VTRDMD 285 + ++ +++ N GT K++ +T + Sbjct: 179 -VAIVVFSSNATINPISVSGTTKIKSTDKSSESSIPNYKTLQNEYFLDINDSRLITMINN 237 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T++ +++A + L S + K II ++DG + Sbjct: 238 IDAQGGTNTGDGLRKA-EYLLS---------------QKGDSVANKTIILMSDGLPTYYS 281 Query: 346 SNVNT-IKICDKAKENFIKIV 365 + + + + K++ + I Sbjct: 282 GSTESGVNYYKEIKDDVVGIF 302 >gi|85701714|ref|NP_001028371.1| calcium activated chloride channel [Mus musculus] gi|74202052|dbj|BAE23018.1| unnamed protein product [Mus musculus] gi|148680069|gb|EDL12016.1| mCG120741 [Mus musculus] gi|187951335|gb|AAI39089.1| Expressed sequence AI747448 [Mus musculus] gi|187957592|gb|AAI39090.1| Expressed sequence AI747448 [Mus musculus] Length = 925 Score = 38.3 bits (87), Expect = 2.6, Method: Composition-based stats. Identities = 36/235 (15%), Positives = 81/235 (34%), Gaps = 58/235 (24%) Query: 195 LVVDLSGSMHCA--MNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 LV+D+SGSM + +N + + R+ + + + + V E + Sbjct: 311 LVLDVSGSMSSSDRLNRMNQAAKYFLSQIIENRSWVGMVHFSSQATI--------VHELI 362 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 M + +E+N + + + + T +K A+Q+ + + ++ Sbjct: 363 QM-----NSDIERNKLL------------QTLPTSAIGGTSICSGIKTAFQVFKNGEYQT 405 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINA 371 T I+ L+DGE++ T K C D+ K++ + I++ Sbjct: 406 DGTE----------------ILLLSDGEDS-------TAKDCIDEVKDSGSIVHFIAL-G 441 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADS---LIHVFQNIS---QLMVHRKYSVILKG 420 + + + H + LI F ++ + + + KG Sbjct: 442 PSADLAVTNMSILTGGNHKLATDEAQNNGLIDAFGALASENTDITQKSLQLESKG 496 >gi|327313514|ref|YP_004328951.1| von Willebrand factor type A domain-containing protein [Prevotella denticola F0289] gi|326944388|gb|AEA20273.1| von Willebrand factor type A domain protein [Prevotella denticola F0289] Length = 331 Score = 38.3 bits (87), Expect = 2.6, Method: Composition-based stats. Identities = 29/176 (16%), Positives = 57/176 (32%), Gaps = 45/176 (25%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 + +D+S SM D +++ K + ++ + Sbjct: 95 IALDISNSMLAE---------------DVSPSRLEKSKLLVENLMNKFSED-------KI 132 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDSTPAMKQAYQILTSDKKRSF 313 GLI + + + + ++ SLI + TD A++ + T + K Sbjct: 133 GLIVFAGDAFVQLPITGDYVSAKMFLDNINPSLIGTQGTDIGKALQLSINSFTPNSKVG- 191 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 K II +TDGE+N + + A+ IK+ + I Sbjct: 192 -----------------KAIILITDGEDNEGGAEAMAKQ----ARNKGIKVFILGI 226 >gi|315634664|ref|ZP_07889948.1| PpkA protein [Aggregatibacter segnis ATCC 33393] gi|315476612|gb|EFU67360.1| PpkA protein [Aggregatibacter segnis ATCC 33393] Length = 657 Score = 38.3 bits (87), Expect = 2.6, Method: Composition-based stats. Identities = 29/211 (13%), Positives = 72/211 (34%), Gaps = 30/211 (14%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + ++ + + I+ + K+ V GL+G+ + + +YV++ Sbjct: 246 INRTRDVIKQVSEQIEKENLGKQ-VKFGLVGFRSSTKAVPGL--------EYVSKMFVD- 295 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ------KFIIFLTD--- 338 D+ +++A ++ + F+ G+ ++I+ +TD Sbjct: 296 PSTVKDTQDFLQKAVELKQAKVSSKEFSEDSFAGISQALNEINWNNFGGRYIVLVTDAGA 355 Query: 339 --GENNNFKSNVNTIKICDKAKENFIKIVTISINASPN----GQRLLKTCVSSPEYH--- 389 G N + ++ ++ +A+ + I T+ + Q + S + Sbjct: 356 LDGNNPLSSTGMDAKQLRLEAQHRGVAIYTLHLKTPAGVKNHEQAKTQYSDLSFNNYLNK 415 Query: 390 --YNVVNADSLIHVFQNISQLMVHRKYSVIL 418 Y VNA + + + L V L Sbjct: 416 PLYYAVNAGDVNEFGEKVGALARALTSQVKL 446 >gi|254430946|ref|ZP_05044649.1| structural toxin protein RtxA [Cyanobium sp. PCC 7001] gi|197625399|gb|EDY37958.1| structural toxin protein RtxA [Cyanobium sp. PCC 7001] Length = 2003 Score = 38.3 bits (87), Expect = 2.6, Method: Composition-based stats. Identities = 41/259 (15%), Positives = 70/259 (27%), Gaps = 30/259 (11%) Query: 56 ANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYN 115 + ILAGA V + L ++ N +L A + + Sbjct: 789 SIGGTILAGA---VGQVIYLAQDHSAVDNAVTASLASGANDRATLQV-----NADGTYTF 840 Query: 116 TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVS 175 T I N + S T + D I L + + Sbjct: 841 TLIDNFLLSDPGDTTEQTESISSLVGGINILVEDGDGDAANGTTGIALSLLVKDDIPTAV 900 Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 I +F LV+D+SGSM A D T+M N+ Sbjct: 901 PITESGESFPTGTNLF---LVIDVSGSMANASGVDGM-------------TRMQLQINSA 944 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 L +D + L +K +V + + T +W + + + + T+ Sbjct: 945 LELIDQYEALGPLKVNV----VTFATDASAPFSTTWQDADAVKTFIQTL--VPTSRTNYD 998 Query: 296 PAMKQAYQILTSDKKRSFF 314 A+ Sbjct: 999 AALNLTINTFNGGTASDID 1017 >gi|301381604|ref|ZP_07230022.1| ppkA-related protein [Pseudomonas syringae pv. tomato Max13] gi|302059660|ref|ZP_07251201.1| ppkA-related protein [Pseudomonas syringae pv. tomato K40] gi|302131911|ref|ZP_07257901.1| ppkA-related protein [Pseudomonas syringae pv. tomato NCPPB 1108] Length = 663 Score = 38.3 bits (87), Expect = 2.6, Method: Composition-based stats. Identities = 30/193 (15%), Positives = 67/193 (34%), Gaps = 23/193 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ-YVTRDMDS 286 + + A+ I + ++ V GL+ + + + + T+ +D Sbjct: 253 IERTREAIAKVYAQIAKENLGRQ-VKFGLVAFRSSTQAVPGLEYVTKMYADPNTVKDGAD 311 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD-----GEN 341 + K D +KQA S + S+ S ++++ +TD G++ Sbjct: 312 FLAKAAD----LKQAKVSSKSFNEDSYAGVMQAIDKVDWSPFGARYVVLITDAGALDGDD 367 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISI----------NASPNGQRLLKTCVSSPEYHYN 391 + +N ++ +A + I T+ + A Q L ++ +Y Sbjct: 368 KLSGTGLNAEQVRIEASNPGVAIYTLHLKTAAGAKDHAKAEAQYQALSTYTGTNTSLYYP 427 Query: 392 VVNADSLIHVFQN 404 V +A L F + Sbjct: 428 V-DAGDLN-AFGS 438 >gi|118443040|ref|YP_877686.1| von Willebrand factor type A domain-containing protein [Clostridium novyi NT] gi|118133496|gb|ABK60540.1| von Willebrand factor type A domain protein [Clostridium novyi NT] Length = 708 Score = 38.3 bits (87), Expect = 2.6, Method: Composition-based stats. Identities = 18/114 (15%), Positives = 44/114 (38%), Gaps = 24/114 (21%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 +K+ L+ A F++ + S+ K +GL+ Y + E + +++ + + Sbjct: 182 SKIDELQKAAKNFVNKFETKSNTK----IGLVSYGNKGEVVHSLTNELDRINSSIDYGLS 237 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + T+ +++A +L + K+I+ +TDG Sbjct: 238 --VYGATNIGDGIRKANGLLNNGSNA------------------DKYIVLMTDG 271 >gi|326911082|ref|XP_003201891.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like [Meleagris gallopavo] Length = 951 Score = 38.3 bits (87), Expect = 2.6, Method: Composition-based stats. Identities = 41/233 (17%), Positives = 74/233 (31%), Gaps = 61/233 (26%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D S SM TK+ K AL L + H Sbjct: 308 VVFVLDSSASMVG--------------------TKLRQTKEALFTILQDLRPEDHFN--- 344 Query: 253 YMGLIGYTTRV-----EKNIEPS-WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 +IG++ R+ ++ + + ++Y+ + T+ A++ ++L Sbjct: 345 ---IIGFSNRIKVWQQDRLVPVTPNNIRDAKKYI---HNMSPTGGTNINSALQTGAKLLN 398 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD-----GENNNFKSNVNTIKICDKAKENF 361 ++ IIFLTD GE + K NT A + Sbjct: 399 DYIAQNNI-----------DARSVSLIIFLTDGRPTVGETQSSKILSNTK----DAIRDK 443 Query: 362 IKIVTISINASPNGQRL----LKTCVSSPEYHYNVVNADSLIHVF-QNISQLM 409 + TI I + + L L+ C + +A S + F I + Sbjct: 444 FCLFTIGIGNDVDYKLLERMALENCGM-VRHFQEDEDAASHLKGFYDEIGTPL 495 >gi|308472879|ref|XP_003098666.1| hypothetical protein CRE_04169 [Caenorhabditis remanei] gi|308268266|gb|EFP12219.1| hypothetical protein CRE_04169 [Caenorhabditis remanei] Length = 382 Score = 38.3 bits (87), Expect = 2.6, Method: Composition-based stats. Identities = 29/165 (17%), Positives = 59/165 (35%), Gaps = 24/165 (14%) Query: 236 LLFLDSIDLLSHVKEDV-----YMGLIGYTTRVEKNIEP----SWGTEKVRQYVTRDMDS 286 +D + +S++ E V +G + Y + N + SWG + Q V ++ Sbjct: 47 NSVVDIQNSISNIFEIVPIPINRVGFVTYNSLATINADLNKFKSWG--DLSQGVNDSYNN 104 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 + L +++ + QG I + + K II N Sbjct: 105 MNLSSENTS---------FIGTGLITAGELLQVQGSAIGRVYYPKVIIVYASAFNGTGLL 155 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNG--QRLLKTCVSSPEYH 389 + ++ + K I I+T++++ NG Q+ L + S Sbjct: 156 DPLSVA--NTLKSAGITIITVAVDTDNNGVIQKQLASIASPGSAF 198 >gi|308493174|ref|XP_003108777.1| hypothetical protein CRE_11006 [Caenorhabditis remanei] gi|308248517|gb|EFO92469.1| hypothetical protein CRE_11006 [Caenorhabditis remanei] Length = 425 Score = 37.9 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 32/220 (14%), Positives = 65/220 (29%), Gaps = 47/220 (21%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LV+D S + P+ + K+ + + + H Sbjct: 238 LDLVLVLDFS-------------TTTDPVYN--------SYKDLSKRLVSQLKIGPH--- 273 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSD 308 + + + T + + ++ V R +D L K T +++A Sbjct: 274 YTQVAAVTFATVGRTRVRFNLKKYTTQEEVLRGIDKLQSKGGTTAIGAGIEKA------- 326 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 T P + + I+F TDG +N K A ++ T++ Sbjct: 327 -----LTQIDESEGARPGIATKVMIVF-TDGWSNKGPDPEKRAK---DAVNAGFEMYTVA 377 Query: 369 I--NASPN---GQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 A + L S + + V SL+ + Sbjct: 378 YTARAPNSVTLNNETLSAISGSSGHAFTDVTFQSLVDKIK 417 >gi|262371395|ref|ZP_06064713.1| conserved hypothetical protein [Acinetobacter johnsonii SH046] gi|262313732|gb|EEY94781.1| conserved hypothetical protein [Acinetobacter johnsonii SH046] Length = 1327 Score = 37.9 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 36/259 (13%), Positives = 76/259 (29%), Gaps = 31/259 (11%) Query: 99 KNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSS--NNTIFYNMDVMTSYDY 156 + +SG + + + +VN + T+ N + +F + Sbjct: 473 SGTPNKVISGTTITISDADGTLVVNKLTGAYTYTLNAATTQGVDDIKVFSYIAKDQVTGQ 532 Query: 157 RLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNS 216 +N + + + I L + + + + +++D+SGSM D S Sbjct: 533 TSAATNLTINVIDDAPVATDINQTLSMAATPQ-TYNLTIILDVSGSMA----QDENGRGS 587 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 A D ++ K A+ +D +D L +V + + + W + Sbjct: 588 ATAGFDPNTIRLMLAKQAIAQLIDKVDTLGNVNVKM-INFSSANSEEVS----QWYVDSA 642 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + T L++ K P + F+ F+ Sbjct: 643 TGAINFVNTQQADGGT----------SYLSAINKTMTDWG-------TPPAADKHFVYFI 685 Query: 337 TDG--ENNNFKSNVNTIKI 353 TDG + N Sbjct: 686 TDGAPDTNQGIDATTGTNW 704 >gi|171681714|ref|XP_001905800.1| hypothetical protein [Podospora anserina S mat+] gi|170940816|emb|CAP66465.1| unnamed protein product [Podospora anserina S mat+] Length = 648 Score = 37.9 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 28/167 (16%), Positives = 66/167 (39%), Gaps = 17/167 (10%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 P L + + L +D+SGSM P + + + + +++A Sbjct: 58 PELEDLRERNHVPLDLVLSIDVSGSMGAD---APVPAKNGTEGEHYGLSVLDLVRHAAKT 114 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTP 296 L+++D +G++ ++T + E ++ T + + + +D+L L T+ Sbjct: 115 ILETLDDHD------RLGIVTFSTSSKVVRELTYMTPANKAKILKQLDALQPLSMTNLWH 168 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 ++ + ++ K R+ +P ++ LTDG N+ Sbjct: 169 GIRDGLSLFNNNLKAVND----RRNPGSGRVPA---LLVLTDGMPNH 208 >gi|150019021|ref|YP_001311275.1| von Willebrand factor, type A [Clostridium beijerinckii NCIMB 8052] gi|149905486|gb|ABR36319.1| von Willebrand factor, type A [Clostridium beijerinckii NCIMB 8052] Length = 962 Score = 37.9 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 27/155 (17%), Positives = 56/155 (36%), Gaps = 39/155 (25%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I LV+D SGSM K+ LK A F+ + + ++K Sbjct: 82 IVLVLDSSGSMADNY-------------------KLTNLKKAATDFITKMSTVKNLK--- 119 Query: 253 YMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 + ++ + T+ + + + V + T++ ++QA +L++ + Sbjct: 120 -IAIVDFDTQATIINKLTDVSSSTNVTALKRSINNLTAGGGTNTGEGLRQAAYLLSNSSE 178 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 ++ K IIF++DGE + Sbjct: 179 QNPLA--------------SKNIIFMSDGEPTYYN 199 >gi|325860337|ref|ZP_08173459.1| von Willebrand factor type A domain protein [Prevotella denticola CRIS 18C-A] gi|325482216|gb|EGC85227.1| von Willebrand factor type A domain protein [Prevotella denticola CRIS 18C-A] Length = 331 Score = 37.9 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 29/176 (16%), Positives = 57/176 (32%), Gaps = 45/176 (25%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 + +D+S SM D +++ K + ++ + Sbjct: 95 IALDISNSMLAE---------------DVSPSRLEKSKLLVENLMNKFSED-------KI 132 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDSTPAMKQAYQILTSDKKRSF 313 GLI + + + + ++ SLI + TD A++ + T + K Sbjct: 133 GLIVFAGDAFVQLPITGDYVSAKMFLDNINPSLIGTQGTDIGKALQLSINSFTPNSKVG- 191 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 K II +TDGE+N + + A+ IK+ + I Sbjct: 192 -----------------KAIILITDGEDNEGGAEAMAKQ----ARNKGIKVFILGI 226 >gi|281416613|ref|ZP_06247633.1| von Willebrand factor type A [Clostridium thermocellum JW20] gi|281408015|gb|EFB38273.1| von Willebrand factor type A [Clostridium thermocellum JW20] Length = 1363 Score = 37.9 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 43/314 (13%), Positives = 96/314 (30%), Gaps = 66/314 (21%) Query: 112 VFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQ 171 +FY E + + + N + ++ Y + + + I+ +L++ N Sbjct: 65 IFYYDEENDTIKFLETEVDEETNTIKTTVDHFSIYGVIDIVRFAQSW-GIKDILDKLLNP 123 Query: 172 --KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 + + + + + I V+D +GSM +N+ ++ + + Sbjct: 124 GGETIPPVAEIGQA--------DIVFVIDTTGSMGSVINNVKNNITNFANTLMEN----- 170 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR----VEKNIEPSWGTEKVRQYVTRDMD 285 DV +GLI Y ++ W + V ++ + Sbjct: 171 -------------------NVDVRLGLIDYKDLEEDGMDSTKNLGW-FDNVSDFIASVNN 210 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD---GENN 342 D+ + A + KFI+ TD E+ Sbjct: 211 MRATGGGDAPESTVDALEE---------------ARRMDFRPGVNKFIMLFTDVSYKEST 255 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 F+ + + +K KE+ I + I + L + + Y ++ F Sbjct: 256 RFEDVQSMKTVIEKLKEDKIVVSAI---VPSGYESLYRNLYTETGGVYA-----NITQAF 307 Query: 403 QNISQLMVHRKYSV 416 + Q ++ SV Sbjct: 308 SSALQSLISNIASV 321 >gi|19704610|ref|NP_604172.1| acriflavin resistance protein B [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] gi|19714908|gb|AAL95471.1| Acriflavin resistance protein B [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] Length = 1020 Score = 37.9 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 40/270 (14%), Positives = 89/270 (32%), Gaps = 30/270 (11%) Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 + + S+T N D + + + + +S + + Sbjct: 550 DEGKYSLTAELQNGTDLNKAE---RIAKELEEIIKSDPHTQSYLMLVSTSSISVNANVGK 606 Query: 183 IEMGERPIFLIE--------LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 + +F I V+D SM + + + + Q + ++ L Sbjct: 607 KNTRDDSVFTIMNDIRNKTSKVLDARVSMTNQFSGGQTNKDVQFLLQGSNQNEIKQLGKQ 666 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 LL L S + + + + G+I ++++ S+G V + + +L + Sbjct: 667 LLEKLQSYNGMVDISSTLDPGIIELRVNIDRDKIASYGISPTV--VAQTISYYMLGGDKA 724 Query: 295 TPA-MKQAYQIL-----TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 A +K + + KR+ +K+ F K L+D + Sbjct: 725 NTATLKTDTEEIDVLVRLPKDKRNDINTLASLNIKVGDNKFVK----LSDVATLQYAEGT 780 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRL 378 + I+ K+N I T++I+ + G L Sbjct: 781 SEIR-----KKNG--IYTVTISGNDGGVGL 803 >gi|213966921|ref|ZP_03395071.1| ppkA-related protein [Pseudomonas syringae pv. tomato T1] gi|213928243|gb|EEB61788.1| ppkA-related protein [Pseudomonas syringae pv. tomato T1] Length = 653 Score = 37.9 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 30/193 (15%), Positives = 67/193 (34%), Gaps = 23/193 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ-YVTRDMDS 286 + + A+ I + ++ V GL+ + + + + T+ +D Sbjct: 243 IERTREAIAKVYAQIAKENLGRQ-VKFGLVAFRSSTQAVPGLEYVTKMYADPNTVKDGAD 301 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD-----GEN 341 + K D +KQA S + S+ S ++++ +TD G++ Sbjct: 302 FLAKAAD----LKQAKVSSKSFNEDSYAGVMQAIDKVDWSPFGARYVVLITDAGALDGDD 357 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISI----------NASPNGQRLLKTCVSSPEYHYN 391 + +N ++ +A + I T+ + A Q L ++ +Y Sbjct: 358 KLSGTGLNAEQVRIEASNPGVAIYTLHLKTAAGAKDHAKAEAQYQALSTYTGTNTSLYYP 417 Query: 392 VVNADSLIHVFQN 404 V +A L F + Sbjct: 418 V-DAGDLN-AFGS 428 >gi|307565332|ref|ZP_07627825.1| von Willebrand factor type A domain protein [Prevotella amnii CRIS 21A-A] gi|307346001|gb|EFN91345.1| von Willebrand factor type A domain protein [Prevotella amnii CRIS 21A-A] Length = 566 Score = 37.9 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 51/269 (18%), Positives = 85/269 (31%), Gaps = 49/269 (18%) Query: 106 LSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLL 165 S ++ + + S + + + I + S R +FI+ LL Sbjct: 2 FRFESPIYLWLLLLIPLLSIFYYILYQHRKQRIKKFGDIELVKQLSPSLSNRRRFIKFLL 61 Query: 166 NQRYNQKIVSFI--PALLRIEMGERPIFLIELV--VDLSGSMHCAMNSDPEDVNSAPICQ 221 Q +V I P + + IE V +D+S SM Q Sbjct: 62 MQSSIALMVLIIARPQIGNRISSTKNGKGIETVIALDISNSMLA---------------Q 106 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 D +++ K + L S D + V GLI + + + + ++ Sbjct: 107 DVIPSRLDKSKLLIEDLLRSFD-----NDKV--GLIVFAGDAFVQLPITSDFISAKMFLN 159 Query: 282 RDMDSLI-LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 SLI + TD A+ A P+ K II +TDGE Sbjct: 160 DINPSLIGTQGTDIGKAINLA------------------MHSFSPTSKAGKAIIIITDGE 201 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISI 369 +N + KA+E I + I Sbjct: 202 DNEGGAEAMAK----KAQEAGFHIYILGI 226 >gi|198425808|ref|XP_002121992.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 431 Score = 37.9 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 24/149 (16%), Positives = 48/149 (32%), Gaps = 25/149 (16%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 +K + D D + +G+I Y+ E T V+ Y+ + Sbjct: 197 DRVKQWVKNVTDRFD----ISTFANVGVIQYSHYYE--------TRTVQPYMK---VEIG 241 Query: 289 LKPTDSTPAMKQAYQIL-------TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 L + + A + + + F + +K I+ LTDG++ Sbjct: 242 LGQYKTQAEFQMAVDSIQFQGFTTFTAHALNRTVEEFMNSTRYSDPTTRKVIVLLTDGQS 301 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISIN 370 N+ + T A+ I I + + Sbjct: 302 NDREFLEETSAY---ARGLGITIFAVGVE 327 >gi|15603687|ref|NP_246761.1| PpkA [Pasteurella multocida subsp. multocida str. Pm70] gi|12722246|gb|AAK03906.1| PpkA [Pasteurella multocida subsp. multocida str. Pm70] Length = 674 Score = 37.9 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 28/206 (13%), Positives = 69/206 (33%), Gaps = 20/206 (9%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + + A+ D I+ + E V GL+ + + + + + +V Sbjct: 258 INRTREAIKQVYDQIEKENLG-EQVKFGLVAFRSSTKAVKGLEY---TSKMFVDPTTVKD 313 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSF-FTNFFRQGVKIPSLPFQKFIIFLTD-----GEN 341 ++KQA + ++ N ++ ++++ +TD G+N Sbjct: 314 GKDFMQKVASLKQAKVSSKEFSEDAYAGINQALNEIQWNQFGA-RYMVLITDAGAIEGDN 372 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPN----GQRLLKTCVSSPEYH-----YNV 392 + ++ ++ +A+ + I + + Q + + ++ Y Sbjct: 373 PISTTGLDAKQLRLEAQHRGVAIYALHLKTPSGAKNHEQAQAQYTDLAFNHYLNKALYYP 432 Query: 393 VNADSLIHVFQNISQLMVHRKYSVIL 418 VNA + + IS L V L Sbjct: 433 VNAGDVNEFGEKISTLAKALTAQVKL 458 >gi|254779584|ref|YP_003057690.1| hypothetical protein HELPY_0994 [Helicobacter pylori B38] gi|254001496|emb|CAX29512.1| Conserved hypothetical protein [Helicobacter pylori B38] Length = 214 Score = 37.9 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 30/234 (12%), Positives = 70/234 (29%), Gaps = 43/234 (18%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E + L++D SGSM + T++ L + ++++ + Sbjct: 11 EERFIPVFLLLDTSGSMSHPLG---------------NSTRIGVLNLCIQKMIETLKQEA 55 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + M ++ + P + V T A + A ++ Sbjct: 56 KKELFSKMAIVTFGENGAVLHTPFDDIKNV-----NFEPLSTSGGTPLDQAFRLAKDLIE 110 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 P+ ++ + I ++DGE NN K + + Sbjct: 111 DKD-------------TFPTKFYKPYSILVSDGEPNNDKWQEPLSSFHHDGRSAKSVCWS 157 Query: 367 ISI----NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I I + K + + + + L+ +F+ ++Q + S+ Sbjct: 158 IFIGDREANPQVNKDFGK------DGVFYADDVEKLVKLFEIMTQTISKGSASI 205 >gi|219683166|ref|YP_002469549.1| FctX [Bifidobacterium animalis subsp. lactis AD011] gi|219620816|gb|ACL28973.1| FctX [Bifidobacterium animalis subsp. lactis AD011] Length = 879 Score = 37.9 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 38/244 (15%), Positives = 75/244 (30%), Gaps = 66/244 (27%) Query: 216 SAPICQDKKRTKMAALKNALLLFLDSI----DLLSHVKEDVYMGLIGYT----------- 260 + + T++ ALK+A+ FLD + ++ + V + LI Y Sbjct: 197 QFYKSKQSEETRLDALKDAVTYFLDQVEDQNQRINDPGKKVQVALIKYAGKNSDKIGNDT 256 Query: 261 --------TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 + +W E +++ T + ++ A + L S Sbjct: 257 YNEDGYNYNYSQTVHSLAWTPEDLQKEQAAVNSLKAGGATRADFGLQHAVKQLNS----- 311 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS-----NVNTIKICDKAKENFIKIVTI 367 QK +F +DG + N IK + K + ++++I Sbjct: 312 ------------GRPGAQKLTVFYSDGSPTSSDGFEAKIANNAIKAAAQLKNDHSQVISI 359 Query: 368 SIN--ASPNG----QRLLKTCVS---------------SPEYHYNVVNADSLIHVFQNIS 406 A P+G + + S Y+Y V L +F+ I Sbjct: 360 GAMPGADPSGTDNANKFMNYVSSNYPKAQSMSEPHDRVEGTYYYAVSARTDLQTIFKEII 419 Query: 407 QLMV 410 ++ Sbjct: 420 SIVT 423 >gi|126316412|ref|XP_001380737.1| PREDICTED: similar to integrin, alpha 1 [Monodelphis domestica] Length = 1183 Score = 37.9 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 27/189 (14%), Positives = 70/189 (37%), Gaps = 28/189 (14%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK- 290 ++ FL+S+ + + +G++ Y V + T + + Sbjct: 192 TSVTDFLNSLLGKMDIGPKQTQVGIVQYGENVTHEFNLNKYTTTEEVLIAANQIVQRQGR 251 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + + A + + + +K ++ +TDGE+++ Sbjct: 252 QTMTALGIDTARKEAFTKA-------------RGARSGVKKVMVIVTDGESHDNHRLNEV 298 Query: 351 IKICDKAKENFIKIVTISINASPNGQRL--------LKTCVSSP--EYHYNVVNADSLIH 400 I+ C+ + I+ +I+I N L +K+ S P ++ +NV + +L+ Sbjct: 299 IQDCE---DEDIQRFSIAILGHYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVT 355 Query: 401 VFQNISQLM 409 + + + + + Sbjct: 356 IAEVLGERI 364 >gi|291461066|ref|ZP_06026725.2| D-amino acid dehydrogenase large subunit [Fusobacterium periodonticum ATCC 33693] gi|291379168|gb|EFE86686.1| D-amino acid dehydrogenase large subunit [Fusobacterium periodonticum ATCC 33693] Length = 529 Score = 37.9 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 47/304 (15%), Positives = 100/304 (32%), Gaps = 34/304 (11%) Query: 118 IQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDY-RLQFIEHLLNQRYNQKIVSF 176 +++ + + + + +F + + YDY + I+ + + + Sbjct: 106 KEDVQRELKKIEPALGEDASEEEIQHLFKQLLYIAGYDYTPFETIDRFSYVIFKNDMENP 165 Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 ++ E +E+V+D SGSM + +T M K ++ Sbjct: 166 FT---HEKIEENMNVNVEIVLDASGSMVKKIG---------------DKTMMEIAKESIK 207 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 L + V G+ + + + + + + D + + Sbjct: 208 QVLSEM----PTNAKV--GVRVFGHKGDNTASKKDESCGANELIYPIEDLNVEGIEKALE 261 Query: 297 AMK-QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 ++ + + + G K ++ +II TDG + V K Sbjct: 262 PIQPTGWTSIAKSIEYGVEDLKALDGEKTLNIL---YII--TDGIETCGGNPVEIAKQL- 315 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY-NVVNADSLIHVFQNISQL-MVHRK 413 K + I + I N N RLLK + +Y +V +AD L I++L K Sbjct: 316 KGENTNIVLGIIGFNVDANQNRLLKQIADAAGGYYSSVNDADKLTGELYRINELAFSDYK 375 Query: 414 YSVI 417 + V+ Sbjct: 376 WEVL 379 >gi|197337036|ref|YP_002157821.1| hypothetical protein VFMJ11_A0264 [Vibrio fischeri MJ11] gi|197314288|gb|ACH63737.1| conserved hypothetical protein [Vibrio fischeri MJ11] Length = 423 Score = 37.9 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 66/451 (14%), Positives = 139/451 (30%), Gaps = 78/451 (17%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 + ++ + +I+FA+ + + + K +E A+ A LA ++ + Sbjct: 2 RNLRKHQQGHAAILFAMMIPALFGIFALASDGARAIQTKARIEDASEVAALAISAHNDPD 61 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 G S N ++ + +++ +S AV T+I+ + + Sbjct: 62 QPDNGSYTPSTRN----------RQIVVDYVNAYISDVDAV---TDIKVAKRRCEL-IPE 107 Query: 132 MANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF 191 D + + +DV T + E + + S L + Sbjct: 108 CVAGLYDGDMRYLEHEIDVTTRQNSWFPGNEAIEG---MGETFSTRGKSLARKYQSE-AV 163 Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 D SGSM + +S P D + + + L + L + Sbjct: 164 DAMFAADFSGSMLDTWSG-----SSNPKYVD-----LIEIIRNISAELQKFNDLPENRNK 213 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVT-RDMDSLILKPTDSTPAM------------ 298 MG+ ++T + + Q V R+ + +P + + Sbjct: 214 STMGISAFSTFTNSFTSDTGIQCSLSQGVNGRNGPATWFRPVKAANTVANIWNPKTEDYC 273 Query: 299 -KQAYQILTSDKKRSFFTNFFRQGVK----IPSLPFQ----------------KFIIFLT 337 AY S F Q + +Q + +I L+ Sbjct: 274 KSGAYAGFHDVNLTSNFNYLNGQVGSFYAGGGTASYQALIRGAQLLRKGNNSRRLLIVLS 333 Query: 338 DGENNNFK--SNVNTIKICDKAKEN------------FIKIVTISINASPNGQRLLKTCV 383 DG +N+ + + + +C + K+ I + +P + LK CV Sbjct: 334 DGMDNDTQLADGLVSAGMCRDIQNGLESDRTPDRRPIAAKMAVIGFDYNPFANKALKDCV 393 Query: 384 SSPEYHYNVVNADSLIHVFQN-ISQLMVHRK 413 + Y +AD + + I++ + H K Sbjct: 394 G-EKNVYKAEDADEVEDIILELINEEIGHLK 423 >gi|284052943|ref|ZP_06383153.1| von Willebrand factor, type A [Arthrospira platensis str. Paraca] gi|291569121|dbj|BAI91393.1| hypothetical protein [Arthrospira platensis NIES-39] Length = 463 Score = 37.9 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 31/189 (16%), Positives = 60/189 (31%), Gaps = 25/189 (13%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD-KKRTKMAALKNALLLFLDSID 243 RP V+D SGSM+ + + D K+ T++ K + ++S++ Sbjct: 35 SASRPSTTFSFVIDTSGSMYEVLEGEETIPTGNSYFLDGKQYTQVTGGKTKIDQVIESLE 94 Query: 244 LLS---HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 L + L+ + + + T+ T M++ Sbjct: 95 RLVSSGQADSRDRIALVRFDDSASVLLPLTASTDTASLKNAIGQLRNFSGGTRMALGMEE 154 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A IL K L ++ +IF TDG+ + + +A Sbjct: 155 ALNIL-----------------KNCDLSSRRTLIF-TDGQTFDESDCRDLATQFAEA--- 193 Query: 361 FIKIVTISI 369 I I + + Sbjct: 194 GIPITALGV 202 >gi|242042271|ref|XP_002468530.1| hypothetical protein SORBIDRAFT_01g047470 [Sorghum bicolor] gi|241922384|gb|EER95528.1| hypothetical protein SORBIDRAFT_01g047470 [Sorghum bicolor] Length = 686 Score = 37.9 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 25/178 (14%), Positives = 60/178 (33%), Gaps = 47/178 (26%) Query: 172 KIVSFIPALLRIEMGERPI----FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + + + G+R + V+D+SGSM K Sbjct: 143 HVKAPGVIVNEAAAGDRDAPRAPLDLVTVLDVSGSMRWD--------------------K 182 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +A +K A+ + S+ + ++ +++ + + + T ++SL Sbjct: 183 LALVKQAMGFVIGSLGPHD------RLSVVSFSSGARRVTRLLRMSHTGKSLATEAVESL 236 Query: 288 -ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 T+ ++ A ++L + R+ ++ +I L+DG +N Sbjct: 237 RAGGGTNIAEGLRTAAKVLGERRHRNAVSS----------------VILLSDGHDNYS 278 >gi|149909538|ref|ZP_01898192.1| TadG-like protein [Moritella sp. PE36] gi|149807443|gb|EDM67394.1| TadG-like protein [Moritella sp. PE36] Length = 405 Score = 37.9 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 55/441 (12%), Positives = 120/441 (27%), Gaps = 81/441 (18%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 I + + +++FA+ + +F + K +E + + Sbjct: 4 IRKQSGHAAMLFAMMIPAFFGIFTLASDGARALQSKARLED--------------AAEAA 49 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + ++++ + + I + + +I + + + Sbjct: 50 VLAIAAHNADNSGSSSGSAINKKIASDWIGQYMQDMQAISDIKITKLNCN---DIAECKE 106 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF--- 191 + + Y + T+ HL N F + + F Sbjct: 107 GLENGESRYFQYEILAKTN---------HLSWFPGNNSTAGFGESFDVVGSATARKFQSE 157 Query: 192 --LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + V D SGSM+ + + L + + +D ++ Sbjct: 158 SVDVMFVSDFSGSMNNKWSGG------------SNSRRYKDLIKIIGDVIKELDKFNNAH 205 Query: 250 EDV--YMGLIGYTTRVEKN---------IEPSWGTEKVRQYVTRDMDSLILKPTDSTP-A 297 +G G+ T K + S G + + + S A Sbjct: 206 TTTTNRVGFTGFNTYTRKTADNSCYQDQYDRSAGRTVNKIFEVKGCKSRSSGGAKFHDIA 265 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKI----------PSLPFQKFIIFLTDGENNNFKSN 347 M Y + K I P ++ +I L+DG ++ Sbjct: 266 MTDNYNEFKNTIKYFKPGGGTASYQGIIRGAQMMDAAPEPRPRRIMIILSDGIDSKRSRA 325 Query: 348 VNTIK--ICDKA------------KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV 393 ++ +C K K K+ + + +P L CV Y Sbjct: 326 NKLVEEGMCSKILLKLGNANTSDGKAIKTKMAVVGFDYNPASNPSLAKCVG-EHNVYGAN 384 Query: 394 NADS-LIHVFQNISQLMVHRK 413 N + L + + IS+ + H K Sbjct: 385 NPEDVLNKILELISEEIGHLK 405 >gi|330879011|gb|EGH13160.1| serine/threonine-protein kinase PpkA [Pseudomonas syringae pv. morsprunorum str. M302280PT] Length = 663 Score = 37.9 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 30/193 (15%), Positives = 67/193 (34%), Gaps = 23/193 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ-YVTRDMDS 286 + + A+ I + ++ V GL+ + + + + T+ +D Sbjct: 253 IERTREAIAKVYAQIAKENLGRQ-VKFGLVAFRSSTQAVPGLEYVTKMYADPNTVKDGAD 311 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD-----GEN 341 + K D +KQA S + S+ S ++++ +TD G++ Sbjct: 312 FLAKAAD----LKQAKVSSKSFNEDSYAGVMQAIDKVDWSPFGARYVVLITDAGALDGDD 367 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISI----------NASPNGQRLLKTCVSSPEYHYN 391 + +N ++ +A + I T+ + A Q L ++ +Y Sbjct: 368 KLSGTGLNAEQVRIEASNPGVAIYTLHLKTAAGAKDHAKAEAQYQALSTYTGTNTSLYYP 427 Query: 392 VVNADSLIHVFQN 404 V +A L F + Sbjct: 428 V-DAGDLN-AFGS 438 >gi|262195558|ref|YP_003266767.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262078905|gb|ACY14874.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 775 Score = 37.9 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 31/202 (15%), Positives = 58/202 (28%), Gaps = 45/202 (22%) Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCA--MNSDPEDVNSAPICQDKKRTK 227 + +++ + + E + LV+D SGSM A + D R Sbjct: 244 DAYLIANLATPPGLAATEIADKDVTLVLDRSGSMSGAPLARAKDAAKAVVARLGDGDRVN 303 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + A + + +S + + +Y+ R D Sbjct: 304 VMAFDDGVDALFLRPVPISAER---RSQAV--------------------EYIDRLSDG- 339 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 TD A+ +A + S P I+FLTDG +S+ Sbjct: 340 --GGTDLAGALAEALDA----------QHPSESEADTGSRPH--VILFLTDG-----QSD 380 Query: 348 VNTIKICDKAKENFIKIVTISI 369 + ++ TI + Sbjct: 381 SQATLQVARGDAGDARVFTIGV 402 >gi|242042269|ref|XP_002468529.1| hypothetical protein SORBIDRAFT_01g047460 [Sorghum bicolor] gi|241922383|gb|EER95527.1| hypothetical protein SORBIDRAFT_01g047460 [Sorghum bicolor] Length = 698 Score = 37.9 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 23/168 (13%), Positives = 56/168 (33%), Gaps = 29/168 (17%) Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLI 193 + ++ +N I ++ + ++ IE + + ++ R + Sbjct: 185 QSDMNDEHNAITGSVKIKAYSEFPA--IEQSVTKEIFAILIHLRAPKSSHSASSRAPLDL 242 Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 V+D+SGSM TK+A LKNA+ + ++ Sbjct: 243 VTVLDVSGSMAG--------------------TKIALLKNAMSFVIQTLGPND------R 276 Query: 254 MGLIGYTTRVEKNIEPSWGT-EKVRQYVTRDMDSLILKPTDSTPAMKQ 300 + +I +++ + T +Q + + T+ +K+ Sbjct: 277 LSVIAFSSTARRLFPLRRMTLAGRQQALQAVSSLVASGGTNIADGLKK 324 >gi|150009321|ref|YP_001304064.1| hypothetical protein BDI_2723 [Parabacteroides distasonis ATCC 8503] gi|149937745|gb|ABR44442.1| hypothetical protein BDI_2723 [Parabacteroides distasonis ATCC 8503] Length = 1149 Score = 37.9 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 22/143 (15%), Positives = 51/143 (35%), Gaps = 15/143 (10%) Query: 244 LLSHVKEDVYMGLIGYT----TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 ++ G + + + +++ +W V+ + R + S + K T T ++ Sbjct: 42 SVNTENNY---GALSFDPTTLSDLKELNPATWTVSNVKTSLGRTLYSFVNKATGLTLSVD 98 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPF--------QKFIIFLTDGENNNFKSNVNTI 351 A I D +S S + ++ DGEN + + + Sbjct: 99 PATAIAVDDNNKSQKIASAAPLALGGSASEWVVETVGGKDILVSYIDGENVVYIAQSGSK 158 Query: 352 KICDKAKENFIKIVTISINASPN 374 +KAK++ I I++++ Sbjct: 159 LYLEKAKKSAISTAAIALSSDQA 181 >gi|315613111|ref|ZP_07888021.1| collagen adhesion protein [Streptococcus sanguinis ATCC 49296] gi|315314673|gb|EFU62715.1| collagen adhesion protein [Streptococcus sanguinis ATCC 49296] Length = 863 Score = 37.9 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 51/320 (15%), Positives = 99/320 (30%), Gaps = 81/320 (25%) Query: 159 QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP 218 + I + Q ++ I + L E P + LV DLSGSM Sbjct: 47 KTITPISGQDDKYELSLDITSKLGTETQTDP-LDVVLVADLSGSM---------QNQDVQ 96 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHV--KEDVYMGLIGYTTRVEKN---------- 266 + +++ ALKN L L+ + + + ++G+ +++ Sbjct: 97 SFDGRTISRIDALKNTLRGTNGRKGLIDTILSNSNNRLSMVGFGGKIDNKKVDQYWDGNK 156 Query: 267 ---IEPSWGTEKVRQYVTRDMD----SLILKPTDSTPAMKQA-YQILTSDKKRSFFTNFF 318 P W E++ +Y + IL +++ A K A Y + + + Sbjct: 157 WRLFRPYWPYERMTKYYDGVSPWDDANTILGWSNNARAAKTAVYNMSIAGGNSIGTESGI 216 Query: 319 RQGVKIP-------------SLPFQKFIIFLTDG---------------ENNNF------ 344 G I +K +I L+DG NN Sbjct: 217 GTGTNIGAGLTLANQLMGSARSNAKKVVILLSDGFANMVYDANGYTIYNYNNEDPNIETA 276 Query: 345 --------KSNVNTIKICDKAKENFIKIVTISINASPN-------GQRLLKTCVSSPEYH 389 +N+N++ + +I S N + + S P Sbjct: 277 PQWFWDRLNNNLNSLSYSLAPTLDGF--YSIKFRYSNNVDSITSLQYYMRQHNASIPNEI 334 Query: 390 YNVVNADSLIHVFQNISQLM 409 ++ + D L F++I+ + Sbjct: 335 FSANDEDQLRDSFKDITDKI 354 >gi|254491469|ref|ZP_05104648.1| von Willebrand factor type A domain protein [Methylophaga thiooxidans DMS010] gi|224462947|gb|EEF79217.1| von Willebrand factor type A domain protein [Methylophaga thiooxydans DMS010] Length = 341 Score = 37.9 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 24/129 (18%), Positives = 48/129 (37%), Gaps = 10/129 (7%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 + +E + I L +D S SM P +NS P+ ++ R+K + Sbjct: 65 LAGPFTPEKTVERYRQGAEFIVL-LDRSRSMDDIFARRP--LNSLPVEKELIRSKRKVSR 121 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 + L+ F+ + D G + ++ + + + ++ V V L T Sbjct: 122 DYLVEFVKR-------RPDDRFGYVLFSDKPTEILRLTYNKAAVLATVEAGGLGKGLSKT 174 Query: 293 DSTPAMKQA 301 + A+K A Sbjct: 175 NIFSALKLA 183 >gi|91974615|ref|YP_567274.1| von Willebrand factor, type A [Rhodopseudomonas palustris BisB5] gi|91681071|gb|ABE37373.1| von Willebrand factor, type A [Rhodopseudomonas palustris BisB5] Length = 372 Score = 37.9 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 27/211 (12%), Positives = 63/211 (29%), Gaps = 40/211 (18%) Query: 161 IEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 ++ + R P + RP + V+D +GSM Sbjct: 1 MKRITMLRALAFAALIAPLAIPSAARARPAVEVAFVLDTTGSMSGL-------------- 46 Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY----TTRVEKNIEPSWGTEKV 276 + K + +I + + + MGL+ Y V +N+E T + Sbjct: 47 -------IEGAKRKIWSIATAIVDSNPGAD-IRMGLVAYRDIGDDYVTRNVEL---TPDI 95 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + R ++ D ++ +A + + + S + R + P Sbjct: 96 QDLYARLLELQARGGGDWPESVNEALDVAVNKLRWSKDGDTRRIVFLVGDAPP------- 148 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + ++ + A++ I + + Sbjct: 149 ----HMDYAQDTKYPTTLSVARQKDIIVNAV 175 >gi|312130321|ref|YP_003997661.1| von willebrand factor type a [Leadbetterella byssophila DSM 17132] gi|311906867|gb|ADQ17308.1| von Willebrand factor type A [Leadbetterella byssophila DSM 17132] Length = 318 Score = 37.9 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 31/185 (16%), Positives = 61/185 (32%), Gaps = 42/185 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +DLS SM D +++ KN L +D Sbjct: 79 DIFLAIDLSESM---------------NATDVVPSRIDRAKNELQGLIDRF-SAD----- 117 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G+I + + ++ TE +R + +I K + M Sbjct: 118 -RIGIILFNSNAYLLTPLTFDTENIRNTIGNLKTHMIDKGSTDFSPM------------- 163 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + + + K I +TDGE +++ + K + K+N I + + + Sbjct: 164 ---LEMINEKLSVGTQNRGKVAIVVTDGET-HYQIDEQLAK---RLKQNNIHLFWLGVGT 216 Query: 372 SPNGQ 376 G+ Sbjct: 217 LGGGK 221 >gi|301065602|ref|YP_003787625.1| hypothetical protein LCAZH_0484 [Lactobacillus casei str. Zhang] gi|300438009|gb|ADK17775.1| Uncharacterized protein encoded in toxicity protection region of plasmid R478, contains von Willebrand factor (vWF) domain [Lactobacillus casei str. Zhang] Length = 909 Score = 37.9 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 38/234 (16%), Positives = 71/234 (30%), Gaps = 25/234 (10%) Query: 110 SAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRY 169 + + T QN++N +N + + + D + Sbjct: 56 TNSWQVTGQQNVINQRGGDQVSGWDNNTIWNGDATDTTNSYLKFGD-PNNPDYQIRKYAK 114 Query: 170 NQKIVSFIPALLRIEMGER---PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 L ++ + I LVVD+SGSM N + Sbjct: 115 ETNTPGLYDVYLNVKGNTQQNVKPVDIVLVVDMSGSMESKNNGGTD-------------- 160 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR-VEKNIEPSWGTEKVRQYVTRDMD 285 + A++ + FL SI + + V +GLIG+++ + G +R V Sbjct: 161 RAGAVRTGVKNFLTSIQNA-GLGDYVNVGLIGFSSPGYIGGGNKTTGPGYIR--VGLGKA 217 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 I + A+ + + + +K +I LTDG Sbjct: 218 GNISQQQAINSALSPTFN---GGTYTQIGLRQGSAMLNTDTSGNKKMMILLTDG 268 >gi|94498564|ref|ZP_01305119.1| hypothetical protein SKA58_08324 [Sphingomonas sp. SKA58] gi|94422007|gb|EAT07053.1| hypothetical protein SKA58_08324 [Sphingomonas sp. SKA58] Length = 634 Score = 37.9 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 29/153 (18%), Positives = 52/153 (33%), Gaps = 30/153 (19%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 A+ + E+ RP + +V+D +GSM ++ T+M ALK A F Sbjct: 152 AVAKAELQPRP-LEVMVVLDDTGSMKANLSGGR--------------TRMVALKEAANDF 196 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 +D + + + D+ MG IGY V V + +K + + Sbjct: 197 VDILHQGASSRRDLAMGFIGYD---------------VTVNVGHLLPGNAVKKVEGFNSF 241 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 A S + +G + + Sbjct: 242 AGASAYGLSVPSHASGNYLAWKGCVMADTTVKD 274 >gi|326506938|dbj|BAJ91510.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 378 Score = 37.9 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 25/152 (16%), Positives = 50/152 (32%), Gaps = 27/152 (17%) Query: 151 MTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 + D+ + Q + A R + ++D+SGSM Sbjct: 227 SSQDDFAVLIHLKAPCANPEQITSRPVNATSVGYPTSRAPVDLVTLLDVSGSMAG----- 281 Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 TK+A LK A+ + + + +I +++ V + Sbjct: 282 ---------------TKLALLKRAMGFVIQHLGPSD------RLSVIAFSSTVRRLFHLR 320 Query: 271 WGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQA 301 + RQ + ++SL T+ A+K+A Sbjct: 321 RMSHSGRQQALQAVNSLGAGGGTNIADALKKA 352 >gi|153806292|ref|ZP_01958960.1| hypothetical protein BACCAC_00548 [Bacteroides caccae ATCC 43185] gi|149130969|gb|EDM22175.1| hypothetical protein BACCAC_00548 [Bacteroides caccae ATCC 43185] Length = 342 Score = 37.9 bits (86), Expect = 3.1, Method: Composition-based stats. Identities = 30/192 (15%), Positives = 64/192 (33%), Gaps = 41/192 (21%) Query: 151 MTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 ++ Y ++F + +++ + E +R + + +D+S SM Sbjct: 50 VSKYRPDIKFWIIFVVIGLFSVLLARPQFGSKQETVKRKGVEVIIALDISNSMLA----- 104 Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 QD + +++ K + +D +D +G+I + + + Sbjct: 105 ----------QDVQPSRLEKAKRLISRLVDELDND-------KIGMIVFAGDAFTQLPIT 147 Query: 271 WGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 + ++ SLI K T A+ A + T P Sbjct: 148 SDYISAKMFLESISPSLISKQGTAIGEAINLAVRSFT------------------PQEGV 189 Query: 330 QKFIIFLTDGEN 341 + I+ +TDGEN Sbjct: 190 GRAIVVITDGEN 201 >gi|89889806|ref|ZP_01201317.1| BatB [Flavobacteria bacterium BBFL7] gi|89518079|gb|EAS20735.1| BatB [Flavobacteria bacterium BBFL7] Length = 343 Score = 37.9 bits (86), Expect = 3.1, Method: Composition-based stats. Identities = 30/195 (15%), Positives = 61/195 (31%), Gaps = 45/195 (23%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 +IE +R I VD+S SM D +++ + + +++ Sbjct: 81 KIETIKREGVDIVFAVDISKSMLAE---------------DIAPSRLEKSQQLVTQIINN 125 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAMKQ 300 + +GLI Y + + + ++ L+ T A++ Sbjct: 126 LASD-------RIGLIAYAGSAVPQLPITTDYSSAKMFLQSMNTDLVSSQGTAIAEAIQL 178 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A + K ++ ++DGE + + A EN Sbjct: 179 A------------------ESYYSEDTEASKVLVIISDGE----DHEGEALDYAEAAAEN 216 Query: 361 FIKIVTISINASPNG 375 I+I+TI + G Sbjct: 217 GIRIITIGVGTEKGG 231 >gi|159899681|ref|YP_001545928.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159892720|gb|ABX05800.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 550 Score = 37.9 bits (86), Expect = 3.1, Method: Composition-based stats. Identities = 42/246 (17%), Positives = 84/246 (34%), Gaps = 47/246 (19%) Query: 165 LNQRYNQKIVSFIPALLRIE---MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 NQ N ++ + R I LVVD SGSM Sbjct: 347 PNQPRNSLATPPADVIVAAKNAWANNRKPANIMLVVDSSGSM------------------ 388 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + KM K + +FL+ + +V G+IG+++ + + +E + Sbjct: 389 -RDDDKMDQAKLGVEVFLNRLPSKDNV------GMIGFSSSPAVLVPLATRSENMANLQM 441 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + T A+ A Q L + K+ I+ L+DG + Sbjct: 442 QTQGLVPDGNTSLYDAIDLARQELENLKQPDRINA----------------IVVLSDGAD 485 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + +++ ++ E+ I+I I+ A +L+ + + + Sbjct: 486 TASQLSID--QMLGNFGESSIQIFPIAYGA-DAETSILQQIADFSRTELVQGSTGDIDKI 542 Query: 402 FQNISQ 407 F+N+S+ Sbjct: 543 FENLSR 548 >gi|297279075|ref|XP_001109489.2| PREDICTED: calcium-activated chloride channel regulator 4-like [Macaca mulatta] Length = 931 Score = 37.9 bits (86), Expect = 3.1, Method: Composition-based stats. Identities = 37/233 (15%), Positives = 78/233 (33%), Gaps = 54/233 (23%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM ++ + A FL I V+ ++ Sbjct: 309 LVLDKSGSMA-------------------GYDRLNQMNKAAKYFLLQI-----VENGSWV 344 Query: 255 GLIGYT-TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + T N + R + + + T +K A+Q++ Sbjct: 345 GMVHFDSTATIINKPIQIISSDERNTLLAWLPTYASGGTSICSGIKSAFQVIGELSSHLD 404 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINAS 372 + ++ LTDGE+ S C D+ K + + I++ + Sbjct: 405 GSE----------------VVLLTDGEDYTASS-------CIDEVKRSGAIVHFIALGTA 441 Query: 373 PNGQRLLKTCVSSPEYHYNVVNA--DSLIHVFQNIS---QLMVHRKYSVILKG 420 + + + ++ + Y A + LI F +++ + + + KG Sbjct: 442 ADKAVIEMSKITGGRHFYASDKAQNNGLIDAFGDLTSGNTELSQKSLQLESKG 494 >gi|293365338|ref|ZP_06612055.1| collagen adhesion protein [Streptococcus oralis ATCC 35037] gi|307703880|ref|ZP_07640821.1| von Willebrand factor type A domain protein [Streptococcus oralis ATCC 35037] gi|291316788|gb|EFE57224.1| collagen adhesion protein [Streptococcus oralis ATCC 35037] gi|307622715|gb|EFO01711.1| von Willebrand factor type A domain protein [Streptococcus oralis ATCC 35037] Length = 863 Score = 37.9 bits (86), Expect = 3.1, Method: Composition-based stats. Identities = 51/320 (15%), Positives = 99/320 (30%), Gaps = 81/320 (25%) Query: 159 QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP 218 + I + Q ++ I + L E P + LV DLSGSM Sbjct: 47 KTITPISGQDDKYELSLDITSKLGTETQTDP-LDVVLVADLSGSM---------QNQDVQ 96 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHV--KEDVYMGLIGYTTRVEKN---------- 266 + +++ ALKN L L+ + + + ++G+ +++ Sbjct: 97 SFDGRTISRIDALKNTLRGTNGRKGLIDTILSNSNNRLSMVGFGGKIDNKKVDQYWDGNK 156 Query: 267 ---IEPSWGTEKVRQYVTRDMD----SLILKPTDSTPAMKQA-YQILTSDKKRSFFTNFF 318 P W E++ +Y + IL +++ A K A Y + + + Sbjct: 157 WRLFRPYWPYERMTKYYDGVSPWDDANTILGWSNNARAAKTAVYNMSIAGGNSIGTESGI 216 Query: 319 RQGVKIP-------------SLPFQKFIIFLTDG---------------ENNNF------ 344 G I +K +I L+DG NN Sbjct: 217 GTGTNIGAGLTLANQLMGSARSNAKKVVILLSDGFANMVYDANGYTIYNYNNEDPNIETA 276 Query: 345 --------KSNVNTIKICDKAKENFIKIVTISINASPN-------GQRLLKTCVSSPEYH 389 +N+N++ + +I S N + + S P Sbjct: 277 PQWFWDRLNNNLNSLSYSLAPTLDGF--YSIKFRYSNNVDSITSLQYYMRQHNASIPNEI 334 Query: 390 YNVVNADSLIHVFQNISQLM 409 ++ + D L F++I+ + Sbjct: 335 FSANDEDQLRDSFKDITDKI 354 >gi|330830099|ref|YP_004393051.1| RTX toxin-like protein [Aeromonas veronii B565] gi|328805235|gb|AEB50434.1| RTX toxin-like protein [Aeromonas veronii B565] Length = 1553 Score = 37.9 bits (86), Expect = 3.2, Method: Composition-based stats. Identities = 41/252 (16%), Positives = 78/252 (30%), Gaps = 54/252 (21%) Query: 164 LLNQRYNQKIVSFIPA---LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 LL + IV + A + + + + ++D+SGS+ Sbjct: 1123 LLAPDVDGAIVGGVGADVLIGDVGGVTSGSYNLTFMIDMSGSISG--------------- 1167 Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 T+ +K+A+ L +S ++ V +G + V +Q V Sbjct: 1168 -----TEFQLMKDAINNLLAKFSGISQLQ--VEIGTFADNSNVVGTYS---SVTAAQQAV 1217 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 + S T+ A+ ++T D K++ FLTDGE Sbjct: 1218 SNLTRS--GGGTNYQAALTTLNTMMTVDPVADH-----------------KYVYFLTDGE 1258 Query: 341 N-----NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQ--RLLKTCVSSPEYHYNVV 393 N N + + + I + I L ++P+ + V Sbjct: 1259 PTVGSWTNSTQIANGMAALNALTAPGVVINAVGIGVPSGASFGNNLNAIDNTPDNYLAVD 1318 Query: 394 NADSLIHVFQNI 405 N D L ++ Sbjct: 1319 NFDDLSSGLGSL 1330 >gi|290985353|ref|XP_002675390.1| von Willebrand factor type A domain-containing protein [Naegleria gruberi] gi|284088986|gb|EFC42646.1| von Willebrand factor type A domain-containing protein [Naegleria gruberi] Length = 923 Score = 37.9 bits (86), Expect = 3.2, Method: Composition-based stats. Identities = 39/241 (16%), Positives = 79/241 (32%), Gaps = 49/241 (20%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 P + ER + LVVD SGSM K+ +K+ L Sbjct: 678 PCFEQQAQKERKGVDLVLVVDKSGSMAGQ--------------------KLDMVKSTLSF 717 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTP 296 +D + + ++ + T+V+ N++ + + ++ + ++ T+ + Sbjct: 718 MVDQLKEKD------RVAIVEFDTQVKTNLDLTKMDIEGKKKAKQVSSAISPGSCTNLSG 771 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKIC 354 A+ + ++L S ++ +I TDG N +N + Sbjct: 772 ALFTSLKLLASRQQEKNEVTS---------------VILFTDGLANRGLISTNEILQNMQ 816 Query: 355 DKAKE----NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN-ISQLM 409 D E + + I T + L + + AD + F N I L+ Sbjct: 817 DLMDELLSTSNVTIHTFGFGQDTDANMLTSIAQKGNGLYDYLETADDIPKAFGNVIGNLV 876 Query: 410 V 410 Sbjct: 877 S 877 >gi|237747941|ref|ZP_04578421.1| predicted protein [Oxalobacter formigenes OXCC13] gi|229379303|gb|EEO29394.1| predicted protein [Oxalobacter formigenes OXCC13] Length = 1843 Score = 37.9 bits (86), Expect = 3.2, Method: Composition-based stats. Identities = 31/210 (14%), Positives = 64/210 (30%), Gaps = 34/210 (16%) Query: 166 NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 N +V + + + I LVVD+S SM M+ Sbjct: 1160 GHDVNDVLVGDFAGNTTVGAVQDVVNNIYLVVDVSASMKDGMDPG------------SDV 1207 Query: 226 TKMAALKNALLLFLDSIDLLSHVK-EDVYMGLIGYTTRVEKNIEP-----SWGTEKVRQY 279 +++ + +AL + +++ V + L ++ + E W K Sbjct: 1208 SRLKVVADALDKLFEQMNVADADPHTTVNVTLSIFSGGGDTPKEYNLSVSDW-AGKTADD 1266 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + + +D T+ + A + + K + K I F+TD Sbjct: 1267 IYKMLDDFKHGDTNFELPLGNA-------------SEWADSQAK-GDGIYNKVI-FVTDA 1311 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + DK + + + I+I Sbjct: 1312 NGTEILNTGKAQDYVDKILGSGVDVEAIAI 1341 >gi|116624267|ref|YP_826423.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116227429|gb|ABJ86138.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 306 Score = 37.9 bits (86), Expect = 3.2, Method: Composition-based stats. Identities = 34/239 (14%), Positives = 74/239 (30%), Gaps = 56/239 (23%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 +++ E + LV+D S SM K + +A L + Sbjct: 76 IKVFRQEDVPISLGLVIDTSASMS---------------------NKRDRVNSAALAMVK 114 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 + + V V E ++ T+ V + SL + AM+ Sbjct: 115 ASNPEDEV-------------FVISFSEEAFITQDFTSDVKQLESSLRKLGSKGETAMRD 161 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A + ++ P+ +K ++ +TDGE+N+ + A + Sbjct: 162 ALSLGLD-------------HLRAPARKDKKVLVVITDGEDNSSIQKQE--NLIRAAHLS 206 Query: 361 FIKIVTISINASPNGQRLLKTCVS-------SPEYHYNVVNADSLIHVFQNISQLMVHR 412 + I I + A+ + S + + N + + I+ + ++ Sbjct: 207 NVIIYGIGLLAAEAPASAQRAKASLDVLTLATGGRSWYPENVADIEKITPEIAHEIRNQ 265 >gi|260781663|ref|XP_002585923.1| hypothetical protein BRAFLDRAFT_90333 [Branchiostoma floridae] gi|229270991|gb|EEN41934.1| hypothetical protein BRAFLDRAFT_90333 [Branchiostoma floridae] Length = 2692 Score = 37.9 bits (86), Expect = 3.2, Method: Composition-based stats. Identities = 24/173 (13%), Positives = 55/173 (31%), Gaps = 19/173 (10%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + + ++ Y+ + + + K Y D S + T + A+ Sbjct: 1125 VQNFDISETATRVAVVQYSDQFSTEFSLNAFSTKTEVYNAIDNISYLTGGTFTGFAIDFV 1184 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 Q + + + + ++ +TDG + + S D A+ Sbjct: 1185 MQSVFTS-------------ISGERDGYPDLLVVVTDGLSTDDVSGPA-----DTARAQG 1226 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 + I + + S L+ V + SL+ + Q +SQ + Y Sbjct: 1227 VTIYAVGV-GSDIDFNTLEQIAGLTSRVSQVSDFSSLVTLSQTLSQDICTACY 1278 Score = 37.5 bits (85), Expect = 4.1, Method: Composition-based stats. Identities = 28/227 (12%), Positives = 56/227 (24%), Gaps = 41/227 (18%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 P F + ++D SGS+ K + Sbjct: 294 NPEFDLIFLLDESGSIGT-----------------------DNFKLVKSFTERMANNFDI 330 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 +G++ Y+ + + + + D + Sbjct: 331 SPNSTRVGVVQYSNFPGTEFSLN-------AFTDKAAVLDAISKIDYNGG------STFT 377 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 F N V +I +TDG N +V+ I A I + Sbjct: 378 GAAIDFVRNNEFTSVNGDRDDVPNILIVITDG---NPNDDVSGPAI--SANNAGITTYAV 432 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 I ++ + L++ P + L V + + + Y Sbjct: 433 GIGSNVDQANLVQMTAGRPGRVLQAADFTDLTTVVGTLQENVCDAVY 479 >gi|327542746|gb|EGF29212.1| protein containing DUF58 [Rhodopirellula baltica WH47] Length = 346 Score = 37.9 bits (86), Expect = 3.2, Method: Composition-based stats. Identities = 23/119 (19%), Positives = 45/119 (37%), Gaps = 8/119 (6%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E L+VD SGSM A + + +S+ ++K+ A+ A+ + S Sbjct: 78 EETNLRCTLLVDRSGSMKYAGSRALQLTDSSTSIGNRKQDYATAVAVAIAYMMLSSQDA- 136 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 +GL+ + R+ ++ P ++ + TD ++QA L Sbjct: 137 -------VGLMTFDDRIGDSLPPRTTPSHLQSLLAILAGDSGGGETDLGTVIRQAMLKL 188 >gi|227533248|ref|ZP_03963297.1| yvcC protein [Lactobacillus paracasei subsp. paracasei ATCC 25302] gi|227189098|gb|EEI69165.1| yvcC protein [Lactobacillus paracasei subsp. paracasei ATCC 25302] Length = 596 Score = 37.9 bits (86), Expect = 3.2, Method: Composition-based stats. Identities = 37/234 (15%), Positives = 69/234 (29%), Gaps = 25/234 (10%) Query: 110 SAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRY 169 + + T QN++N +N + + + D + Sbjct: 53 TNSWQVTGQQNVINQRGGDQVSGWDNNTIWNGDATDTTNSYLKFGD-PNNPDYQIRKYAK 111 Query: 170 NQKIVSFIPALLRIEMGER---PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 L ++ + I LVVD+SGSM N + Sbjct: 112 ETNTPGLYDVYLNVKGNTQQNVKPVDIVLVVDMSGSMESKNNGGTD-------------- 157 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + A++ + FL SI + + V +GLIG+++ Y+ + Sbjct: 158 RAGAVRTGVKNFLTSIQNA-GLGDYVNVGLIGFSSPGYIGGG---NKTTGPGYIHVGL-- 211 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP-FQKFIIFLTDG 339 T A+ A + + + +K +I LTDG Sbjct: 212 GKAGNTSQQQAINSALSPTFNGGTYTQIGLRQGSAMLNEDTSGNKKMMILLTDG 265 >gi|32474268|ref|NP_867262.1| hypothetical protein RB6446 [Rhodopirellula baltica SH 1] gi|32444806|emb|CAD74808.1| conserved hypothetical protein [Rhodopirellula baltica SH 1] Length = 346 Score = 37.9 bits (86), Expect = 3.2, Method: Composition-based stats. Identities = 23/119 (19%), Positives = 45/119 (37%), Gaps = 8/119 (6%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E L+VD SGSM A + + +S+ ++K+ A+ A+ + S Sbjct: 78 EETNLRCTLLVDRSGSMKYAGSRALQLTDSSTSIGNRKQDYATAVAVAIAYMMLSSQDA- 136 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 +GL+ + R+ ++ P ++ + TD ++QA L Sbjct: 137 -------VGLMTFDDRIGDSLPPRTTPSHLQSLLAILAGDSGGGETDLGTVIRQAMLKL 188 >gi|149188837|ref|ZP_01867127.1| hypothetical protein VSAK1_05790 [Vibrio shilonii AK1] gi|148837257|gb|EDL54204.1| hypothetical protein VSAK1_05790 [Vibrio shilonii AK1] Length = 504 Score = 37.9 bits (86), Expect = 3.3, Method: Composition-based stats. Identities = 16/55 (29%), Positives = 21/55 (38%), Gaps = 6/55 (10%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 I V D SGSM + D +A C K+A LK A+ D + Sbjct: 130 DKNIDIVFVSDFSGSMSWQWGGNSSDPCTATNC------KIADLKVAVKEIADKL 178 >gi|111025338|ref|YP_707758.1| hypothetical protein RHA1_ro08556 [Rhodococcus jostii RHA1] gi|110824317|gb|ABG99600.1| conserved hypothetical protein [Rhodococcus jostii RHA1] Length = 326 Score = 37.9 bits (86), Expect = 3.3, Method: Composition-based stats. Identities = 36/239 (15%), Positives = 68/239 (28%), Gaps = 51/239 (21%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + L +D+S SM D +++AA + F + Sbjct: 86 RNRATVILAIDVSLSM---------------RATDVPPSRLAAAQAGAKTF------ADN 124 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + + +GL + + + T + A+ A Q + Sbjct: 125 LTPGINLGLEAF---AGTASMLVSPITDHTATDNALDHLQLAERTATGEAIFTALQAI-- 179 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS--NVNTIKICDK-AKENFIKI 364 + V P I+ +DG+ + + AKE + I Sbjct: 180 --------DTLAGVVGGGGTPPPARIVLESDGKQTVPTDLNDPRGAFTAARLAKEQGVPI 231 Query: 365 VTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 TIS + + L + S + +AD L +QN+ Q + Sbjct: 232 STISFGTTHGAIDLNGSHIPVPVDDESLRRIAELSGGSFFTATSADELQASYQNLQQQI 290 >gi|149031330|gb|EDL86328.1| rCG38899 [Rattus norvegicus] Length = 1029 Score = 37.9 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 39/231 (16%), Positives = 75/231 (32%), Gaps = 42/231 (18%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D+SGSM TK+ K A+ L + Sbjct: 248 VVFVIDVSGSMFG--------------------TKLQQTKKAMDKILSDLQTSDSFNIIT 287 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 + + + YV+R + TD A+ A +L + Sbjct: 288 FSDTVNIWKAEGSIQATVQNIHNAKNYVSRMEAN---GWTDINAALLAAASVL------N 338 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI-KICDKAKENFIKIVTISINA 371 +G + +P I+FLTDGE ++ + I +A + + + +++ Sbjct: 339 HSNQEPGKGRGVGQIPL---IMFLTDGEPTAGETTPSVILSNVRQALAHRVSLFSLAF-G 394 Query: 372 SPNGQRLLKTCVSSPEY-----HYNVVNADSLIHVFQNISQLM---VHRKY 414 LL+ + + + A L ++ IS+ + VH Y Sbjct: 395 DDADFSLLRRLSLENQGEARRIYEDADAALQLEGLYAEISRPLLADVHLDY 445 >gi|157827515|ref|YP_001496579.1| hypothetical protein A1I_06110 [Rickettsia bellii OSU 85-389] gi|157802819|gb|ABV79542.1| hypothetical protein A1I_06110 [Rickettsia bellii OSU 85-389] Length = 446 Score = 37.9 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 53/334 (15%), Positives = 99/334 (29%), Gaps = 61/334 (18%) Query: 47 HYKKNSMESANNAA---ILAGASKMVSNLSRLGDRFESISNHAKRAL-IDDAKRFIKNHI 102 KN ++ A +L K +LS++ E I+ K D + I Sbjct: 129 LSFKNPIKKDIKEATLKLLQATIKDEKSLSKILVNEEDITPFQKAIYHPTDFSQLITQIS 188 Query: 103 KESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIE 162 + + + N I V T + +S+ + +D Y L Sbjct: 189 SNEENSLNFIMNNGAIAQSVQVY----TADGKAPIIASDLKDGFIIDKQYLLKYLLPIFN 244 Query: 163 HLLNQRYNQKIVSFIPALLRIEMGERP-IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 + + + F P ++ GE I L++D+SGSM + Sbjct: 245 GFIWNEEGKFPIMFAPKNPKVLDGENNYAHNISLLIDISGSMEKDFSVY----------- 293 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE--KNIEPSWGTEKVRQY 279 KN +L LD + + + + ++ + E ++ Y Sbjct: 294 ----------KNNILKILDKLAEIPNW----QINIVVFNDESTARSFSNQENNIEDIKVY 339 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + + T +K+A + S I+F TDG Sbjct: 340 INNLKAN---GYTKLYGTIKEALESFKGKIDESSTL-----------------IVF-TDG 378 Query: 340 ENNNFKSNVNTIKICD----KAKENFIKIVTISI 369 ++ SNV + D K + T+ Sbjct: 379 KDEGTNSNVTEKDVVDVTSEVIKNPQFNMYTVGF 412 >gi|183602734|ref|ZP_02964097.1| hypothetical protein BIFLAC_00845 [Bifidobacterium animalis subsp. lactis HN019] gi|183217972|gb|EDT88620.1| hypothetical protein BIFLAC_00845 [Bifidobacterium animalis subsp. lactis HN019] Length = 839 Score = 37.9 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 38/244 (15%), Positives = 75/244 (30%), Gaps = 66/244 (27%) Query: 216 SAPICQDKKRTKMAALKNALLLFLDSI----DLLSHVKEDVYMGLIGYT----------- 260 + + T++ ALK+A+ FLD + ++ + V + LI Y Sbjct: 157 QFYKSKQSEETRLDALKDAVTYFLDQVEDQNQRINDPGKKVQVALIKYAGKNSDKIGNDT 216 Query: 261 --------TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 + +W E +++ T + ++ A + L S Sbjct: 217 YNEDGYNYNYSQTVHSLAWTPEDLQKEQAAVNSLKAGGATRADFGLQHAVKQLNS----- 271 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS-----NVNTIKICDKAKENFIKIVTI 367 QK +F +DG + N IK + K + ++++I Sbjct: 272 ------------GRPGAQKLTVFYSDGSPTSSDGFEAKIANNAIKAAAQLKNDHSQVISI 319 Query: 368 SIN--ASPNG----QRLLKTCVS---------------SPEYHYNVVNADSLIHVFQNIS 406 A P+G + + S Y+Y V L +F+ I Sbjct: 320 GAMPGADPSGTDNANKFMNYVSSNYPKAQSMSEPHDRVEGTYYYAVSARTDLQTIFKEII 379 Query: 407 QLMV 410 ++ Sbjct: 380 SIVT 383 >gi|293374377|ref|ZP_06620704.1| LPXTG-motif cell wall anchor domain protein [Turicibacter sanguinis PC909] gi|292647040|gb|EFF65023.1| LPXTG-motif cell wall anchor domain protein [Turicibacter sanguinis PC909] Length = 1099 Score = 37.9 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 23/155 (14%), Positives = 58/155 (37%), Gaps = 12/155 (7%) Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYN-QKIVSFIPAL 180 +S+ + + S N+ + +D + D +I++L + I+S + Sbjct: 213 ADSNTSDEYTRLTSSMSSENDGAVW-LDKSVTLDSNFNYIDNLAISNGDFNVILSALGQG 271 Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 +E + + V+D+SGSM + D T+ A+ A+ L + Sbjct: 272 YNVESTAKTPANVVFVLDVSGSMKYKLGKDEWATKD-------SETRAYAMVKAVNLAIK 324 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 + + +G++ ++ ++ + + T Sbjct: 325 QLVSQNP---QTKIGIVTFSEGSQQFLPLNNYTST 356 >gi|289177626|gb|ADC84872.1| Collagen adhesion protein [Bifidobacterium animalis subsp. lactis BB-12] Length = 905 Score = 37.5 bits (85), Expect = 3.5, Method: Composition-based stats. Identities = 38/244 (15%), Positives = 75/244 (30%), Gaps = 66/244 (27%) Query: 216 SAPICQDKKRTKMAALKNALLLFLDSI----DLLSHVKEDVYMGLIGYT----------- 260 + + T++ ALK+A+ FLD + ++ + V + LI Y Sbjct: 223 QFYKSKQSEETRLDALKDAVTYFLDQVEDQNQRINDPGKKVQVALIKYAGKNSDKIGNDT 282 Query: 261 --------TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 + +W E +++ T + ++ A + L S Sbjct: 283 YNEDGYNYNYSQTVHSLAWTPEDLQKEQAAVNSLKAGGATRADFGLQHAVKQLNS----- 337 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS-----NVNTIKICDKAKENFIKIVTI 367 QK +F +DG + N IK + K + ++++I Sbjct: 338 ------------GRPGAQKLTVFYSDGSPTSSDGFEAKIANNAIKAAAQLKNDHSQVISI 385 Query: 368 SIN--ASPNG----QRLLKTCVS---------------SPEYHYNVVNADSLIHVFQNIS 406 A P+G + + S Y+Y V L +F+ I Sbjct: 386 GAMPGADPSGTDNANKFMNYVSSNYPKAQSMSEPHDRVEGTYYYAVSARTDLQTIFKEII 445 Query: 407 QLMV 410 ++ Sbjct: 446 SIVT 449 >gi|260818477|ref|XP_002604409.1| hypothetical protein BRAFLDRAFT_220331 [Branchiostoma floridae] gi|229289736|gb|EEN60420.1| hypothetical protein BRAFLDRAFT_220331 [Branchiostoma floridae] Length = 192 Score = 37.5 bits (85), Expect = 3.5, Method: Composition-based stats. Identities = 32/235 (13%), Positives = 70/235 (29%), Gaps = 46/235 (19%) Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 F P + + V+D SGS+ D KM Sbjct: 1 MFTPGPTTPAPICNALMDLFFVLDGSGSV-----------------TDANFDKMKQFAKN 43 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 ++ D +G++ Y+ +K D T + Sbjct: 44 VVNAFDISASS------TRVGVVQYSDSNTLEFNLGDHADKPSTLAAIDSIVYQGGGTRT 97 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A N +G +P K +I +TDG++ + ++ Sbjct: 98 GSALEFA------------RVNAAWRGESVP-----KVMIVVTDGKSADSVTSSA----- 135 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + + + I + + +LL+ + + + ++L + I+Q + Sbjct: 136 NNLASQGVDVYAIGV-GNYRSTQLLEIAAGNQNNVIELTDFNALSAEIEQIAQAV 189 >gi|218672263|ref|ZP_03521932.1| hypothetical protein RetlG_11787 [Rhizobium etli GR56] Length = 256 Score = 37.5 bits (85), Expect = 3.6, Method: Composition-based stats. Identities = 33/272 (12%), Positives = 74/272 (27%), Gaps = 50/272 (18%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 I+ NF I+ AL ++ + G + + + +A +AA + + S Sbjct: 9 ISDRSGNFGIMTALLMVPLVGTAGMAVDFAHALSLRTQLYAAADAAAVGS---IAEKSSA 65 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + N DA+ + + L+ + + Sbjct: 66 VAAAMAMNGNGTISLGKTDARNIFMSQVSGELA----------------EVHVDLGIDVT 109 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIE 194 + N+ + + V T+ + + I E Sbjct: 110 KTANKLNSQVSFTATVPTT-------FMQIFGRDSIT-----ISGTATAEYQTAAFMDFY 157 Query: 195 LVVDLSGSM----------------HCAMNSDPEDVNSAPICQDKK---RTKMAALKNAL 235 +++D + SM CA D ++ K ++ ++ A Sbjct: 158 ILLDNTPSMGVGATPSDVSKLEAKTGCAFACHQMDKSTNNYTIAKSLGVAMRIDVVRQAT 217 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNI 267 D+ + MG+ + T+ E Sbjct: 218 QALTDTAKTERISSDQFRMGVYTFGTKAEDAK 249 >gi|271964249|ref|YP_003338445.1| hypothetical protein Sros_2742 [Streptosporangium roseum DSM 43021] gi|270507424|gb|ACZ85702.1| conserved hypothetical protein [Streptosporangium roseum DSM 43021] Length = 774 Score = 37.5 bits (85), Expect = 3.6, Method: Composition-based stats. Identities = 34/232 (14%), Positives = 71/232 (30%), Gaps = 56/232 (24%) Query: 184 EMGERPIFLIELVVDLSGSM-HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + LV+D SGSM KM A + A +D++ Sbjct: 286 APATANAKDVVLVLDRSGSMTGW---------------------KMVAARRAAARIVDTL 324 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR-----QYVTRDMDSLILKPTDSTPA 297 ++ + VE+ G + + V T+ Sbjct: 325 TGGD------RFAVLSFDGVVEQPEGLGEGLSEASDRNRYRAVEHLARLEARGGTEMLAP 378 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 ++QA +L+ + + ++ +TDG+ N + I A Sbjct: 379 LEQAVALLSESGRD-------------------RVLVLVTDGQVGNEDQILERIG----A 415 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + +++ T+ I+ + N L + V + D L ++I + + Sbjct: 416 RLAGVRVHTVGIDRAVNAGFLGRLAGLGSGRCELVESEDRLDEAMEHIHRRI 467 >gi|224106794|ref|XP_002314287.1| predicted protein [Populus trichocarpa] gi|222850695|gb|EEE88242.1| predicted protein [Populus trichocarpa] Length = 688 Score = 37.5 bits (85), Expect = 3.6, Method: Composition-based stats. Identities = 26/171 (15%), Positives = 62/171 (36%), Gaps = 43/171 (25%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + R + V+D+SGSM TK+A LK A+ + Sbjct: 253 SPQSSQDSRAPVDLVTVLDVSGSMSG--------------------TKLALLKRAMGFVI 292 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDSTPAM 298 ++ + +I +++ ++ TE + + ++SL+ T+ + Sbjct: 293 QNLGPSD------RLSVIAFSSTARRHFPLRRMTETGKLEALQAVNSLVSSGGTNIAEGL 346 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 ++ ++++ K ++ + II L+DG++ S + Sbjct: 347 RKGFKVVVDRKWKNPVCS----------------IILLSDGQDTYTISGTS 381 >gi|113970537|ref|YP_734330.1| vault protein inter-alpha-trypsin subunit [Shewanella sp. MR-4] gi|113885221|gb|ABI39273.1| Vault protein inter-alpha-trypsin domain protein [Shewanella sp. MR-4] Length = 759 Score = 37.5 bits (85), Expect = 3.7, Method: Composition-based stats. Identities = 47/245 (19%), Positives = 84/245 (34%), Gaps = 60/245 (24%) Query: 182 RIEMGERPIFLIEL--VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 ++E E+P EL V+D SGSM + KNAL L Sbjct: 366 KVEASEQPNLPRELILVIDTSGSMAGD--------------------SIIQAKNALRYAL 405 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVE--KNIEP---SWGTEKVRQYVTRDMDSLILKPTDS 294 + +I + + V + + RQ+V R T+ Sbjct: 406 RGLRPQDSFN------IIEFNSDVSLLSSTPLPATATNLAMARQFVNRLQAD---GGTEM 456 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ--KFIIFLTDGENNNFKSNVNTIK 352 A+ A RQ S + + +IF+TDG N + I+ Sbjct: 457 AQALNSA---------------LPRQAFNTASGEDKSLRQVIFMTDGSVGNESALFELIR 501 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 ++ +N ++ T+ I ++PN + + + + D + Q IS+L+ Sbjct: 502 --NQIGDN--RLFTVGIGSAPNSHFMQRAAELGRGTFTYIGDVDEVE---QKISKLLAKI 554 Query: 413 KYSVI 417 +Y V+ Sbjct: 555 QYPVL 559 >gi|209527393|ref|ZP_03275900.1| von Willebrand factor type A [Arthrospira maxima CS-328] gi|209492184|gb|EDZ92532.1| von Willebrand factor type A [Arthrospira maxima CS-328] Length = 463 Score = 37.5 bits (85), Expect = 3.7, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 58/193 (30%), Gaps = 33/193 (17%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK--------RTKMAALKNALL 236 RP V+D SGSM+ + + D K +TK+ + +L Sbjct: 35 SASRPSTTFSFVIDTSGSMYEVLEGEETIPTGNSYFLDGKQYTQVTGGKTKIDQVIESLE 94 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 + S S + L+ + + + T+ T Sbjct: 95 GLVSSGQADSRD----RIALVRFDDSASVLLPLTASTDTASLKNAIGQLRNFSGGTRMAL 150 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 M++A IL K L ++ +IF TDG+ + + + Sbjct: 151 GMEEALNIL-----------------KNCDLSSRRTLIF-TDGQTFDESDCRDLATQFAE 192 Query: 357 AKENFIKIVTISI 369 A I I + + Sbjct: 193 A---GIPITALGV 202 >gi|13471293|ref|NP_102862.1| hypothetical protein mll1222 [Mesorhizobium loti MAFF303099] gi|14022037|dbj|BAB48648.1| mll1222 [Mesorhizobium loti MAFF303099] Length = 638 Score = 37.5 bits (85), Expect = 3.7, Method: Composition-based stats. Identities = 34/291 (11%), Positives = 85/291 (29%), Gaps = 60/291 (20%) Query: 92 DDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVM 151 + + F N + +L + F V+++ S + +++ Sbjct: 166 NRVQDFKTNPVHAALEDPVSTFSID-----VDTASYSFVRRSLKEGSVPQADTVRVEEMI 220 Query: 152 TSYDYRLQFIEHLLNQRYNQKIVSFIPALLR------------IEMGERPIFLIELVVDL 199 + Y + + + + V P + I+ E+P + ++D+ Sbjct: 221 NYFPYDWKGPDSVSTPFNSTVSVMPTPWNAQTKLMHVAIKGFDIKPTEQPKANLVFLIDV 280 Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 SGSM + K+ LK+A L + + + Y G Sbjct: 281 SGSMD-------------------EPDKLPLLKSAFRLLVSKLRADDTISIVTYAG---- 317 Query: 260 TTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFR 319 + + + + + T +K+AY L N Sbjct: 318 ---EAGTVLMPTRAAEKDKILNAIDNLTPGGSTAGEAGIKEAY-KLAQQSFVKDGVNR-- 371 Query: 320 QGVKIPSLPFQKFIIFLTDGE-NNNFKSNVNTIKICDKAKENFIKIVTISI 369 ++ TDG+ N + + ++ ++ +++ + + Sbjct: 372 -------------VMLATDGDFNVGQSDDDDLKRLIEQERKSGVFLSVFGF 409 >gi|326435125|gb|EGD80695.1| hypothetical protein PTSG_01285 [Salpingoeca sp. ATCC 50818] Length = 1006 Score = 37.5 bits (85), Expect = 3.8, Method: Composition-based stats. Identities = 38/218 (17%), Positives = 71/218 (32%), Gaps = 46/218 (21%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + +VD+SGS+ A + D ++D+L + Sbjct: 308 AVDVGFIVDVSGSVGAANFALVRD-----------------------FIASTVDMLPVGE 344 Query: 250 EDVYMGLIGYTTRVEKNIEP--SWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILT 306 V + L+ Y + + S+ V ++ + D T + A+ Sbjct: 345 NTVRVALMTYHSSNMPQFDFDDSFDRATVVSAISSLVYDDSRQYGTATGSALN------- 397 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 FF + Q +F TDG + +V + ++VT Sbjct: 398 ------FFADNMLQASAGYRGGPAIVYVF-TDG---ASQDDVTPGA--AALQATGAQVVT 445 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 I + A+ N L+ SSP + V + DSL ++ Sbjct: 446 IGVTAAVNEAE-LQEIASSPSDVFIVADFDSLTDEVRD 482 >gi|67921226|ref|ZP_00514745.1| hypothetical protein CwatDRAFT_5710 [Crocosphaera watsonii WH 8501] gi|67857343|gb|EAM52583.1| hypothetical protein CwatDRAFT_5710 [Crocosphaera watsonii WH 8501] Length = 223 Score = 37.5 bits (85), Expect = 3.8, Method: Composition-based stats. Identities = 28/175 (16%), Positives = 60/175 (34%), Gaps = 20/175 (11%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL-------- 244 I ++ D+SGSM C ++D + K + K A + L F++ Sbjct: 35 IIVLFDMSGSMQC--STDLNRKGNC-NKVPKGQRKYDAAIDTLKNFVEEAKTWKGKTQVS 91 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWG------TEKVRQYVTRDM-DSLILKPTDSTPA 297 + V G ++ + + ++ +++ D+ + T+ + Sbjct: 92 IVPFGNRVKTGNCNFSEPFVNVSPDNLNDFQNVESTELTKFLNTDLSNKTPCTATNIYES 151 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + ++ + LT+D F+ II TDG + + TI Sbjct: 152 LYKSVEFLTNDSDERFYPVNREGEPTDNQPRLS--IILFTDGFYTDTFNKDETIA 204 >gi|307352559|ref|YP_003893610.1| von Willebrand factor type A [Methanoplanus petrolearius DSM 11571] gi|307155792|gb|ADN35172.1| von Willebrand factor type A [Methanoplanus petrolearius DSM 11571] Length = 1022 Score = 37.5 bits (85), Expect = 3.9, Method: Composition-based stats. Identities = 16/85 (18%), Positives = 39/85 (45%), Gaps = 11/85 (12%) Query: 330 QKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN--ASPNGQRLLKTCVS-SP 386 +K+ F + N+ + ++ AK+N IKI +I+ S G+ +L+ + Sbjct: 700 EKYYTF---SDLNSSEQDMRIF-----AKDNDIKIFSIAYADGISSEGKAVLQALAEGTG 751 Query: 387 EYHYNVVNADSLIHVFQNISQLMVH 411 +Y + + L ++++I+ + Sbjct: 752 GKYYYAPSGEDLEEIYEDIAGELKE 776 >gi|238796988|ref|ZP_04640492.1| Flp pilus assembly protein TadG [Yersinia mollaretii ATCC 43969] gi|238719248|gb|EEQ11060.1| Flp pilus assembly protein TadG [Yersinia mollaretii ATCC 43969] Length = 536 Score = 37.5 bits (85), Expect = 3.9, Method: Composition-based stats. Identities = 44/263 (16%), Positives = 79/263 (30%), Gaps = 61/263 (23%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 I ++K + F + + F+ L+ + + KK + A A LA + + Sbjct: 26 IKNDKGAILLPFVIFLPIFIGLLFLSFEISQFLQKKAKLSDAIEQATLA----LTVENND 81 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHI-KESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + D +S N D + ++ E S T I +I N+ Sbjct: 82 IPDANQSQKNR------DLVTHYATAYLPSEKFS--------TPIIDISNN--------- 118 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLI 193 + Y + SY QF+ + I A + +G + + Sbjct: 119 -------KGHLLYKAETTMSY--PAQFLANSPLANTKISIADSGAARKDVAVGPSELTDV 169 Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 VVD S SM + R+K+ L++ +I Sbjct: 170 VFVVDYSSSMINPFHGTYG-----------SRSKIDELRDIFYKLNGNILKND------- 211 Query: 254 MGLIGYTTRVEKNIEPSWGTEKV 276 I SWG +K+ Sbjct: 212 ------NINTIGFIPFSWGIKKI 228 >gi|223933146|ref|ZP_03625138.1| LPXTG-motif cell wall anchor domain protein [Streptococcus suis 89/1591] gi|330833239|ref|YP_004402064.1| LPXTG-motif cell wall anchor domain-containing protein [Streptococcus suis ST3] gi|223898207|gb|EEF64576.1| LPXTG-motif cell wall anchor domain protein [Streptococcus suis 89/1591] gi|329307462|gb|AEB81878.1| LPXTG-motif cell wall anchor domain protein [Streptococcus suis ST3] Length = 997 Score = 37.5 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 27/159 (16%), Positives = 46/159 (28%), Gaps = 29/159 (18%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 G + F +VVD S SM MNS + + T L + Sbjct: 463 TGTKEPFDTLIVVDRSTSMTDPMNSVDTQARYLAVYKALNGT------AGRQGLLSKLVG 516 Query: 245 LSHVKEDVYMGLIGY----TTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 + +G GY + + + +WG T+ T ++ Sbjct: 517 FHPENQVAIVGFQGYPGYPSGDQDSTVIANWGRSTSVALSNIQPPYN--NGTNYTAGLRT 574 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 A +L +K +IF++DG Sbjct: 575 AGVVL-----------------DQNQSSRKKVMIFISDG 596 >gi|163850366|ref|YP_001638409.1| hypothetical protein Mext_0933 [Methylobacterium extorquens PA1] gi|163661971|gb|ABY29338.1| conserved hypothetical protein [Methylobacterium extorquens PA1] Length = 473 Score = 37.5 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 52/359 (14%), Positives = 115/359 (32%), Gaps = 26/359 (7%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 R + ++ + + +++FAL+V+ + L+G + K +++A +AA LAG Sbjct: 6 RLKGRATALASNAEGSINVLFALAVLPTIGLVGLGVDYGMAISSKTRLDNAADAAALAGV 65 Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 ++ + + ++ K K F N K A ++++ + + Sbjct: 66 VTAKEFIAANAQQSDVMTAGIKAGEYQALKAFNVNASKVPF----ATVSLSQLEIVRSGQ 121 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEM 185 + T + S+ +F T+ R+ + +V ++ Sbjct: 122 TLDATVSYTATVQSTFGRLFGLSA--TTLTNRVNASADIAGYLDFYLMVDVSGSMGLPTT 179 Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 L V+ G+ C + + K + + A+ NA+ LD Sbjct: 180 DSDAALLASKSVEDQGN--CQFACHFPNRKGWNLAAGKIQLRSDAVNNAVCALLDRASK- 236 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 V +G+ + R+ S T + + D P T + L Sbjct: 237 PIVPNQYRIGIYPFINRLATLAPLSDTTTSLAS-LKTTADCGKAWPLAFTNLLDTGSTQL 295 Query: 306 TSDKKRSFFTNFFRQGVKI----------------PSLPFQKFIIFLTDGENNNFKSNV 348 ++ + T + S + F+ +TDG N+ + Sbjct: 296 FTNNDPTTGTGSGGTHFETALPQMKSTIRTFGNGSSSANPKPFVFLITDGMQNSQTYSS 354 >gi|156342098|ref|XP_001620878.1| hypothetical protein NEMVEDRAFT_v1g222619 [Nematostella vectensis] gi|156206294|gb|EDO28778.1| predicted protein [Nematostella vectensis] Length = 252 Score = 37.5 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 28/124 (22%), Positives = 48/124 (38%), Gaps = 16/124 (12%) Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 S T++ A++ A+Q+ T+ K S R ++ +TDG +N K Sbjct: 115 SRSGGKTNTQDALELAFQMFTTSKYGSTPGGLAR-------------VLVVTDGRSNIEK 161 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADSLIHVFQN 404 K K N I++ I+I G L ++ + Y V + L V + Sbjct: 162 HRTERKAF--KLKANGIEVFVIAIGDYLEGMDELARMANTKYAHMYRVEDVKGLARVVKL 219 Query: 405 ISQL 408 I +L Sbjct: 220 IPRL 223 >gi|255068089|ref|ZP_05319944.1| neisseria PilC protein [Neisseria sicca ATCC 29256] gi|255047687|gb|EET43151.1| neisseria PilC protein [Neisseria sicca ATCC 29256] Length = 1097 Score = 37.5 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 11/65 (16%), Positives = 25/65 (38%), Gaps = 4/65 (6%) Query: 359 ENFIKIVTISI---NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + ++ T+ + Q L++ S +++N D L F+ I + S Sbjct: 317 KQTVQTFTVGFGNGISPAGEQYLIRG-ASHDGWYFNAAKPDDLYKAFEKIISQISDDNAS 375 Query: 416 VILKG 420 + +G Sbjct: 376 IPFEG 380 >gi|229188238|ref|ZP_04315314.1| hypothetical protein bcere0004_57400 [Bacillus cereus BGSC 6E1] gi|228595258|gb|EEK53002.1| hypothetical protein bcere0004_57400 [Bacillus cereus BGSC 6E1] Length = 425 Score = 37.5 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 39/218 (17%), Positives = 78/218 (35%), Gaps = 29/218 (13%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 + +E+++D SGSM +N + KM A K A+ +LD I Sbjct: 120 SKAKSLNVEILLDASGSMAGKVNGEV---------------KMEAAKKAIYNYLDKI--- 161 Query: 246 SHVKEDVYMGLIGYT-TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 +V + + G+ + E + S G+ +V + + + Sbjct: 162 -PDNANVMLRVYGHKGSNNENDKSLSCGSSEVMYPLQPYNKEQFNAALSKFGP--KGWTP 218 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L S + G + ++ +I+ +DGE VN K +++ + + + Sbjct: 219 LASAIESVNDDFKEYTGEENLNVV---YIV--SDGEETCGGDPVNAAKNLNQSNTHAV-V 272 Query: 365 VTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHV 401 I + + Q+ LK + + V +AD L Sbjct: 273 NIIGFDVKNSEQQQLKYTAEAGKGNYATVSSADELHQT 310 >gi|127512721|ref|YP_001093918.1| vault protein inter-alpha-trypsin subunit [Shewanella loihica PV-4] gi|126638016|gb|ABO23659.1| Vault protein inter-alpha-trypsin domain protein [Shewanella loihica PV-4] Length = 776 Score = 37.5 bits (85), Expect = 4.1, Method: Composition-based stats. Identities = 49/282 (17%), Positives = 96/282 (34%), Gaps = 47/282 (16%) Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 TH N +S ++ +N V++ ++ + + + +V +P + + R Sbjct: 344 THEFKNE-ESLASSQAHNDQVVSEANHPAEAQASDKEAKDSYALVMLMPP--QDKARVRL 400 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + LV+D SGSM +A K+A+L L + Sbjct: 401 PRELTLVIDTSGSMTGD--------------------SIAQAKSAILNALAGLGSQDTFN 440 Query: 250 EDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILTS 307 +I + + V + T + SL T+ PA+ L + Sbjct: 441 ------VIAFDSSVRSLSPVALSATAANLGKANLFVQSLEADGGTEMAPAL------LRA 488 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + + VK L K ++F+TDG N ++ A ++ T+ Sbjct: 489 LSQPESGVSSISSAVKPERL---KQVVFITDG----AVGNEASLFALIAANIGRQRLFTV 541 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 I A+PNG + + + + V + I +L+ Sbjct: 542 GIGAAPNGYFMERAARAGRGTYTYVGKISEVDA---KIGELL 580 >gi|332798629|ref|YP_004460128.1| hypothetical protein TepRe1_0633 [Tepidanaerobacter sp. Re1] gi|332696364|gb|AEE90821.1| hypothetical protein TepRe1_0633 [Tepidanaerobacter sp. Re1] Length = 185 Score = 37.5 bits (85), Expect = 4.1, Method: Composition-based stats. Identities = 15/46 (32%), Positives = 24/46 (52%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESAN 57 KK + E N +IFA S++ IG + V + K+NSM++ Sbjct: 6 KKFLQKEDGNILLIFAGSMVLIAFFIGICLDVSMIYVKRNSMQNIL 51 >gi|168705263|ref|ZP_02737540.1| hypothetical protein GobsU_37375 [Gemmata obscuriglobus UQM 2246] Length = 987 Score = 37.5 bits (85), Expect = 4.1, Method: Composition-based stats. Identities = 43/350 (12%), Positives = 89/350 (25%), Gaps = 44/350 (12%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 + + + ++ + + + + ++ + A L G ++ ++N + Sbjct: 10 RGRRGTILPLLGVCLIGLFGFVALAVDLGMLAVSRTQSQNGADVAALVG-TRTLNNRDGV 68 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 + A +A + N + +S Y + + + Sbjct: 69 AYNNLPAAVTAAQASVTSNPHLSTNFVSGEVSKMEVGQYLYDPTSQTFQVQNWTQVTGGG 128 Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 + + + M V F+ V+ +P+ RP I Sbjct: 129 AMSAPGGNSWTAMRVTLGVSQPTYFMRVF--------GVNSMPSGAVATAVYRPR-DIAF 179 Query: 196 VVDLSGSM--HCAMNSDPEDVNSAPICQDKK---------RTKMAALKNALLLFLDSIDL 244 V+D++GSM NS +S+ T + + A +S + Sbjct: 180 VLDMTGSMAFSSTFNSGNAQGSSSDYQSMNPDPLVPKAGHYTTVQSRIVAADNLANSSNE 239 Query: 245 LSHVKEDVYM---GLIGYTTRVEKNIEPSWGT-EKVRQYVTRDMD------------SLI 288 G + WGT V VT D S Sbjct: 240 ALPRNNFTITTPGGPPIVRSYYYDPSN--WGTPSTVASPVTTKGDGSANLLPAFHRWSPP 297 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 DS + Y + + P +TD Sbjct: 298 ESGADSDNYIAPTYNF-----AGYNAFHKGNETTPQGPTPAPDTYGTMTD 342 >gi|67078187|ref|YP_245807.1| D-amino acid dehydrogenase, large subunit [Bacillus cereus E33L] gi|66970493|gb|AAY60469.1| conserved hypothetical protein [Bacillus cereus E33L] Length = 452 Score = 37.5 bits (85), Expect = 4.1, Method: Composition-based stats. Identities = 39/217 (17%), Positives = 77/217 (35%), Gaps = 29/217 (13%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + +E+++D SGSM +N + KM A K A+ +LD I Sbjct: 148 KEKSLNVEILLDASGSMAGKVNG---------------QVKMEAAKKAIYNYLDKI---- 188 Query: 247 HVKEDVYMGLIGYT-TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 +V + + G+ + E + S G+ +V + + + L Sbjct: 189 PDNANVMLRVYGHKGSNNENDKSLSCGSSEVMYPLQPYKKEQFNAALSKFGP--KGWTPL 246 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 S + G + ++ +I+ +DGE VN K +++ + + + Sbjct: 247 ASAIESVNDDFKEYTGEENLNVV---YIV--SDGEETCGGDPVNAAKNLNQSSTHAV-VN 300 Query: 366 TISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHV 401 I + + Q+ L + + V NAD L Sbjct: 301 IIGFDVKNSEQQQLMNTAEAGKGNYATVSNADELYQT 337 >gi|218192783|gb|EEC75210.1| hypothetical protein OsI_11473 [Oryza sativa Indica Group] Length = 365 Score = 37.5 bits (85), Expect = 4.2, Method: Composition-based stats. Identities = 11/54 (20%), Positives = 23/54 (42%), Gaps = 2/54 (3%) Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYH--YNVVNADSLIHVFQNISQLMV 410 +NF KI ++ + LL+ C +S + Y N + +F + + + Sbjct: 226 KNFSKIKKMAFSEPEILHNLLQKCTTSMANYIKYQADNGAQAVQIFDSWATELS 279 >gi|168186710|ref|ZP_02621345.1| von Willebrand factor type A domain protein [Clostridium botulinum C str. Eklund] gi|169295292|gb|EDS77425.1| von Willebrand factor type A domain protein [Clostridium botulinum C str. Eklund] Length = 693 Score = 37.5 bits (85), Expect = 4.2, Method: Composition-based stats. Identities = 29/233 (12%), Positives = 73/233 (31%), Gaps = 54/233 (23%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ L+ A F++ + +K + +GL+ Y ++ + +++ + + Sbjct: 184 KIDELQKAAKNFVNKFE----IKANTKIGLVSYGNNGKEVHSLTNELDRINSSIDSGLS- 238 Query: 287 LILKPTDSTPAMKQA---------------------------YQILTSDKKRSFFTNFFR 319 I T+ ++ A Y + R + Sbjct: 239 -IGGGTNVGDGIRMANGILNNGSDADKYIVLMTDGMPTAATCYNDIYYKNNRFYSKYGEI 297 Query: 320 QGVKIPSLPFQKFI--------------------IFLTDGENNNFKSNVN-TIKICDKAK 358 L + +I I + G+N+ +N + + +A Sbjct: 298 LNGNGNPLSYFNYIGNYKYKFEYNPNDYTHEDEKIIMNYGDNDYGNIALNYSKEALKRAS 357 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 E+ + I + N ++L Y+ ++ D L V++ I+ + + Sbjct: 358 ESGVNNFVIGFSNGINREKLNGIATEGNGYYREAMHGDELTDVYKRIADEINN 410 >gi|296506525|ref|YP_003667759.1| hypothetical protein BMB171_P0145 [Bacillus thuringiensis BMB171] gi|296327112|gb|ADH10039.1| hypothetical protein BMB171_P0145 [Bacillus thuringiensis BMB171] Length = 452 Score = 37.5 bits (85), Expect = 4.3, Method: Composition-based stats. Identities = 39/217 (17%), Positives = 78/217 (35%), Gaps = 29/217 (13%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + +E+++D SGSM +N + KM A K A+ +LD I Sbjct: 148 KAKSLNVEILLDASGSMAGKVNGEV---------------KMEAAKKAIYNYLDKI---- 188 Query: 247 HVKEDVYMGLIGYT-TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 +V + + G+ + E + S G+ +V + + + L Sbjct: 189 PDNANVMLRVYGHKGSNNENDKSLSCGSSEVMYPLQPYNKEQFNAALSKFGP--KGWTPL 246 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 S + G + ++ +I+ +DGE VN K +++ + + + Sbjct: 247 ASAIESINDDFKEYTGEENLNVV---YIV--SDGEETCGGDPVNAAKNLNQSSTHAV-VN 300 Query: 366 TISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHV 401 I + + Q+ LK + + V +AD L Sbjct: 301 IIGFDVKNSEQQQLKNTAEAGKGNYATVSSADELHQT 337 >gi|167617233|ref|ZP_02385864.1| hypothetical protein BthaB_13083 [Burkholderia thailandensis Bt4] Length = 396 Score = 37.5 bits (85), Expect = 4.3, Method: Composition-based stats. Identities = 9/63 (14%), Positives = 28/63 (44%) Query: 22 FSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFES 81 SI+ AL + + +G + + + ++ ++++ ++ LA A + ++ Sbjct: 1 MSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSADSCALAAARDLTGAINLSVPEAAG 60 Query: 82 ISN 84 I+ Sbjct: 61 ITA 63 >gi|238915987|ref|YP_002929504.1| hypothetical protein EUBELI_00011 [Eubacterium eligens ATCC 27750] gi|238871347|gb|ACR71057.1| Hypothetical protein EUBELI_00011 [Eubacterium eligens ATCC 27750] Length = 745 Score = 37.5 bits (85), Expect = 4.3, Method: Composition-based stats. Identities = 44/274 (16%), Positives = 81/274 (29%), Gaps = 58/274 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT-KMAALKNALLLFLDSIDLLSHVKED 251 + LV+D S SM C+++ D + AP + + + + A++ L FL Sbjct: 162 VILVLDRSFSMACSVDEDVDSDAMAPTYEKTRWSVTINAVEKFLNEFL-------PEGTS 214 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL-TSDKK 310 + +I Y I +K+ + + + + + + Sbjct: 215 NKVSVISYCGSARTEITNESSKDKIMSKLNSIYNRNMYNEDYKNSSKRYNVTQIGRGLGS 274 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDG-----EN------------------NNFKSN 347 + +Q +I +I TDG +N N + Sbjct: 275 ATNIQQGLKQVSEIAGDTTGASVILFTDGGANRYDNGEIGGYYYIKNNNETNRYNKPRDE 334 Query: 348 VNTIKICDKA----KENFIKIVTISINASPN----------GQRLLKTCVS--------- 384 VN KA K I TI + + + G + L S Sbjct: 335 VNGSYYAGKAGEELKAAGADIYTIVLMSKESDITDLVKVSLGNKSLTYEKSWEKTYFTFS 394 Query: 385 ---SPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + Y NA+ L + F+ I M + Sbjct: 395 DGGYAKEFYTAANAEQLNNRFKQIMTEMTSLPFE 428 >gi|194226347|ref|XP_001489610.2| PREDICTED: similar to Collagen, type VI, alpha 2 [Equus caballus] Length = 1019 Score = 37.5 bits (85), Expect = 4.3, Method: Composition-based stats. Identities = 28/189 (14%), Positives = 62/189 (32%), Gaps = 49/189 (25%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL--S 246 + V+D S S+ T KN ++ ++ + + Sbjct: 612 GALDVVFVIDSSESIG--------------------YTNFTLEKNFVINVVNRLGAIAKD 651 Query: 247 HVKED-VYMGLIGYTTR-----VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 E +G++ Y+ ++ + E ++ V I T + A+K Sbjct: 652 PKSETGTRVGVVQYSHEGTFEAIQLDDERINSLSSFKEAVKNL--EWIAGGTWTPSALKF 709 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 AY L + +R + F + +TDG ++ ++N +C+ + Sbjct: 710 AYNQLIKESRRQKT---------------RVFAVVITDGRHDPRDDDLNLRALCN----H 750 Query: 361 FIKIVTISI 369 + + I I Sbjct: 751 DVTVTAIGI 759 >gi|298484180|ref|ZP_07002346.1| BatB protein [Bacteroides sp. D22] gi|298269684|gb|EFI11279.1| BatB protein [Bacteroides sp. D22] Length = 342 Score = 37.5 bits (85), Expect = 4.4, Method: Composition-based stats. Identities = 32/192 (16%), Positives = 66/192 (34%), Gaps = 41/192 (21%) Query: 151 MTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 ++ Y ++F + +++ ++E +R + + +D+S SM Sbjct: 50 VSKYRPDVKFWIIFVAIGLFSVLLARPQFGSKLETVKRKGVEVIIALDISNSMLA----- 104 Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 QD + +++ K + +D +D + V G+I + + + Sbjct: 105 ----------QDVQPSRLEKAKRLISRLVDELD-----NDKV--GMIVFAGDAFTQLPIT 147 Query: 271 WGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 + ++ SLI K T A+ A + T P Sbjct: 148 SDYISAKMFLESINPSLISKQGTAIGEAINLAARSFT------------------PQEGV 189 Query: 330 QKFIIFLTDGEN 341 + II +TDGEN Sbjct: 190 GRAIIVITDGEN 201 >gi|237716506|ref|ZP_04546987.1| conserved hypothetical protein [Bacteroides sp. D1] gi|262408104|ref|ZP_06084652.1| conserved hypothetical protein [Bacteroides sp. 2_1_22] gi|294645098|ref|ZP_06722824.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CC 2a] gi|294809499|ref|ZP_06768202.1| von Willebrand factor type A domain protein [Bacteroides xylanisolvens SD CC 1b] gi|229444153|gb|EEO49944.1| conserved hypothetical protein [Bacteroides sp. D1] gi|262354912|gb|EEZ04004.1| conserved hypothetical protein [Bacteroides sp. 2_1_22] gi|292639604|gb|EFF57896.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CC 2a] gi|294443317|gb|EFG12081.1| von Willebrand factor type A domain protein [Bacteroides xylanisolvens SD CC 1b] Length = 342 Score = 37.5 bits (85), Expect = 4.4, Method: Composition-based stats. Identities = 32/192 (16%), Positives = 66/192 (34%), Gaps = 41/192 (21%) Query: 151 MTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 ++ Y ++F + +++ ++E +R + + +D+S SM Sbjct: 50 VSKYRPDVKFWIIFVAIGLFSVLLARPQFGSKLETVKRKGVEVIIALDISNSMLA----- 104 Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 QD + +++ K + +D +D + V G+I + + + Sbjct: 105 ----------QDVQPSRLEKAKRLISRLVDELD-----NDKV--GMIVFAGDAFTQLPIT 147 Query: 271 WGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 + ++ SLI K T A+ A + T P Sbjct: 148 SDYISAKMFLESINPSLISKQGTAIGEAINLAARSFT------------------PQEGV 189 Query: 330 QKFIIFLTDGEN 341 + II +TDGEN Sbjct: 190 GRAIIVITDGEN 201 >gi|19703911|ref|NP_603473.1| D-amino acid dehydrogenase large subunit [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] gi|19714079|gb|AAL94772.1| hypothetical protein FN0576 [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] Length = 369 Score = 37.5 bits (85), Expect = 4.4, Method: Composition-based stats. Identities = 41/238 (17%), Positives = 80/238 (33%), Gaps = 30/238 (12%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 ++ E +E+V+D SGSM + +T M K ++ L + Sbjct: 8 EKIEENMNVNVEIVLDASGSMVKKIG---------------DKTMMEIAKESIKKVLSEM 52 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK-QA 301 V G+ + + + + + + D + + ++ Sbjct: 53 ----PANAKV--GIRVFGHKGDNTASKKDESCGANELIYPIGDLNVEGIEKALEPIQPTG 106 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + + + G K ++ +II TDG + V K K + Sbjct: 107 WTSIAKSIEYGVEDLKALDGEKTLNIL---YII--TDGIETCGGNPVEIAKQL-KGENTN 160 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHY-NVVNADSLIHVFQNISQL-MVHRKYSVI 417 I + I N N RLLK + +Y +V +A+ L I++L K+ V+ Sbjct: 161 IVLGIIGFNVDANQNRLLKQIADAAGGYYSSVNDANKLTGELYRINELAFSDYKWEVL 218 >gi|291059650|gb|ADD72385.1| putative von Willebrand factor type A domain protein [Treponema pallidum subsp. pallidum str. Chicago] Length = 650 Score = 37.5 bits (85), Expect = 4.4, Method: Composition-based stats. Identities = 29/202 (14%), Positives = 66/202 (32%), Gaps = 34/202 (16%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 +R + VVD SGSM + ++ M +LK+ + D Sbjct: 420 VRACTLRPAQIEVSFVVDNSGSM---------NKEKIASAREALAVSMLSLKD-FGEYSD 469 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI------LKPTDS 294 + + ++ + + + K G K + + + + T+ Sbjct: 470 MLAAGRRERTTIHSEVYYFGSSFIKVKSF--GKSKSKDFNSAQLIKASVNLDGRFGGTND 527 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 +K + +D +R R+ F K ++ +TDG ++ + TI Sbjct: 528 AEVLKH----ILADVER-------RRARVSSDTSFVKVVLVITDGCSSYPHESRRTI--- 573 Query: 355 DKAKENFIKI--VTISINASPN 374 ++ + + I I + + Sbjct: 574 EELRRRGVMIFGFQIGLMSPEE 595 >gi|160882769|ref|ZP_02063772.1| hypothetical protein BACOVA_00730 [Bacteroides ovatus ATCC 8483] gi|237720675|ref|ZP_04551156.1| conserved hypothetical protein [Bacteroides sp. 2_2_4] gi|260170238|ref|ZP_05756650.1| hypothetical protein BacD2_00060 [Bacteroides sp. D2] gi|293373991|ref|ZP_06620332.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CMC 3f] gi|299145609|ref|ZP_07038677.1| BatB protein [Bacteroides sp. 3_1_23] gi|315918601|ref|ZP_07914841.1| conserved hypothetical protein [Bacteroides sp. D2] gi|156111793|gb|EDO13538.1| hypothetical protein BACOVA_00730 [Bacteroides ovatus ATCC 8483] gi|229449510|gb|EEO55301.1| conserved hypothetical protein [Bacteroides sp. 2_2_4] gi|292631067|gb|EFF49704.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CMC 3f] gi|298516100|gb|EFI39981.1| BatB protein [Bacteroides sp. 3_1_23] gi|313692476|gb|EFS29311.1| conserved hypothetical protein [Bacteroides sp. D2] Length = 342 Score = 37.5 bits (85), Expect = 4.4, Method: Composition-based stats. Identities = 32/192 (16%), Positives = 66/192 (34%), Gaps = 41/192 (21%) Query: 151 MTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 ++ Y ++F + +++ ++E +R + + +D+S SM Sbjct: 50 VSKYRPDVKFWIIFVAIGLFSVLLARPQFGSKLETVKRKGVEVIIALDISNSMLA----- 104 Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 QD + +++ K + +D +D + V G+I + + + Sbjct: 105 ----------QDVQPSRLEKAKRLISRLVDELD-----NDKV--GMIVFAGDAFTQLPIT 147 Query: 271 WGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 + ++ SLI K T A+ A + T P Sbjct: 148 SDYISAKMFLESISPSLISKQGTAIGEAINLAARSFT------------------PQEGV 189 Query: 330 QKFIIFLTDGEN 341 + II +TDGEN Sbjct: 190 GRAIIVITDGEN 201 >gi|15639238|ref|NP_218686.1| hypothetical protein TP0246 [Treponema pallidum subsp. pallidum str. Nichols] gi|189025479|ref|YP_001933251.1| hypothetical protein TPASS_0246 [Treponema pallidum subsp. pallidum SS14] gi|14285852|sp|O83274|Y246_TREPA RecName: Full=Uncharacterized protein TP_0246 gi|3322523|gb|AAC65240.1| predicted coding region TP0246 [Treponema pallidum subsp. pallidum str. Nichols] gi|189018054|gb|ACD70672.1| hypothetical protein TPASS_0246 [Treponema pallidum subsp. pallidum SS14] Length = 597 Score = 37.5 bits (85), Expect = 4.4, Method: Composition-based stats. Identities = 29/202 (14%), Positives = 66/202 (32%), Gaps = 34/202 (16%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 +R + VVD SGSM + ++ M +LK+ + D Sbjct: 367 VRACTLRPAQIEVSFVVDNSGSM---------NKEKIASAREALAVSMLSLKD-FGEYSD 416 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI------LKPTDS 294 + + ++ + + + K G K + + + + T+ Sbjct: 417 MLAAGRRERTTIHSEVYYFGSSFIKVKSF--GKSKSKDFNSAQLIKASVNLDGRFGGTND 474 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 +K + +D +R R+ F K ++ +TDG ++ + TI Sbjct: 475 AEVLKH----ILADVER-------RRARVSSDTSFVKVVLVITDGCSSYPHESRRTI--- 520 Query: 355 DKAKENFIKI--VTISINASPN 374 ++ + + I I + + Sbjct: 521 EELRRRGVMIFGFQIGLMSPEE 542 >gi|162424746|gb|ABX90059.1| hedgling [Amphimedon queenslandica] Length = 2416 Score = 37.5 bits (85), Expect = 4.4, Method: Composition-based stats. Identities = 34/218 (15%), Positives = 65/218 (29%), Gaps = 51/218 (23%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + V+D SGS+ + + AL ++ Sbjct: 184 LDVVFVLDQSGSIGYYNH-----------------------QLALNFLSKVVEFFKIGAN 220 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILTSDK 309 +GLI Y+T + + R T + + QA + + Sbjct: 221 KTQVGLITYSTHAYVQFDL-NDYHSKSTILNRISRIYYTGGWTATALGLFQA--GVILNP 277 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 ++ QGV + +I LTDG +N + + D I++ T+ + Sbjct: 278 QQMRGARPISQGVP-------RVVILLTDGRSNRVPIDEVAPSLHDF----GIQVYTVGV 326 Query: 370 N---ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 LK S P+ ++ L+ F + Sbjct: 327 GNIYLPE-----LKFIASDPDPYHIF-----LLDSFSD 354 >gi|291231970|ref|XP_002735935.1| PREDICTED: chloride channel accessory 2-like, partial [Saccoglossus kowalevskii] Length = 849 Score = 37.5 bits (85), Expect = 4.5, Method: Composition-based stats. Identities = 31/223 (13%), Positives = 71/223 (31%), Gaps = 51/223 (22%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 L++D+SGSM + + K+ + + + E + Sbjct: 303 LLLDISGSMSS----------------NNRIEKLGQV-----ASIYILLTADDDDE---L 338 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS--LILKPTDSTPAMKQAYQILTSDKKRS 312 G++ + + + +E R + + + I T + +A +L Sbjct: 339 GMVVFNDQPSTRSQMVTISESTRLDLLELIPTRDDIGDATGIGSGLSEAIDVL------E 392 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK--SNVNTIKICDKAKENFIKIVTISIN 370 N G II ++DGE N +V + + + + TI++ Sbjct: 393 NGGNDAAGGC----------IILVSDGEENRSPYIDDVQS-----TIVDKGVCVHTIALG 437 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVN--ADSLIHVFQNISQLMVH 411 + + + Y N +++L F I++ + Sbjct: 438 VDASHNMEQLPLATDGKSFYYSENPYSNALNEAFITIAKQDTN 480 >gi|113971723|ref|YP_735516.1| putative outer membrane adhesin like protein [Shewanella sp. MR-4] gi|113886407|gb|ABI40459.1| putative outer membrane adhesin like protein [Shewanella sp. MR-4] Length = 1215 Score = 37.2 bits (84), Expect = 4.6, Method: Composition-based stats. Identities = 56/362 (15%), Positives = 115/362 (31%), Gaps = 65/362 (17%) Query: 48 YKKNSMESANNAAI---LAGASKMVSNLSRLGDRFESISNHAKRALID---DAKRFIKNH 101 YK N++ A NA + +A S M + + +++ + D A+ + Sbjct: 187 YKGNAVSGAANATLTTDVATDSIMSNQWQARNGDMKWLNHSTANNIGDVNRTAQGRVYGK 246 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFI 161 + + +R T +ANN D+SN +S +L+F+ Sbjct: 247 SAWEVLTQDVKDDPKSGRKTAQPTRTRYTTLANNAPDASNPVKTELPAAQSSCRDQLKFV 306 Query: 162 EHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 +++V+D SGSM+ Sbjct: 307 ------------------------WVEGDIDMQIVMDRSGSMYG---------------- 326 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + + A +D+ S MGL+ +++R + + +++ + T Sbjct: 327 ----SPINNAIQAAKTLVDATAEGSTA-----MGLVSFSSRSSVKQDFA--VQQIPKPDT 375 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 +L + + A L + Q P F+ L DG++ Sbjct: 376 GIKQALKAAIDNIYASGSTA---LFDGSSLALDNLITYQTAAASGAPGVVFV--LADGDD 430 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISI-NASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 N+ N +T+ + + I + +ASP G + + +Y + +I Sbjct: 431 NSSIKNESTVIT--AYQNANVPIFSFGYGSASPTGPLVTMANATGGKYFSSPTTLSEIID 488 Query: 401 VF 402 F Sbjct: 489 AF 490 >gi|296327481|ref|ZP_06870027.1| D-amino acid dehydrogenase large subunit [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] gi|296155307|gb|EFG96078.1| D-amino acid dehydrogenase large subunit [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] Length = 530 Score = 37.2 bits (84), Expect = 4.7, Method: Composition-based stats. Identities = 41/238 (17%), Positives = 80/238 (33%), Gaps = 30/238 (12%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 ++ E +E+V+D SGSM + +T M K ++ L + Sbjct: 169 EKIEENMNVNVEIVLDASGSMVKKIG---------------DKTMMEIAKESIKKVLSEM 213 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK-QA 301 V G+ + + + + + + D + + ++ Sbjct: 214 ----PANAKV--GIRVFGHKGDNTASKKDESCGSNELIYPIGDLNVEGIEKALEPIQPTG 267 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + + + G K ++ +II TDG + V K K + Sbjct: 268 WTSIAKSIEYGVEDLKALDGEKTLNIL---YII--TDGIETCGGNPVEIAKQL-KGENTN 321 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHY-NVVNADSLIHVFQNISQL-MVHRKYSVI 417 I + I N N RLLK + +Y +V +A+ L I++L K+ V+ Sbjct: 322 IVLGIIGFNVDANQNRLLKQIADAAGGYYSSVNDANKLTGELYRINELAFSDYKWEVL 379 >gi|255262383|ref|ZP_05341725.1| von Willebrand factor, type A [Thalassiobium sp. R2A62] gi|255104718|gb|EET47392.1| von Willebrand factor, type A [Thalassiobium sp. R2A62] Length = 634 Score = 37.2 bits (84), Expect = 4.7, Method: Composition-based stats. Identities = 30/170 (17%), Positives = 53/170 (31%), Gaps = 44/170 (25%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 ++ E P + ++D SGSM + K+ L +A L L + Sbjct: 273 DVSEHPPLNLVFLIDTSGSM-------------------NQPDKLPLLISAFRLMLSELR 313 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAY 302 V Y G G + R + ++ L T ++QAY Sbjct: 314 PEDEVSIITYAGSAG--------QVLAPTPASDRATILAALNRLSAGGSTAGQAGLRQAY 365 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 I + + +I TDG+ N ++ +T+K Sbjct: 366 AIAAAMSEDGEIAR----------------VILATDGDFNVGLNDPDTLK 399 >gi|331006836|ref|ZP_08330094.1| hypothetical protein IMCC1989_793 [gamma proteobacterium IMCC1989] gi|330419347|gb|EGG93755.1| hypothetical protein IMCC1989_793 [gamma proteobacterium IMCC1989] Length = 693 Score = 37.2 bits (84), Expect = 4.8, Method: Composition-based stats. Identities = 43/244 (17%), Positives = 86/244 (35%), Gaps = 59/244 (24%) Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 IP + LV+D+SGSM +DP++ L+ L Sbjct: 13 IPLSYAQAPEAAKPSDVRLVIDISGSMK---KNDPQN-----------------LRRPAL 52 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVE-----KNIEPSWG-TEKVRQYVTRDMDSLILK 290 L + + + G+ + V K ++ WG T + + Sbjct: 53 DMLVQL-----LPKGSKAGIWTFGQYVNMLVPHKPVDAQWGRTASAASSEIKSIAQF--- 104 Query: 291 PTDSTPAM-KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG----ENNNFK 345 T+ A+ K AY + +Q +I LTDG + +N Sbjct: 105 -TNIGAALEKAAYD---------------HKQQMKADQDYQTHVILLTDGMVDIDRDNRL 148 Query: 346 SNVNTIKICDKA----KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + +I + +++ I + TI+++ + + + L K +++ NA+ L++V Sbjct: 149 NKKERQRILNNVLPMYQQSGITLHTIALSDNADKKLLNKLALATDGKVSVAKNAEELMNV 208 Query: 402 FQNI 405 F + Sbjct: 209 FLRV 212 >gi|320169699|gb|EFW46598.1| hypothetical protein CAOG_04556 [Capsaspora owczarzaki ATCC 30864] Length = 501 Score = 37.2 bits (84), Expect = 4.8, Method: Composition-based stats. Identities = 27/129 (20%), Positives = 41/129 (31%), Gaps = 33/129 (25%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 RP I V+D SGSM TK+ +K L + + Sbjct: 40 RPTLDIVAVIDKSGSMAG--------------------TKLELVKKTLETLVAQLRACD- 78 Query: 248 VKEDVYMGLIGYTTRVEKNIEPS----WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + L+ Y T V ++ S G K Q V D T+ + + + Sbjct: 79 -----RLALVTYDTEVTLDLALSPMDDKGRSKATQVVNGIRDG---SSTNLSGGLLEGLN 130 Query: 304 ILTSDKKRS 312 IL + S Sbjct: 131 ILRNRPTDS 139 >gi|307260995|ref|ZP_07542677.1| Flp pilus assembly protein [Actinobacillus pleuropneumoniae serovar 12 str. 1096] gi|306869297|gb|EFN01092.1| Flp pilus assembly protein [Actinobacillus pleuropneumoniae serovar 12 str. 1096] Length = 539 Score = 37.2 bits (84), Expect = 4.8, Method: Composition-based stats. Identities = 40/262 (15%), Positives = 85/262 (32%), Gaps = 13/262 (4%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 ++ I E ++++ L + L LI + + + + A+L+ + Sbjct: 6 LSQARRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTA- 64 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFY-NTEIQNIVNSSR 126 N S D +S K D + + ++ + F T+ +N+ + Sbjct: 65 --ENNSGRKDNDYKLSGSNKENDSFDISSEVGKRDSQMVTKFVKAFLPQTDEKNMHLTPL 122 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG 186 + + + +S++ + + + L Q V+ L+ Sbjct: 123 CKTINNNSGKGHTSSSEVTCTVSGTVEHKSWFPLKVGNLEVIPKQVNVASKSKALKKNTF 182 Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 PI + +V DLSGSM +D + +K+ L+ L D Sbjct: 183 NIPI-DLMVVADLSGSM--------KDGIKGEKLEGGTNSKIYILREVLKELADKSLFTQ 233 Query: 247 HVKEDVYMGLIGYTTRVEKNIE 268 E +G+ + E E Sbjct: 234 ESNEYNRIGITAFAMGAEHPKE 255 >gi|32475537|ref|NP_868531.1| hypothetical protein RB8767 [Rhodopirellula baltica SH 1] gi|32446079|emb|CAD75908.1| conserved hypothetical protein [Rhodopirellula baltica SH 1] Length = 291 Score = 37.2 bits (84), Expect = 4.8, Method: Composition-based stats. Identities = 28/157 (17%), Positives = 55/157 (35%), Gaps = 25/157 (15%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E + L+VD S S+ S K L+ L + +S Sbjct: 73 EERELAVMLMVDCSASLDFGTQSQT--------------------KRELVTELGATLAMS 112 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAM----KQA 301 +K + +GL ++ VEK+ P G+ V + + + TD A+ + A Sbjct: 113 AIKNNDRVGLTLFSEDVEKSFPPRQGSRHVLRLIREMLTHPSSGSGTDVGAALEHLQRTA 172 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + F + + + +++ S + +TD Sbjct: 173 KRRTVVFLISDFQSKDYEKSLRVASRKHDLIPVVVTD 209 >gi|295132199|ref|YP_003582875.1| von Willebrand factor (vWA) type A domain-containing protein [Zunongwangia profunda SM-A87] gi|294980214|gb|ADF50679.1| von Willebrand factor (vWA) type A domain-containing protein [Zunongwangia profunda SM-A87] Length = 347 Score = 37.2 bits (84), Expect = 4.9, Method: Composition-based stats. Identities = 31/194 (15%), Positives = 59/194 (30%), Gaps = 45/194 (23%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 +E +R I +D+S SM D +++ K + L + Sbjct: 82 METVKREGVDIVFAIDVSKSMDAE---------------DIAPSRLEKSKQIVRQILGGL 126 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-MDSLILKPTDSTPAMKQA 301 +G+I Y + + + ++ D + + T A+ A Sbjct: 127 GSD-------RVGIIAYAGSAFPQLPITTDYAAAKMFLQALNTDMISSQGTAIGDAIDLA 179 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 D + + + + ++DGE N I D+A E Sbjct: 180 TTYYDDDNQTN------------------RVLFIISDGE----DHEGNVESIADEAAEKG 217 Query: 362 IKIVTISINASPNG 375 I+I TI + G Sbjct: 218 IRIYTIGVGTEKGG 231 >gi|229113208|ref|ZP_04242703.1| hypothetical protein bcere0018_54170 [Bacillus cereus Rock1-15] gi|228670234|gb|EEL25582.1| hypothetical protein bcere0018_54170 [Bacillus cereus Rock1-15] Length = 425 Score = 37.2 bits (84), Expect = 4.9, Method: Composition-based stats. Identities = 39/217 (17%), Positives = 78/217 (35%), Gaps = 29/217 (13%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + +E+++D SGSM +N + KM A K A+ +LD I Sbjct: 121 KAKSLNVEILLDASGSMAGKVNGEV---------------KMEAAKKAIYNYLDKI---- 161 Query: 247 HVKEDVYMGLIGYT-TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 +V + + G+ + E + S G+ +V + + + L Sbjct: 162 PDNANVMLRVYGHKGSNNENDKSLSCGSSEVMYPLQPYNKEQFNAALSKFGP--KGWTPL 219 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 S + G + ++ +I+ +DGE VN K +++ + + + Sbjct: 220 ASAIESVNDDFKEYTGEENLNVV---YIV--SDGEETCGGDPVNAAKNLNQSSTHAV-VN 273 Query: 366 TISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHV 401 I + + Q+ LK + + V +AD L Sbjct: 274 IIGFDVKNSEQQQLKNTAEAGKGNYATVSSADELHQT 310 >gi|156408868|ref|XP_001642078.1| predicted protein [Nematostella vectensis] gi|156229219|gb|EDO50015.1| predicted protein [Nematostella vectensis] Length = 257 Score = 37.2 bits (84), Expect = 4.9, Method: Composition-based stats. Identities = 16/81 (19%), Positives = 36/81 (44%), Gaps = 3/81 (3%) Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG-QRLLKTCVSSP 386 P ++ I+ TDG++N T+ + K+ ++I +++ G ++ SS Sbjct: 155 PGRRRILIFTDGQSN--VKEQMTLYRAFRLKKRGVEIYVVAVGKYLYGMHEIIGLATSSS 212 Query: 387 EYHYNVVNADSLIHVFQNISQ 407 + Y V + + + Q I + Sbjct: 213 HHLYRVRSMKDFVKIVQLIPR 233 >gi|307109844|gb|EFN58081.1| hypothetical protein CHLNCDRAFT_142385 [Chlorella variabilis] Length = 654 Score = 37.2 bits (84), Expect = 4.9, Method: Composition-based stats. Identities = 22/114 (19%), Positives = 41/114 (35%), Gaps = 12/114 (10%) Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI-------PSLPFQKFIIFLTDGENNN 343 T + A ++ + + + Q + + I+ +TDG+ + Sbjct: 526 ATGAEVAANNSFVSMVVNTTPDAHGTWMAQAITTCQGVLLEEDTQASRTIVLITDGKPTD 585 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 + T + D AK IK+V + A LK S P++ + N DS Sbjct: 586 PQP---TFEAADAAKARGIKMVVVG--AGEIDYATLKALASGPQFVFANTNLDS 634 >gi|228472734|ref|ZP_04057492.1| BatB protein [Capnocytophaga gingivalis ATCC 33624] gi|228275785|gb|EEK14551.1| BatB protein [Capnocytophaga gingivalis ATCC 33624] Length = 353 Score = 37.2 bits (84), Expect = 4.9, Method: Composition-based stats. Identities = 41/262 (15%), Positives = 82/262 (31%), Gaps = 72/262 (27%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 ++E +R I +D+S SM D K ++ K+ + ++ Sbjct: 81 KLETVKREGVDIVFAIDVSKSMLAE---------------DVKPNRIEKAKHIISQLIEV 125 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-MDSLILKPTDSTPAMKQ 300 + + I Y + + + + ++ + L + T A++ Sbjct: 126 L-------HGDRVAFIPYAAQAYPQLPLTSDYSSAKIFLEGINTNMLSSQGTAIGEAIQM 178 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A + S K +I L+DGE + + V+T+ +AK+ Sbjct: 179 AINYFEESSQTS------------------KILIILSDGE--DHQQGVDTVIQ--EAKDK 216 Query: 361 FIKIVTIS--------INASPNG------------------QRLLKTCVSS-PEYHYNVV 393 I++ TI I S NG Q LL+ ++N Sbjct: 217 GIRLFTIGLGTAQGATIPVSENGQIVAKRDNNGQVVITKLNQALLEEIAQEGGGKYFNGA 276 Query: 394 NADSLIHVFQNISQLMVHRKYS 415 N ++ Q + +Y Sbjct: 277 NTKEVLDALQKALDTIEKNEYE 298 >gi|261883870|ref|ZP_06007909.1| hypothetical protein CfetvA_01086 [Campylobacter fetus subsp. venerealis str. Azul-94] Length = 216 Score = 37.2 bits (84), Expect = 5.0, Method: Composition-based stats. Identities = 39/220 (17%), Positives = 78/220 (35%), Gaps = 39/220 (17%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + I +++D SGSM + +T+M A K+A+ F++S+ ++ Sbjct: 28 KESLNIMVILDASGSMA---------------NIEGDQTRMEAAKSAINSFMESLPQEAN 72 Query: 248 VKEDVY----MGLIGYTTRVEKNIEPSWGTEKV--RQYVTRDMDSLILKPTDSTPAMKQA 301 V +Y G R + E + +K ++ + + T A+K+A Sbjct: 73 VGLRIYGHEGSGSKTDKARSCSSSELVYSIDKYDPAKFNQALVKAKPAGWTPIGFALKEA 132 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + L + + + + ++DG + V + K + Sbjct: 133 QKDLAAFQGDANTN----------------IVYLVSDGISTCDDDPVGSAKALYDSDITP 176 Query: 362 IKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIH 400 I I I N GQ+ L+ + + NV A+ L Sbjct: 177 I-INVIGFNVDQEGQKQLQEVAKVTEGTYQNVQMAEGLYD 215 >gi|302670289|ref|YP_003830249.1| von Willebrand factor type A domain-containing protein [Butyrivibrio proteoclasticus B316] gi|302394762|gb|ADL33667.1| von Willebrand factor type A domain-containing protein [Butyrivibrio proteoclasticus B316] Length = 568 Score = 37.2 bits (84), Expect = 5.0, Method: Composition-based stats. Identities = 32/192 (16%), Positives = 63/192 (32%), Gaps = 42/192 (21%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 I+M E P+ + ++D+SGSM R K+ L+ + Sbjct: 198 KTGEIDMEEAPVSNLVFLIDVSGSMS-------------------SRNKLPLLQKSFDEL 238 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 +DS+ + Y G +V + EP + +++ + + + T+ M Sbjct: 239 VDSLPDEGTISIVTYSG----EEKVVLSGEPMSNKKGIKKAIDKLHAN---GCTNGQAGM 291 Query: 299 KQAY---QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD-------------GENN 342 ++AY Q + + + + F+TD GE N Sbjct: 292 QKAYEIAQKYFIEGGNNRVIMATDGDLNVGISDLDDLEKFITDKKDEGVFLSILGFGEGN 351 Query: 343 NFKSNVNTIKIC 354 + T+ C Sbjct: 352 YKDDKMQTLADC 363 >gi|221111402|ref|XP_002161005.1| PREDICTED: similar to collagen, partial [Hydra magnipapillata] Length = 1100 Score = 37.2 bits (84), Expect = 5.0, Method: Composition-based stats. Identities = 34/222 (15%), Positives = 65/222 (29%), Gaps = 42/222 (18%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + E+ I I ++D SGS+ K L L Sbjct: 57 KNEVCNEGIVDIGFIMDSSGSLG------------------KNYKNEKDLLKTLASLFSI 98 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--TDSTPAMK 299 G+I ++ E +I+ + +D + L T ++ Sbjct: 99 ------KPNGSQAGVITFSFYTEHSIKL--NQFSDQDSFNDAVDRIPLMGHTTRIDKGLR 150 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 A + + + K ++ LTDG K ++ I D+ ++ Sbjct: 151 LAQKEMF-------------KVENGGRPGVSKLLVLLTDGSQTQGKGVIDPAIIADEIRK 197 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + I+ I I N L+K Y+ + + L Sbjct: 198 QGVPIIAIGIGKEINKNELIKI-GGGEANTYSADDFEKLKES 238 >gi|110556625|ref|NP_997091.3| calcium-activated chloride channel regulator 4 [Mus musculus] gi|148680067|gb|EDL12014.1| mCG119588 [Mus musculus] gi|148922513|gb|AAI46305.1| Chloride channel calcium activated 6 [synthetic construct] gi|151556758|gb|AAI48748.1| Chloride channel calcium activated 6 [synthetic construct] Length = 925 Score = 37.2 bits (84), Expect = 5.0, Method: Composition-based stats. Identities = 35/237 (14%), Positives = 80/237 (33%), Gaps = 58/237 (24%) Query: 193 IELVVDLSGSMHC--AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LV+D+SGSM +N + + R+ + + + + V E Sbjct: 309 MCLVLDVSGSMTSYDRLNRMNQAAKYFLSQIIENRSWVGMVHFSSQATI--------VHE 360 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 + + + +E+N + + + T +K A+Q+ + + Sbjct: 361 LIQI-----NSDIERNQLL------------QTLPTSANGGTSICSGIKAAFQVFKNGEY 403 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISI 369 ++ T I+ L+DGE++ T K C D+ K++ + I++ Sbjct: 404 QTDGTE----------------ILLLSDGEDS-------TAKDCIDEVKDSGSIVHFIAL 440 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADS---LIHVFQNISQ---LMVHRKYSVILKG 420 + + + H + LI F ++ + + + KG Sbjct: 441 -GPSADLAVTNMSILTGGNHKLATDEAQNNGLIDAFGALASENADITQKSLQLESKG 496 >gi|319949307|ref|ZP_08023384.1| hypothetical protein ES5_07781 [Dietzia cinnamea P4] gi|319437027|gb|EFV92070.1| hypothetical protein ES5_07781 [Dietzia cinnamea P4] Length = 326 Score = 37.2 bits (84), Expect = 5.1, Method: Composition-based stats. Identities = 41/240 (17%), Positives = 74/240 (30%), Gaps = 53/240 (22%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + LVVD+S SM D +++ A + A ++ Sbjct: 86 RNRATVMLVVDVSLSMES---------------TDVSPSRLEAAQQAATT------FANN 124 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILT 306 + V +GL+ Y R V R +D L L T + A+ A Q +T Sbjct: 125 LTPGVNLGLVSY----AGTASMLVAPTTDRGPVVRAVDRLSLDERTATGEAIYTATQAIT 180 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI---KICDKAKENFIK 363 + + + P + I+ L+DG+ ++A E I Sbjct: 181 TFTESLGGPD---------QAPPAR-IVLLSDGKETVPADPTEERGAFTAAERAAEAGIP 230 Query: 364 IVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + TIS + L S + + L V++ + + + Sbjct: 231 VSTISFGTLYGTVDIQGRPQPVPVDDASLRTIAELSGGDFFTASTLEELDSVYRTLEEQI 290 >gi|332879551|ref|ZP_08447246.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332682517|gb|EGJ55419.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 352 Score = 37.2 bits (84), Expect = 5.1, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 66/194 (34%), Gaps = 45/194 (23%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 +++ +R + +D+S SM D +++ K + +D Sbjct: 81 KMDTRKRQGIEAIIAMDISNSMMAE---------------DVTPSRLEKSKMLVSNIVDK 125 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAMKQ 300 + D +GLI Y + + + ++ S+I TD A+ Sbjct: 126 M-------TDDKIGLIVYAGEAYTQLPITSDYVSAKIFLETINPSMITTQGTDIKQAIDL 178 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A + TS++ K I +TDGE+N + +++ A E Sbjct: 179 AMKSFTSNQ------------------DVSKAIFVITDGEDNEGGA----VEMAKAAAEK 216 Query: 361 FIKIVTISINASPN 374 IK+ + + + Sbjct: 217 GIKVYVLGVGSPQG 230 >gi|258627123|ref|ZP_05721919.1| hypothetical protein VMB_32200 [Vibrio mimicus VM603] gi|258580641|gb|EEW05594.1| hypothetical protein VMB_32200 [Vibrio mimicus VM603] Length = 335 Score = 37.2 bits (84), Expect = 5.1, Method: Composition-based stats. Identities = 51/273 (18%), Positives = 90/273 (32%), Gaps = 54/273 (19%) Query: 162 EHLLNQRYNQKIVSFIPALLRIEM----------GERPIF--LIELVVDLSGSMHCAMNS 209 LLN + Q I + LL + R F + +VVDLSGSM Sbjct: 55 AGLLNASHWQHIALIVSWLLIVTALAKPSVLGEVQTREAFGRDVLMVVDLSGSME----- 109 Query: 210 DPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP 269 ++ +++ A K L F+ ++ GLI + Sbjct: 110 ----EKDFATESGEQLSRLTAAKKVLRNFVTQ-------RQGDRFGLILFGDAAFIQTPF 158 Query: 270 SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 + + + + T+ A+ F Q PS Sbjct: 159 TADQNVWLNLLDEAETGMAGQSTNLGDAI-------------GLGIKVFEQ---SPSTSQ 202 Query: 330 QKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN-GQRLL------KTC 382 + ++ LTDG N+ S V+ + A I+I I++ N G++ L + Sbjct: 203 DQIMLVLTDG--NDTGSFVSPVDAAKIAAAKGIRIYVIAMGDPENVGEQPLDMDVVNRVS 260 Query: 383 VSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + + ++ L +Q I Q + +KYS Sbjct: 261 SLTQARSFVAIDQPQLNEAYQVIDQ-LEPQKYS 292 >gi|319783082|ref|YP_004142558.1| von Willebrand factor type A [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317168970|gb|ADV12508.1| von Willebrand factor type A [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 704 Score = 37.2 bits (84), Expect = 5.2, Method: Composition-based stats. Identities = 27/204 (13%), Positives = 66/204 (32%), Gaps = 45/204 (22%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + +++ ++ E+P + ++D+SGSM + K Sbjct: 315 NTHTRLMHVAIKGFDVKPTEQPKANLVFLIDVSGSMD-------------------EPDK 355 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + LK+A L + + + Y G + + + +D+L Sbjct: 356 LPLLKSAFRLLVSKLKADDTISIVTYAG--------DAGTVLEPTKASQKDKILSAIDNL 407 Query: 288 ILKPTDSTPA-MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE-NNNFK 345 + + A +K+AY+ L N ++ TDG+ N Sbjct: 408 TPGGSTAGEAGIKEAYR-LAQKSFVKDGVNR---------------VMLATDGDFNVGQS 451 Query: 346 SNVNTIKICDKAKENFIKIVTISI 369 + + ++ +K ++ + + Sbjct: 452 DDDDLKRLIEKERKTGVFLSVFGF 475 >gi|239629502|ref|ZP_04672533.1| von Willebrand factor domain containing protein [Lactobacillus paracasei subsp. paracasei 8700:2] gi|239528188|gb|EEQ67189.1| von Willebrand factor domain containing protein [Lactobacillus paracasei subsp. paracasei 8700:2] Length = 909 Score = 37.2 bits (84), Expect = 5.2, Method: Composition-based stats. Identities = 37/234 (15%), Positives = 69/234 (29%), Gaps = 25/234 (10%) Query: 110 SAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRY 169 + + T QN++N +N + + + D + Sbjct: 56 TNSWQVTGQQNVINQRGGDQVSGWDNNTIWNGDATDTTNSYLKFGD-PNNPDYQIRKYAK 114 Query: 170 NQKIVSFIPALLRIEMGER---PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 L ++ + I LVVD+SGSM N + Sbjct: 115 ETNTPGLYDVYLNVKGNTQQNVKPVDIVLVVDMSGSMESKNNGGTD-------------- 160 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + A++ + FL SI + + V +GLIG+++ Y+ + Sbjct: 161 RAGAVRTGVKNFLTSIQNA-GLGDYVNVGLIGFSSPGYIGGG---NKTTGPGYIRVGL-- 214 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP-FQKFIIFLTDG 339 T A+ A + + + +K +I LTDG Sbjct: 215 GKAGNTSQQQAINSALSPTFNGGTYTQIGLRQGSAMLNADTSGNKKMMILLTDG 268 >gi|308510204|ref|XP_003117285.1| hypothetical protein CRE_01843 [Caenorhabditis remanei] gi|308242199|gb|EFO86151.1| hypothetical protein CRE_01843 [Caenorhabditis remanei] Length = 409 Score = 37.2 bits (84), Expect = 5.3, Method: Composition-based stats. Identities = 20/145 (13%), Positives = 51/145 (35%), Gaps = 11/145 (7%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 T++ + + I + + + +GL+ Y + + G + + Y D Sbjct: 71 TQVDGNIATVFGYDTRIGVRDYEPKTTRVGLVSYN----ADAKILAGLDTYQSY-----D 121 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 L DS ++ + + + F ++K +I + Sbjct: 122 DLANGVFDSLNSVSATDESYLAKGLSAAEKVFEEGKSTANRTQYKKVVIVYAS--SYKGT 179 Query: 346 SNVNTIKICDKAKENFIKIVTISIN 370 +N + + D+ K +KI+T++ + Sbjct: 180 GELNPVPVADRMKTAGVKIITVAFS 204 >gi|291225695|ref|XP_002732834.1| PREDICTED: MUscle Positioning family member (mup-4)-like [Saccoglossus kowalevskii] Length = 317 Score = 37.2 bits (84), Expect = 5.3, Method: Composition-based stats. Identities = 32/243 (13%), Positives = 73/243 (30%), Gaps = 59/243 (24%) Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 + G P + V+D SGS+ K+ L+ Sbjct: 95 TGSFFAASNGNPPPIDLLFVIDKSGSIG-----------------QSDFNKIIEHIKELV 137 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTT--RVEKNIEP--------SWGTEKVRQYVTRDMDS 286 + + + I Y++ +V+ + S ++ + + Sbjct: 138 DLF----TVEISVDKTRVSAISYSSSNKVDLDFNFRRCLFESSSASKTCIKSEIDKI--D 191 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T +T A+ +A + + K + +TDG +N Sbjct: 192 FEGGSTHTTKALVKA-------------RDEAFKSFHGSRTNSHKVLFLVTDGRSNG--- 235 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSS--PEYHYNVVNADSLIHVFQN 404 N ++ + K + ++I + + + L++ VS P++ + F Sbjct: 236 NGPLVETANSLKNDDVEIYALGVTSDVVEAE-LRSIVSDPIPDHLFY-------YDTFNA 287 Query: 405 ISQ 407 +SQ Sbjct: 288 VSQ 290 >gi|282858825|ref|ZP_06267970.1| von Willebrand factor type A domain protein [Prevotella bivia JCVIHMP010] gi|282588394|gb|EFB93554.1| von Willebrand factor type A domain protein [Prevotella bivia JCVIHMP010] Length = 340 Score = 37.2 bits (84), Expect = 5.4, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 65/195 (33%), Gaps = 47/195 (24%) Query: 178 PALLRIEMGERPIFLIELV--VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 P + + F IE V +D+S SM QD +++ K + Sbjct: 76 PQIGNRISTTKHSFGIETVIALDISNSMLA---------------QDVVPSRLDKSKLLI 120 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 D + V GLI + + + + ++ SLI + TD Sbjct: 121 EDLFRIFD-----NDKV--GLIVFAGDAFVQLPITSDFISAKMFLDNINPSLIGTQGTDI 173 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+ A P+ K I+ +TDGE+N ++ + Sbjct: 174 GQAINLA------------------MHSFSPTSKSGKAIVVITDGEDNEGRAEE----MA 211 Query: 355 DKAKENFIKIVTISI 369 KA++ I++ + + Sbjct: 212 SKAQKAGIQVYILGV 226 >gi|302557483|ref|ZP_07309825.1| secreted protein [Streptomyces griseoflavus Tu4000] gi|302475101|gb|EFL38194.1| secreted protein [Streptomyces griseoflavus Tu4000] Length = 417 Score = 37.2 bits (84), Expect = 5.4, Method: Composition-based stats. Identities = 38/227 (16%), Positives = 72/227 (31%), Gaps = 32/227 (14%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 F E + + L++D+SGSM +++MAA K A Sbjct: 17 FPAGAAAGEPTGQSAPKVNLLLDVSGSMRAK--------------DIDGQSRMAAAKQAF 62 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 L E+V +G+ + + + Y + P + T Sbjct: 63 NEVL------DATPEEVELGIRTLGANYPGDDRKEGCKDTAQLY--------PVGPLNRT 108 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A K A L + + K I+ ++DGE+ + + Sbjct: 109 EA-KTAVATLAPTGWTPIGPALLKAADDLDGGDGSKRIVLISDGEDTCAPLDPCEVAREI 167 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIH 400 A+ + I T+ + + R L C++ + +V + D L Sbjct: 168 AARGIGLTIDTLGLVPNAKLSRQLS-CIAEATGGTYTSVEHQDELTD 213 >gi|226941651|ref|YP_002796725.1| outer membrane adhesin like proteiin [Laribacter hongkongensis HLHK9] gi|226716578|gb|ACO75716.1| putative outer membrane adhesin like proteiin [Laribacter hongkongensis HLHK9] Length = 2392 Score = 37.2 bits (84), Expect = 5.4, Method: Composition-based stats. Identities = 31/206 (15%), Positives = 69/206 (33%), Gaps = 21/206 (10%) Query: 162 EHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 + LL + I + I + + +++D SGSM M++ Sbjct: 1647 DTLLGDSSGEIITGDVSGSGGISTPGMN-YNLCIILDSSGSMESYMSTGSY--------- 1696 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 ++ KNA+ LD++ + + ++ + + + + +R Sbjct: 1697 -SSSARIDVAKNAIQTLLDNMADFDG---TINLSIVDF----DSGSKMALQGVTLRDLCV 1748 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII---FLTD 338 +D T + A QA +T+ ++ + F + P + F+TD Sbjct: 1749 QDAQGNWHLDTSAGSAFMQAMGNITATGGTNYESAFMQAEQWFSQQPTDGYTNHTYFVTD 1808 Query: 339 GENNNFKSNVNTIKICDKAKENFIKI 364 G+ N KI + + + Sbjct: 1809 GKPTARYENSFADKISHTVSFDSVTV 1834 >gi|253582981|ref|ZP_04860199.1| magnesium chelatase [Fusobacterium varium ATCC 27725] gi|251835187|gb|EES63730.1| magnesium chelatase [Fusobacterium varium ATCC 27725] Length = 632 Score = 37.2 bits (84), Expect = 5.5, Method: Composition-based stats. Identities = 33/180 (18%), Positives = 64/180 (35%), Gaps = 44/180 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I VVD SGSM + +M A+K A++ L ++ Sbjct: 451 SILFVVDSSGSMGV-------------------KKRMEAVKGAVMSLLK-----DAYEKR 486 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G++ + R +K E T + + T + +AY I+ ++ ++ Sbjct: 487 DRVGMVSF--RRDKAEELLPITRSIDLAQKKLEKLATGGKTPLAEGIAKAYTIIKNEMRK 544 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN----VNTIKICDKAKENFIKIVTI 367 I+FL+DG+ N S ++++ +K K I+ + I Sbjct: 545 DK--------------EVVPLIVFLSDGKGNFSASGKDPVKESLEMAEKIKNEGIRAIVI 590 >gi|209521403|ref|ZP_03270114.1| von Willebrand factor type A [Burkholderia sp. H160] gi|209498150|gb|EDZ98294.1| von Willebrand factor type A [Burkholderia sp. H160] Length = 275 Score = 37.2 bits (84), Expect = 5.5, Method: Composition-based stats. Identities = 39/285 (13%), Positives = 81/285 (28%), Gaps = 45/285 (15%) Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 +V ++ + + A T+ + + F L V Sbjct: 13 LVTATACAASLQAVAVSQPEPGTVTVTVRAPGAAPSADAFTLQLPGADTTPTRVPAQSVE 72 Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 E+ + L VD SGSMH + + A+K AL L Sbjct: 73 AANELSPDLATAVLLCVDRSGSMH---------------------SAVPAIKAALKDVLA 111 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMK 299 + D+ + L+ + + + V + V T A+ Sbjct: 112 R------PRPDLRIALMSFGSDTPAPTPFYSESAPVIEAVDAIRAETGRDGKTRLYDALN 165 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 +L + R ++ I+ +TDG++ ++ + + + + + Sbjct: 166 IGMSMLANVPLRG----------------PKRLIV-ITDGKDEGSQTRFDVLSVLLQGRG 208 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + + +A L S N SL+ +N Sbjct: 209 QPMDAIAFGQSAQKTSSGLATLANKSSGAFVLATNPSSLVEALRN 253 >gi|121595491|ref|YP_987387.1| type 4 fimbrial biogenesis protein PilY1 [Acidovorax sp. JS42] gi|120607571|gb|ABM43311.1| type 4 fimbrial biogenesis protein PilY1 [Acidovorax sp. JS42] Length = 1154 Score = 37.2 bits (84), Expect = 5.5, Method: Composition-based stats. Identities = 31/226 (13%), Positives = 70/226 (30%), Gaps = 24/226 (10%) Query: 150 VMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF---LIELVVDLSGSMHCA 206 +R + L+ + + + G + + + +D SGSM+ Sbjct: 5 ASARRHFRQTPLAVLIGAALSSQSALAALDFAQAPPGTVEPYVAPNVIISIDDSGSMNWR 64 Query: 207 MNSDPEDVNS---APICQDKKRT--KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT 261 ++ + + + P T +M LK AL + LL K + + Sbjct: 65 LDRENDRGATDNVTPTNGSWLSTDRRMNVLKYALKQVFNDTSLLPDGK--IRLAWQVMHN 122 Query: 262 R--------VEKNIEPSWGTEKVR-----QYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 V+ + + ++ +++ T + KQA + + Sbjct: 123 NGDAPDAKNVDSDTMKTNSMRVLKGSHRDNFISFINSLRPNNGTPTHKMFKQADDYMRRE 182 Query: 309 KKRSFFTNFFRQGVKIPSLPFQK-FIIFLTDGENNNFKSNVNTIKI 353 + P L ++ + I +TDG N++ T + Sbjct: 183 LGINSPWASEPGVKDKPYLGCRRNYHIVMTDGRWNSYSDTYQTGNL 228 >gi|4139906|pdb|1AOX|A Chain A, I Domain From Integrin Alpha2-Beta1 gi|4139907|pdb|1AOX|B Chain B, I Domain From Integrin Alpha2-Beta1 Length = 203 Score = 37.2 bits (84), Expect = 5.5, Method: Composition-based stats. Identities = 32/191 (16%), Positives = 64/191 (33%), Gaps = 32/191 (16%) Query: 233 NALLLFLDS-IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY-VTRDMDSLILK 290 +A+ FL+ + L +GLI Y T K ++ + + Sbjct: 24 DAVKNFLEKFVQGLDIGPTKTQVGLIQYANNPRVVFNL--NTYKTKEEMIVATSQTSQYG 81 Query: 291 P--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T++ A++ A + S K ++ +TDGE+++ Sbjct: 82 GDLTNTFGAIQYARKYAYSAA-------------SGGRRSATKVMVVVTDGESHDGSMLK 128 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRL--------LKTCVSSPE--YHYNVVNADSL 398 I C+ + I I++ N L +K S P Y +NV + +L Sbjct: 129 AVIDQCN---HDNILRFGIAVLGYLNRNALDTKNLIKEIKAIASIPTERYFFNVSDEAAL 185 Query: 399 IHVFQNISQLM 409 + + + + Sbjct: 186 LEKAGTLGEQI 196 >gi|255513821|gb|EET90086.1| von Willebrand factor type A [Candidatus Micrarchaeum acidiphilum ARMAN-2] Length = 705 Score = 37.2 bits (84), Expect = 5.5, Method: Composition-based stats. Identities = 41/218 (18%), Positives = 76/218 (34%), Gaps = 53/218 (24%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + I +++D+SGSM K+ A K L DS+D Sbjct: 522 KDAGAEIWMLLDISGSMGGQ--------------------KINAAKRILGSIHDSLD--- 558 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + V++ + G+ S GT M+ + T + A+ A ++ Sbjct: 559 -GSKYVHLRMFGFYG--------SDGTHVFEFDRKMLMNLAAMGDTPTDIAIYYAMDLM- 608 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 K F K + +TDG+ NN + N + A +N + + T Sbjct: 609 ----------------KKDKSNFDKTLFIITDGDPNNGQETKNALNSLKNAMKN-VNVFT 651 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 I I+ + S ++++NV + D + V + Sbjct: 652 IFISREAARAVEI---FSPSDWYFNVSSMDEVEKVLEK 686 >gi|327540682|gb|EGF27254.1| von Willebrand factor type A domain-containing protein [Rhodopirellula baltica WH47] Length = 887 Score = 37.2 bits (84), Expect = 5.6, Method: Composition-based stats. Identities = 24/140 (17%), Positives = 48/140 (34%), Gaps = 27/140 (19%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 N+ + I A I+ +RP + ++D SGSM K+ Sbjct: 480 PWNENNRLVRVGIQAK-DIDRKKRPRCNLVFLIDTSGSM-------------------KR 519 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 K+ + + + LD ++ V VY G G ++ ++ ++ + Sbjct: 520 PNKLPLVIEGMKVLLDQLNKKDRVAIVVYAGSSG--LVLDSTP-----VKQKKKIIRALS 572 Query: 285 DSLILKPTDSTPAMKQAYQI 304 T+ ++ AYQ Sbjct: 573 ALSAGGSTNGGAGLQLAYQT 592 >gi|332140408|ref|YP_004426146.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Alteromonas macleodii str. 'Deep ecotype'] gi|327550430|gb|AEA97148.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Alteromonas macleodii str. 'Deep ecotype'] Length = 1355 Score = 37.2 bits (84), Expect = 5.6, Method: Composition-based stats. Identities = 21/105 (20%), Positives = 33/105 (31%), Gaps = 25/105 (23%) Query: 330 QKFIIFLTDGE--NNNFKSNVNTI-------------------KICDKAKE---NFIKIV 365 I+ L+DGE NN+ S + T+ I D + + Sbjct: 580 NNHIVLLSDGEANNNHSVSKIQTLLGQSCTGNGGEKCGLDLVRNISDTSTSVIGPRVITH 639 Query: 366 TISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQLM 409 TI A+ L S Y N+ L+ F I + + Sbjct: 640 TIGFAANNTANNFLNQLALQSGGGFYQADNSTDLLEAFNTILRSV 684 >gi|297380203|gb|ADI35090.1| phage/colicin/tellurite resistance cluster terY protein [Helicobacter pylori v225d] Length = 217 Score = 37.2 bits (84), Expect = 5.7, Method: Composition-based stats. Identities = 30/240 (12%), Positives = 73/240 (30%), Gaps = 43/240 (17%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 L E + L++D SGSM ++ T++ L + ++ Sbjct: 5 LSEYTMEERFIPVFLLLDTSGSMSHSLG---------------NGTRIEVLNLCIQKMIE 49 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 ++ + + M +I + P + + T A + Sbjct: 50 TLKQEAKKELFSKMAIITFGENGAVLHTPFDDIKNI-----NFKPLSASGGTPLDQAFRL 104 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A ++ P+ ++ + I ++DGE N+ K + Sbjct: 105 AKDLIEDKD-------------TFPTKFYKPYSILVSDGEPNDDKWQKALSDFHHYGRSA 151 Query: 361 FIKIVTISI----NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 +I I + + K + + + + L+ +F+ ++Q + S+ Sbjct: 152 KSVCWSIFIGNRNDNPQVNKEFGK------DGVFYADDVEKLVGLFEIMTQTISKGSTSI 205 >gi|150005796|ref|YP_001300540.1| aerotolerance-related membrane protein [Bacteroides vulgatus ATCC 8482] gi|294776175|ref|ZP_06741664.1| von Willebrand factor type A domain protein [Bacteroides vulgatus PC510] gi|149934220|gb|ABR40918.1| aerotolerance-related membrane protein [Bacteroides vulgatus ATCC 8482] gi|294449998|gb|EFG18509.1| von Willebrand factor type A domain protein [Bacteroides vulgatus PC510] Length = 340 Score = 37.2 bits (84), Expect = 5.7, Method: Composition-based stats. Identities = 33/229 (14%), Positives = 75/229 (32%), Gaps = 45/229 (19%) Query: 148 MDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAM 207 M ++ Y ++F +++ ++E +R + +D+S SM Sbjct: 47 MPTISKYRPDIKFWLTFAALTLVILMLARPQFGSKMETVKRSGVEAVIALDISNSMLAE- 105 Query: 208 NSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNI 267 D +++ K + +D+ + + V GLI + + Sbjct: 106 --------------DVTPSRLDKSKKLISRLVDTFN-----NDKV--GLIVFAGDAFTQL 144 Query: 268 EPSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 + + ++ SLI + TD A++ A + T P Sbjct: 145 PITSDYVSAKMFLETINPSLITTQGTDIGAAIRLAMKSFT------------------PQ 186 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 + II +TDGEN+ + + E +++ + + + Sbjct: 187 EGVGRAIIVITDGENHEGGAVEAAQEA----AEKGMQVFVLGVGSPDGS 231 >gi|29346316|ref|NP_809819.1| hypothetical protein BT_0906 [Bacteroides thetaiotaomicron VPI-5482] gi|253568263|ref|ZP_04845674.1| conserved hypothetical protein [Bacteroides sp. 1_1_6] gi|298385670|ref|ZP_06995228.1| BatB protein [Bacteroides sp. 1_1_14] gi|29338211|gb|AAO76013.1| BatB, conserved hypothetical protein [Bacteroides thetaiotaomicron VPI-5482] gi|251842336|gb|EES70416.1| conserved hypothetical protein [Bacteroides sp. 1_1_6] gi|298261811|gb|EFI04677.1| BatB protein [Bacteroides sp. 1_1_14] Length = 342 Score = 37.2 bits (84), Expect = 5.7, Method: Composition-based stats. Identities = 32/192 (16%), Positives = 64/192 (33%), Gaps = 41/192 (21%) Query: 151 MTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 ++ Y ++F +++ + E +R + + +D+S SM Sbjct: 50 VSKYRPDVKFWLIFAAIGLFSVLLARPQFGSKQETVKRKGVEVIIALDISNSMLA----- 104 Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 QD + +++ K + +D +D + V G+I + + + Sbjct: 105 ----------QDVQPSRLEKAKRLISRLVDELD-----NDKV--GMIVFAGDAFTQLPIT 147 Query: 271 WGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 + ++ SLI K T A+ A + T P Sbjct: 148 SDYISAKMFLESISPSLISKQGTAIGEAINLATRSFT------------------PQEGV 189 Query: 330 QKFIIFLTDGEN 341 + II +TDGEN Sbjct: 190 GRAIIVITDGEN 201 >gi|332141921|ref|YP_004427659.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Alteromonas macleodii str. 'Deep ecotype'] gi|327551943|gb|AEA98661.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Alteromonas macleodii str. 'Deep ecotype'] Length = 1355 Score = 37.2 bits (84), Expect = 5.7, Method: Composition-based stats. Identities = 21/105 (20%), Positives = 33/105 (31%), Gaps = 25/105 (23%) Query: 330 QKFIIFLTDGE--NNNFKSNVNTI-------------------KICDKAKE---NFIKIV 365 I+ L+DGE NN+ S + T+ I D + + Sbjct: 580 NNHIVLLSDGEANNNHSVSKIQTLLGQSCTGNGGEKCGLDLVRNISDTSTSVIGPRVITH 639 Query: 366 TISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQLM 409 TI A+ L S Y N+ L+ F I + + Sbjct: 640 TIGFAANNTANNFLNQLALQSGGGFYQADNSTDLLEAFNTILRSV 684 >gi|114046070|ref|YP_736620.1| putative outer membrane adhesin like protein [Shewanella sp. MR-7] gi|113887512|gb|ABI41563.1| putative outer membrane adhesin like protein [Shewanella sp. MR-7] Length = 1215 Score = 37.2 bits (84), Expect = 5.7, Method: Composition-based stats. Identities = 56/362 (15%), Positives = 114/362 (31%), Gaps = 65/362 (17%) Query: 48 YKKNSMESANNAAI---LAGASKMVSNLSRLGDRFESISNHAKRALID---DAKRFIKNH 101 YK N++ A NA + +A S M + + +++ + D A+ + Sbjct: 187 YKGNAVSGAANATLTTDVATDSIMSNQWQARNGDMKWLNHSTANNIGDVNRTAQGRVYGK 246 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFI 161 + + +R T +ANN D+SN +S +L+F+ Sbjct: 247 SAWEVLTQDVKDDPKSGRKTAQPTRTRYTTLANNAPDASNPVKTELPAAQSSCRDQLKFV 306 Query: 162 EHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 +++V+D SGSM+ Sbjct: 307 ------------------------WVEGDIDMQIVMDRSGSMYG---------------- 326 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + + A +D+ S MGL+ +++R + + +++ + T Sbjct: 327 ----SPINNAIQAAKTLVDATAEGSTA-----MGLVSFSSRSSVKQDFA--VQQIPKPDT 375 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 +L + + A L + Q P F+ L DG++ Sbjct: 376 GIKQALKAAIDNIYASGSTA---LFDGSSLALDNLITYQTAAASGAPGVVFV--LADGDD 430 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISI-NASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 N+ N T+ + + I + +ASP G + + +Y + +I Sbjct: 431 NSSIKNEATVIT--AYQNANVPIFSFGYGSASPTGPLVTMANATGGKYFSSPTTLSEIID 488 Query: 401 VF 402 F Sbjct: 489 AF 490 >gi|254882022|ref|ZP_05254732.1| conserved hypothetical protein [Bacteroides sp. 4_3_47FAA] gi|319640970|ref|ZP_07995678.1| aerotolerance-like membrane protein [Bacteroides sp. 3_1_40A] gi|254834815|gb|EET15124.1| conserved hypothetical protein [Bacteroides sp. 4_3_47FAA] gi|317387415|gb|EFV68286.1| aerotolerance-like membrane protein [Bacteroides sp. 3_1_40A] Length = 340 Score = 37.2 bits (84), Expect = 5.8, Method: Composition-based stats. Identities = 33/229 (14%), Positives = 75/229 (32%), Gaps = 45/229 (19%) Query: 148 MDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAM 207 M ++ Y ++F +++ ++E +R + +D+S SM Sbjct: 47 MPTISKYRPDIKFWLTFAALTLVILMLARPQFGSKMETVKRSGVEAVIALDISNSMLAE- 105 Query: 208 NSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNI 267 D +++ K + +D+ + + V GLI + + Sbjct: 106 --------------DVTPSRLDKSKKLISRLVDTFN-----NDKV--GLIVFAGDAFTQL 144 Query: 268 EPSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 + + ++ SLI + TD A++ A + T P Sbjct: 145 PITSDYVSAKMFLETINPSLITTQGTDIGAAIRLAMKSFT------------------PQ 186 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 + II +TDGEN+ + + E +++ + + + Sbjct: 187 EGVGRAIIVITDGENHEGGAVEAAQEA----AEKGMQVFVLGVGSPDGS 231 >gi|323498921|ref|ZP_08103904.1| hypothetical protein VISI1226_07138 [Vibrio sinaloensis DSM 21326] gi|323316033|gb|EGA69061.1| hypothetical protein VISI1226_07138 [Vibrio sinaloensis DSM 21326] Length = 418 Score = 37.2 bits (84), Expect = 5.8, Method: Composition-based stats. Identities = 23/147 (15%), Positives = 54/147 (36%), Gaps = 5/147 (3%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 +++ ++ +S+ + + I KN +++A + A LAGA+ + + Sbjct: 11 KAQQGLTLVVMTVSMAVIVGVAALSIDANHLMVSKNRLQNALDTAALAGAT-VANRTYEE 69 Query: 76 GDRFESISNHAKR---ALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 D E+I + A +D + SL + + + ++ S ++ Sbjct: 70 DDAKEAIVEAYNKVTSAAGNDELVLAASDDGTSLKSLTIEYSDNANSGFSSNFPSSADYI 129 Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRLQ 159 RL SN + + + Y + Sbjct: 130 YV-RLQVSNVELSEYLAGLLGYSKSIN 155 >gi|302391027|ref|YP_003826847.1| hypothetical protein Acear_0232 [Acetohalobium arabaticum DSM 5501] gi|302203104|gb|ADL11782.1| Protein of unknown function DUF2134, membrane [Acetohalobium arabaticum DSM 5501] Length = 307 Score = 37.2 bits (84), Expect = 5.9, Method: Composition-based stats. Identities = 21/125 (16%), Positives = 52/125 (41%), Gaps = 7/125 (5%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA------SK 67 I S+K ++ AL + F+ + +I + + ++ + + +AA LAG S+ Sbjct: 3 LINSQKGTVIVVVALMMTVFISFLALVIDIGSLYLERIRLVNTLDAAALAGVQDLPDDSQ 62 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 ++ + N+ + DD + + K+ ++ +F + + SS+ Sbjct: 63 QAETVALDYASRNGLDNNVTVEITDDDHQIGLSGSKQVGMNFAVIFGI-DQVEVAASSKA 121 Query: 128 SMTHM 132 + H+ Sbjct: 122 RVGHV 126 >gi|257887439|ref|ZP_05667092.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,141,733] gi|257823493|gb|EEV50425.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,141,733] Length = 1107 Score = 36.8 bits (83), Expect = 6.0, Method: Composition-based stats. Identities = 30/152 (19%), Positives = 53/152 (34%), Gaps = 41/152 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM ++ +K + F+D++ S + + Sbjct: 269 LDLVLVVDWSGSM-------------------NNNDRIGEVKIGVDRFVDTLAD-SGITD 308 Query: 251 DVYMGLIGYTTR--VEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + MG +GY++ N G + V+ V S T + ++ A +L+ Sbjct: 309 KINMGYVGYSSEGHNYSNGTVQMGSFDSVKNQVKSITPSWTNGGTFTQKGLRDAGDMLSV 368 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 369 PNGH------------------KKVIVLLTDG 382 >gi|299116460|emb|CBN76178.1| similar to integrin alpha Hr1 precursor-like [Ectocarpus siliculosus] Length = 353 Score = 36.8 bits (83), Expect = 6.1, Method: Composition-based stats. Identities = 32/216 (14%), Positives = 74/216 (34%), Gaps = 16/216 (7%) Query: 199 LSGSMHCAMNSDPEDVNSAPICQD-------KKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++G C + P + P D + +T + A + + +DS + + D Sbjct: 19 VTGQDSCVQVASPTWAATCPCDSDGQYSFTMEGKTTVTANTVNVAVIIDSSGSVDDDEWD 78 Query: 252 VYMGLI--GYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + M ++ ++N+ + G+ + Q+ + + + A +S Sbjct: 79 MSMAFAKDAVSSFADQNLFTNGGSASIAQFSSSASEGGTFYSLEDFNAFVDGNTKYSSGG 138 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + + S F+I TDG++++ K+ D A++ + + + Sbjct: 139 TDIIDGIAKGRELLKASPATTSFMIVTTDGQSSSPKAEA------DAARDEGTIVYAVGV 192 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 P Q +L ++V D L I Sbjct: 193 GTGPT-QEILLDIGGEEANVFDVDGFDELDVALAGI 227 >gi|116494017|ref|YP_805751.1| von Willebrand factor domain-containing protein [Lactobacillus casei ATCC 334] gi|116104167|gb|ABJ69309.1| Uncharacterized protein encoded in toxicity protection region of plasmid R478, contains von Willebrand factor (vWF) domain [Lactobacillus casei ATCC 334] Length = 909 Score = 36.8 bits (83), Expect = 6.1, Method: Composition-based stats. Identities = 37/234 (15%), Positives = 69/234 (29%), Gaps = 25/234 (10%) Query: 110 SAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRY 169 + + T QN++N +N + + + D + Sbjct: 56 TNSWQVTGQQNVINQRGGDQVSGWDNNTIWNGDATDTTNSYLKFGD-PNNPDYQIRKYAK 114 Query: 170 NQKIVSFIPALLRIEMGER---PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 L ++ + I LVVD+SGSM N + Sbjct: 115 ETNTPGLYDVYLNVKGNTQQNVKPVDIVLVVDMSGSMESKNNGGTD-------------- 160 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + A++ + FL SI + + V +GLIG+++ Y+ + Sbjct: 161 RAGAVRTGVKNFLTSIQNA-GLGDYVNVGLIGFSSPGYIGGG---NKTTGPGYIRVGL-- 214 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP-FQKFIIFLTDG 339 T A+ A + + + +K +I LTDG Sbjct: 215 GKAGNTSQQQAINSALSPTFNGGTYTQIGLRQGSAMLNADTSGNKKMMILLTDG 268 >gi|327542239|gb|EGF28728.1| protein containing von Willebrand factor [Rhodopirellula baltica WH47] Length = 291 Score = 36.8 bits (83), Expect = 6.1, Method: Composition-based stats. Identities = 27/157 (17%), Positives = 55/157 (35%), Gaps = 25/157 (15%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E + L+VD S S+ + K L+ L + +S Sbjct: 73 EERELAVMLMVDCSASLDFGTQTQT--------------------KRELVTELGATLAMS 112 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAM----KQA 301 +K + +GL ++ VEK+ P G+ V + + + TD A+ + A Sbjct: 113 AIKNNDRVGLTLFSEDVEKSFPPRQGSRHVLRLIREMLTHPCSGSGTDVGAALEHLQRTA 172 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + F + + + +++ S + +TD Sbjct: 173 KRRTVVFLISDFQSKDYEKSLRVASRKHDLIPVVVTD 209 >gi|321460553|gb|EFX71594.1| hypothetical protein DAPPUDRAFT_326977 [Daphnia pulex] Length = 1000 Score = 36.8 bits (83), Expect = 6.1, Method: Composition-based stats. Identities = 46/328 (14%), Positives = 114/328 (34%), Gaps = 55/328 (16%) Query: 82 ISNHAKRALIDDAKRFIKNHIKE----SLSGYSAVFYNTEIQNIVNSSRISMTHMANNRL 137 ISN+ ID A + + S +++ + ++V+ + H + Sbjct: 217 ISNNNAACKIDPATNIYDQNCRFEFHPSFKPDTSLASYHLLDSVVHFCKDGDFHSHRSET 276 Query: 138 DSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVV 197 + +NT+ + T F + +PA +I F++ V+ Sbjct: 277 PNKHNTMCGGVSTWTVIMRNPDFA----FYHNRPVDLDQLPAEFKIVKPTGSRFVV--VM 330 Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLI 257 D+S SM K+ ++ L ++ ++ + V +G++ Sbjct: 331 DVSDSM-------------------KQCNRIDKLGESVRAWIK-----NDVPTGSQLGMV 366 Query: 258 GYTTRVEKNIEPS-WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTN 316 +++ E K+RQ + + + + T + A Q+L Sbjct: 367 MFSSTAHIVSELQVISDMKIRQEMMKKVPKDLYSITCIGCGLDLAVQMLQEKGNNKTGG- 425 Query: 317 FFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQ 376 I+ +TDG N+ +++ ++ + + I++VT++ + + Sbjct: 426 ---------------IIVLVTDGRNSAGYLDISDVE--EDIVKAGIRVVTVAFGSEADSN 468 Query: 377 RLLKTCVSSPEYHY--NVVNADSLIHVF 402 V+ + +Y + ++++L F Sbjct: 469 IERLADVTGGKSYYIKDGDSSEALQRAF 496 >gi|312133570|ref|YP_004000909.1| protein [Bifidobacterium longum subsp. longum BBMN68] gi|311772822|gb|ADQ02310.1| Hypothetical protein BBMN68_1309 [Bifidobacterium longum subsp. longum BBMN68] Length = 362 Score = 36.8 bits (83), Expect = 6.2, Method: Composition-based stats. Identities = 36/208 (17%), Positives = 68/208 (32%), Gaps = 38/208 (18%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 VVD SGSM + +A + D+ + + V + Sbjct: 187 VVDYSGSMSGEGKNGVVKGLNAALDPDQAKKSY----------------IEPASGGVNI- 229 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 LI + T + ++ + GT + + TD + A L S+ + S +T Sbjct: 230 LIPFETEAHRPVKAT-GTS-TSDLLHEADATDASGGTDIYEVLLSALDELPSESEASQYT 287 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 I+ +TDG +N+ + K++ + I +I + Sbjct: 288 TA---------------IVLMTDGRSNSDHQDEFESAY--KSRGRDLPIFSIMFGDADPS 330 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 Q LK+ + + L VF+ Sbjct: 331 Q--LKSLATLSNAKVFDGRSGDLAAVFR 356 >gi|330995093|ref|ZP_08319010.1| von Willebrand factor type A domain protein [Paraprevotella xylaniphila YIT 11841] gi|329576669|gb|EGG58172.1| von Willebrand factor type A domain protein [Paraprevotella xylaniphila YIT 11841] Length = 340 Score = 36.8 bits (83), Expect = 6.3, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 64/194 (32%), Gaps = 45/194 (23%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 +++ +R + +D+S SM D +++ K + +D Sbjct: 81 KMDTRKRQGIEAIIAMDISNSMMAE---------------DVTPSRLEKSKMLVSNIVDK 125 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAMKQ 300 + D +GLI Y + + + ++ S+I TD A+ Sbjct: 126 M-------TDDKIGLIVYAGEAYTQLPITSDYVSAKMFLETINPSMITTQGTDIKQAIDL 178 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A + T P+ K I +TDGE+N +++ A E Sbjct: 179 AMKSFT------------------PNQDVSKAIFVITDGEDNE----GGVVEMAKAAAEK 216 Query: 361 FIKIVTISINASPN 374 IK+ + + + Sbjct: 217 GIKVYVLGVGSPQG 230 >gi|221135528|ref|XP_002156134.1| PREDICTED: similar to collagen, partial [Hydra magnipapillata] Length = 194 Score = 36.8 bits (83), Expect = 6.3, Method: Composition-based stats. Identities = 35/217 (16%), Positives = 68/217 (31%), Gaps = 42/217 (19%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 F + ++D SGS+ + LK F S Sbjct: 5 ETPDCEGFFDVGFILDSSGSLKSQY-----------------WKEKDFLKKLANSFGIS- 46 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 + + G++ ++ E +I + D S + T A+ +A Sbjct: 47 ------NKGSHAGVVTFSHYAELSIRLDAFYSSIDFNDAVDRISHMDSFTRIDLALAKAL 100 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 ++ + LTDG+ + + I D+ K+ I Sbjct: 101 ELFDIKNGARNDVPNL--------------LFLLTDGKQ---EPEMPLTHISDEIKQKGI 143 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 ++ + I A N + L+ V +PE + V + D L+ Sbjct: 144 QLFAVGIGAGAN-KTELEKIVGNPENVFMVDDFDKLL 179 >gi|332707319|ref|ZP_08427370.1| hypothetical protein LYNGBM3L_36250 [Lyngbya majuscula 3L] gi|332353913|gb|EGJ33402.1| hypothetical protein LYNGBM3L_36250 [Lyngbya majuscula 3L] Length = 464 Score = 36.8 bits (83), Expect = 6.4, Method: Composition-based stats. Identities = 28/162 (17%), Positives = 51/162 (31%), Gaps = 12/162 (7%) Query: 162 EHLLNQRYNQKI-VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 E L +QK+ V + E P V+D SGSM+ + D + Sbjct: 15 EFLPADTPDQKLFVMLKLRPTKEVAVELPSTSFAFVIDTSGSMYEVVEGDTKPTGRVYTQ 74 Query: 221 QDKK-------RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 +TK+ + +LL + S L + +I + + + + T Sbjct: 75 DGNDYEEVIGGKTKIDIVIESLLNLVRSNQLGGSD----RIAIIQFDDQASTIVGLTPAT 130 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 E + + T M+Q +L+ S Sbjct: 131 ETSQLEAGIEKLRNYSGGTCMGEGMEQTLTMLSGQTMTSRHA 172 >gi|66576258|gb|AAY51689.1| conserved hypothetical protein [Chlorobium tepidum TLS] Length = 350 Score = 36.8 bits (83), Expect = 6.4, Method: Composition-based stats. Identities = 33/184 (17%), Positives = 72/184 (39%), Gaps = 9/184 (4%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 K + S++ +I+FAL +M + LI + + H K +++A +AA LAGA ++ Sbjct: 8 KRLHSQRGVVTILFALVLMVLVGLIALAVDLTRLHLVKAELQNAADAAALAGAGSLIDTS 67 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 + + + + + A ++ A ++ V N++ S S T + Sbjct: 68 LQTFNWSAATAKAQEFADVNSADGKTIGQHRQ--EQDVNVAIQPGYWNLITPSFTSNTGL 125 Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRLQ-FIEHLLNQRYNQKIVSFIPALLRIEMGERPIF 191 + D + + + + L+ F +L + I A + G +F Sbjct: 126 VTHTGDGNIPAVQVTITLS-----HLKFFFAPILGIPEGTVQATAI-AAVSPPTGGTGLF 179 Query: 192 LIEL 195 + + Sbjct: 180 PMAI 183 >gi|89100218|ref|ZP_01173085.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] gi|89085068|gb|EAR64202.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] Length = 456 Score = 36.8 bits (83), Expect = 6.4, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 57/153 (37%), Gaps = 20/153 (13%) Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFI 161 ++E S F + ++ + + ++ + D+S+ + + Y Y L + Sbjct: 66 LEEIFSYPEGSFSSEDLSIDTSDTEAALNAVPAVPEDASDE----EIKELFGYLYSLYKM 121 Query: 162 EHLLNQRYNQKIVSFIPALLRIEMGERPI-FLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 E+ + + E P F +E+V+D SGSM + S Sbjct: 122 EYQDPRAIMESASEVEGPESEGGSQESPASFNVEIVLDASGSMANKLGS----------- 170 Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 +T+M K ++ F S+ +++ VY Sbjct: 171 ----KTRMELAKESIKEFASSLPEEANISLRVY 199 >gi|326675078|ref|XP_692457.5| PREDICTED: collagen alpha-6(VI) chain [Danio rerio] Length = 1605 Score = 36.8 bits (83), Expect = 6.5, Method: Composition-based stats. Identities = 19/102 (18%), Positives = 35/102 (34%), Gaps = 9/102 (8%) Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 YQ + + K +P ++ +TDG +++ + D+ Sbjct: 620 FSSVYQKEFYMNEYKDADGEDKGSRKQSGVPQN--LVLITDGVSSDRVNEAA-----DQL 672 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 + I + I I A Q L SP+ + V N + L Sbjct: 673 RNLGINVFAIGIGAVSLQQ--LSYIAGSPDRLFKVQNFNYLN 712 >gi|309266960|ref|XP_003086909.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha (globulin) inhibitor H5-like, pseudogene [Mus musculus] gi|309271570|ref|XP_003085348.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha (globulin) inhibitor H5-like, pseudogene [Mus musculus] Length = 1321 Score = 36.8 bits (83), Expect = 6.5, Method: Composition-based stats. Identities = 36/223 (16%), Positives = 71/223 (31%), Gaps = 39/223 (17%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D+SGSM TK+ K A+ L + Sbjct: 284 VVFVIDVSGSMFG--------------------TKLQQTKKAMDTILSDLQASDSFNIIT 323 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 + + + YV+R TD A+ A +L + Sbjct: 324 FSDTVNIWKAEGSIQATVQNIHSAKNYVSRMEAD---GWTDINAALLAAASVL------N 374 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI-KICDKAKENFIKIVTISINA 371 +G + +P I+FLTDGE ++ + I +A + + + +++ Sbjct: 375 HSNQEPGKGRGVGQIPL---IMFLTDGEPTAGETTPSVILSNIRQALAHRVSLFSLAF-G 430 Query: 372 SPNGQRLLKTCVSSPEY-----HYNVVNADSLIHVFQNISQLM 409 LL+ + + + A L ++ IS+ + Sbjct: 431 DDADFSLLRRLSLENQGEARRIYEDADAALQLEGLYAEISRPL 473 >gi|308462090|ref|XP_003093331.1| hypothetical protein CRE_03458 [Caenorhabditis remanei] gi|308250342|gb|EFO94294.1| hypothetical protein CRE_03458 [Caenorhabditis remanei] Length = 1816 Score = 36.8 bits (83), Expect = 6.5, Method: Composition-based stats. Identities = 11/69 (15%), Positives = 24/69 (34%) Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW 271 + + K K+ LK A+ ++ D + ++ + G ++ Sbjct: 561 NGEHQKAVANFKSYQKLEDLKAAVSDVFNATDKIDEAQKAIKQGAEEVQKKMSDLEIFYN 620 Query: 272 GTEKVRQYV 280 GT + YV Sbjct: 621 GTNNIATYV 629 >gi|39937341|ref|NP_949617.1| hypothetical protein RPA4281 [Rhodopseudomonas palustris CGA009] gi|192293121|ref|YP_001993726.1| hypothetical protein Rpal_4760 [Rhodopseudomonas palustris TIE-1] gi|39651199|emb|CAE29722.1| conserved hypothetical protein [Rhodopseudomonas palustris CGA009] gi|192286870|gb|ACF03251.1| conserved hypothetical protein [Rhodopseudomonas palustris TIE-1] Length = 390 Score = 36.8 bits (83), Expect = 6.5, Method: Composition-based stats. Identities = 13/52 (25%), Positives = 30/52 (57%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG 64 + + AN ++IFA++++ L +G + ++ M++A ++A+LAG Sbjct: 4 RFLRDRSANVAVIFAIALIPLLGAVGSAVDYTIASNQRMKMQTALDSAVLAG 55 >gi|327439430|dbj|BAK15795.1| uncharacterized protein containing a von Willebrand factor type A domain [Solibacillus silvestris StLB046] Length = 986 Score = 36.8 bits (83), Expect = 6.6, Method: Composition-based stats. Identities = 24/134 (17%), Positives = 49/134 (36%), Gaps = 10/134 (7%) Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF----I 333 +V R + + TD + +LT + I + ++ I Sbjct: 747 SHVYRFNNRPNHEATDKADIVSSIDSLLTYKNENRSTNIVKALETAIGNFTTNQYTSKAI 806 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY--HYN 391 + +TDG + ++ ++ AK I I T+S+ + L +SS + N Sbjct: 807 VLVTDG----YSNSNGLEQVLRDAKLKGIAIHTVSVGSYTTVNEKLLKDISSETNGTYQN 862 Query: 392 VVNADSLIHVFQNI 405 + + ++L Q I Sbjct: 863 ITSIENLHGSLQAI 876 >gi|255693879|ref|ZP_05417554.1| BatB protein [Bacteroides finegoldii DSM 17565] gi|260620308|gb|EEX43179.1| BatB protein [Bacteroides finegoldii DSM 17565] Length = 342 Score = 36.8 bits (83), Expect = 6.6, Method: Composition-based stats. Identities = 32/192 (16%), Positives = 65/192 (33%), Gaps = 41/192 (21%) Query: 151 MTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 ++ Y ++F +++ ++E +R + + +D+S SM Sbjct: 50 VSKYRPDVKFWILFAAIGLFSVLLARPQFGSKLETVKRKGVEVIIALDISNSMLA----- 104 Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 QD + +++ K + +D +D + V G+I + + + Sbjct: 105 ----------QDVQPSRLEKAKRLISRLVDELD-----NDKV--GMIVFAGDAFTQLPIT 147 Query: 271 WGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 + ++ SLI K T A+ A + T P Sbjct: 148 SDYISAKMFLESINPSLISKQGTAIGEAINLAARSFT------------------PQEGV 189 Query: 330 QKFIIFLTDGEN 341 + II +TDGEN Sbjct: 190 GRAIIVITDGEN 201 >gi|327403933|ref|YP_004344771.1| von Willebrand factor type A [Fluviicola taffensis DSM 16823] gi|327319441|gb|AEA43933.1| von Willebrand factor type A [Fluviicola taffensis DSM 16823] Length = 375 Score = 36.8 bits (83), Expect = 6.6, Method: Composition-based stats. Identities = 25/204 (12%), Positives = 68/204 (33%), Gaps = 44/204 (21%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 + + ++ G + + + + +D+S SM+ + +++ A K Sbjct: 111 MAQPVAGSRKVN-GSKRVLDLVICLDISNSMNTQDMGGND------------VSRLTAAK 157 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-P 291 A+ L+ + + ++ + + + + ++ S+I Sbjct: 158 QAIGELLNQLK-------GERIAVVIFANDAYTQLPLTMDYGAAKLFIPDIETSMISDQG 210 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T+ A++ A Q D + K I+ +TDGE Sbjct: 211 TNVGRALEIA-QEQFKDTESG------------------KAILVITDGE----DHEALWK 247 Query: 352 KICDKAKENFIKIVTISINASPNG 375 + + K+ +++ + + +S G Sbjct: 248 EQIAELKKKNVELTYLGLGSSKGG 271 >gi|304393605|ref|ZP_07375533.1| conserved hypothetical protein [Ahrensia sp. R2A130] gi|303294612|gb|EFL88984.1| conserved hypothetical protein [Ahrensia sp. R2A130] Length = 170 Score = 36.8 bits (83), Expect = 6.6, Method: Composition-based stats. Identities = 10/61 (16%), Positives = 30/61 (49%) Query: 10 YFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMV 69 + ++ +E+ N IIFA + + + + + + + K+ ++ A++ A L+ + + Sbjct: 17 FLRRFANNERGNVMIIFAAAALPMAIGVAGALEISQYSQLKSQLQEASDRAALSAMAALR 76 Query: 70 S 70 Sbjct: 77 E 77 >gi|329888194|ref|ZP_08266792.1| von Willebrand factor type A domain protein [Brevundimonas diminuta ATCC 11568] gi|328846750|gb|EGF96312.1| von Willebrand factor type A domain protein [Brevundimonas diminuta ATCC 11568] Length = 655 Score = 36.8 bits (83), Expect = 6.7, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 51/177 (28%), Gaps = 43/177 (24%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 +R + +VD+SGSM K+ K ++ L +D + Sbjct: 286 QRRPLNLTFLVDVSGSM-------------------NSPDKLDLAKQSMNLIIDRLRPQD 326 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + + Y + P+ GT+K++ T M AY Sbjct: 327 ------RVAVAYYAEGAGTTLAPTAGTQKLKLR-CAVASLRASGGTAGATGMTNAYDQ-- 377 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE-NNNFKSNVNTIKICDKAKENFI 362 + SF N + I+ TDG+ N + + I Sbjct: 378 --AQASFGRNKVNR------------ILMFTDGDFNVGVTDDKRLEDYVADKRRTGI 420 >gi|291520528|emb|CBK75749.1| Gram positive anchor./von Willebrand factor type A domain [Butyrivibrio fibrisolvens 16/4] Length = 605 Score = 36.8 bits (83), Expect = 6.7, Method: Composition-based stats. Identities = 31/128 (24%), Positives = 51/128 (39%), Gaps = 18/128 (14%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV---- 280 +T++ LK+A+ +D+I V D +I + TR T V+QYV Sbjct: 8 KTRIQLLKSAVDNMIDNIAEKEDV--DAKWEVIDFATRAAVRGGGWLNTSNVKQYVTTAI 65 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 D + I + T+ M A + + S N +K ++FLTDG+ Sbjct: 66 NEDNNVDIGRGTNYQAGMDLAQKEFEKKQPESDRPN------------AKKIVLFLTDGQ 113 Query: 341 NNNFKSNV 348 + S V Sbjct: 114 PTYYGSGV 121 >gi|301772358|ref|XP_002921601.1| PREDICTED: integrin alpha-2-like [Ailuropoda melanoleuca] Length = 1369 Score = 36.8 bits (83), Expect = 6.7, Method: Composition-based stats. Identities = 31/189 (16%), Positives = 63/189 (33%), Gaps = 30/189 (15%) Query: 234 ALLLFLDS-IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP- 291 A+ FL+ + L + +GLI Y + + + + + Sbjct: 378 AVKNFLEKFVQSLDIGPKKTQVGLIQYANKPRVIFNL-NTFKTKAEMIEATSQTYQYGGD 436 Query: 292 -TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T++ A++ A +F L K ++ +TDGE+++ Sbjct: 437 LTNTFKAIQYAK-------------DFAYAAGAGGRLGAAKVMVVVTDGESHDGSMLKAV 483 Query: 351 IKICDKAKENFIKIVTISINASPNGQRL--------LKTCVSSPEY--HYNVVNADSLIH 400 I CD + I I++ N L +K S P +NV + +L+ Sbjct: 484 IDQCD---NDNILRFGIAVLGYLNRNALDTKNLIKEIKAIASIPTETFFFNVSDEAALLE 540 Query: 401 VFQNISQLM 409 + + + Sbjct: 541 KAGTLGEQI 549 >gi|227112734|ref|ZP_03826390.1| putative hemagglutinin/hemolysin-related protein [Pectobacterium carotovorum subsp. brasiliensis PBR1692] Length = 1159 Score = 36.8 bits (83), Expect = 6.7, Method: Composition-based stats. Identities = 42/272 (15%), Positives = 81/272 (29%), Gaps = 48/272 (17%) Query: 84 NHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN-RLDSSNN 142 N+ K + DD IKN V++ + ++ A + + + + Sbjct: 533 NNVKYTVGDDGSYIIKNANNAQTLAGKITLDVP-----VDAGKFNVVAQATSTEIANHDT 587 Query: 143 TIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGS 202 Y+ + Y + I + V +++ E + I +VD SGS Sbjct: 588 ATGYSAGGVEQYGMSIGTIGDDTMSGTHSHDVMIADVSG-LQIIEGQNYNIAFMVDSSGS 646 Query: 203 MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI--DLLSHVKEDVYMGLIGYT 260 M T + + +L S+ V + L + Sbjct: 647 MSS--------------------TDIDNARTSLSNVFKSLINSAGGANSGTVNVFLADFD 686 Query: 261 TRVEKNIEPSWGTEKVRQYVTRDMDSL-----ILKPTDSTPAMKQAYQILTSDKKRSFFT 315 T+V K + + +T ++S+ T+ K SD + Sbjct: 687 TQVGKTVSVNLNDTNALNKLTAVLNSMVGGSSAGGGTNYEDVFKTTANWFQSDVVKKNVG 746 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 N F+TDGE +++N Sbjct: 747 NNLTY--------------FITDGEPTYYQTN 764 >gi|115749084|ref|XP_001197592.1| PREDICTED: similar to Clca1 protein [Strongylocentrotus purpuratus] gi|115959785|ref|XP_001193076.1| PREDICTED: similar to Clca1 protein [Strongylocentrotus purpuratus] Length = 966 Score = 36.8 bits (83), Expect = 6.7, Method: Composition-based stats. Identities = 34/219 (15%), Positives = 71/219 (32%), Gaps = 46/219 (21%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 I LV+D SGSM ++ KM + + I + Sbjct: 312 GSLRIVLVLDTSGSMD-----------------GERFDKM------IRGAKNFIQSIVPN 348 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKV-RQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 V ++ + + + T + R+ + + +L T + A Q+ Sbjct: 349 NSYV--AIVEFNYESIVDSYMTELTSVISRKDLASLLPTLADGATCIGCGIVTAIQVAQY 406 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + S ++I L+DGE N+ +T+ D + + + + +I Sbjct: 407 NDMDSRGV----------------YLILLSDGEENHGTPIADTM---DDIEGSGVIVHSI 447 Query: 368 SI-NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + A + L + + +A +I F +I Sbjct: 448 AFYEADTQLEDLAQMTGGISATCADGGSAQCVISAFVSI 486 >gi|317012798|gb|ADU83406.1| hypothetical protein HPLT_05065 [Helicobacter pylori Lithuania75] Length = 219 Score = 36.8 bits (83), Expect = 6.8, Method: Composition-based stats. Identities = 31/234 (13%), Positives = 73/234 (31%), Gaps = 38/234 (16%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E L++D SGSM+ ++ D RT++ L + ++++ + Sbjct: 11 EERFIPAFLLLDTSGSMNESLGDRTRD----------DRTRIGVLNLCIQKMIETLKQEA 60 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + M ++ + P + + T A K A ++ Sbjct: 61 KKELFNKMAIVTFGENGAVLHTPFDDIKNI-----NFKPLSASGGTPLDQAFKLAKDLI- 114 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 P+ ++ + I ++DGE N+ K + + Sbjct: 115 ------------EYKDTFPTKFYKPYSILVSDGEPNDDKWQEPLFNFHHDGRSAKSVCWS 162 Query: 367 ISI----NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I I + K + + + + L+ +F+ ++Q + S+ Sbjct: 163 IFIGDREANPQVNKDFGK------DGVFYADDVEKLVKLFEIMTQTISKGSASI 210 >gi|51245384|ref|YP_065268.1| hypothetical protein DP1532 [Desulfotalea psychrophila LSv54] gi|50876421|emb|CAG36261.1| unknown protein [Desulfotalea psychrophila LSv54] Length = 420 Score = 36.8 bits (83), Expect = 6.8, Method: Composition-based stats. Identities = 24/180 (13%), Positives = 68/180 (37%), Gaps = 20/180 (11%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGD 77 E+ +++ A+ + ++ + + + +N +++ +A LAGA +++ + + Sbjct: 14 EEGAVAVLTAILMAVLIMFAALAVDLGYLYGVRNELQNGADAGALAGAHELLDVENGILT 73 Query: 78 RFESISNHAK----RALIDDAKRFI---KNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 R ++I+ + + +DA +F H + ++ F + + Sbjct: 74 RDDAIAEAERVVSLNSTGNDAVQFKPIETGHWSFT----TSTFSPNPT-DTQGEWQEKSF 128 Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQR---YNQKIVSFIPALLRIEMGE 187 + L+ N + V++ D F + N + +++I + GE Sbjct: 129 AELDADLNFIN-----AVRVVSFRDDTPAFFARIFGFASFLVNTEAIAYIGFAGDLYPGE 183 >gi|281202341|gb|EFA76546.1| Ubiquitin-conjugating enzyme [Polysphondylium pallidum PN500] Length = 561 Score = 36.8 bits (83), Expect = 6.9, Method: Composition-based stats. Identities = 35/212 (16%), Positives = 72/212 (33%), Gaps = 31/212 (14%) Query: 141 NNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLS 200 NNT V ++ + I N N + I + + + I +V+D+S Sbjct: 91 NNTSSITNIVNSTRRLSIGDITDSNNNNNNTEPFIPIKPTVSVFSEKIKQIDI-IVLDVS 149 Query: 201 GSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY------M 254 GSM A + + + + T++ + F+D + V + Sbjct: 150 GSMKAA------AYAGSKVPGELEMTRIEVAQALFQTFID---------KYVQQEIPACV 194 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 GL+ + R++ +P+ + S L D+ A + Y+ + + Sbjct: 195 GLVCFGERIDLTFQPTRNFDSF---------STELGDVDANQAKTRLYEAIKLAAETIVS 245 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 + S + LTDG++N+ Sbjct: 246 YKNKHPADILLSDDLNCRVFALTDGQDNSGSD 277 >gi|198438279|ref|XP_002126427.1| PREDICTED: similar to LOC779593 protein [Ciona intestinalis] Length = 1012 Score = 36.8 bits (83), Expect = 6.9, Method: Composition-based stats. Identities = 34/211 (16%), Positives = 57/211 (27%), Gaps = 64/211 (30%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 L+ VVD SGSM K+ +K AL L S++ H Sbjct: 328 LVVFVVDTSGSMFGY--------------------KLKQVKQALADSLRSLNNEDH---- 363 Query: 252 VYMGLIGYTTRVEKNIEPSW--------GTEKVRQYVTRDMDSLILKPTDSTPAMKQAY- 302 + V + W T + +T T+ A++ A+ Sbjct: 364 -------FNIVVFGDTAEPWISGVLSTASTRSINDAITYVDAVSARGGTNMLVALQTAFA 416 Query: 303 -----------------------QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + + + F + + K I+FLTDG Sbjct: 417 IMEPYLPSLPENETMVEDTTPFPTPVPLQPETNHFIRKRATETQTELSNYAKMIVFLTDG 476 Query: 340 ENNNFKSNVNTIK-ICDKAKENFIKIVTISI 369 + I +K + + TI Sbjct: 477 RPTKDDVGTDDIASRIEKINGGRVNLHTIGF 507 >gi|55670411|pdb|1V7P|C Chain C, Structure Of Ems16-Alpha2-I Domain Complex Length = 200 Score = 36.8 bits (83), Expect = 6.9, Method: Composition-based stats. Identities = 32/191 (16%), Positives = 64/191 (33%), Gaps = 32/191 (16%) Query: 233 NALLLFLDS-IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY-VTRDMDSLILK 290 +A+ FL+ + L +GLI Y T K ++ + + Sbjct: 23 DAVKNFLEKFVQGLDIGPTKTQVGLIQYANNPRVVFNL--NTYKTKEEMIVATSQTSQYG 80 Query: 291 P--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T++ A++ A + S K ++ +TDGE+++ Sbjct: 81 GDLTNTFGAIQYARKYAYSAA-------------SGGRRSATKVMVVVTDGESHDGSMLK 127 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRL--------LKTCVSSPE--YHYNVVNADSL 398 I C+ + I I++ N L +K S P Y +NV + +L Sbjct: 128 AVIDQCN---HDNILRFGIAVLGYLNRNALDTKNLIKEIKAIASIPTERYFFNVSDEAAL 184 Query: 399 IHVFQNISQLM 409 + + + + Sbjct: 185 LEKAGTLGEQI 195 >gi|41386751|ref|NP_958822.1| calcium-activated chloride channel regulator 4 [Rattus norvegicus] gi|37703077|gb|AAR01113.1| parturition-related protein PRP3 [Rattus norvegicus] Length = 923 Score = 36.8 bits (83), Expect = 7.1, Method: Composition-based stats. Identities = 36/234 (15%), Positives = 81/234 (34%), Gaps = 56/234 (23%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D+SGSM ++ + A FL I ++ + Sbjct: 310 LVLDVSGSMG-------------------SYDRLNRMNQAAKFFLQQI-----LESRSWA 345 Query: 255 GLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + E ++ R + + + T ++ A+Q+ + ++ Sbjct: 346 GMVHFHSSATVKSELIQINSDVERNQLLETLPTSASGGTSICSGIRTAFQVFKNKGYQTG 405 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISI-NA 371 + I+ L+DGE++ T K C D+ K++ + I++ A Sbjct: 406 GND----------------ILLLSDGEDS-------TAKDCLDEVKDSGAVVHFIALGKA 442 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNA--DSLIHVFQNISQ---LMVHRKYSVILKG 420 + V+ + + A + LI F ++ + + + KG Sbjct: 443 FDQSISNMAN-VTGGKQLFATDEAQNNGLIDAFGALASENADVTEKSLQLESKG 495 >gi|260559816|ref|ZP_05831995.1| von Willebrand factor [Enterococcus faecium C68] gi|260074040|gb|EEW62363.1| von Willebrand factor [Enterococcus faecium C68] Length = 857 Score = 36.8 bits (83), Expect = 7.2, Method: Composition-based stats. Identities = 28/151 (18%), Positives = 52/151 (34%), Gaps = 41/151 (27%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM ++ +K + F+D++ S + + Sbjct: 19 LDLVLVVDWSGSM-------------------NDNNRIGEVKIGVDRFVDTLAD-SGITD 58 Query: 251 DVYMGLIGYTTRVEKNIEPS---WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + MG +GY++ + + V+ V S T + A++ A +L+ Sbjct: 59 KINMGYVGYSSEGYSYSNGAVQMGSFDSVKNQVKSITPSRTNGGTFTQKALRDAGSMLSV 118 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 +K I+ LTD Sbjct: 119 PNGH------------------KKVIVLLTD 131 >gi|239993926|ref|ZP_04714450.1| inter-alpha-trypsin inhibitor domain-containing protein [Alteromonas macleodii ATCC 27126] Length = 586 Score = 36.8 bits (83), Expect = 7.2, Method: Composition-based stats. Identities = 38/230 (16%), Positives = 75/230 (32%), Gaps = 55/230 (23%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 +V +P ++ + + I V+D SGSM + K Sbjct: 296 LVMLMPPQVKSQDLQDFDRDITFVIDTSGSMGGR--------------------PIVDAK 335 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS-WGTEKVRQYVTRDMDS-LILK 290 +L L +D + ++ + + E S GT + +QY + Sbjct: 336 ESLQLAIDRLSEKD------RFNVVAFNNDTTRLFETSVEGTTRNKQYARDFVKHLNAGG 389 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T+ PA+ A + T+ F K ++F+TDG + N Sbjct: 390 GTEMAPALNAALKRTTTK-------------------DFIKQVVFITDG------AVGNE 424 Query: 351 IKICDKAKEN--FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 + + K ++ T+ I ++PN + + + V N + Sbjct: 425 AALFSQIKNELGDARLFTVGIGSAPNSYFMTRAAQFGLGSYVFVRNTADI 474 >gi|194291603|ref|YP_002007510.1| hypothetical protein RALTA_B0837 [Cupriavidus taiwanensis LMG 19424] gi|193225507|emb|CAQ71453.1| conserved hypothetical protein, Von Willebrand factor type A domain (vwa), putative membrane protein [Cupriavidus taiwanensis LMG 19424] Length = 353 Score = 36.8 bits (83), Expect = 7.2, Method: Composition-based stats. Identities = 37/262 (14%), Positives = 82/262 (31%), Gaps = 71/262 (27%) Query: 189 PIFLIELVV--DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 P + LV+ D S SM A D +++A + A DL+ Sbjct: 82 PADTVTLVLAMDTSRSMEAA---------------DVPPNRISAAQQAAR------DLVV 120 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + V +G++ + + P+ + + + R + + T + + QA +L Sbjct: 121 GLPASVRLGIVSFAGTAAVVLPPTDNRQDMLDAIERFQ---LQRGTATGSGLFQALAVLF 177 Query: 307 SD---------------------------KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + + Q P +I L+DG Sbjct: 178 PEDGIDLEVILFGSRSDRAGRGTSLDEAAAADAARRREQGQQAAQPGSYRHGAVILLSDG 237 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISI------NASPNGQR--------LLKTCVS- 384 ++ ++ A + +++ T+ +A + L+ S Sbjct: 238 RRTTGPDPLDAARM---AAQRGVRVYTVGFGSQQVTSAPESSLSYFMQLDEPALRAVASI 294 Query: 385 SPEYHYNVVNADSLIHVFQNIS 406 + +Y+ +A L V++ +S Sbjct: 295 TGGEYYHAGSAADLSQVYRQLS 316 >gi|326426493|gb|EGD72063.1| hypothetical protein PTSG_00082 [Salpingoeca sp. ATCC 50818] Length = 571 Score = 36.8 bits (83), Expect = 7.2, Method: Composition-based stats. Identities = 36/241 (14%), Positives = 71/241 (29%), Gaps = 42/241 (17%) Query: 103 KESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIF--YNMDVMTSYDYRLQF 160 + S F + Q + ++ T ++ N S N + +S YR+ Sbjct: 98 SSVMRTASTPFDDDVRQEACWTRQLDNTFISINGGTSENTDTATKWQYVGTSSGFYRIYP 157 Query: 161 IEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 + Q N P + G + I +V+D SGSM Sbjct: 158 --GVPQQDCNAYDPRLRPWYVAATSGPK---DIVIVLDRSGSMA---------------- 196 Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY-MGLIGYTTRVEKNIEPSWGTEKVRQY 279 + + +A L+++ + V V+ T Sbjct: 197 TNNRW---ETAMDAAETVLETLTIADFVAIVVFDTSASQVCGTTIPCGSLVQATADNVGT 253 Query: 280 VTRDMDS-LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + + + T+ A + A+ +L +R+ + I+F+TD Sbjct: 254 LRTLLANFNPDGSTNFESAFQVAFSVLKQTGERTSNCHTA--------------ILFMTD 299 Query: 339 G 339 G Sbjct: 300 G 300 >gi|119512000|ref|ZP_01631096.1| von Willebrand factor, type A [Nodularia spumigena CCY9414] gi|119463351|gb|EAW44292.1| von Willebrand factor, type A [Nodularia spumigena CCY9414] Length = 202 Score = 36.8 bits (83), Expect = 7.2, Method: Composition-based stats. Identities = 27/149 (18%), Positives = 50/149 (33%), Gaps = 35/149 (23%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 L++D SGSM P D+ + M AL D +D + ++ Sbjct: 11 LIIDKSGSMES-----PGDLPGK-----TRWETMQENTLALARKCDQLDP-DGINVYLFS 59 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 G + + V KV+Q ++ T+ ++ A+ +++ Sbjct: 60 GRHKFYSEVTA--------SKVQQ---IFQENHPAGSTNLGGVLQAAFDDFFKRREQGLT 108 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 N II +TDGE ++ Sbjct: 109 PNGET-------------IIVVTDGEPDD 124 >gi|224054053|ref|XP_002190891.1| PREDICTED: collagen, type VI, alpha 2 [Taeniopygia guttata] Length = 1016 Score = 36.8 bits (83), Expect = 7.3, Method: Composition-based stats. Identities = 31/207 (14%), Positives = 64/207 (30%), Gaps = 54/207 (26%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 I V+D S S+ T KN ++ + + ++ Sbjct: 606 GALDIMFVIDSSESIG--------------------YTNFTLEKNFVINVVSRLGSIAKD 645 Query: 249 KEDV---YMGLIGYTTR-VEKNIEPSW----GTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 + + +G++ Y+ + I+ ++ V R I T + A++ Sbjct: 646 PKSLTGARVGVVQYSHEGTFEAIKLDDERIDSLSSFKEAVKRL--EWIAGGTWTPSALQF 703 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 AY L + +R Q F + +TDG + + N +C Sbjct: 704 AYNKLIKESRREKA---------------QVFAVVITDGRYDPRDDDKNLGALC----GR 744 Query: 361 FIKIVTIS----INASPNGQRLLK-TC 382 + + TI + + L+ C Sbjct: 745 DVVVNTIGIGDMFDQPEQSETLVSIAC 771 >gi|262171974|ref|ZP_06039652.1| protein BatA [Vibrio mimicus MB-451] gi|261893050|gb|EEY39036.1| protein BatA [Vibrio mimicus MB-451] Length = 335 Score = 36.8 bits (83), Expect = 7.4, Method: Composition-based stats. Identities = 51/273 (18%), Positives = 91/273 (33%), Gaps = 54/273 (19%) Query: 162 EHLLNQRYNQKIVSFIPALLRIEM----------GERPIF--LIELVVDLSGSMHCAMNS 209 LLN + Q I + LL + R F + +VVDLSGSM Sbjct: 55 AGLLNASHWQHIALIVSWLLIVTALAKPSVLGEVQTREAFGRDVLMVVDLSGSME----- 109 Query: 210 DPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP 269 ++ +++ A K L F+ ++ GLI + Sbjct: 110 ----EKDFATKSGEQLSRLTAAKKVLRDFVTQ-------RQGDRFGLILFGDAAFIQTPF 158 Query: 270 SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 + + + + + T+ A+ F Q PS Sbjct: 159 TADQDVWLNLLDEAETGMAGQSTNLGDAI-------------GLGIKVFEQ---SPSTSQ 202 Query: 330 QKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN-GQRLL------KTC 382 + ++ LTDG N+ S V+ + A I+I I++ N G++ L + Sbjct: 203 DQIMLVLTDG--NDTGSFVSPVDAAKIAAAKGIRIYVIAMGDPENVGEQPLDMDVVSRVS 260 Query: 383 VSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + + ++ L +Q I Q + +KYS Sbjct: 261 SLTQARSFVAIDQSQLNEAYQVIDQ-LEPQKYS 292 >gi|332535277|ref|ZP_08411079.1| hypothetical protein PH505_ck00200 [Pseudoalteromonas haloplanktis ANT/505] gi|332035308|gb|EGI71812.1| hypothetical protein PH505_ck00200 [Pseudoalteromonas haloplanktis ANT/505] Length = 203 Score = 36.8 bits (83), Expect = 7.4, Method: Composition-based stats. Identities = 21/130 (16%), Positives = 43/130 (33%), Gaps = 13/130 (10%) Query: 183 IEMGERPIFLIELVVDLSGSM--HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 E+P + L+ D SGSM + D +++ A + A+ Sbjct: 34 ANSDEKPR--VLLIFDTSGSMAFSSSTGDDCGQKYYNGPYILCSDSRLEAAQEAMS---- 87 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDSTPAMK 299 ++ K+ + GL+ + + G + + SL T + Sbjct: 88 --KVVDDNKDLIDFGLMRFNGTDGGYVL--NGIGSPASDIDASIASLPASGGTPLAETLW 143 Query: 300 QAYQILTSDK 309 +AY +T + Sbjct: 144 EAYLYITGKE 153 >gi|305665950|ref|YP_003862237.1| aerotolerance-related membrane protein [Maribacter sp. HTCC2170] gi|88710725|gb|EAR02957.1| aerotolerance-related membrane protein [Maribacter sp. HTCC2170] Length = 349 Score = 36.8 bits (83), Expect = 7.4, Method: Composition-based stats. Identities = 31/189 (16%), Positives = 65/189 (34%), Gaps = 44/189 (23%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 ++E +R I VD+S SM D ++ K + ++ Sbjct: 81 KLETVKREGVDIVFAVDVSKSMLAE---------------DIAPNRLEKAKRLVSEIINQ 125 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV-TRDMDSLILKPTDSTPAMKQ 300 + +G+I Y + + + + ++ + + L + T A++ Sbjct: 126 LASD-------RIGIIAYAGQAFPQLPITTDYGAAKMFLQNMNTNMLTSQGTAINEAIEL 178 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A D++++ + + ++DGE S T+K D A E Sbjct: 179 A-TTYYDDEEQTN-----------------RVLFIISDGE---DHSEGTTLKAVDDAIEE 217 Query: 361 FIKIVTISI 369 I+I TI + Sbjct: 218 GIQIFTIGV 226 >gi|145491137|ref|XP_001431568.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124398673|emb|CAK64170.1| unnamed protein product [Paramecium tetraurelia] Length = 591 Score = 36.8 bits (83), Expect = 7.5, Method: Composition-based stats. Identities = 46/308 (14%), Positives = 94/308 (30%), Gaps = 69/308 (22%) Query: 112 VFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQ 171 + Q+I + N + + ++ L + LLN R Sbjct: 84 GMNTVKNQDIHTNFNDDEKIEPKKEEAKQNANKYNLNEKLSFEVKALYKMGKLLNSRT-- 141 Query: 172 KIVSFIPALLRIEMGERPIF----------LIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 ++P ++ I+ ++ + + ++D SGSM Sbjct: 142 ---QYLPGIVSIKAQDQAVIQNQENQRVGVDLICLIDKSGSM------------------ 180 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 K+ +K L L LD ++ LI + ++ ++ T+ +QY Sbjct: 181 --NGQKIEMVKQTLALLLDFLNEND------RYQLITFESQAQRLTPLKRVTDGNKQYFK 232 Query: 282 RDMDS-LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 + + T A + A++ L K R+ T+ I L+DG+ Sbjct: 233 QVIQQINSGGGTTIGTATEIAFKQLQERKYRNNVTS----------------IFLLSDGQ 276 Query: 341 ----NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 N + + T+ + T + Q + + C Y V + Sbjct: 277 DGQANQRIQEQIKTVN-------EVFTLHTFGFGEDHDAQMMTQLCNLKSGSFYFVQDVT 329 Query: 397 SLIHVFQN 404 L F + Sbjct: 330 LLDEFFAD 337 >gi|212693196|ref|ZP_03301324.1| hypothetical protein BACDOR_02706 [Bacteroides dorei DSM 17855] gi|237709938|ref|ZP_04540419.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA] gi|237725395|ref|ZP_04555876.1| conserved hypothetical protein [Bacteroides sp. D4] gi|265753590|ref|ZP_06088945.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA] gi|212664301|gb|EEB24873.1| hypothetical protein BACDOR_02706 [Bacteroides dorei DSM 17855] gi|229436082|gb|EEO46159.1| conserved hypothetical protein [Bacteroides dorei 5_1_36/D4] gi|229456031|gb|EEO61752.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA] gi|263235304|gb|EEZ20828.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA] Length = 340 Score = 36.8 bits (83), Expect = 7.6, Method: Composition-based stats. Identities = 33/229 (14%), Positives = 75/229 (32%), Gaps = 45/229 (19%) Query: 148 MDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAM 207 M ++ Y ++F +++ ++E +R + +D+S SM Sbjct: 47 MPTISKYRPDIKFWLTFAALTLVILMLARPQFGSKMETVKRSGVEAVIALDISNSMLAE- 105 Query: 208 NSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNI 267 D +++ K + +D+ + + V GLI + + Sbjct: 106 --------------DVTPSRLDKSKKLISRLVDTFN-----NDKV--GLIVFAGDAFTQL 144 Query: 268 EPSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 + + ++ SLI + TD A++ A + T P Sbjct: 145 PITSDYVSAKMFLETINPSLITTQGTDIGTAIRLAMKSFT------------------PQ 186 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 + II +TDGEN+ + + E +++ + + + Sbjct: 187 EGVGRAIIVITDGENHEGGAVEAAQEA----AEKGMQVFVLGVGSPDGS 231 >gi|145485516|ref|XP_001428766.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124395854|emb|CAK61368.1| unnamed protein product [Paramecium tetraurelia] Length = 947 Score = 36.8 bits (83), Expect = 7.6, Method: Composition-based stats. Identities = 60/374 (16%), Positives = 119/374 (31%), Gaps = 67/374 (17%) Query: 43 VLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHI 102 + K + ++ + LA +SN +L D++E+ + A ++D+ + Sbjct: 622 ISHILQCKVLLNLEDSNSALAE----ISNAEQLSDKYENSYD-RSDAQVNDSFPIPPGIL 676 Query: 103 KESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIE 162 K+ + +F I I ++ + + + F Sbjct: 677 KQRILYEKGLF-------IKRYDSIKKAAFIFTECLETSKFYDPEIRINCLKQLKEIFQS 729 Query: 163 HLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 L + + ++ I V+D SGSM Sbjct: 730 QNLLYKVPKIEQLLELNEIKKNN------DIVFVIDHSGSM------------------- 764 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP---SWGTEKVRQY 279 + K + A+ L D +++++ + + + +E + S T +R Sbjct: 765 ENIKK----ELAINGILKIFD--NYLQDQDRISYMRFNQNIEVIFDLTSKSENTAYLRSA 818 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + R + T A+ AY I K+ Q++I+ L DG Sbjct: 819 IERSKNIRAEGMTAMLSAVLHAYSIHEKAVKKDN----------------QQWIVVLCDG 862 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISI---NASPNGQRLLKTCVSSPEYHY-NVVNA 395 E+N +K +K I ++ I I L C S + V + Sbjct: 863 EDNLSNITYERMKK-FTSKRPQISLIVIGIGLSLKPDCLDELYDLCRLSQKGFLIESVYS 921 Query: 396 DSLIHVFQNISQLM 409 + L FQ+IS L+ Sbjct: 922 EDLDIAFQSISNLI 935 >gi|145482427|ref|XP_001427236.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124394316|emb|CAK59838.1| unnamed protein product [Paramecium tetraurelia] Length = 1189 Score = 36.8 bits (83), Expect = 7.6, Method: Composition-based stats. Identities = 38/211 (18%), Positives = 75/211 (35%), Gaps = 35/211 (16%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I +++D+SGSM + T+M +K F D Sbjct: 742 IVVLLDISGSMDELY------------YDSEDLTRMGVVKAFFSTFADRTMAYDLKHV-- 787 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 + L+ + R+ + + + V + + T+ A+K A L Sbjct: 788 -ISLVYFDNRIIEKCSFTELFILFKDLVNK---AQPTGRTNLYRALKYAENQLL------ 837 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 F+Q L II LTDG++N+ ++ IK+ + +N +I+ S S Sbjct: 838 ----KFKQTYPKCLLR----IIALTDGQDND-NHPLDPIKVAESILKN--EILLDSFVVS 886 Query: 373 PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 + L K ++ ++ + +F+ Sbjct: 887 DDCTDLKKITKATGGQCFSPQTIQEGLKLFE 917 >gi|291221810|ref|XP_002730913.1| PREDICTED: chloride channel calcium activated 2-like [Saccoglossus kowalevskii] Length = 858 Score = 36.8 bits (83), Expect = 7.7, Method: Composition-based stats. Identities = 38/256 (14%), Positives = 89/256 (34%), Gaps = 54/256 (21%) Query: 152 TSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDP 211 +S++ LQ + N + ++ P ++ G R + LV+D SGSM Sbjct: 321 SSWEVMLQHEDFKNNVNLPRLMIDTTPTFNIVKTGRRR---VVLVLDTSGSMAGN----- 372 Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW 271 ++ L F+ L+ + +D ++G+I +++ E Sbjct: 373 ---------------RIERLHRDTTHFI-----LNVIDDDSFVGIIQFSSDATVLSEMKK 412 Query: 272 GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 R + + T+ ++ A Q L + + Sbjct: 413 IDYASRPQIAASVPYSANGGTNFGAGIRAALQELKESQLSLKGAS--------------- 457 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-NASPNGQRLLKTCVSSPEYHY 390 ++ +TDG+ + + D+ + +++ TI+ A+ + R+L + Y+Y Sbjct: 458 -LLIITDGQFSYTSD------VTDEVYASGVRVDTIAYTQAAEDSLRVLSD-RTGGSYYY 509 Query: 391 --NVVNADSLIHVFQN 404 + + L+ + Sbjct: 510 VSDDETSTELLDSLTS 525 >gi|125975609|ref|YP_001039519.1| von Willebrand factor, type A [Clostridium thermocellum ATCC 27405] gi|256003656|ref|ZP_05428645.1| von Willebrand factor type A [Clostridium thermocellum DSM 2360] gi|281416621|ref|ZP_06247641.1| von Willebrand factor type A [Clostridium thermocellum JW20] gi|125715834|gb|ABN54326.1| von Willebrand factor, type A [Clostridium thermocellum ATCC 27405] gi|255992447|gb|EEU02540.1| von Willebrand factor type A [Clostridium thermocellum DSM 2360] gi|281408023|gb|EFB38281.1| von Willebrand factor type A [Clostridium thermocellum JW20] gi|316939730|gb|ADU73764.1| von Willebrand factor type A [Clostridium thermocellum DSM 1313] Length = 565 Score = 36.8 bits (83), Expect = 7.7, Method: Composition-based stats. Identities = 34/135 (25%), Positives = 55/135 (40%), Gaps = 20/135 (14%) Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILT 306 + DV +GL+ Y+T V N+ + R ++SL T + A+ A ++L Sbjct: 422 ISSDVSIGLVSYSTDVNINLPIAKFDLNQRSLFVGAVESLAAGGNTATFDAIIVATKMLK 481 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIF-LTDGENNNFKSNVNTIKICDKAKENFIKIV 365 +K ++ K ++F L+DG N ++N IK D K I I Sbjct: 482 EEKAKNPNA---------------KLMLFVLSDGVTNYG-HSLNDIK--DMMKTFGIPIY 523 Query: 366 TISINASPNGQRLLK 380 TI NA+ L Sbjct: 524 TIGYNANIKALETLS 538 >gi|125625081|ref|YP_001033564.1| hypothetical protein llmg_2320 [Lactococcus lactis subsp. cremoris MG1363] gi|124493889|emb|CAL98883.1| hypothetical protein predicted by Glimmer/Critica [Lactococcus lactis subsp. cremoris MG1363] Length = 1444 Score = 36.4 bits (82), Expect = 7.7, Method: Composition-based stats. Identities = 39/149 (26%), Positives = 58/149 (38%), Gaps = 33/149 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I LVVD+SGSM A + A+K + FL SI+ + + Sbjct: 305 DIVLVVDMSGSMEGA--------------------REGAIKQGVKSFLSSIENTA-YAQY 343 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT-SDKK 310 V +GL+GY++ G Y+T M+SL S AM +A + Sbjct: 344 VNVGLVGYSSP---------GYISNSGYITVPMESLATDGHVS--AMNKALERQFVGGTF 392 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 Q +K + +K II +TDG Sbjct: 393 TQLGIRQGAQMLKEDASGNEKMIILMTDG 421 >gi|300071886|gb|ADJ61286.1| hypothetical protein LLNZ_11975 [Lactococcus lactis subsp. cremoris NZ9000] Length = 1438 Score = 36.4 bits (82), Expect = 7.8, Method: Composition-based stats. Identities = 39/149 (26%), Positives = 58/149 (38%), Gaps = 33/149 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I LVVD+SGSM A + A+K + FL SI+ + + Sbjct: 299 DIVLVVDMSGSMEGA--------------------REGAIKQGVKSFLSSIENTA-YAQY 337 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT-SDKK 310 V +GL+GY++ G Y+T M+SL S AM +A + Sbjct: 338 VNVGLVGYSSP---------GYISNSGYITVPMESLATDGHVS--AMNKALERQFVGGTF 386 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 Q +K + +K II +TDG Sbjct: 387 TQLGIRQGAQMLKEDASGNEKMIILMTDG 415 >gi|59713412|ref|YP_206187.1| TadG-like protein [Vibrio fischeri ES114] gi|59481660|gb|AAW87299.1| TadG-like protein [Vibrio fischeri ES114] Length = 423 Score = 36.4 bits (82), Expect = 7.8, Method: Composition-based stats. Identities = 36/257 (14%), Positives = 83/257 (32%), Gaps = 28/257 (10%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 + ++ + +I+FA+ + + + K +E A+ A LA ++ + Sbjct: 2 RNLRKHQQGHAAILFAMMIPALFGIFALASDGARAIQTKARIEDASEVAALAISAHNDPD 61 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 G S N ++ + +++ +S AV T+I+ + ++ Sbjct: 62 QPDNGSYTPSTRN----------RQIVVDYVNAYISDIDAV---TDIKVAKRRCEL-ISG 107 Query: 132 MANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF 191 + + +DV T + E + + S L + Sbjct: 108 CVAGLYKGDARYLEHEIDVTTRQNSWFPGNEAIEG---MGETFSTRGKSLARKYQSE-AV 163 Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + D SGSM + +S P D + + + + L + L ++ Sbjct: 164 DVMFAADFSGSMLDTWSG-----SSNPKYID-----LIEIIRNISVELQKFNDLPENRDK 213 Query: 252 VYMGLIGYTTRVEKNIE 268 MG+ ++T Sbjct: 214 STMGISAFSTFTNSFTS 230 >gi|308472975|ref|XP_003098714.1| hypothetical protein CRE_04178 [Caenorhabditis remanei] gi|308268314|gb|EFP12267.1| hypothetical protein CRE_04178 [Caenorhabditis remanei] Length = 412 Score = 36.4 bits (82), Expect = 8.1, Method: Composition-based stats. Identities = 35/181 (19%), Positives = 65/181 (35%), Gaps = 15/181 (8%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK--MAALKNALLLFLDSIDLLSHV 248 + VVD S M +N + TK +A+ ++ F I L + Sbjct: 38 LDVIAVVDNSRGM--TVNGLNYVSIFSLTMPSSLFTKFQIASNIASVFGFGTRIGLNASE 95 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 +GL+ Y + + + + QY D+ D + + L++ Sbjct: 96 PRTTRLGLVTYNSVATQMADLN-------QY-QSLHDAFNRIFDDLSNTVDTTESYLSTG 147 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + F Q V +QK +I N +SN +I D+ K + +KI+T++ Sbjct: 148 LTLAEKM-FNDQSVNSTRAHYQKVVIVYASKYQTNGESNPESIA--DRLKLSGVKIITVA 204 Query: 369 I 369 Sbjct: 205 Y 205 >gi|307256565|ref|ZP_07538346.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 10 str. D13039] gi|306864975|gb|EFM96877.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 10 str. D13039] Length = 531 Score = 36.4 bits (82), Expect = 8.1, Method: Composition-based stats. Identities = 40/255 (15%), Positives = 80/255 (31%), Gaps = 27/255 (10%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 ++ I E ++++ L + L LI + + + + A+L+ + Sbjct: 6 LSQARRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTA- 64 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 N S D +S K D + + ++ + F + +N I Sbjct: 65 --ENNSGRKDNDYKLSGSNKENDSFDISSEVGKRDSQMVTKFVKAFLPQTNDDKMNLIPI 122 Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE 187 T N +T + S IEH + IP + + Sbjct: 123 CKTV---NNTSGKGHTSSSEVTCTVSGT-----IEHKSWFPLKVGTLEVIPQQVNVASQS 174 Query: 188 RPI--------FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 R + +V DLSGSM + + E + + +K+ LK+ L+ Sbjct: 175 RAFKKNTFNIPIDLMVVADLSGSMRYDITNKYE--------TNNETSKLGILKDVLIELA 226 Query: 240 DSIDLLSHVKEDVYM 254 + L + + Sbjct: 227 EKTLLSEDANQHNRI 241 >gi|148975971|ref|ZP_01812760.1| hypothetical protein VSWAT3_06656 [Vibrionales bacterium SWAT-3] gi|145964716|gb|EDK29969.1| hypothetical protein VSWAT3_06656 [Vibrionales bacterium SWAT-3] Length = 520 Score = 36.4 bits (82), Expect = 8.1, Method: Composition-based stats. Identities = 39/272 (14%), Positives = 74/272 (27%), Gaps = 54/272 (19%) Query: 99 KNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRL 158 + +++S + + ++ + S+N + L Sbjct: 51 TSRLRDSAEAAAIAVTIEDQPDLARGLATQYVENYVRDIKSTNLSAQRFHQTEDEGAGIL 110 Query: 159 QFIEHLLNQRYNQKI---VSFIPA-------LLRIEMGERPIF------LIELVVDLSGS 202 ++I++ +N + SFIP+ R + P++ I V D SGS Sbjct: 111 EYIQYTVNAKTTHDSWFASSFIPSFDEQQDLAGRSLARKYPVYLGDNNIDIVFVSDFSGS 170 Query: 203 MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV---------- 252 M S + K+ LK A+ I S E V Sbjct: 171 MDDRWGS-------------SRHKKIDDLKTAIDQISSKILCTSTDLEYVDGEWKEVCDE 217 Query: 253 ------------YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 +G + + R + S G ++ + S + Sbjct: 218 PGEDTTGDKLLNRVGFVPFNVRTREI--VSGGQANATSQLSYKPNYKPNVSPYSYNDVNW 275 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 Y S + N+ P QK+ Sbjct: 276 DYWRAYSQNEVLNCANWQSY-CPNPKSDNQKY 306 >gi|308063814|gb|ADO05701.1| phage/colicin/tellurite resistance cluster terY protein [Helicobacter pylori Sat464] Length = 217 Score = 36.4 bits (82), Expect = 8.1, Method: Composition-based stats. Identities = 31/234 (13%), Positives = 73/234 (31%), Gaps = 43/234 (18%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E + L+VD SGSM+ ++ T++ AL + ++++ + Sbjct: 11 EERFIPVFLLVDTSGSMNESLG---------------NCTRIEALNLCIQKMIETLKQEA 55 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + M +I + P + + T A + A ++ Sbjct: 56 KKELFSKMAIITFGENGAVLHTPFDDVKNI-----NFKPLSASGGTPLDQAFRLAKNLIE 110 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 P+ ++ + I ++DGE N+ K + + Sbjct: 111 DKD-------------TFPTKFYKPYSILVSDGELNDGKWQKALSDFHHDGRSTKSVCWS 157 Query: 367 ISI----NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I I + K + + + + L+ +F+ ++Q + S+ Sbjct: 158 IFIGDREANPQVNKDFGK------DGVFYADDVEKLVGLFEIMTQTISKGSTSI 205 >gi|242078053|ref|XP_002443795.1| hypothetical protein SORBIDRAFT_07g002215 [Sorghum bicolor] gi|241940145|gb|EES13290.1| hypothetical protein SORBIDRAFT_07g002215 [Sorghum bicolor] Length = 423 Score = 36.4 bits (82), Expect = 8.3, Method: Composition-based stats. Identities = 24/158 (15%), Positives = 52/158 (32%), Gaps = 49/158 (31%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + V+D+SGSM KM +K A+ +D++ Sbjct: 125 LDLVTVLDVSGSMAGK--------------------KMERVKRAMGFLIDNLGSDD---- 160 Query: 251 DVYMGLIGYTTRVEKNIEPSW----GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + ++ ++T + I + G ++ V S T+ + A +L Sbjct: 161 --RLSVVAFSTDARRIIRLTRMSDDGKAAAKRAVESLAAS---GSTNIRGGLDVAAMVLD 215 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + ++ + +I L+DG++N Sbjct: 216 GRRHKNAVAS----------------VILLSDGQDNQS 237 >gi|124007374|ref|ZP_01692081.1| von Willebrand factor type A domain protein [Microscilla marina ATCC 23134] gi|123987207|gb|EAY26947.1| von Willebrand factor type A domain protein [Microscilla marina ATCC 23134] Length = 1088 Score = 36.4 bits (82), Expect = 8.3, Method: Composition-based stats. Identities = 32/212 (15%), Positives = 60/212 (28%), Gaps = 31/212 (14%) Query: 124 SSRISMTHMANN-RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 + M A + RL D I+ L + K + + Sbjct: 844 NITYKMVREAQSKRLPGKRIRKSTTTSTKKPCDCDSYKIKQLPPPKIVIKTDTNYQGVNM 903 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 M + L++D+SGSM + K+ LK + + + Sbjct: 904 TSMKGFAHNNLMLLLDVSGSMS-------------------SKDKLPLLKESFKYLISIM 944 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQA 301 V +Y G + I + ++ + +D L T+ K A Sbjct: 945 RPQDDVSIVIYAG--------DAAIVLKPTSASNQEQINAVIDKLRSRGKTNVKAGFKLA 996 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 Y+ ++ + K P K+I Sbjct: 997 YKWMSKNFKEGGNNRIIL--ATDGEFPISKYI 1026 >gi|308472813|ref|XP_003098633.1| hypothetical protein CRE_04227 [Caenorhabditis remanei] gi|308268233|gb|EFP12186.1| hypothetical protein CRE_04227 [Caenorhabditis remanei] Length = 395 Score = 36.4 bits (82), Expect = 8.5, Method: Composition-based stats. Identities = 21/126 (16%), Positives = 40/126 (31%), Gaps = 13/126 (10%) Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL--TSDK 309 +G + Y ++ N + K + Y L +S M + + Sbjct: 78 TRVGFVTYNSQATTNADL----NKFKSY-----GDLQQGVYNSYNDMNLSPEKTPYIGTG 128 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + QG + K II N + ++ + K I I+TI++ Sbjct: 129 LIAAGELLQIQGSADGHVNHPKVIIAYATALNGTGLLDPLSVA--NTLKSAGITIITIAV 186 Query: 370 NASPNG 375 + NG Sbjct: 187 DTDDNG 192 >gi|325287596|ref|YP_004263386.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] gi|324323050|gb|ADY30515.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] Length = 696 Score = 36.4 bits (82), Expect = 8.6, Method: Composition-based stats. Identities = 37/220 (16%), Positives = 67/220 (30%), Gaps = 44/220 (20%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E P + ++D+SGSM + K+ L +A L + + Sbjct: 334 ELPASNLTFLIDVSGSM-------------------EDHNKLPLLISAFKLLVHQLIEKD 374 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 V VY G G + P+ G +K + + T +K AY L Sbjct: 375 KVSIVVYAGAAG------VVLPPTNGDQK-EKIINALQKLEAGGSTAGGQGIKLAY-KLA 426 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE-NNNFKSNVNTIKICDKAKENFIKIV 365 + N +I TDG+ N S+ K+ +K + + + + Sbjct: 427 EKNFKKNGNNR---------------VILATDGDFNVGASSDTAMEKLIEKKRASGVFLS 471 Query: 366 TISI-NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + + +L H + VF + Sbjct: 472 VLGFGMGNYKDSKLETLADKGNGNHAYIDTMQEAQKVFGD 511 >gi|257454382|ref|ZP_05619644.1| von Willebrand factor, type A [Enhydrobacter aerosaccus SK60] gi|257448148|gb|EEV23129.1| von Willebrand factor, type A [Enhydrobacter aerosaccus SK60] Length = 550 Score = 36.4 bits (82), Expect = 8.7, Method: Composition-based stats. Identities = 42/258 (16%), Positives = 88/258 (34%), Gaps = 56/258 (21%) Query: 165 LNQRYNQKIVSFIPA---LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 + N+ I I A + + P L+ LV D+SGSM Sbjct: 163 PWRTANRIIKVAIKADDPTITKQSTLPPANLVFLV-DVSGSMS----------------- 204 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 K+ +K++L + + + Y G T+V T+K+ + Sbjct: 205 --DNDKLPLVKSSLKMLTKQLRPQDTISIVTYAG----RTQVTLPATRGSDTDKILAAID 258 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 S T+ A+K AYQ + K + I+ +TDG+ Sbjct: 259 SLDAS---GSTNGEAAIKLAYQQ-------------AKIHYKKDGINR---ILMMTDGDF 299 Query: 342 NNFKSNV-NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA----- 395 N S+V + I + +++ + + T +++ + +Y+ +++ Sbjct: 300 NVGVSDVDEMLDIIRRERDSGVSLSTFGFGEGNLNDHMMEQVADNGNGNYSYIDSLSEAK 359 Query: 396 ----DSLIHVFQNISQLM 409 D + F +++ + Sbjct: 360 KALVDEMSATFNTVAKDV 377 >gi|219847012|ref|YP_002461445.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219541271|gb|ACL23009.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 847 Score = 36.4 bits (82), Expect = 8.8, Method: Composition-based stats. Identities = 49/330 (14%), Positives = 90/330 (27%), Gaps = 109/330 (33%) Query: 161 IEHLLNQRYNQKIVSFIPAL-----LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 N +V ++ +I + ER +++D+SGSM+ Sbjct: 374 CSATPPPPENTSVVRLTGSVELWPEYQIVVNERRPVQYVVILDVSGSMNANFIGQGIVNG 433 Query: 216 SAPICQDKKR--------------------TKMAALKNALLLFLDSIDL-----LSHVKE 250 C + ++ K AL L + ++ + Sbjct: 434 RVTQCTNGPPGSPPAQSCGQPQYAWNPVQERRIYVAKKALELLIRQTNMPGNPGYDPTQP 493 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ---------- 300 M L+ +T V PS + + ++ K Sbjct: 494 IDSMALVWFTHNV-----PSTNVVPFKSNPNELIQAVNSAGAYQGDPYKTSGGTNGTGGL 548 Query: 301 --AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF----KSNVN----- 349 A Q+L + + + +++ IIF+TDG N F SNVN Sbjct: 549 YRASQLLANAPRTTNQLGKEW--------IYRRAIIFVTDGVTNTFFNANNSNVNGGSSN 600 Query: 350 -----TIKICDKA---------------KENFI----------------------KIVTI 367 T +C KA K N + I + Sbjct: 601 QTTYPTGHVCRKAEVLEDALCQTTEVGGKYNGMDRPITQMVNMTNTIKSNQSIQTDIYAL 660 Query: 368 SINA-SPNGQRLLKTCVSSPEYHYNVVNAD 396 ++++ G L S+P + Y + Sbjct: 661 ALSSIPATG--LRDGVASTPRHFYTAETLE 688 >gi|120437735|ref|YP_863421.1| von Willebrand factor (vWA) type A domain-containing protein [Gramella forsetii KT0803] gi|117579885|emb|CAL68354.1| membrane protein containing von Willebrand factor (vWA) type A domain [Gramella forsetii KT0803] Length = 354 Score = 36.4 bits (82), Expect = 8.8, Method: Composition-based stats. Identities = 35/230 (15%), Positives = 73/230 (31%), Gaps = 46/230 (20%) Query: 148 MDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL-RIEMGERPIFLIELVVDLSGSMHCA 206 + S + + +L + I P + ++E +R I +D+S SM Sbjct: 46 LAPNRSRSKPVIKLVLILLALASLVIALVNPKMGTKMETVKREGVDIVFAIDVSKSMDAE 105 Query: 207 MNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN 266 D ++ K + L S+ +G+I Y Sbjct: 106 ---------------DIAPNRLEKSKQLVSQILSSLGSD-------RVGIIAYAGGAYPQ 143 Query: 267 IEPSWGTEKVRQYVTRD-MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP 325 + + + ++ D + + T + A++ A D++ + Sbjct: 144 LPITTDFSAAKMFLQALNTDMISSQGTAISDAIELATTYYDDDQQTN------------- 190 Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 + + ++DGE N I ++A E I+I TI + G Sbjct: 191 -----RVLFIISDGE----DHEGNVEDIAEQAAEKGIRIFTIGVGTEKGG 231 >gi|262402640|ref|ZP_06079201.1| protein BatA [Vibrio sp. RC586] gi|262351422|gb|EEZ00555.1| protein BatA [Vibrio sp. RC586] Length = 335 Score = 36.4 bits (82), Expect = 8.8, Method: Composition-based stats. Identities = 53/303 (17%), Positives = 101/303 (33%), Gaps = 54/303 (17%) Query: 132 MANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEM------ 185 + R + + + ++ + D + LLN + Q I + LL + Sbjct: 25 VPAYRSRENAIKVPFFTQILEALDETPKVAAGLLNASHWQHIALIVSWLLMVTALAKPSV 84 Query: 186 ----GERPIF--LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 R F + +VVDLSGSM ++ +++ A K L F+ Sbjct: 85 LGEVQTREAFGRDVLMVVDLSGSME---------EKDFATESGEQLSRLTAAKKVLRDFV 135 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 ++ GLI + + E + + + T+ A+ Sbjct: 136 TQ-------RQGDRFGLILFGDAAFIQTPFTADQEVWLNLLDEAETGMAGQSTNLGDAI- 187 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 F Q P+ + ++ LTDG N+ S V+ + A Sbjct: 188 ------------GLGIKVFEQ---SPATSQDQIMLVLTDG--NDTGSFVSPVDAAKIAAA 230 Query: 360 NFIKIVTISINASPN-GQRLL------KTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 I+I I++ N G++ L + + + ++ L +Q I Q + + Sbjct: 231 KGIRIYVIAMGDPENVGEQPLDMDVVNRVSSLTQARSFVAIDQPQLNEAYQVIDQ-LEPQ 289 Query: 413 KYS 415 KYS Sbjct: 290 KYS 292 >gi|221111394|ref|XP_002160866.1| PREDICTED: similar to collagen type VI alpha 6 [Hydra magnipapillata] Length = 419 Score = 36.4 bits (82), Expect = 8.8, Method: Composition-based stats. Identities = 36/246 (14%), Positives = 79/246 (32%), Gaps = 42/246 (17%) Query: 157 RLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNS 216 + + +L ++ + + ++D S S+ + Sbjct: 199 SCEIVSSVLQSTPTIATTQASSTDGHVQPRCEAVVDVAFILDSSHSLEASYQ-------- 250 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT--E 274 ++K K K A + + S +G+I ++ R E +++ + T Sbjct: 251 ----KEKNFLK----KLAAVFGISS--------NGSRVGVITFSYRAELSVKLNSFTDLS 294 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 + V + L+ T A++ A + + + + K II Sbjct: 295 SFNEAVDKIP--LMNFTTRIDRALRLAQKDMFTSAN-------------GGRVGVSKLII 339 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 LTDG + +I D+ + + + I+ + I ++ N L + Y Sbjct: 340 LLTDGSQTPGGDAEDPERIADELRNDGVVILGVGIGSAVNETE-LSHITGGKKNAYTAAT 398 Query: 395 ADSLIH 400 DSL Sbjct: 399 FDSLTD 404 >gi|239908012|ref|YP_002954753.1| hypothetical protein DMR_33760 [Desulfovibrio magneticus RS-1] gi|239797878|dbj|BAH76867.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 451 Score = 36.4 bits (82), Expect = 9.0, Method: Composition-based stats. Identities = 11/56 (19%), Positives = 27/56 (48%), Gaps = 5/56 (8%) Query: 362 IKIVTISI-NASPNGQRLLKTCVSS----PEYHYNVVNADSLIHVFQNISQLMVHR 412 I+I +I ++ + L+K SS +++++ + + +F+ I Q + R Sbjct: 384 IEIFSIRFGDSDSTDKNLMKQIASSKPGTEDHYFDAPDEAGIKEMFKKIGQQLGQR 439 >gi|306829469|ref|ZP_07462659.1| collagen adhesion protein [Streptococcus mitis ATCC 6249] gi|304428555|gb|EFM31645.1| collagen adhesion protein [Streptococcus mitis ATCC 6249] Length = 861 Score = 36.4 bits (82), Expect = 9.1, Method: Composition-based stats. Identities = 27/117 (23%), Positives = 47/117 (40%), Gaps = 13/117 (11%) Query: 159 QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP 218 + I + Q ++ I + L E P + LV DLSGSM + P Sbjct: 47 KIITPISGQEDKYELSLDITSKLGTETQSEP-LDVVLVADLSGSM---------NKRDVP 96 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHV--KEDVYMGLIGYTTRVEKNI-EPSWG 272 + T++ ALKN L D L+ + + + ++G+ +++ E SW Sbjct: 97 SSTGRTITRLDALKNTLKGTRDRQGLIDTILSNSNNRLSMVGFGGKIDNKFAEQSWN 153 >gi|301609304|ref|XP_002934186.1| PREDICTED: epithelial chloride channel protein-like [Xenopus (Silurana) tropicalis] Length = 934 Score = 36.4 bits (82), Expect = 9.1, Method: Composition-based stats. Identities = 35/216 (16%), Positives = 70/216 (32%), Gaps = 52/216 (24%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + LV+D+SGSM + ++ A+ LF+ I ++ Sbjct: 305 VSLVLDVSGSMSSS-------------------NRIGRQLQAVELFVVQI-----IENGA 340 Query: 253 YMGLIGYTTRVEKNIEPS-WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 ++G++ +++ T+ R + + T+ ++ Sbjct: 341 HVGIVKFSSSASVVSSLVKINTQAQRDQLKSLIPRTAGGGTNICAGIRA----------- 389 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISIN 370 N G + I+ LTDGE+N S +C ++ I I++ Sbjct: 390 GIALNKNFDGSSYGTE-----IVLLTDGEDNLDTS------LCFKDITDSGAIIHVIAL- 437 Query: 371 ASPNGQRLLKTCVSSPEYHYNV---VNADSLIHVFQ 403 + L + +N V A+ LI F Sbjct: 438 GPNAAKELETIANMTGGLRFNALDKVEANELIDAFS 473 >gi|293570439|ref|ZP_06681494.1| von Willebrand factor type A domain protein [Enterococcus faecium E980] gi|291609385|gb|EFF38652.1| von Willebrand factor type A domain protein [Enterococcus faecium E980] Length = 1042 Score = 36.4 bits (82), Expect = 9.1, Method: Composition-based stats. Identities = 30/152 (19%), Positives = 53/152 (34%), Gaps = 41/152 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM ++ +K + F+D++ S + + Sbjct: 291 LDLVLVVDWSGSM-------------------NDNDRIGEVKIGVDRFVDTL-SDSGITD 330 Query: 251 DVYMGLIGYTT--RVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + MG +GY++ N G + V+ V S T + ++ A +L+ Sbjct: 331 KINMGYVGYSSDGYNYSNGTVQMGSFDSVKNQVKSITPSWTNGGTFTQKGLRDAGDMLSV 390 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +K I+ LTDG Sbjct: 391 PNGH------------------KKVIVLLTDG 404 >gi|315126093|ref|YP_004068096.1| pilin biogenesis protein [Pseudoalteromonas sp. SM9913] gi|315014607|gb|ADT67945.1| pilin biogenesis protein [Pseudoalteromonas sp. SM9913] Length = 1057 Score = 36.4 bits (82), Expect = 9.2, Method: Composition-based stats. Identities = 33/211 (15%), Positives = 63/211 (29%), Gaps = 47/211 (22%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSD-------------------------PEDVNSA 217 ++ E+P ++ + D SGSM +N E Sbjct: 34 VQTNEKPRVIM--IFDTSGSMAWDVNDGDACYKRSGNSYYEVDCFESKDSYKRNEQCYKR 91 Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 + + T + + + + +D+ GL+ + + G EK Sbjct: 92 TSYYNYEPTCSDSRLRVAQNAITQL---VNDNDDIEFGLMRFNGSNGGYVLARVGAEK-S 147 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK----RSFFTNFFRQGVKIPSLPFQKF- 332 + + + T T + +AY +T + +F + I PF Sbjct: 148 FLLNKVEELPASGSTPLTETLWEAYLYITGQNVYYGGNTNERDFTAESSGIYKSPFAPVT 207 Query: 333 -----------IIFLTDGENNNFKSNVNTIK 352 II +TDG+ N + N I Sbjct: 208 GEPLRCDNSINIILMTDGDPTNDSNRNNDIY 238 >gi|291242484|ref|XP_002741138.1| PREDICTED: chloride channel calcium activated 2-like [Saccoglossus kowalevskii] Length = 765 Score = 36.4 bits (82), Expect = 9.2, Method: Composition-based stats. Identities = 36/295 (12%), Positives = 88/295 (29%), Gaps = 59/295 (20%) Query: 119 QNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP 178 + + N + + + + L + + I P Sbjct: 221 DDPSDEYSYHNRMAPNRQNRICSGQSAWGVM--------LNHTDFKDGNNPTRIIDDTTP 272 Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + G R + LV+D SGSM ++ L + F Sbjct: 273 NFIIKRSGVRR---VVLVLDTSGSMDGD--------------------RIQRLHQSATYF 309 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 +++ + +G Y E +G + R + ++ + T + Sbjct: 310 IET--RIEDGSFVGIVGFSSYAVIHSGITEIKYGFQ--RGEIASNVPQVASGATSIGDGL 365 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG-ENNNFKSNVNTIKICDKA 357 + A Q+L S + ++ +TDG EN + + + Sbjct: 366 RVALQVLQDGNVTSEGAS----------------LLLITDGIENTYP----LLMNVMQEV 405 Query: 358 KENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVH 411 ++ +++ TI+ + Q L+ + ++ V + D+ +++ + Sbjct: 406 YDSGVRVDTIAY--TEAAQSTLQELSDNTGGLYFYVPDNDTSTAFIDSLAATISE 458 >gi|221110023|ref|XP_002170779.1| PREDICTED: similar to collagen, partial [Hydra magnipapillata] Length = 671 Score = 36.4 bits (82), Expect = 9.2, Method: Composition-based stats. Identities = 36/222 (16%), Positives = 69/222 (31%), Gaps = 42/222 (18%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 L F + ++D SGS+ + LK Sbjct: 27 GTLHLETPDCEGFFDVGFILDSSGSLKSQY-----------------WKEKDFLKKLANS 69 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 F S + + G++ ++ E +I + D S + T A Sbjct: 70 FGIS-------NKGSHAGVVTFSHYAELSIRLDAFYSSIDFNDAVDRISHMDSFTRIDLA 122 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + +A ++ + LTDG+ + + I D+ Sbjct: 123 LAKALELFDIKNGARNDVPNL--------------LFLLTDGKQ---EPEMPLTHISDEI 165 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 K+ I++ + I A N + L+ V +PE + V + D L+ Sbjct: 166 KQKGIQLFAVGIGAGAN-KTELEKIVGNPENVFMVDDFDKLL 206 >gi|148675553|gb|EDL07500.1| mCG120277 [Mus musculus] Length = 1031 Score = 36.4 bits (82), Expect = 9.2, Method: Composition-based stats. Identities = 36/223 (16%), Positives = 71/223 (31%), Gaps = 39/223 (17%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D+SGSM TK+ K A+ L + Sbjct: 250 VVFVIDVSGSMFG--------------------TKLQQTKKAMDTILSDLQASDSFNIIT 289 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 + + + YV+R TD A+ A +L + Sbjct: 290 FSDTVNIWKAEGSIQATVQNIHSAKNYVSRMEAD---GWTDINAALLAAASVL------N 340 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI-KICDKAKENFIKIVTISINA 371 +G + +P I+FLTDGE ++ + I +A + + + +++ Sbjct: 341 HSNQEPGKGRGVGQIPL---IMFLTDGEPTAGETTPSVILSNIRQALAHRVSLFSLAF-G 396 Query: 372 SPNGQRLLKTCVSSPEY-----HYNVVNADSLIHVFQNISQLM 409 LL+ + + + A L ++ IS+ + Sbjct: 397 DDADFSLLRRLSLENQGEARRIYEDADAALQLEGLYAEISRPL 439 >gi|160892883|ref|ZP_02073672.1| hypothetical protein CLOL250_00414 [Clostridium sp. L2-50] gi|156865442|gb|EDO58873.1| hypothetical protein CLOL250_00414 [Clostridium sp. L2-50] Length = 596 Score = 36.4 bits (82), Expect = 9.3, Method: Composition-based stats. Identities = 35/231 (15%), Positives = 78/231 (33%), Gaps = 47/231 (20%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + K++ I+M E+ + ++D SGSM + K Sbjct: 192 NKDTKLMMVGLNTAAIDMSEKKASNLVFLIDTSGSM-------------------YEENK 232 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY-VTRDMDS 286 + + A + +++D + Y G + G Y + +DS Sbjct: 233 LPLAQKAFKMLAENLDENDRISIVTYAG---------SDTVVLNGVAGSEAYTICEALDS 283 Query: 287 L-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE-NNNF 344 L T+ + + AY+I +Q +K + +I TDG+ N Sbjct: 284 LEASGSTNGSAGLITAYEI------------AEQQFIKDGNNR----VILATDGDLNVGL 327 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA 395 S + + + + K++ I + + + L+ +Y+ +++ Sbjct: 328 TSESDLVGLITEEKDSGIFLSVLGFGSDNLKDNKLEALADHGNGNYSYLDS 378 >gi|115496702|ref|NP_001068594.1| collagen alpha-2(VI) chain [Bos taurus] gi|94574217|gb|AAI16098.1| Collagen, type VI, alpha 2 [Bos taurus] gi|296490819|gb|DAA32932.1| collagen, type VI, alpha 2 [Bos taurus] Length = 917 Score = 36.4 bits (82), Expect = 9.3, Method: Composition-based stats. Identities = 30/182 (16%), Positives = 58/182 (31%), Gaps = 59/182 (32%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL--S 246 + V+D S S+ T KN ++ ++ + + Sbjct: 611 GALDVVFVIDSSESIG--------------------YTNFTLEKNFVINVVNRLGAIAKD 650 Query: 247 HVKED-VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL------------ILKPTD 293 E +G++ Y+ GT + Q +DSL I T Sbjct: 651 PKSETGTRVGVVQYSHE---------GTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTW 701 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A+K AY L + +R + F + +TDG ++ ++N + Sbjct: 702 TPSALKFAYNKLIKESRRQKT---------------RVFAVVITDGRHDPRDDDLNLRAL 746 Query: 354 CD 355 C+ Sbjct: 747 CN 748 >gi|78186669|ref|YP_374712.1| hypothetical protein Plut_0797 [Chlorobium luteolum DSM 273] gi|78166571|gb|ABB23669.1| putative membrane protein [Chlorobium luteolum DSM 273] Length = 349 Score = 36.4 bits (82), Expect = 9.3, Method: Composition-based stats. Identities = 9/44 (20%), Positives = 22/44 (50%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMES 55 +K + SE+ +I+FAL++ L + + H + +++ Sbjct: 10 QKRLQSERGGAAILFALTLPVLLGFAALAVDLARIHLTRVELQN 53 >gi|163815506|ref|ZP_02206879.1| hypothetical protein COPEUT_01671 [Coprococcus eutactus ATCC 27759] gi|158449143|gb|EDP26138.1| hypothetical protein COPEUT_01671 [Coprococcus eutactus ATCC 27759] Length = 550 Score = 36.4 bits (82), Expect = 9.4, Method: Composition-based stats. Identities = 38/187 (20%), Positives = 68/187 (36%), Gaps = 46/187 (24%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V D SGSM DP M LKN+L + ++ ++ Y+G Sbjct: 379 VADCSGSMD----GDP----------------MNQLKNSLT------NGAQYINDNNYVG 412 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDSTPAMKQAYQILTSDKKRSFF 314 L+ Y+ V + + R Y +++LI T S A+ A +++T K + Sbjct: 413 LVSYSNSVTIEVPIAQFDLNQRSYFQGAVNNLIASGGTASYDAVVVAVKMITEAKAQHPD 472 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDG-ENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 + + L+DG NN + + T + + + I + TI Sbjct: 473 A--------------KCMLFLLSDGYANNGYSMDEITSAL----RTSGIPVYTIGYGDDA 514 Query: 374 NGQRLLK 380 + L + Sbjct: 515 DTGELAR 521 >gi|116625273|ref|YP_827429.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116228435|gb|ABJ87144.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 307 Score = 36.4 bits (82), Expect = 9.4, Method: Composition-based stats. Identities = 33/242 (13%), Positives = 82/242 (33%), Gaps = 52/242 (21%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 P +++ E + L++D SGSM KR K+ A AL Sbjct: 53 PQSVKVFRREDVPVSMGLIIDNSGSM------------------RDKRGKVEAAALAL-- 92 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 + + E ++ + + + + ++L + A Sbjct: 93 ----VRDSNPDDEVF---IVNFNDEAFLDNPHG---KDFTTNIKEMEEALTRIDSRGGTA 142 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 M+ A ++ K + +K ++ +TDG N+ S V+ + + Sbjct: 143 MRDALRMSIDHVKEKAHKD-------------KKVLVVVTDG--NDNSSVVSLENLVKAS 187 Query: 358 KENFIKIVTISI------NASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQLMV 410 +++ + I + + + QR LK ++ + + + + V +++ + Sbjct: 188 QQSEVLIYGVGLLGDEERREAQRAQRALKALAEATGGEVFFPKDVNEVDKVAHQVARDIR 247 Query: 411 HR 412 ++ Sbjct: 248 NQ 249 >gi|221111396|ref|XP_002160892.1| PREDICTED: similar to proximal thread matrix protein 1 [Hydra magnipapillata] Length = 315 Score = 36.4 bits (82), Expect = 9.5, Method: Composition-based stats. Identities = 36/246 (14%), Positives = 79/246 (32%), Gaps = 42/246 (17%) Query: 157 RLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNS 216 + + +L ++ + + ++D S S+ + Sbjct: 93 SCEIVSSVLQSTPTIATTQASSTDGHVQPRCEAVVDVAFILDSSHSLEASYQ-------- 144 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT--E 274 ++K K K A + + S +G+I ++ R E +++ + T Sbjct: 145 ----KEKNFLK----KLAAVFGISS--------NGSRVGVITFSYRAELSVKLNSFTDLS 188 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 + V + L+ T A++ A + + + + K II Sbjct: 189 SFNEAVDKIP--LMNFTTRIDRALRLAQKDMFTSAN-------------GGRVGVSKLII 233 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 LTDG + +I D+ + + + I+ + I ++ N L + Y Sbjct: 234 LLTDGSQTPGGDAEDPERIADELRNDGVVILGVGIGSAVNETE-LSHITGGKKNAYTAAT 292 Query: 395 ADSLIH 400 DSL Sbjct: 293 FDSLTD 298 >gi|16080727|ref|NP_391555.1| hypothetical protein BSU36740 [Bacillus subtilis subsp. subtilis str. 168] gi|221311634|ref|ZP_03593481.1| hypothetical protein Bsubs1_19866 [Bacillus subtilis subsp. subtilis str. 168] gi|221315962|ref|ZP_03597767.1| hypothetical protein BsubsN3_19787 [Bacillus subtilis subsp. subtilis str. NCIB 3610] gi|221320873|ref|ZP_03602167.1| hypothetical protein BsubsJ_19730 [Bacillus subtilis subsp. subtilis str. JH642] gi|221325158|ref|ZP_03606452.1| hypothetical protein BsubsS_19896 [Bacillus subtilis subsp. subtilis str. SMY] gi|321313224|ref|YP_004205511.1| hypothetical protein BSn5_09320 [Bacillus subtilis BSn5] gi|8928525|sp|P70960|YWMC_BACSU RecName: Full=Uncharacterized protein ywmC; Flags: Precursor gi|1648853|emb|CAB03680.1| unknown [Bacillus subtilis subsp. subtilis str. 168] gi|2636199|emb|CAB15691.1| putative exported protein [Bacillus subtilis subsp. subtilis str. 168] gi|320019498|gb|ADV94484.1| hypothetical protein BSn5_09320 [Bacillus subtilis BSn5] Length = 227 Score = 36.4 bits (82), Expect = 9.5, Method: Composition-based stats. Identities = 30/243 (12%), Positives = 67/243 (27%), Gaps = 30/243 (12%) Query: 161 IEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 ++ + S A + E + + +++D SGSM ++ NSA Sbjct: 5 FSLIMMTGLLFGLTSPAFAAEKTETEAKAPANVAVLLDASGSMAKRID-GVSKFNSAKKE 63 Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 K + + + S + + V ++ + ++ Sbjct: 64 ISKFASSLPEGTQVKMSVFGS-EGNNKNSGKVQSCEAIRNVYGFQSF-------NEQSFL 115 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 T A+ +A + +K + LTDGE Sbjct: 116 NSLNTIGPTGWTPIAKALNEAKSSFDQLDAKG-----------------EKVVYLLTDGE 158 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLI 399 N IK + +++ I + I + + L ++ + Sbjct: 159 ETC---GGNPIKTAKELQKDNITVNVIGFDYKEGYKGQLNAIAKVGGGEYFPAYTQKDVE 215 Query: 400 HVF 402 +F Sbjct: 216 KIF 218 >gi|326430405|gb|EGD75975.1| hypothetical protein PTSG_00683 [Salpingoeca sp. ATCC 50818] Length = 762 Score = 36.4 bits (82), Expect = 9.5, Method: Composition-based stats. Identities = 27/209 (12%), Positives = 67/209 (32%), Gaps = 41/209 (19%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + ++D SGS+ A +D +D ++ Sbjct: 501 DVLFILDNSGSVGP-----------------------DNFALAQEFVMDLVDQMTISSTA 537 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKK 310 + +G + ++V+ + ++ ++ T + A+ A + Sbjct: 538 INVGAFLFNSQVQMLTAFTDDKTAIQAAISGYSYPPSNTAGTATGAALNFAVDTMLQSGA 597 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 G + S+ ++I TDG + S V + ++V++ I Sbjct: 598 ----------GYRGGSVLV--YVI--TDGRSQEDSSFVASAA--ANLHATGAEVVSVGIT 641 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 +S + + L+T +S + + + L Sbjct: 642 SSVDETQ-LRTIATSDSNVFVLEDFSDLN 669 >gi|91228647|ref|ZP_01262563.1| putative calcium-binding outer membrane-like protein [Vibrio alginolyticus 12G01] gi|91187798|gb|EAS74114.1| putative calcium-binding outer membrane-like protein [Vibrio alginolyticus 12G01] Length = 2510 Score = 36.4 bits (82), Expect = 9.7, Method: Composition-based stats. Identities = 34/256 (13%), Positives = 79/256 (30%), Gaps = 39/256 (15%) Query: 98 IKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYR 157 + N+ + + + N + ++ S +T+ + V+ S Sbjct: 1975 LSNYDQMLIGSANGEEVIKISVNDTGVVSTELLGPIDHSNPSGEDTLNIEVPVLVS---N 2031 Query: 158 LQFIEHLLNQRYNQKIVSFIPALL--RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 + + + + S A + + + ++L++D+SGSM Sbjct: 2032 ARGLTNSTTVNVIVEDDSPDSASIIHDVVAETKESANVQLIMDVSGSM------------ 2079 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTE 274 + ++ +K++ L+ + + + V + T +W E Sbjct: 2080 -------RTDNRLQIMKDSATQLLNQYESI--GQTRVQIITYSSTASTYAIGAATWLTVE 2130 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 + + Y+ T+ A+ +A Q K + +N P FI Sbjct: 2131 EAKAYIETLTAG---GATNYNNALNEAKQSWDDVGKLTSASNVSYFLSDGQPNPASSFIN 2187 Query: 335 ---------FLTDGEN 341 LTD +N Sbjct: 2188 DAREQSWIDHLTDPDN 2203 >gi|258623679|ref|ZP_05718665.1| conserved hypothetical protein [Vibrio mimicus VM573] gi|258584045|gb|EEW08808.1| conserved hypothetical protein [Vibrio mimicus VM573] Length = 371 Score = 36.4 bits (82), Expect = 9.8, Method: Composition-based stats. Identities = 51/273 (18%), Positives = 91/273 (33%), Gaps = 54/273 (19%) Query: 162 EHLLNQRYNQKIVSFIPALLRIEM----------GERPIF--LIELVVDLSGSMHCAMNS 209 LLN + Q I + LL + R F + +VVDLSGSM Sbjct: 91 AGLLNASHWQHIALIVSWLLIVTALAKPSVLGEVQTREAFGRDVLMVVDLSGSME----- 145 Query: 210 DPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP 269 ++ +++ A K L F+ ++ GLI + Sbjct: 146 ----EKDFSTAAGEQLSRLTAAKRVLRNFVTQ-------RQGDRFGLILFGDAAFIQTPF 194 Query: 270 SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 + + + + + T+ A+ F Q PS Sbjct: 195 TADQDVWLNLLDEAETGMAGQSTNLGDAI-------------GLGIKVFEQ---SPSTSQ 238 Query: 330 QKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN-GQRLL------KTC 382 + ++ LTDG N+ S V+ + A I+I I++ N G++ L + Sbjct: 239 DQIMLVLTDG--NDTGSFVSPVDAAKIAAAKGIRIYVIAMGDPENVGEQPLDMDVVNRVS 296 Query: 383 VSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + + ++ L +Q I Q + +KYS Sbjct: 297 SLTQARSFVAIDQPQLNEAYQVIDQ-LEPQKYS 328 >gi|260592519|ref|ZP_05857977.1| BatB protein [Prevotella veroralis F0319] gi|260535565|gb|EEX18182.1| BatB protein [Prevotella veroralis F0319] Length = 331 Score = 36.4 bits (82), Expect = 9.9, Method: Composition-based stats. Identities = 30/193 (15%), Positives = 60/193 (31%), Gaps = 45/193 (23%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 R + + +D+S SM D +++ K + ++ Sbjct: 83 ANNTSREGIEVIMALDISNSMLA---------------TDVVPSRLDKSKLMVEGLMNKF 127 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDSTPAMKQA 301 +GLI + + + + ++ SLI + TD A+ A Sbjct: 128 TKN-------KLGLIVFAGDAFVQLPITSDYVSAKMFLDNINPSLIGTQGTDIGKAINLA 180 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 P+ K I+ +TDGE+N + + A+E Sbjct: 181 ------------------MHSFTPNTQTGKAIVVITDGEDNEGGAEAMAKQ----AQEKG 218 Query: 362 IKIVTISINASPN 374 IK+ + I ++ Sbjct: 219 IKVFILGIGSTQG 231 >gi|255039144|ref|YP_003089765.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] gi|254951900|gb|ACT96600.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] Length = 935 Score = 36.4 bits (82), Expect = 9.9, Method: Composition-based stats. Identities = 39/207 (18%), Positives = 74/207 (35%), Gaps = 48/207 (23%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + L++D+S SM KM LK ++ L + + + Sbjct: 759 MVLLLDVSSSM-------------------NSPYKMPLLKRSIKSLLTLVRPEDMISIVL 799 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILTSDKKR 311 Y G + ++P+ G + ++R +D L TD +K AY+ Sbjct: 800 YSG------KARVVLKPTSGAK--ASEISRMIDLLQSDGDTDGNEGIKLAYKT------- 844 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN-FIKIVTISIN 370 +Q ++ + I+ TDGE S+ I A+++ ++ I T Sbjct: 845 -----ANKQYIRGGNNR----IVLATDGE--FPVSDEVMDMIRQNARQDVYLSIFTFG-R 892 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADS 397 GQ+L K + +V +A + Sbjct: 893 HEHTGQKLKKLSELGMGSYAHVTDASA 919 >gi|156409369|ref|XP_001642142.1| predicted protein [Nematostella vectensis] gi|156229283|gb|EDO50079.1| predicted protein [Nematostella vectensis] Length = 182 Score = 36.4 bits (82), Expect = 9.9, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 57/165 (34%), Gaps = 23/165 (13%) Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + + I Y++R + + T R + T + A++ A L Sbjct: 41 RTSTRVAAIVYSSRPRVAFDFNRYTSARRAAHAVKRLRFLRGGTSTGRALRLASSRLFRR 100 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 R +K ++ +TDG KS+ + +K K ++I + Sbjct: 101 YGRKR----------------RKVLMLITDG-----KSSDDVLKPSKALKRKGVQIFAVG 139 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + S + L+ S P Y + SL + +++++ K Sbjct: 140 VGMSVSRNELI-LIASHPSQVYQA-SFTSLSAIVKSLARKTCESK 182 >gi|257878265|ref|ZP_05657918.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,230,933] gi|257812493|gb|EEV41251.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,230,933] Length = 1107 Score = 36.4 bits (82), Expect = 9.9, Method: Composition-based stats. Identities = 28/151 (18%), Positives = 52/151 (34%), Gaps = 41/151 (27%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + LVVD SGSM ++ +K + F+D++ S + + Sbjct: 269 LDLVLVVDWSGSM-------------------NDNNRIGEVKIGVDRFVDTLAD-SGITD 308 Query: 251 DVYMGLIGYTTRVEKNIEPS---WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + MG +GY++ + + V+ V S T + A++ A +L+ Sbjct: 309 KINMGYVGYSSEGYSYSNGAVQMGSFDSVKNQVKSITPSRTNGGTFTQKALRDAGSMLSV 368 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 +K I+ LTD Sbjct: 369 PNGH------------------KKVIVLLTD 381 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.312 0.117 0.278 Lambda K H 0.267 0.0359 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 5,444,743,759 Number of Sequences: 14124377 Number of extensions: 189240059 Number of successful extensions: 599609 Number of sequences better than 10.0: 3071 Number of HSP's better than 10.0 without gapping: 357 Number of HSP's successfully gapped in prelim test: 2823 Number of HSP's that attempted gapping in prelim test: 595849 Number of HSP's gapped (non-prelim): 4239 length of query: 420 length of database: 4,842,793,630 effective HSP length: 142 effective length of query: 278 effective length of database: 2,837,132,096 effective search space: 788722722688 effective search space used: 788722722688 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.6 bits) S2: 82 (36.4 bits)